CIS 2010 Final
predictive analytics
extracts information from data and uses it to predict future trends and identify behavioral patterns
data dictionary
compiles all of the metadata about the data elements in the data model
utility software
provides additional functionality to the operating system, used to maintain and repair the computer
primary key
A field (or group of fields) that uniquely identifies a given entity in a table
Management Information Systems (MIS)
A business function, like accounting and human resources, which moves information about people, products, and processes across the company to facilitate decision-making and problem-solving
cryptocurrency
A digital asset designed to work as a medium of exchange
data warehouse
A logical collection of information, gathered from many different operational databases, that supports business analysis activities and decision-making tasks
foreign key
A primary key of one table that appears as an attribute in another table and acts to provide a logical relationship between the two tables
ETL (extraction, transformation, and loading)
A process that extracts information from internal and external databases, transforms the information using a common set of enterprise definitions, and loads the information into a data warehouse
information scrubbing
A process that weeds out and fixes or discards inconsistent, incorrect, or incomplete information (i.e. missing records, redundant records, missing key, etc.)
assignment
A statement that assigns a value to a variable
bitcoin
A worldwide cryptocurrency and digital payment system called the first decentralized digital currency, as the system works without a central repository or single administrator
virtualization
Creates multiple "virtual" machines on a single computing device
Business Intelligence (BI)
Information collected from multiple sources such as suppliers, customers, competitors, partners, and industries that analyzes patterns, trends, and relationships for strategic decision making
software updates
Occurs when the software vendor releases updates to software to fix problems or enhance features.
entity relationship diagram
a technique for documenting the entities and relationships in a database environment
stored attribute
The attribute that is used to derive the attribute
decentralization
The process of distributing or dispersing functions, powers, people or things away from a central location or authority
analytics
The use of math and statistics to derive meaning from data
conditional statement
This is the ability to test a variable against a value and act in one way if the condition is met by the variable or another way if not
miners
Transactions are authenticated by a network of these who complete complex mathematical problems to arrive at the same unique solution to verify the transaction
data redundancy
When the same data exists in more than one place in a database
digital ledger
a bookkeeping list of assets (i.e. money, property, ideas), identified ownership, and transactions that record the transfer of ownership among participants, all transactions are recorded with a date, time, participant names and other information
data artist
a business analytics specialist who uses visual tools to help people understand complex data
field
a characteristic of a table
Big Data
a collection of large, complex data sets, including structured and unstructured data, which cannot be analyzed using traditional database methods and tools
system
a collection of parts that link to achieve a common purpose
attribute
a column or specific field of the data elements associated with an entity
information integrity
a measure of the quality of information, occurs when a system produces incorrect, inconsistent, or duplicate data
data types
a particular kind of data item, as defined by the values it can take, the programming language used, or the operations that can be performed on it (i.e. real, integer or Boolean)
format
a pre-established layout for data
record
a row or a collection of related data elements
subsystem
a self-contained system within a larger system (i.e. hardware within an entire computer system)
systems thinking
a way of monitoring the entire system by viewing multiple inputs being processed or transformed to produce outputs while continuously gathering feedback on each part
(1) Any well-connected node in the Bitcoin blockchain can determine, with certainty, whether a transaction does or does not exist in the data set (2) A prohibitively high cost to attempt to rewrite or alter transaction history
advantages of bitcoin
(1) transparency (2) reduced transaction costs (3) faster transaction settlement (4) decentralization (5) user-controlled networks
advantages of blockchain
(1) increased flexibility (2) increased scalability and performance (3) increased information integrity (4) increased information security (5) reduced information redundancy
advantages of relational databases
multitasking
allows more than one piece of software to be used at a time (i.e. using excel and simultaneously printing a document)
double-spend
asset transferred more than once
null-valued attribute
assigned to an attribute when no other value applies or when a value is unknown
derived attribute
attribute can be calculated using the value of another attribute
many-to-many relationship
between two entities in which an instance of one entity is related to many instances of another and one instance of the other can be related to many instances of the first entity
one-to-many relationship
between two entities in which an instance of one entity can be related to many instances of a related entity
one-to-one relationship
between two entities in which an instance of one entity can be related to only one instance of a related entity
composite attribute
can be divided into smaller components, which represent more basic attributes that have their own meanings
simple attribute
cannot be broken down into a smaller component
(1) digital ledger (2) hash/digital signature (3) miners (4) decentralized
central components of blockchain
internal databases
collection of consumer and market information within a company network describing items of interest
JOIN
command used to combine two or more tables
AND
command used to display a record (i.e. message) if all the conditions separated by the function are true
FROM
command used to specify the table that you're getting the data from
information cube
common term for the representation of multidimensional information
table
composed of rows and columns that represent an entity
process
computer program that processes the data
data mart
contains a subset of data warehouse information
feedback loops
controlling every step, at each step you observe the output then adjust the process as required
system software
controls how the various technology tools work together along with the application software
operating system software
controls the application software and manages how the hardware devices work together, supports multitasking
database management system (DBMS)
creates, reads, updates, and deletes data in a database while controlling access and security
inputs
data that is entered in a computer
external databases
databases supplied by organizations outside the firm such as online information databases
business rule
defines how a company performs certain aspects of its business and typically results in either a yes/no or true/false answer
data visualization
describes technologies that allow users to "see" or visualize data to transform information into a business perspective
metadata
details about data
dirty data
erroneous or flawed data
peer-to-peer
every user is allowed to connect to the network, send new transactions to it, verify transactions, and create new blocks
(1) excel (2) email (3) twitter (4) skype (5) photoshop
examples of application software
(1) iOs (2) Android (3) Windows (4) MacOS (5) Linux
examples of operating system software
(1) backup (2) encryption (3) antivirus (4) uninstaller (5) anti-spam (6) screen savers
examples of utility software
cardinality
expresses the specific number of instances in an entity
data analytics
extensive use of data, statistical, and quantitative analysis, explanatory and predictive models, and fact-based management to drive decisions and actions
data scientist
extracts knowledge from data by performing statistical analysis, data mining, and advanced analytics on big data to identify trends, market changes, and other relevant information
(1) entity (table) (2) field (3) record (4) attribute (field) (5) primary key (6) foreign key (7) composite key
features of relational databases
single-valued attribute
having only a single value of each attribute of an entity (i.e. person cannot have more than one age)
hash
how computer science and advanced mathematics protect the blockchain's integrity and anonymity, includes digital signatures from the existing blockchain as well as the new transaction
data model
logical data structures that detail the relationships among data elements using graphics or pictures
database
maintains information about various types of objects (inventory), events (transactions), people (employees), and places (warehouses)
Multi-valued attribute
means having the potential to contain more than one value for an attribute at any given time (i.e. person can have more than one college degree)
feed-forward loops
monitor input variations to adjust the process to compensate
blockchain
new way to store and record transaction, blocks are linked together cryptographically to ensure they are temper-proof
software upgrades
occurs when the software vendor releases a new version of the software, making significant changes to the program
(1) duplicate data (2) incorrect data (3) violates business rules
problems with dirty data
distributed computing
processes and manages algorithms across many machines in a computing environment
integrity constraints
rules that help ensure the quality of information
tools
something that helps a process move forward
barriers
something that holds a process back
entity
stores information about a person, place, thing, transaction, or event
fast data
the application of big data analytics to smaller data sets in near-real or real-time in order to solve a problem or create business value
data aggregation
the collection of data from various sources for the purpose of data processing
information redundancy
the duplication of data, or the storage of the same data in multiple places
output
the resulting information from the computer program
data element
the smallest or basic unit of information(i.e. customer's name, address, email, discount rate, preferred shipping method, product name, etc.)
(1) simple vs. composite (2) single-valued vs. multivalued (3) stored vs. derived (4) null-valued
types of attributes
(1) many-to-many relationship (2) one-to-many relationship (3) one-to-one relationship
types of entity relationships
prescriptive analytics
use techniques that create models indicating the best decision to make or course of action to take
application software
used for specific information processing needs, including payroll, customer relationship management, project management, training, and many others
Structured Query Language (SQL)
users write lines of code to answer questions against a database
open system
where the inputs are converted to the output through a process