Chapter 6. Data: Business Intelligence
Name three items for the data elements associated with an entity.
1. Attributes 2. Columns 3. Fields
Which high-quality information term matches this question? Is there an incorrect value in the information?
Accurate
What data mining activity matches this definition? Determines the relationship between variables and the nature of the relationships.
Affinity grouping analysis
Mathematical formulas placed in software that performs an analysis on a data set. What term matches this definition?
Algorithms
What occurs when the user goes into an emotional state of over-analysis (or over-thinking) a situation so that a decision or action is never taken, in effect paralyzing the outcome?
Analysis paralysis
Using data about people's behaviors to understand intent and predictive future actions.
Behavioral analysis
________ data is a collection of large, complex data sets, including structured and unstructured data, which cannot be analyzed using traditional database methods and tools.
Big
________ critical integrity constraints enforce business rules vital to an organization's success and often require more insight and knowledge than relational integrity constraints.
Business
What tracks corporate metrics such as critical success factors and key performance indicators and include advanced capabilities such as interactive controls allowing users to manipulate data for analysis?
Business intelligence dashboards
What defines how a company performs certain aspects of its business and typically results in either a yes/no or true/false answer?
Business rule
What data mining phase matches this definition? Gain a clear understanding of the business problem that must be solved and how it impacts the company.
Business understanding
What data mining activity matches this definition? Assigns records to one of a predefined set of classes.
Classification analysis
________ analysis is a technique used to divide information sets into mutually exclusive groups such that the members of each group are as close together as possible to one another and the different groups are as far apart as possible.
Cluster
What is the collection of data from various sources for the purposes of data processing?
Data aggregation
What complies all of the metadata about the data elements in the data model?
Data dictionary
What is an interactive website that uses a database to constantly update in order to remain relevant to the needs of its customers?
Data-driven website
What uses a variety of techniques to find patterns and relationships in large volumes of information that predict future behavior and guide decision making?
Data-mining tools
What is the term for this example? While a database has only one physical view, it can easily support multiple logical views that provides for flexibility.
Database
What data mining phase matches this definition? Deploy the discoveries to the organization for work in everyday business.
Deployment
What data mining activity matches this definition? Determines values for an unknown continuous variable behavior or estimated future value.
Estimation analysis
What data mining phase matches this definition? Analyze the trends and patterns to access the potential for solving the business problem.
Evaluation
Identifies patterns in data, including outliers, uncovering the underlying structure to understand relationships between the variables.
Exploratory data analysis
What reason why business analysis is difficult using operational databases matches this definition? Every department had its own method for recording data so when trying to share information, data did not match and users did not get the data they really needed.
Inconsistent data definitions
What reason why business analysis is difficult using operational databases matches this definition? Most data stored in operational databases did not allow users direct access; users had to wait to have their queries or questions answered by MIS professionals who could code SQL.
Ineffective direct data access
What occurs when the same data element has different values?
Information inconsistency
What focusses on how individual users access information to meet their own particular business needs?
Logical view of information
What analyzes such items as websites and checkout scanner information to detect customers' buying behavior and predict future behavior by identifying affinities among customers' choices of products and services?
Market basket analysis
________ data management is the practice of gathering data and ensuring that it is uniform, accurate, consistent, and complete, including such entities as customers, suppliers, products, sales, employees, and other critical entities that are commonly integrated across organizational systems.
Master
________ provides details about data.
Metadata
A data value that is numerically distant from most of the other data points in a set of data. What term matches this definition?
Outlier
The classification or labeling of an identified pattern in the machine learning process.
Pattern recognition analysis
What deals with the physical storage of information on a storage device?
Physical view of information
What reason why business analysis is difficult using operational databases matches this definition? The data, if available, were often incorrected or incomplete. Therefore, users could not rely on the data to make decisions.
Poor data quality
What is a field that uniquely identifies a given record in a table?
Primary key
What are examples of analytical information?
Product statistics, sales projections, future growth, trends
________ -time information means immediate, up-to-date information.
Real
Which system provides real-time information in response to requests?
Real-time
What is a data mining algorithm that analyzes a customer's purchases and actions on a website and then uses the data to recommend complementary products?
Recommendation engine
Which of the following are rules that enforce basic and fundamental information-based constraints, such as not creating an order for a nonexistent customer?
Relational integrity constraints
What is a central location in which data is stored and managed?
Repository
What are examples of transactional information?
Sales receipt, airline ticket, packing slip
Analyzes text flowing across the internet, including unstructured text from blogs and messages.
Social media analysis
What identifies the primary location where data is collected?
Source data
The process of analyzing recorded calls to gather information.
Speech analysis
________ -series information is timestamped information collected at a particular frequency.
Time
What is timestamped information collected at a particular frequency?
Time-series information
Which high-quality information term matches this question? Is the information current with respect to business needs?
Timely
What term matches this definition? Encompasses all of the information contained within a single business process or unit of work, and its primary purpose is to support daily operational tasks.
Transactional information
True or false: Business-critical integrity constraints tend to mirror the very rules by which an organization achieves success.
True
True or false: Databases today scale to exceptional levels, allowing all types of users and programs to perform information processing and information-searching tasks.
True
What big data characteristic matches this definition? The scale of data; includes enormous volumes of data generated daily; Massive volume created by machines and networks.
Volume
Analyzes unstructured data associated with websites to identify consumer behavior and website navigation.
Web analysis
Which of the following are answers to tough business questions BI can answer? (Choose all that apply). a. Where is the business now? b. Who is the best sales representative? c. Where is the business going? d. Where has the business been? e. What is the best selling product?
a, c, d
A data _________ is a business analytics specialist who uses visual tools to help people understand complex data.
artist
Select the four primary reasons low-quality information occurs in a system. a. Different systems use consistent information entry standards and formats b. Different systems have different information entry standards and formats c. Data-entry personnel enter abbreviated information to save time d. Third-party and external information contains inconsistencies and errors e. Data-entry personnel enter accurate information to receive bonuses f. Online customers intentionally enter inaccurate information to protect their privacy
b, c, d, f
The problem of being data rich and information poor results from an inability to turn business data into ________.
business intelligence
Which statement accurately define the relationship between entities and attributes in a relational database? a. Each attribute in an entity occupies multiple columns in its respective table b. Each entity is an attribute occupies one row in its respective table c. Each attribute of an entity occupies a separate column of a table d. Each attribute in an entity occupies multiple rows in its respective table
c
Data ________ refers to the overall management of the availability, usability, integrity, and security of company data.
governance
Many firms complete data ________ audits to determine the accuracy and completeness of its data.
quality
A(n) _______-by-example tool helps users graphically design the answer to a question against a database.
query
Name five accurate statements reflecting the business advantages of a relational database.
1. Increased flexibility 2. Increased information integrity 3. Increased information security 4. Increased scalability and performance 5. Reduced information redundancy
Name two terms that describe the process for weeding out, fixing, or discarding inconsistent, incorrect, or incomplete information.
1. Information cleansing 2. Information scrubbing
Name four primary traits of the value of information.
1. Information type 2. Information timeliness 3. Information quality 4. Information governance
The process of identifying rare or unexpected items or events in a data set that do not conform to other items in a data set.
Anomaly detection
This information type matches with what data mining term? Summary information level
Coarse granularity
Which high-quality information term matches this question? Is a value missing from the information?
Complete
What is a technique for establishing a match, or balance, between the source data and the target data warehouse?
Data map
________ are predictions based on time-series information.
Forecasts
Analyzes unstructured data to find trends and patterns in words of sentences.
Text analysis
Why does a database offer increased information security? a. Various security features of database allow individuals to all information contained in the database. b. Various security features of databases ensure that individuals have only certain types of access to certain types of information.
b
A content ________ is the person responsible for creating the original website content.
creator
A(n) ________ manages information about various types of objects (inventory), events (transactions), people (employees), and places (warehouses).
database
A(n) ________ is a statement about what will happen or might happen in the future; for example, predicting future sales or employee turnover.
prediction
A(n) ________ is a collection of related data elements.
record
Data________ includes the tests and evaluations used to determine compliance with data governance policies to ensure correctness of data.
validation
Virtualization is the creation of a ________(rather than actual) version of computing resources, such as an operating system, a server, a storage device, or network resources.
virtual
A data ________ is a logical collection of information, gathered from many different operational databases, that supports business analysis activities and decision-making tasks.
warehouse
Name four functions that a database management system can perform on data in a database.
1. Creates data 2. Reads data 3. Updates data 4. Deletes data
What are three focus areas of big data?
1. Data mining 2. Data analysis 3. Data visualization
What are three advantages of a data-driven website?
1. Easy to manage content 2. Easy to store large amounts of data 3. Easy to eliminate human errors
What term matches this definition? Encompasses all organizational information, and its primary purpose is to support the performing of managerial analysis tasks.
Analytical information
The science of fact-based decision making. What term matches this definition?
Analytics
What data mining activity matches this definition? Divides information that is similar into groups that are mutually exclusive of each another.
Cluster analysis
Which high-quality information term matches this question? Is aggregate or summary information in agreement with detailed information?
Consistent
Determines a statistical relationship between variables, often for the purpose of identifying predictive factors among variables.
Correlation analysis
What is the smallest or basic unit of information?
Data element
What data mining phase matches this definition? Apply mathematical techniques to identify trends and patterns in the data.
Data modeling
What are logical data structures that detail the relationships among data elements using graphics or pictures?
Data models
What data mining phase matches this definition? Gather and organize the data in the correct formats and structures for analysis.
Data preparation
What statement accurately describes a situation in which there is too much data to properly understand or make use of it?
Data rich and information poor
What is an organized collection of data?
Data set
Who is responsible for ensuring the policies and procedures are implemented across the organization and act as a liaison between the MIS and the business?
Data steward
What is the management and oversight of an organization's data assets to help provide business users with high-quality data that is easily accessible in a consistent manner?
Data stewardship
What data mining phase matches this definition? Analyze all current data along with identifying any data quality issues.
Data understanding
What describes technologies that allow users to "see: or visualize data to transform information into a business perspective?
Data visualization
What moves beyond Excel graphs and charts into sophisticated analysis techniques such as pie charts, controls, instruments, maps, time-series graphs, and more?
Data visualization tools
What is erroneous or flawed data?
Dirty data
What processes and manages algorithms across many machines in a computing environment?
Distributed computing
This information type matches with what data mining term? Proceed in reverse through decreasing levels of detail
Drilling up
Which of the following problems are associated with dirty data?
Duplicate data, non-formatted data, and incorrect data
What includes data that change based on user actions?
Dynamic information
What is stored in a dynamic catalog, or an area of a website that stores information about products in a database?
Dynamic website information
What is a process that extracts information from internal and external databases, transforms it using a common set of enterprise definitions, and loads it into a data warehouse?
Extraction, transformation, and loading (ETL)
True or false: Information in an organization exists only a few departments, such as sales and marketing.
False
What is the application of big data analytics to smaller data sets in near-real or real-time in order to solve a problem or create business value?
Fast data
________ present the results of data analysis, displaying the patterns, relationships, and trends in a graphical format.
Infographics
What occurs when a system produces incorrect, inconsistent, or duplicate data?
Information integrity issues
What reason why business analysis is difficult using operational databases matches this definition? Managers need to perform cross-functional analysis using data from all departments, which differed in granularities, formats, and levels.
Lack of data standards
________ information includes fixed data incapable of change in the event of a user action.
Static
What is the tool that consists of lines of code (in contrast to a graphical design) for answering questions against a database?
Structured query language (SQL)
True or false: Gender, for instance can be referred to in many ways, but it should be standardized on a data warehouse with one common way of referring to each data element that stores gender.
True
Which high-quality information term matches this question? Is each transaction and event represented only once in the information?
Unique
What big data characteristic matches this definition? Different forms of structured and unstructured data; Data from spreadsheets and databases as well as from email, videos, photos, and PDFs, all of which must be analyzed.
Variety
What big data characteristic matches this definition? The analysis of streaming data as it travels around the Internet; Analysis necessary of social media messages spreading globally.
Velocity
What big data characteristic matches this definition? The uncertainty of data, including biases, noise, and abnormalities; Uncertainty of untrustworthiness of data; Data must be meaningful to the problem being analyzed.
Veracity
Integrity ________ are rules that help ensure the quality of information.
constraints
What is a business that collects personal information about consumers and sells that information to other organizations?
data broker
What is a storage repository that holds a vast amount of raw data in its original format until the business needs it?
data lake
What is the process of collecting statistics and information about big data in an existing source?
data profiling
Data-driven ________ management is an approach to business governance that values decisions that can be backed up with verifiable data.
decision
This information type matches with what data mining term? Progress through increasing levels of detail
drilling down
A content ________is the person responsible for updating and maintaining website content.
editor
Select a term that refers to the table used to store information about a person, place, thing, transaction, or event.
entity
A(n) ________ key is a primary key of one table that appears as an attribute in another table and acts to provide a logical relationship between the two tables.
foreign
Information ________ refers to the extent of detail within the information (fine and detailed or coarse and abstract).
granularity
What reason why business analysis is difficult using operational databases matches this definition? Users could not get the data they needed; what was collected was not always useful for intended purposes.
inadequate data usefulness
A data point is an ________ item on a graph or a chart.
individual
What is the common term for the representation of multidimensional information?
information cube
Information ________ is a measure of the quality of information.
integrity
A data________ contains a subset of data warehouse information.
mart
Data _________ is the process of analyzing data to extract information not offered by the raw data alone.
mining
One primary goal of a database is to eliminate information redundancy by recording each piece of information in ________ place(s) in the database.
only one
Information ________ is the duplication of data, or the storage of the same data in multiple places.
redundancy
A(n) ________ database management system allows users to create, read, update, and delete data in a relational database.
relational
A(n) ________ database model stores information in the form of logically related two-dimensional tables.
relational
Data ________ is the process of sharing information to ensure consistency between multiple data sources.
replication
A data ________ extracts knowledge from data by performing statistical analysis, data mining, and advanced analytics on big data to identify trends, market changes, and other relevant information.
scientist