module 9 practice questions
Social Media Analysis
Analyzes text flowing across the Internet, including unstructured text from blogs and messages.
Web Analysis
Analyzes unstructured data associated with websites to identify consumer behavior and website navigation.
Text Analysis
Analyzes unstructured data to find trends and patterns in words and sentences
What contains a subset of data warehouse information?
Data mart
What is a process that weeds out and fixes or discards inconsistent, incorrect, or incomplete information?
Information cleansing
The data, if available, were often incorrect or incomplete. Therefore, users could not rely on the data to make decisions.
Poor Data Quality
BI can help managers with __________ monitoring where a company keeps tabs of its competitor's activities on the web using software that automatically tracks all competitor website activities such as discounts and new products.
competitive
A data __________ is a technique for establishing a match, or balance, between the source data and the target data warehouse.
map
Algorithms are mathematical formulas placed in software that performs an analysis on a data set.
true
Infographics (information graphics) present the results of data analysis, displaying the patterns, relationships, and trends in a graphical format.
true
What is the creation of a virtual (rather than actual) version of computing resources, such as an operating system, a server, a storage device, or network resources?
virtualization
Data ___________ describes technologies that allow users to "see" or visualize data to transform information into a business perspective.
visualization
A data ________ is a logical collection of information, gathered from many different operational databases, that supports business analysis activities and decision-making tasks.
warehouse
What is the solution to the problem of being data rich and information poor?
Business intelligence
What tracks corporate metrics such as critical success factors and key performance indicators and include advanced capabilities such as interactive controls allowing users to manipulate data for analysis?
Business intelligence dashboards
Select the statement that accurately describes a situation in which there is too much data to properly understand or make use of it.
Data rich and information poor
What describes technologies that allow users to "see" or visualize data to transform information into a business perspective?
Data visualization
Correlation Analysis
Determines a statistical relationship between variables, often for the purpose of identifying predictive factors among the variables.
__________ computing processes and manages algorithms across many machines in a computing environment.
Distributed
___________, transformation, and loading is a process that extracts information from internal and external databases, transforms it using a common set of enterprise definitions, and loads it into a data warehouse.
Extraction
What is a process that extracts information from internal and external databases, transforms it using a common set of enterprise definitions, and loads it into a data warehouse?
Extraction, transformation, and loading
Exploratory Data Analysis
Identifies patterns in data, including outliers, uncovering the underlying structure to understand relationships between the variables.
Users could not get the data they needed; what was collected was not always useful for intended purposes.
Inadequate Data Usefulness
Every department had its own method for recording data so when trying to share information, data did not match and users did not get the data they really needed.
Inconsistent Data Definitions
Most data stored in operational databases did not allow users direct access; users had to wait to have their queries or questions answered by MIS professionals who could code SQL.
Ineffective Direct Data Access
Select two terms that describe the process for weeding out, fixing, or discarding inconsistent, incorrect, or incomplete information.
Information cleansing Information scrubbing
Managers need to perform cross-functional analysis using data from all departments, which differed in granularities, formats, and levels.
Lack of Data Standards
data identifies the primary location where data is collected.
Source
Pattern Recognition Analysis
The classification or labeling of an identified pattern in the machine learning process.
Speech Analysis
The process of analyzing recorded calls to gather information; brings structure to customer interactions and exposes information buried in customer contact center interactions with an enterprise.
Speech analysis
The process of analyzing recorded calls to gather information; brings structure to customer interactions and exposes information buried in customer contact center interactions with an enterprise.
Behavioral analysis
Using data about people's behaviors to understand intent and predict future actions.
Different forms of structured and unstructured data
Variety
The analysis of streaming data as it travels around the Internet
Velocity
The uncertainty of data, including biases, noise, and abnormalities
Veracity
__________ is the creation of a virtual (rather than actual) version of computing resources, such as an operating system, a server, a storage device, or network resources.
Virtualization
The scale of data
Volume
Data __________ is the collection of data from various sources for the purpose of data processing.
aggregation
What are mathematical formulas placed in software that performs an analysis on a data set?
algorithms
What is the process of identifying rare or unexpected items or events in a data set that do not conform to other items in the data set?
anomaly detection
A data __________ is a business analytics specialist who uses visual tools to help people understand complex data.
artist
A data __________ is a business that collects personal information about consumers and sells that information to other organizations.
broker
A(n) __________ analysis can compare two or more data sets to identify patterns and trends.
comparative
What can compare two or more data sets to identify patterns and trends?
comparative analysis
What occurs when a company keeps tabs of its competitor's activities on the web using software that automatically tracks all competitor website activities such as discounts and new products?
competitive monitoring
What is the common term for the representation of multidimensional information?
cube
Business intelligence __________ track corporate metrics such as critical success factors and key performance indicators and include advanced capabilities such as interactive controls allowing users to manipulate data for analysis.
dashboards
What is the collection of data from various sources for the purpose of data processing?
data aggregation
Who is a business analytics specialist who uses visual tools to help people understand complex data?
data artist
What is a business that collects personal information about consumers and sells that information to other organizations?
data broker
What is a storage repository that holds a vast amount of raw data in its original format until the business needs it?
data lake
What is a technique for establishing a match, or balance, between the source data and the target data warehouse?
data map
What is an individual item on a graph or a chart?
data point
Who extracts knowledge from data by performing statistical analysis, data mining, and advanced analytics on big data to identify trends, market changes, and other relevant information?
data scientist
What is an organized collection of data?
data set
What is a logical collection of information, gathered from many different operational databases, that supports business analysis activities and decision-making tasks?
data warehouse
What is an approach to business governance that values decisions that can be backed up with verifiable data?
data-driven decision making
Data-driven __________ management is an approach to business governance that values decisions that can be backed up with verifiable data.
decision
Anomaly __________ is the process of identifying rare or unexpected items or events in a data set that do not conform to other items in the data set.
detection
What is erroneous or flawed data?
dirty data
What processes and manages algorithms across many machines in a computing environment?
distributed computing
_________ data is the application of big data analytics to smaller data sets in near-real or real-time in order to solve a problem or create business value.
fast
What is the application of big data analytics to smaller data sets in near-real or real-time in order to solve a problem or create business value?
fast data
What presents the results of data analysis, displaying the patterns, relationships, and trends in a graphical format?
infographics
A data __________ is a storage repository that holds a vast amount of raw data in its original format until the business needs it.
lake
A(n) __________ is a data value that is numerically distant from most of the other data points in a set of data.
outlier
What is a data value that is numerically distant from most of the other data points in a set of data?
outlier
A(n) __________ is a central location in which data is stored.
repository
A data __________ extracts knowledge from data by performing statistical analysis, data mining, and advanced analytics on big data to identify trends, market changes, and other relevant information.
scientist
A data __________ is an organized collection of data.
set
What identifies the primary location where data is collected?
source data