Quiz 9 - Chapter 9
The Hadoop Distributed File System layer (HDFS) stores data in blocks of _____ or more.
128 MB
Which of the following is a prime number?
5
Which of the following represents the attribute of an entity in a relational database model?
A column
Which of the following represents an entity in a relational database model?
A row
Which of the following data mining techniques uses a specialized set of algorithms that sorts through data and forms statistical rules about relationships among the items?
Association analysis
Which of the following is a term used to describe data collections that are so enormous and complex that traditional data management software and analysis processes are incapable of dealing with them?
Big data
_____ includes a wide range of applications, practices, and technologies for the extraction, transformation, integration, analysis, interpretation, and presentation of data to support improved decision making.
Business intelligence
_____ tools frequently operate on data stored in a data warehouse or data mart.
Business intelligence
In-memory databases (IMDBs) have become feasible because of the decrease in _____ capacities.
C: random access memory (RAM)
Identify the technique through which an SQL database ensures that other transactions do not modify a database until the first transaction succeeds or fails.
Concurrency control
_____ present a set of key performance indicators about the state of a process at a specific point in time.
Dashboards
_____ involves the overall management of the availability, usability, integrity, and security of the data used in an organization.
Data governance
Which of the following is a minor version of a data warehouse that is scaled down to meet the specific needs of a business unit?
Data mart
_____ is a business intelligence (BI) tool used to explore large amounts of data for hidden patterns to predict future trends and behaviors for use in decision making.
Data mining
Which of the following business intelligence tools will help an organization to make business predictions?
Data mining tools
SoulTree, a software giant, has adopted a business strategy, already employed by its competitor, to market its products. Unfortunately, the adopted strategy does not work for SoulTree although it has been successful for its competitor. Which of the following business intelligence tools can help SoulTree to understand why the adopted strategy failed?
Drill-down analysis
_____ is a powerful tool that enables decision makers to gain insight into the details of business data to better understand why something happened.
Drill-down analysis
Identify the spreadsheet tool that is used to determine the optimal solution to a problem with multiple constraints.
Excel Solver
Which of the following tools in a spreadsheet performs "what if" analysis?
Excel scenario manager
PHQ, a graphics firm, decides to filter its employees according to their dates of joining. Which of the following steps in the extract-transform-load (ETL) process should be taken by the firm to accomplish the task?
Extract
Which of the following steps in an extract-transform-load (ETL) process draws desired data from various data sources to update a data warehouse?
Extract
Zomi, a fashion outlet, wishes to delete the details of orders that cost less than $30. Which of the following steps in the extract-transform-load (ETL) process is used to accomplish this task?
Extract
An SQL database stops abruptly when it encounters an error in the update process.
False
An in-memory database (IMDB) is a database management system that stores an entire database in read-only memory (ROM).
False
Case-based reasoning technique of data mining contains a specialized set of algorithms that sorts through data and forms statistical rules about relationships among the items.
False
In spreadsheets programs, Excel Solver can be used by the end users to perform "what if" analysis for the evaluation of various alternatives.
False
Online analytical processing (OLAP) involves the interactive examination of high-level summary data in increasing detail.
False
Organizations employ data warehouses to store junk data.
False
Unstructured data comes from a single source.
False
Which of the following is true of the neural computing technique in data mining?
Historical data is examined for patterns that are then used to make predictions.
_____ enables the analysis of big data and other challenging data-processing applications.
In-memory database (IMDB)
Identify the industry that implements data governance.
Insurance industry
Which of the following is true of unstructured data?
It can add a depth to data analysis, with the right tools, that could not be achieved otherwise.
Which of the following best describes the application layer of Hadoop?
It enables applications such as internally developed programming languages or purchased third-party business intelligence tools to interact with Hadoop.
Which of the following best describes the relational database model?
It helps organize structured data into collections of two-dimensional tables.
Identify a true statement about drill-down analysis.
It involves the interactive examination of high-level summary data in increasing detail to gain insight into certain elements.
Which of the following is true of a data mart?
It is a smaller version of a data warehouse.
Which of the following is true of Hadoop?
It is an open-source software framework designed for processing large volumes of data by dividing the work into a set of independent tasks.
Which of the following is true of a NoSQL database?
It is highly scalable.
Which of the following best describes an extract-transform-load (ETL) process?
It is used to pull data from disparate data sources to populate and maintain the data warehouse.
Identify a drawback of a NoSQL database.
It lacks strong data consistency.
Which of the following steps in an extract-transform-load (ETL) process creates an updated version of a data warehouse?
Load
Which of the following steps in an extract-transform-load (ETL) process updates an existing data warehouse with the data obtained from the extract and transform steps?
Load
Which of the following databases is designed to store and retrieve data without rigidly implementing the ACID (atomicity, consistency, isolation, and durability) conditions associated with the relational database model?
NoSQL
Which of the following databases should be used by a company that has over 50,000 employees in several locations to efficiently manage its database even when a couple of servers are down?
NoSQL
PindCart, a garment retailer, is not sure of the demand for its products during the festive seasons and ends up with very little profit. Which of the following business intelligence tools can help PindCart to predict the demand and maximize its sales?
Online analytical processing
Which of the following is a method to analyze multidimensional data from different perspectives?
Online analytical processing (OLAP)
_____ enables users to identify issues and opportunities as well as perform trend analysis by analyzing multidimensional data from many different perspectives.
Online analytical processing (OLAP)
_____ are designed to draw data in real time from various sources, including corporate databases and spreadsheets, so decision makers can make use of up-to-the-minute data.
Operational dashboards
Which of the following is the storage medium of an in-memory database (IMDB)?
Random access memory (RAM)
Which of the following enables end users to make their own data requests and format the results without the need for additional help from the information technology organization?
Reporting and querying tools
Which of the following is a special-purpose programming language used for accessing and manipulating data stored in a relational database?
SQL
Identify the feature of a NoSQL database that allows a large database to be distributed across thousands of servers running the same NoSQL database management system.
Scalability
Which of the following business intelligence tools performs operations on data based on formulas created by the end user?
Spreadsheet
_____ can be used to create useful reports and graphs based on given data.
Spreadsheets
Identify the system feature that is improved due to the highly scalable nature of NoSQL databases.
System uptime
Which layer of Hadoop is responsible for data storage?
The Hadoop Distributed File system layer (HDFS)
Which of the following is the reason behind the quick responsiveness of online analytical processing (OLAP)?
The pre-aggregation of detailed data into useful data summaries
Which of the following layers of Hadoop handles job failures and interruptions?
The workload management layer
Which of the following best describes reporting and querying tools?
They present data in a simplified form of graphs and charts, making it easier to understand.
Which of the following is true of dashboards?
They provide rapid access to information in an easy-to-interpret manner.
Hadoop is designed for processing large volumes of data by dividing the work into a set of independent tasks that are executed in parallel on a large number of servers.
True
Online analytical processing (OLAP) enables users to identify issues and opportunities as well as perform trend analysis.
True
The goal of a NoSQL database is to provide very fast performance even as the amount of data that must be processed continues to grow.
True
The columns in a relational database model such as customer name and address represent _____.
attributes
The ability to ensure that an update to data in one part of a database is immediately propagated to all other parts of the database is known as _____.
data consistency
Databases built to support online analytical processing (OLAP) consist of _____.
data cubes
The ACID (atomicity, consistency, isolation, and durability) properties of SQL databases ensure _____.
data integrity
A _____ is a database that stores large amounts of historical data in a form that readily supports analysis and management decision making in an organization.
data warehouse
The effectiveness of a(n) _____ process is essential to ensure the success of a data warehouse.
extract-transform-load
A(n) _____ is used to pull data from disparate data sources to populate and maintain a data warehouse.
extract-transform-load (ETL) process
A(n) _____ is a database management system that stores an entire database in random access memory (RAM).
in-memory database (IMDB)
Metrics that consist of a direction, measure, target, and time frame are known as _____.
key performance indicators
A NoSQL database stores data as highly optimized key-value pairs with the data stored in a _____ table.
simple two-column
In an extract-transform-load (ETL) process, the mined data used to update a data warehouse is converted to a different format in the _____ step.
transform
Data that does not fit nicely into relational databases is known as _____.
unstructured data