ISM4402 CH7 MC
In the financial services industry, Big Data can be used to improve
Regulatory oversight and decision making
In a Hadoop "stack," what is a slave node?
a node where data is stored and processed
Using data to understand customers/clients and business operations to sustain and foster growth and profitability is
an increasingly challenging task for today's enterprises
In the Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse case study, what was the analytic goal?
determine differences in rates of disease in urban and rural populations
In a network analysis, what connects nodes?
edges
Which Big Data approach promotes efficiency, lower cost, and better performance by processing jobs in a shared, centrally managed pool of IT resources?
grid computing
Allowing Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights. What is this process called?
in-memory analytics
In the Alternative Data for Market Analysis or Forecasts case study, satellite data was NOT used for
monitoring individual customer patterns
In the Twitter case study, how did influential users support their tweets?
objective data
In a Hadoop "stack," what node periodically replicates and stores data from the Name Node should it fail?
secondary node
Companies with the largest revenues from Big Data tend to be
the largest computer and IT services firms
Traditional data warehouses have not been able to keep up with
the variety and complexity of data
Under which of the following requirements would it be more appropriate to use Hadoop over a data warehouse?
unrestricted, ungoverned sandbox explorations
What is the Hadoop Distributed File System (HDFS) designed to handle?
unstructured and semistructured non-relational data
Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called?
variability
A newly popular unit of data in the Big Data era is the petabyte (PB), which is
1015 bytes
Which of the following sources is likely to produce Big Data the fastest?
RFID tags
What is Big Data's relationship to the cloud?
Amazon and Google have working Hadoop cloud offerings
How does Hadoop work?
It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers
All of the following statements about MapReduce are true EXCEPT
MapReduce runs without fault tolerance