Exam 3 study set
What is Big Data's relationship to the cloud? Hadoop cannot be deployed effectively in the cloud just yet. Amazon and Google have working Hadoop cloud offerings. IBM's homegrown Hadoop platform is the only option. Only MapReduce works in the cloud; Hadoop does not.
Amazon and Google have working Hadoop cloud offerings.
Big Data simplifies data governance issues, especially for global firms. True False
False
For cloud computing to be successful, users must have knowledge and experience in the control of the technology infrastructures. True False
False
In most cases, Hadoop is used to replace data warehouses. True False
False
In the Great Clips case study, the company uses geospatial data to analyze, among other things, the types of haircuts most popular in different geographic locations. True False
False
In the classification of location-based analytic applications, examining geographic site locations falls in the consumer-oriented category. True False
False
Siemens utilizes data sensors to track failure rates in household appliances. True False
False
Users definitely own their biometric data. True False
False
Web-based e-mail such as Google's Gmail are not examples of cloud computing. True False
False
In this model, infrastructure resources like networks, storage, servers, and other computing resources are provided to client companies. SaaS PaaS IaaS DaaS
IaaS
Which of the following allows companies to deploy their software and applications in the cloud so that their customers can use them? SaaS IaaS PaaS AaaS
PaaS
This model allows consumers to use applications and software that run on distant computers in the cloud infrastructure. SaaS PaaS IaaS DaaS
SaaS
Current total storage capacity lags behind the digital information being generated in the world. True False
True
Data as a service began with the notion that data quality could happen in a centralized place, cleansing and enriching data and offering it to different systems, applications, or users, irrespective of where they were in the organization, computers, or on the network. True False
True
For low latency, interactive reports, a data warehouse is preferable to Hadoop. True False
True
In Application Case 7.6, Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse, it was found that urban individuals have a higher number of diagnosed disease conditions. True False
True
In the opening vignette, the Access Telecom (AT), built a system to better visualize customers who were unhappy before they canceled their service. True False
True
Internet of Things (IoT) is the phenomenon of connecting the physical world to the Internet. True False
True
MapReduce can be easily understood by skilled programmers due to its procedural nature. True False
True
One reason the IoT is growing exponentially is because hardware is smaller and more affordable. True False
True
RFID can be used in supply chains to manage product quality. True False
True
Satellite data can be used to evaluate the activity at retail locations as a source of alternative data. True False
True
The quality and objectivity of information disseminated by influential users of Twitter is higher than that disseminated by noninfluential users. True False
True
There is a clear difference between the type of information support provided by influential users versus the others on Twitter. True False
True
GPS Navigation is an example of which kind of location-based analytics? organization-oriented geospatial static approach organization-oriented location-based dynamic approach consumer-oriented geospatial static approach consumer-oriented location-based dynamic approach
consumer-oriented geospatial static approach
In the Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse case study, what was the analytic goal? determine if diseases are accurately diagnosed determine probabilities of diseases that are comorbid determine differences in rates of disease in urban and rural populations determine differences in rates of disease in males v. females
determine differences in rates of disease in urban and rural populations
Which of these is NOT a part of the IoT technology infrastructure? hardware connectivity electrical access software
electrical access
Allowing Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights. What is this process called? in-memory analytics in-database analytics grid computing appliances
in-memory analytics
In the Twitter case study, how did influential users support their tweets? opinion objective data multiple posts references to other users
objective data
The portion of the IoT technology infrastructure that focuses on how to manage incoming data and analyze it is hardware. connectivity. software backend. applications.
software backend.
Companies with the largest revenues from Big Data tend to be the largest computer and IT services firms. small computer and IT services firms. pure open source Big Data firms. non-U.S. Big Data firms.
the largest computer and IT services firms.
Traditional data warehouses have not been able to keep up with the evolution of the SQL language. the variety and complexity of data. expert systems that run on them. OLAP.
the variety and complexity of data.
What is the Hadoop Distributed File System (HDFS) designed to handle? unstructured and semistructured relational data unstructured and semistructured non-relational data structured and semistructured relational data structured and semistructured non-relational data
unstructured and semistructured non-relational data