DSCI Exam 3
In a network analysis, what connects nodes? edges metrics paths visualizations
edges
Which of these is NOT a part of the IoT technology infrastructure? hardware connectivity electrical access software
electrical access
Why are companies like IBM shifting to provide more services and consulting? Customers see that significant value can be created with the application of analytics, and need help completing these tasks. They can no longer compete in the software market. New regulations forced them into this market. None of these.
Customers see that significant value can be created with the application of analytics, and need help completing these tasks.
This model began with the notion that data quality could happen in a centralized place, cleansing and enriching data and offering it to different systems, applications, or users, irrespective of where they were in the organization, computers, or on the network. SaaS PaaS IaaS DaaS
DaaS
For cloud computing to be successful, users must have knowledge and experience in the control of the technology infrastructures. (T/F)
False
Hadoop and MapReduce require each other to work. (T/F)
False
In the Great Clips case study, the company uses geospatial data to analyze, among other things, the types of haircuts most popular in different geographic locations. (T/F)
False
In the Salesforce case study, streaming data is used to identify services that customers use most. (T/F)
False
In the classification of location-based analytic applications, examining geographic site locations falls in the consumer-oriented category. (T/F)
False
SaaS combines aspects of cloud computing with Big Data analytics and empowers data scientists and analysts by allowing them to access centrally managed information data sets. (T/F)
False
Siemens utilizes data sensors to track failure rates in household appliances. (T/F)
False
Users definitely own their biometric data. (T/F)
False
Web-based e-mail such as Google's Gmail are not examples of cloud computing. (T/F)
False
The portion of the IoT technology infrastructure that focuses on the sensors themselves is hardware. connectivity. software backend. applications.
Hardware
In this model, infrastructure resources like networks, storage, servers, and other computing resources are provided to client companies. SaaS PaaS IaaS DaaS
IaaS
How does Hadoop work? It integrates Big Data into a whole so large data elements can be processed as a whole on one computer. It integrates Big Data into a whole so large data elements can be processed as a whole on multiple computers. It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on one computer. It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers.
It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers.
All of the following statements about MapReduce are true EXCEPT MapReduce is a general-purpose execution engine. MapReduce handles the complexities of network communication. MapReduce handles parallel programming. MapReduce runs without fault tolerance.
MapReduce runs without fault tolerance.
Which of the following allows companies to deploy their software and applications in the cloud so that their customers can use them? SaaS IaaS PaaS AaaS
PaaS
Which of the following sources is likely to produce Big Data the fastest? order entry clerks cashiers RFID tags online customers
RFID tags
What new geometric data type in Teradata's data warehouse captures geospatial features? NAVTEQ ST_GEOMETRY GIS SQL/MM
ST_GEOMETRY
This model allows consumers to use applications and software that run on distant computers in the cloud infrastructure. SaaS IaaS PaaS AaaS
SaaS
Which of the following is true about the furtherance of homeland security? There is a lessening of privacy issues. There is a greater need for oversight. The impetus was the need to harvest information related to financial fraud after 2001. Most people regard analytic tools as mostly ineffective in increasing security.
There is a greater need for oversight.
Current total storage capacity lags behind the digital information being generated in the world. (T/F)
True
Data as a service began with the notion that data quality could happen in a centralized place, cleansing and enriching data and offering it to different systems, applications, or users, irrespective of where they were in the organization, computers, or on the network. (T/F)
True
For low latency, interactive reports, a data warehouse is preferable to Hadoop. (T/F)
True
From massive amounts of high-dimensional location data, algorithms that reduce the dimensionality of the data can be used to uncover trends, meaning, and relationships to eventually produce human-understandable representations. (T/F)
True
Hadoop was designed to handle petabytes and exabytes of data distributed over multiple nodes in parallel. (T/F)
True
If you have many flexible programming languages running in parallel, Hadoop is preferable to a data warehouse. (T/F)
True
In Application Case 7.6, Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse, it was found that urban individuals have a higher number of diagnosed disease conditions. (T/F)
True
It is important for Big Data and self-service business intelligence to go hand in hand to get maximum value from analytics. (T/F)
True
MapReduce can be easily understood by skilled programmers due to its procedural nature. (T/F)
True
One reason the IoT is growing exponentially is because hardware is smaller and more affordable.(T/F)
True
Satellite data can be used to evaluate the activity at retail locations as a source of alternative data. (T/F)
True
Service-oriented DSS solutions generally offer individual or bundled services to the user as a service. (T/F)
True
Social media mentions can be used to chart and predict flu outbreaks. (T/F)
True
Social networking Web sites like Facebook, Twitter, and LinkedIn, are also examples of cloud computing. (T/F)
True
The quality and objectivity of information disseminated by influential users of Twitter is higher than that disseminated by noninfluential users. (T/F)
True
The term "Big Data" is relative as it depends on the size of the using organization. (T/F)
True
There is a clear difference between the type of information support provided by influential users versus the others on Twitter. (T/F)
True
Which Big Data approach promotes efficiency, lower cost, and better performance by processing jobs in a shared, centrally managed pool of IT resources? in-memory analytics in-database analytics grid computing appliances
grid computing
Allowing Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights. What is this process called? in-memory analytics in-database analytics grid computing appliances
in-memory analytics
In the Alternative Data for Market Analysis or Forecasts case study, satellite data was NOT used for evaluating retail traffic. monitoring activity at factories. tracking agricultural estimates monitoring individual customer patterns.
monitoring individual customer patterns.
In the Alternative Data for Market Analysis or Forecasts case study, satellite data was NOT used for . evaluating retail traffic. monitoring activity at factories. tracking agricultural estimates. monitoring individual customer patterns.
monitoring individual customer patterns.
In the Twitter case study, how did influential users support their tweets? opinion objective data multiple posts references to other users
objective data
Services that let consumers permanently enter a profile of information along with a password and use this information repeatedly to access services at multiple sites are called consumer access applications. information collection portals. single-sign-on facilities. consumer information sign on facilities.
single-sign-on facilities.
The portion of the IoT technology infrastructure that focuses on how to manage incoming data and analyze it is hardware. connectivity. software backend. applications.
software backend.
Companies with the largest revenues from Big Data tend to be the largest computer and IT services firms. small computer and IT services firms. pure open source Big Data firms. non-U.S. Big Data firms.
the largest computer and IT services firms.
Traditional data warehouses have not been able to keep up with the evolution of the SQL language. the variety and complexity of data. expert systems that run on them. OLAP.
the variety and complexity of data.
In most cases, Hadoop is used to replace data warehouses. (T/F)
False
In the Quiznos case, the company employed location-based behavioral targeting to narrow the characteristics of users who were most likely to eat at a quick-service restaurant. (T/F)
True
In the opening vignette, the Access Telecom (AT), built a system to better visualize customers who were unhappy before they canceled their service. (T/F)
True
Internet of Things (IoT) is the phenomenon of connecting the physical world to the Internet. (T/F)
True
RFID can be used in supply chains to manage product quality.(T/F)
True
The term cloud computing originates from a reference to the Internet as a "cloud" and represents an evolution of all of the previously shared/centralized computing trends. (T/F)
True
Using data to understand customers/clients and business operations to sustain and foster growth and profitability is easier with the advent of BI and Big Data. essentially the same now as it has always been. an increasingly challenging task for today's enterprises. now completely automated with no human intervention required.
an increasingly challenging task for today's enterprises.
What is the Hadoop Distributed File System (HDFS) designed to handle? unstructured and semistructured relational data unstructured and semistructured non-relational data structured and semistructured relational data structured and semistructured non-relational data
unstructured and semistructured non-relational data
Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called? volatility periodicity inconsistency variability
variability
What is Big Data's relationship to the cloud? Hadoop cannot be deployed effectively in the cloud just yet. Amazon and Google have working Hadoop cloud offerings. IBM's homegrown Hadoop platform is the only option. Only MapReduce works in the cloud; Hadoop does not.
Amazon and Google have working Hadoop cloud offerings.
Big Data simplifies data governance issues, especially for global firms. (T/F)
False
Big Data uses commodity hardware, which is expensive, specialized hardware that is custom built for a client or application. (T/F)
False
Connectivity is not a part of the IoT infrastructure. (T/F)
False
GPS Navigation is an example of which kind of location-based analytics? organization-oriented geospatial static approach organization-oriented location-based dynamic approach consumer-oriented geospatial static approach consumer-oriented location-based dynamic approach
consumer-oriented geospatial static approach
In the Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse case study, what was the analytic goal? determine if diseases are accurately diagnosed determine probabilities of diseases that are comorbid determine differences in rates of disease in urban and rural populations determine differences in rates of disease in males v. females
determine differences in rates of disease in urban and rural populations