Test 2 chap 7

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

22) A newly popular unit of data in the Big Data era is the petabyte (PB), which is A) 109 bytes. B) 1012 bytes. C) 1015 bytes. D) 1018 bytes

C

23) Which of the following sources is likely to produce Big Data the fastest? A) order entry clerks B) cashiers C) RFID tags D) online customers

C

21) Using data to understand customers/clients and business operations to sustain and foster growth and profitability is A) easier with the advent of BI and Big Data. B) essentially the same now as it has always been. C) an increasingly challenging task for today's enterprises. D) now completely automated with no human intervention required.

C

26) Allowing Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights. What is this process called? A) in-memory analytics B) in-database analytics C) grid computing D) appliances

A

33) In a network analysis, what connects nodes? A) edges B) metrics C) paths D) visualizations

A

25) In the Twitter case study, how did influential users support their tweets? A) opinion B) objective data C) multiple posts D) references to other users

B

29) What is the Hadoop Distributed File System (HDFS) designed to handle? A) unstructured and semistructured relational data B) unstructured and semistructured non-relational data C) structured and semistructured relational data D) structured and semistructured non-relational data

B

31) In a Hadoop "stack," what node periodically replicates and stores data from the Name Node should it fail? A) backup node B) secondary node C) substitute node D) slave node

B

14) In Application Case 7.6, Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse, it was found that urban individuals have a higher number of diagnosed disease conditions.

TRUE

15) For low latency, interactive reports, a data warehouse is preferable to Hadoop.

TRUE

16) If you have many flexible programming languages running in parallel, Hadoop is preferable to a data warehouse.

TRUE

18) It is important for Big Data and self-service business intelligence to go hand in hand to get maximum value from analytics.

TRUE

2) The term "Big Data" is relative as it depends on the size of the using organization.

TRUE

20) Current total storage capacity lags behind the digital information being generated in the world.

TRUE

3) Satellite data can be used to evaluate the activity at retail locations as a source of alternative data.

TRUE

4) Big Data is being driven by the exponential growth, availability, and use of information.

TRUE

5) The quality and objectivity of information disseminated by influential users of Twitter is higher than that disseminated by noninfluential users.

TRUE

36) Under which of the following requirements would it be more appropriate to use Hadoop over a data warehouse? A) ANSI 2003 SQL compliance is required B) online archives alternative to tape C) unrestricted, ungoverned sandbox explorations D) analysis of provisional data

C

24) Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called? A) volatility B) periodicity C) inconsistency D) variability

D

11) Despite their potential, many current NoSQL tools lack mature management and monitoring tools.

TRUE

27) Which Big Data approach promotes efficiency, lower cost, and better performance by processing jobs in a shared, centrally managed pool of IT resources? A) in-memory analytics B) in-database analytics C) grid computing D) appliances

C

30) In a Hadoop "stack," what is a slave node? A) a node where bits of programs are stored B) a node where metadata is stored and used to organize data processing C) a node where data is stored and processed D) a node responsible for holding all the source programs

C

34) In the Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse case study, what was the analytic goal? A) determine if diseases are accurately diagnosed B) determine probabilities of diseases that are comorbid C) determine differences in rates of disease in urban and rural populations D) determine differences in rates of disease in males v. females

C

38) Companies with the largest revenues from Big Data tend to be A) the largest computer and IT services firms. B) small computer and IT services firms. C) pure open source Big Data firms. D) non-U.S. Big Data firms.

A

35) Traditional data warehouses have not been able to keep up with A) the evolution of the SQL language. B) the variety and complexity of data. C) expert systems that run on them. D) OLAP.

B

37) What is Big Data's relationship to the cloud? A) Hadoop cannot be deployed effectively in the cloud just yet. B) Amazon and Google have working Hadoop cloud offerings. C) IBM's homegrown Hadoop platform is the only option. D) Only MapReduce works in the cloud; Hadoop does not.

B

28) How does Hadoop work? A) It integrates Big Data into a whole so large data elements can be processed as a whole on one computer. B) It integrates Big Data into a whole so large data elements can be processed as a whole on multiple computers. C) It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on one computer. D) It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers.

D

32) All of the following statements about MapReduce are true EXCEPT A) MapReduce is a general-purpose execution engine. B) MapReduce handles the complexities of network communication. C) MapReduce handles parallel programming. D) MapReduce runs without fault tolerance.

D

39) In the financial services industry, Big Data can be used to improve A) regulatory oversight. B) decision making. C) customer service. D) both A & B.

D

40) In the Alternative Data for Market Analysis or Forecasts case study, satellite data was NOT used for A) evaluating retail traffic. B) monitoring activity at factories. C) tracking agricultural estimates. D) monitoring individual customer patterns.

D

10) In most cases, Hadoop is used to replace data warehouses.

FALSE

17) In the Salesforce case study, streaming data is used to identify services that customers use most.

FALSE

19) Big Data simplifies data governance issues, especially for global firms.

FALSE

6) Big Data uses commodity hardware, which is expensive, specialized hardware that is custom built for a client or application.

FALSE

9) Hadoop and MapReduce require each other to work.

FALSE

1) In the opening vignette, the Access Telecom (AT), built a system to better visualize customers who were unhappy before they canceled their service.

TRUE

12) There is a clear difference between the type of information support provided by influential users versus the others on Twitter.

TRUE

13) Social media mentions can be used to chart and predict flu outbreaks.

TRUE

7) MapReduce can be easily understood by skilled programmers due to its procedural nature.

TRUE

8) Hadoop was designed to handle petabytes and exabytes of data distributed over multiple nodes in parallel.

TRUE


Ensembles d'études connexes

ch 11 administration of medication and intravenous therapy

View Set

Adult Development and Aging (Chapter 5)

View Set

Resources - satisfying our wants and needs.

View Set

Problem set 7, 8, 9, 10 (test 2)

View Set

Multiple Choice Questions for Investments quiz, Investments quiz 2 part #2, Investments Quiz #2 Part 1

View Set

BAS 282: Strategic Planning: SmartBook

View Set