my selected 2
13. Business intelligence (BI) can be characterized as a transformation of A) data to information to decisions to actions. B) Big Data to data to information to decisions. C) data to processing to information to actions. D) actions to decisions to feedback to information.
A
20. Big Data often involves a form of distributed storage and processing using Hadoop and MapReduce. One reason for this is A) the processing power needed for the centralized model would overload a single computer. B) Big Data systems have to match the geographical spread of social media. C) centralized storage creates too many vulnerabilities. D) the "Big" in Big Data necessitates over 10,000 processing nodes
A
21. In the opening vignette, the architectural system that supported Watson used all the following elements EXCEPT A) a core engine that could operate seamlessly in another domain without changes. B) massive parallelism to enable simultaneous consideration of multiple hypotheses. C) integration of shallow and deep knowledge. D) an underlying confidence subsystem that ranks and integrates answers.
A
22. All of the following are true about external reports between businesses and the government EXCEPT A) their primary focus is government. B) they can be filed nationally or internationally. C) they are standardized for the most part to reduce the regulatory burden. D) they can include tax and compliance reporting.
A
22. Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are A) subject-oriented and nonvolatile. B) subject-oriented and volatile. C) product-oriented and nonvolatile. D) product-oriented and volatile.
A
23. Which of these applications will derive the LEAST benefit from text mining? A) sales transaction files B) patent description files C) customer comment files D) patients' medical files
A
25. A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a A) three tier architecture. B) one tier architecture. C) two tier architecture. D) four tier architecture.
A
26. All of the following are challenges associated with natural language processing EXCEPT A) dividing up a text into individual words in English. B) understanding the context in which something is said. C) recognizing typographical or grammatical errors in texts. D) distinguishing between words that have more than one meaning
A
26. The data field "salary" can be best described as A) ratio data. B) nominal data. C) ordinal data. D) interval data.
A
27. What application is MOST dependent on text analysis of transcribed sales call center notes and voice conversations with customers? A) CRM B) OLAP C) finance D) ER
A
29. What data discovery process, whereby objects are categorized into predetermined groups, is used in text mining? A) classification B) trend analysis C) association D) clustering
A
29. Which type of visualization tool can be very helpful when a data set contains location data? A) geographic map B) tree map C) bar chart D) highlight table
A
30. In the research literature case study, the researchers analyzing academic papers extracted information from which source? A) the paper abstract B) the paper references C) the main body of the paper D) the paper keyword
A
32. Which data mining process/methodology is thought to be the most comprehensive, according to kdnuggets.com rankings? A) CRISP-DM B) SEMMA C) KDD Process D) proprietary organizational methodologies
A
33. All of the following are benefits of hosted data warehouses EXCEPT A) greater control of data. B) frees up in-house systems. C) better quality hardware. D) smaller upfront investment.
A
33. What is the management feature of a dashboard? A) operational data that identify what actions to take to resolve a problem B) summarized dimensional data to analyze the root cause of problems C) graphical, abstracted data to monitor key performance metrics D) summarized dimensional data to monitor key performance metrics
A
35. Contextual metadata for a dashboard includes all the following EXCEPT A) which operating system is running the dashboard server software. B) whether any high-value transactions that would skew the overall trends were rejected as a part of the loading process. C) whether the dashboard is presenting "fresh" or "stale" information. D) when the data warehouse was last refreshed.
A
35. What does the scalability of a data mining method refer to? A) its ability to construct a prediction model efficiently given a large amount of data B) its ability to overcome noisy data to make somewhat accurate predictions C) its ability to predict the outcome of a previously unknown data set accurately D) its speed of computation and computational costs in using the mode
A
37. In data mining, finding an affinity of two products to be commonly together in a shopping cart is known as A) association rule mining. B) decision trees. C) cluster analysis. D) artificial neural networks.
A
37. In text analysis, what is a lexicon? A) a catalog of words, their synonyms, and their meanings B) a catalog of customers, their words, and phrase C) a catalog of customers, products, words, and phrase D) a catalog of letters, words, phrases and sentences
A
37. Why is a performance management system superior to a performance measurement system? A) because measurement alone has little use without action B) because performance management systems cost more C) because performance measurement systems are only in their infancy D) because measurement automatically leads to problem solution
A
38. Which of the following statements is more descriptive of active data warehouses in contrast with traditional data warehouses? A) large numbers of users, including operational staffs B) restrictive reporting with daily and weekly data currency C) detailed data available for strategic use only D) strategic decisions whose impacts are hard to measure
A
39. How does the use of cloud computing affect the scalability of a data warehouse? A) Hardware resources are dynamically allocated as use increases. B) Cloud vendors are mostly based overseas where the cost of labor is low. C) Cloud computing has little effect on a data warehouse's scalability. D) Cloud computing vendors bring as much hardware as needed to users' offices.
A
39. In the Target case study, why did Target send a teen maternity ads? A) Target's analytic model suggested she was pregnant based on her buying habits. B) Target's analytic model confused her with an older woman with a similar name. C) Target was using a special promotion that targeted all teens in her geographical area. D) Target was sending ads to all women in a particular neighborhood.
A
39. Inputs to speech analytics include all of the following EXCEPT A) written transcripts of calls to service centers. B) recorded conversations of customer call-ins. C) videos of customer focus groups. D) live customer interactions with service representatives.
A
40. In the Blue Cross Blue Shield case study, speech analytics were used to identify "confusion" calls by customers. What was true about these calls? A) They were not documented by customer service reps for speech analytics. B) They took less time than others as frustrated customers hung up. C) They led customers to rely more on self-serve options. D) They were difficult to identify using standard phrases like "I don't get it."
A
40. Which of the following is a data mining myth? A) Data mining requires a separate, dedicated database. B) Newer Web-based tools enable managers of all educational levels to do data mining. C) The current state-of-the-art is ready to go for almost any business. D) Data mining is a multistep process that requires deliberate, proactive design and use.
A
9. Which of the following statements about cognitive limits of organizational decision makers is true? A) Cognitive limits affect both the recall and use of data by decision makers. B) Only top managers make decisions where cognitive limits are strained. C) The most talented and effective managers do not have cognitive limitations. D) All organizational decision-making requires data beyond human cognitive limits.
A
In the Magpie Sensing case study, the automated collection of temperature and humidity data on shipped goods helped with various types of analytics. Which of the following is an example of predictive analytics? A) warning of an open shipment seal B) optimal temperature setting C) real time reports of the shipment's temperature D) location of the shipment
A
. Prediction problems where the variables have numeric values are most accurately defined as A) associations. B) regressions. C) computations. D) classifications.
B
17. How are descriptive analytics methods different from the other two types? A) They answer "what to do?" queries, not "what-if?" queries. B) They answer "what-is?" queries, not "what will be?" queries. C) They answer "what-if?" queries, not "how many?" queries. D) They answer "what will be?" queries, not "what to do?" queries.
B
18. Prescriptive BI capabilities are viewed as more powerful than predictive ones for all the following reasons EXCEPT A) prescriptive models generally build on (with some overlap) predictive ones. B) only prescriptive BI capabilities have monetary value to top-level managers. C) understanding the likelihood of certain events often leaves unclear remedies. D) prescriptive BI gives actual guidance as to actions.
B
21. In the Cabela's case study, what types of models helped the company understand the value of customers, using a fivepoint scale? A) simulation and geographical models B) clustering and association models C) simulation and regression models D) reporting and association models
B
22. According to a study by Merrill Lynch and Gartner, what percentage of all corporate data is captured and stored in some sort of unstructured form? A) 15% B) 85% C) 25% D) 75%
B
23. All of the following statements about data mining are true EXCEPT A) the valid aspect means that the discovered patterns should hold true on new data. B) the process aspect means that data mining should be a one-step process to results. C) the novel aspect means that previously unknown patterns are discovered. D) the potentially useful aspect means that results should lead to some business benefit.
B
23. Kaplan and Norton developed a report that presents an integrated view of success in the organization called A) dashboard-type reports. B) balanced scorecard-type reports. C) metric management reports. D) visual reports
B
24. All of the following statements about metadata are true EXCEPT A) metadata gives context to reported data. B) for most organizations, data warehouse metadata are an unnecessary expense. C) metadata helps to describe the meaning and structure of data. D) there may be ethical issues involved in the creation of metadata.
B
24. In text mining, stemming is the process of A) creating new branches or stems of recorded paragraphs. B) reducing multiple words to their base or root. C) categorizing a block of text in a sentence. D) transforming the term-by-document matrix to a manageable size.
B
26. The Internet emerged as a new medium for visualization and brought all the following EXCEPT A) immersive environments for consuming data. B) new forms of computation of business logic. C) worldwide digital distribution of visualization. D) new graphics displays through PC displays.
B
27. Which broad area of data mining applications analyzes data, forming rules to distinguish between defined classes? A) associations B) classification C) visualization D) clustering
B
27. Which kind of chart is described as an enhanced variant of a scatter plot? A) heat map B) bubble chart C) pie chart D) bullet
B
28. Which broad area of data mining applications partitions a collection of objects into natural groupings with similar features? A) visualization B) clustering C) classification D) associations
B
28. Which data warehouse architecture uses a normalized relational warehouse that feeds multiple data marts? A) federated architecture B) hub-and-spoke data warehouse architecture C) independent data marts architecture D) centralized data warehouse architecture
B
29. The data mining algorithm type used for classification somewhat resembling the biological neural networks in the human brain is A) decision trees. B) artificial neural networks C) association rule mining. D) cluster analysis.
B
3. Which of the following is NOT an example that falls within the four major categories of business environment factors for today's organizations? A) globalization B) fewer government regulations C) increased pool of customers D) increased competition
B
30. Identifying and preventing incorrect claim payments and fraudulent activities falls under which type of data mining applications? A) retailing and logistics B) insurance C) computer hardware and software D) customer relationship management
B
31. When you tell a story in a presentation, all of the following are true EXCEPT A) stories and their lessons should be easy to remember. B) a well-told story should have no need for subsequent discussion. C) a story should make sense and order out of a lot of background noise. D) the outcome and reasons for it should be clear at the end of your story.
B
34. What does the robustness of a data mining method refer to? A) its ability to construct a prediction model efficiently given a large amount of data B) its ability to overcome noisy data to make somewhat accurate predictions C) its speed of computation and computational costs in using the mode D) its ability to predict the outcome of a previously unknown data set accurately
B
37. Active data warehousing can be used to support the highest level of decision-making sophistication and power. The major feature that enables this in relation to handling the data is A) nature of the data. B) speed of data transfer. C) country of (data) origin. D) source of the data.
B
39. All of the following statements about balanced scorecards and dashboards are true EXCEPT A) scorecards are less preferred at operational and tactical levels. B) scorecards are best for real-time tracking of a marketing campaign. C) dashboards would be the preferred choice to monitor production quality. D) scorecards are preferred for tracking the achievement of strategic goals.
B
40. All of the following are true about in-database processing technology EXCEPT A) it pushes the algorithms to where the data is. B) it is the same as in-memory storage technology. C) it is often used for apps like credit card fraud detection and investment risk management. D) it makes the response to queries much faster than conventional databases.
B
40. What is Six Sigma? A) a methodology aimed at measuring the amount of variability in a business process B) a methodology aimed at reducing the number of defects in a business process C) a letter in the Greek alphabet that statisticians use to measure process variability D) a methodology aimed at reducing the amount of variability in a business process
B
5. Which of the following activities permeates nearly all managerial activity? A) planning B) decision-making C) directing D) controlling
B
11. For the majority of organizations, a daily accounts receivable transaction is a(n) A) strategic decision. B) managerial control decision. C) structured decision. D) unstructured decision.
C
15. In answering the question "Which customers are likely to be using fake credit cards?", you are most likely to use which of the following analytic applications? A) customer segmentation B) channel optimization C) fraud detection D) customer profitability
C
21. The "single version of the truth" embodied in a data warehouse such as Capri Casinos' means all of the following EXCEPT A) decision makers get to see the same results to queries. B) decision makers have the same data available to support their decisions. C) decision makers have unfettered access to all data in the warehouse. D) decision makers get to use more dependable data for their decisions.
C
25. In text mining, tokenizing is the process of A) transforming the term-by-document matrix to a manageable size. B) reducing multiple words to their base or root. C) categorizing a block of text in a sentence. D) creating new branches or stems of recorded paragraphs.
C
25. Which of the following is LEAST related to data/information visualization? A) statistical graphics B) information graphics C) graphic artwork D) scientific visualization
C
28. Which type of visualization tool can be very helpful when the intention is to show relative proportions of dollars per department allocated by a university administration? A) heat map B) bubble chart C) pie chart D) bullet
C
30. In which stage of extraction, transformation, and load (ETL) into a data warehouse are data aggregated? A) extraction B) load C) transformation D) cleanse
C
31. All of the following statements about data mining are true EXCEPT A) understanding the data, e.g., the relevant variables, is critical to success. B) understanding the business goal is critical. C) building the model takes the most time and effort. D) data is typically preprocessed and/or cleaned before use
C
31. In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected? A) load B) transformation C) cleanse D) extraction
C
32. Benefits of the latest visual analytics tools, such as SAS Visual Analytics, include all of the following EXCEPT A) there is less demand on IT departments for reports. B) mobile platforms such as the iPhone are supported by these products. C) they explore massive amounts of data in hours, not days. D) it is easier to spot useful patterns and trends in the data.
C
32. In sentiment analysis, which of the following is an implicit opinion? A) The cruise we went on last summer was a disaster. B) Our new mayor is great for the city. C) The customer service I got for my TV was laughable. D) The hotel we stayed in was terrible.
C
34. What do voice of the market (VOM) applications of sentiment analysis do? A) They examine the "market of ideas" in politics. B) They examine employee sentiment in the organization. C) They examine customer sentiment at the aggregate level. D) They examine the stock market for trends
C
34. What is the fundamental challenge of dashboard design? A) ensuring that the organization has access to the latest web browsers B) ensuring that the organization has the appropriate hardware onsite to support it C) ensuring that the required information is shown clearly on a single screen D) ensuring that users across the organization have access to it
C
34. When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure? A) relational schema B) dimensional schema C) star schema D) snowflake schema
C
35. How is objectivity handled in sentiment analysis? A) It is clarified with the customer who expressed it. B) It is incorporated as a type of sentiment. C) It is identified and removed as facts are not sentiment. D) It is ignored because it does not appear in customer sentiment
C
35. When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is A) slice. B) roll-up. C) drill down. D) dice.
C
36. Identifying the target of an expressed sentiment is difficult for all the following reasons EXCEPT A) the review may not be directly connected to the target through the topic name. B) sometimes there are multiple targets expressed in a sentiment. C) strong sentiments may be generated by a computer, not a person. D) blogs and articles with the sentiment may be general in nature.
C
36. In estimating the accuracy of data mining (or other) classification models, the true positive rate is A) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified negatives. B) the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly classified positives. C) the ratio of correctly classified positives divided by the total positive count. D) the ratio of correctly classified negatives divided by the total negative count.
C
38. Third party providers of publicly available datasets protect the anonymity of the individuals in the data set primarily by A) letting individuals in the data know their data is being accessed. B) asking data users to use the data ethically. C) removing identifiers such as names and social security numbers. D) leaving in identifiers (e.g., name), but changing other variables.
C
38. Why is the customer perspective important in the balanced scorecard methodology? A) because customers should always be included in any design methodology B) because companies need customer input into the design of the balanced scorecard C) because dissatisfied customers will eventually hurt the bottom line D) because customers understand best how the firm's internal processes should work
C
4. Organizations counter the pressures they experience in their business environments in multiple ways. Which of the following is NOT an effective way to counter these pressures? A) reactive actions B) anticipative actions C) retroactive actions D) adaptive actions
C
7. Business environments and government requirements are becoming more complex. All of the following actions to manage this complexity would be appropriate EXCEPT A) deploying more sophisticated tools and technique. B) hiring more sophisticated and computer-savvy managers. C) seeking new ways to avoid government compliance. D) avoiding expensive trial and error to find out what works.
C
8. The deployment of large data warehouses with terabytes or even petabytes of data been crucial to the growth of decision support. All the following explain why EXCEPT A) data warehouses have enabled the affordable collection of data for analytics. B) data warehouses have assisted the collection of data for data mining. C) data warehouses have enabled the collection of decision makers in one place. D) data warehouses have assisted the collection of data from multiple sources.
C
Understanding customers better has helped Amazon and others become more successful. The understanding comes primarily from A) collecting data about customers and transactions. B) asking the customers what they want. C) analyzing the vast data amounts routinely collected. D) developing a philosophy that is data analytics-centric
C
12. All of the following may be viewed as decision support systems EXCEPT A) an expert system to diagnose a medical condition. B) a system that helps to manage the organization's supply chain management. C) a knowledge management system to guide decision makers. D) a retail sales system that processes customer sales transactions.
D
14. In answering the question "Which customers are most likely to click on my online ads and purchase my goods?", you are most likely to use which of the following analytic applications? A) customer attrition B) channel optimization C) customer profitability D) propensity to buy
D
16. When Sabre developed their Enterprise Data Warehouse, they chose to use near-real-time updating of their database. The main reason they did so was A) to be able to assess internal operations. B) to aggregate performance metrics in an understandable way. C) to provide a 360-degree view of the organization. D) to provide up-to-date executive insights.
D
19. Which of the following statements about Big Data is true? A) MapReduce is a storage filing system. B) Data chunks are stored in different locations on one computer. C) Hadoop is a type of processor used to process Big Data applications. D) Pure Big Data systems do not involve fault tolerance.
D
21. For those executives who do not have the time to go through lengthy reports, the best alternative is the A) last page of the report. B) raw data that informed the report. C) charts in the report. D) executive summary.
D
23. Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates? A) sectional data mart B) volatile data mart C) public data mart D) independent data mart
D
24. What is the main reason parallel processing is sometimes used for data mining? A) because the hardware exists in most organizations and it is available to use B) because any strategic application requires parallel processing C) because the most of the algorithms used for data mining require it D) because of the massive data amounts and search efforts involved
D
24. Which component of a reporting system contains steps detailing how recorded transactions are converted into metrics, scorecards, and dashboards? A) assurance B) extract, transform and load C) data supply D) business logic
D
25. The data field "ethnic group" can be best described as A) ordinal data. B) ratio data. C) interval data. D) nominal data.
D
26. Which of the following BEST enables a data warehouse to handle complex queries and scale up to handle many more requests? A) Microsoft Windows B) a larger IT staff C) use of the web by users as a front-end D) parallel processing
D
27. Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses? A) centralized data warehouse architecture B) hub-and-spoke data warehouse architecture C) independent data marts architecture D) federated architecture
D
28. In text mining, which of the following methods is NOT used to reduce the size of a sparse matrix? A) eliminating rarely occurring terms B) using singular value decomposition C) using a domain expert D) normalizing word frequencies
D
29. Which approach to data warehouse integration focuses more on sharing process functionality than data across systems? A) enterprise function integration B) enterprise information integration C) extraction, transformation, and load D) enterprise application integration
D
30. Which type of question does visual analytics seeks to answer? A) What is happening today? B) What happened yesterday? C) When did it happen? D) Why did it happen?
D
31. Sentiment classification usually covers all the following issues EXCEPT A) range of polarity (e.g., star ratings for hotels and for restaurants). B) classes of sentiment (e.g., positive versus negative). C) range in strength of opinion. D) biometric identification of the consumer expressing the sentiment.
D
32. Data warehouses provide direct and indirect benefits to using organizations. Which of the following is an indirect benefit of data warehouses? A) extensive new analyses performed by users B) simplified access to data C) better and more timely information D) improved customer service
D
33. In the Whirlpool case study, the company sought to better understand information coming from which source? A) delivery information B) customer transaction data C) goods moving through the internal supply chain D) customer e-mails
D
36. Dashboards can be presented at all the following levels EXCEPT A) the static report level. B) the visual dashboard level. C) the self-service cube level. D) the visual cube level.
D
36. Which of the following online analytical processing (OLAP) technologies does NOT require the precomputation and storage of information? A) MOLAP B) SQL C) HOLAP D) ROLAP
D
38. What types of documents are BEST suited to semantic labeling and aggregation to determine sentiment orientation? A) collections of documents B) medium- to large-sized documents C) large-sized documents D) small- to medium-sized documents
D
6. Why are analytical decision making skills now viewed as more important than interpersonal skills for an organization's managers? A) because personable and friendly managers are always the least effective B) because interpersonal skills are never important in organizations C) because analytical-oriented managers tend to be flashier and less methodical D) because analytical-oriented managers produce better results over time
D
In the Magpie Sensing case study, the automated collection of temperature and humidity data on shipped goods helped with various types of analytics. Which of the following is an example of prescriptive analytics? A) location of the shipment B) real time reports of the shipment's temperature C) warning of an open shipment seal D) optimal temperature setting
D
10. For the majority of organizations, evaluating the credit rating of a potential business partner is a(n) A) structured decision. B) unstructured decision. C) managerial control decision. D) strategic decision.
c