MIS4300 Final Exam
According to a study by Merrill Lynch and Gartner, what percentage of all corporate data is captured and stored in some sort of unstructured form? A) 15% B) 75% C) 25% D) 85%
85%
Why do analytics applications have the effect of redistributing power among managers? A) The more information and analysis managers have, the more power they possess. B) Sponsoring an analytics system automatically confers power to a manager. C) New analytics applications change managers' job expectations. D) New analytics systems lead to new budget allocations, resulting in increased power
The more information and analysis managers have, the more power they possess
Analytics integration with other organizational systems makes it harder to identify its impact on the organization True False
True
Articles and auxiliary verbs are assigned little value in text mining and are usually filtered out True False
True
Categorization and clustering of documents during text mining differ only in the preselection of categories. True False
True
Chinese, Japanese, and Thai have features that make them more difficult candidates for natural language processing. True False
True
Cloud computing originates from a reference to the Internet as a "cloud" and is a combination of several information technology components as services True False
True
Content-based filtering approaches are widely used in recommending textual content such as news items and related Web pages True False
True
Current total storage capacity lags behind the digital information being generated in the world True False
True
Current use of sentiment analysis in voice of the customer applications allows companies to change their products or services in real time in response to customer sentiment. True False
True
Data-as-a-service began with the notion that data quality could happen in a centralized place, cleansing and enriching data and offering it to different systems, applications, or users, irrespective of where they were in the organization, computers, or on the network. True False
True
Despite their potential, many current NoSQL tools lack mature management and monitoring tools True False
True
For low latency, interactive reports, a data warehouse is preferable to Hadoop True False
True
From massive amounts of high-dimensional location data, algorithms that reduce the dimensionality of the data can be used to uncover trends, meaning, and relationships to eventually produce human-understandable representations. True False
True
Hadoop was designed to handle petabytes and extabytes of data distributed over multiple nodes in parallel. True False
True
If you have many flexible programming languages running in parallel, Hadoop is preferable to a data warehouse True False
True
In designing analytic systems, it must be kept in mind that the right to an individual's privacy is not absolute True False
True
In sentiment analysis, it is hard to classify some subjects such as news as good or bad, but easier to classify others, e.g., movie reviews, in the same way True False
True
In service-oriented DSS, an application programming interface (API) serves to populate source systems with raw data and to pull operational reports True False
True
In text mining, if an association between two concepts has 7% support, it means that 7% of the documents had both concepts represented in the same document True False
True
In text mining, inputs to the process include unstructured data such as Word documents, PDF files, text excerpts, e-mail and XML files. True False
True
In the BBVA case study, text analytics was used to help the company defend and enhance its reputation in social media True False
True
In the Hong Kong government case study, reporting time was the main benefit of using SAS Business Analytics to generate reports True False
True
In the chapter's opening vignette, IBM's computer named Watson outperformed human game champions on the game show Jeopardy! True False
True
In the financial services firm case study, text analysis for associate-customer interactions were completely automated and could detect whether they met the company's standards True False
True
In the investment bank case study, the major benefit brought about by the supplanting of multiple databases by the new trade operational store was providing real-time access to trading data True False
True
In the life coach case study, Kaggle recently hosted a competition aimed at identifying muscle motions that may be used to predict the progression of Alzheimer's disease True False
True
It is important for Big Data and self-service business intelligence go hand in hand to get maximum value from analytics True False
True
Many analytics tools are too complex for the average user, and this is one justification for Big Data. True False
True
MapReduce can be easily understood by skilled programmers due to its procedural nature True False
True
Regional accents present challenges for natural language processing. True False
True
Service-oriented DSS solutions generally offer individual or bundled services to the user as a service. True False
True
The bag-of-words model is appropriate for spam detection but not for text analytics True False
True
The basic premise behind social networking is that it gives people the power to share, making the world more open and connected True False
True
The data scientist is a profession for a field that is still largely being defined. True False
True
The term "Big Data" is relative as it depends on the size of the using organization True False
True
There is a current undersupply of data scientists for the Big Data market. True False
True
Use of automated decision systems (ADSs) is likely to result in a reduction of middle management True False
True
In the Whirlpool case study, the company sought to better understand information coming from which source? A) customer transaction data B) delivery information C) customer e-mails D) goods moving through the internal supply chain
Customer e-mails
4) A British company called Path Intelligence has developed a system that ascertains how people move within a city or even within a store. What is this system called? A) Pathfinder B) PathMiner C) Footpath D) Pathdata
Footpath
Today, most smartphones are equipped with various instruments to measure jerk, orientation, and sense motion. One of these instruments is an accelerometer, and the other is a(n) A) potentiometer. B) gyroscope. C) microscope. D) oscilloscope.
Gyroscope
Which of the following offers a flexible data integration platform based on a newer generation of service-oriented standards that enables ubiquitous access to any type of data? A) EAI B) EII C) IaaS D) ETL
IaaS
Which component of service-oriented DSS can be described as optimizing the DSS environment use by organizing its capabilities and knowledge, and assimilating them into the business processes? A) information delivery portals B) information services with library and administrator C) extract, transform, load D) data marts
Information services with library and adminstrator
Content-based filtering obtains detailed information about item characteristics and restricts this process to a single user using information tags or A) keywords. B) passphrases. C) key-pairs. D) reality mining
Keywords
Research into managerial use of DSS and expert systems found all the following EXCEPT A) managers spent more of their time planning. B) managers saw their decision making quality enhanced. C) managers spent more time in the office and less in the field. D) managers were able to devote less of their time fighting fires
Manager spent more time in the office and less in the field
Which component of service-oriented DSS can be defined as data that describes the meaning and structure of business data, as well as how it is created, accessed, and used? A) application programming interface B) analytics C) operations and administration D) metadata management
Metadata Management
What kind of location based analytics is real-time marketing promotion? A) organization-oriented geospatial static approach B) organization-oriented location-based dynamic approach C) consumer-oriented geospatial static approach D) consumer-oriented location-based dynamic approach
Organization-oriented location-based dynamic approach
Service-oriented thinking is one of the fastest growing paradigms in today's economy. Which of the following is NOT a characteristic of service-oriented DSS? A) reusability B) substitutability C) extensibility D) originality
Originality
What new geometric data type in Teradata's data warehouse captures geospatial features? A) NAVTEQ B) ST_GEOMETRY C) GIS D) SQL/MM
ST_GEOMETRY
Which of these applications will derive the LEAST benefit from text mining? A) patients' medical files B) patent description files C) sales transaction files D) customer comment files
Sales transactions files
Services that let consumers permanently enter a profile of information along with a password and use this information repeatedly to access services at multiple sites are called A) consumer access applications. B) information collection portals. C) single-sign-on facilities. D) consumer information sign on facilities
Single-sign-on facilities
In text analysis, what is a lexicon? A) a catalog of words, their synonyms, and their meanings B) a catalog of customers, their words, and phrase C) a catalog of letters, words, phrases and sentences D) a catalog of customers, products, words, and phrase
a catalog of words, their synonyms, and their meanings
In the opening vignette, the architectural system that supported Watson used all the following elements EXCEPT A) massive parallelism to enable simultaneous consideration of multiple hypotheses. B) an underlying confidence subsystem that ranks and integrates answers. C) a core engine that could operate seamlessly in another domain without changes. D) integration of shallow and deep knowledge.
a core engine that could operate seamlessly in another domain without changes
All of the following are challenges associated with natural language processing EXCEPT A) dividing up a text into individual words in English. B) understanding the context in which something is said. C) distinguishing between words that have more than one meaning. D) recognizing typographical or grammatical errors in texts
dividing up a text into individual words in English
How is objectivity handled in sentiment analysis? A) It is ignored because it does not appear in customer sentiment. B) It is incorporated as a type of sentiment. C) It is clarified with the customer who expressed it. D) It is identified and removed as facts are not sentiment
it is identified and removed as facts are not sentiment
In text mining, which of the following methods is NOT used to reduce the size of a sparse matrix? A) using a domain expert B) normalizing word frequencies C) using singular value decomposition D) eliminating rarely occurring terms
normalizing world frequencies
In text mining, stemming is the process of A) categorizing a block of text in a sentence. B) reducing multiple words to their base or root. C) transforming the term-by-document matrix to a manageable size. D) creating new branches or stems of recorded paragraphs
reducing multiple words to their base or root
What types of documents are BEST suited to semantic labeling and aggregation to determine sentiment orientation? A) medium- to large-sized documents B) small- to medium-sized documents C) large-sized documents D) collections of documents
small-to medium-sized documents
Identifying the target of an expressed sentiment is difficult for all the following reasons EXCEPT A) the review may not be directly connected to the target through the topic name. B) blogs and articles with the sentiment may be general in nature. C) strong sentiments may be generated by a computer, not a person. D) sometimes there are multiple targets expressed in a sentiment
strong sentiments may be generated by a computer, not a person
In sentiment analysis, which of the following is an implicit opinion? A) The hotel we stayed in was terrible. B) The customer service I got for my TV was laughable. C) The cruise we went on last summer was a disaster. D) Our new mayor is great for the city
the customer service I got for my TV was laughable
In the research literature case study, the researchers analyzing academic papers extracted information from which source? A) the paper abstract B) the paper keywords C) the main body of the paper D) the paper references
the paper abstract
What do voice of the market (VOM) applications of sentiment analysis do? A) They examine customer sentiment at the aggregate level. B) They examine employee sentiment in the organization. C) They examine the stock market for trends. D) They examine the "market of ideas" in politics
they examine customer sentiment at the aggregate level
In the Blue Cross Blue Shield case study, speech analytics were used to identify "confusion" calls by customers. What was true about these calls? A) They took less time than others as frustrated customers hung up. B) They led customers to rely more on self-serve options. C) They were not documented by customer service reps for speech analytics. D) They were difficult to identify using standard phrases like "I don't get it."
they were not documented by customer service reps for speech analytics
Inputs to speech analytics include all of the following EXCEPT A) written transcripts of calls to service centers. B) recorded conversations of customer call-ins. C) live customer interactions with service representatives. D) videos of customer focus groups.
written transcripts of calls to service centers
Text analytics is the subset of text mining that handles information retrieval and extraction, plus data mining True False
False
The Big Data and Analysis in Politics case study makes it clear that the unpredictability of elections makes politics an unsuitable arena for Big Data True False
False
The industry impact of an automated decision system's use is limited to the company's supply chain True False
False
The linguistic approach to speech handles processes elements such as intensity, pitch and jitter from speech recorded on audio True False
False
The trend in the consumption of data analytics is away from in-memory solution and towards mobile devices. True False
False
Web-based e-mail such as Google's Gmail are not examples of cloud computing True False
False
While cloud services are useful for small and midsize analytic applications, they are still limited in their ability to handle Big Data applications True False
False
In most cases, Hadoop is used to replace data warehouses True False
False
In the Luxottica case study, outsourcing enhanced the ability of the company to gain insights into their data True False
False
In the classification of location-based analytic applications, examining geographic site locations falls in the consumer-oriented category. True False
False
Which component of service-oriented DSS includes such examples as optimization, data mining, text mining, simulation, automated decision systems? A) application programming interface B) analytics C) operations and administration D) metadata management
Analytics
Which of the following is considered the economic engine of the whole analytics industry? A) application developers and system integrators B) analytics user organizations C) analytics industry analysts and influencers D) academic providers and certification industries
Analytics user organizations
What data discovery process, whereby objects are categorized into predetermined groups, is used in text mining? A) clustering B) association C) classification D) trend analysis
Classification
GPS Navigation is an example of which kind of location based analytics? A) organization-oriented geospatial static approach B) organization-oriented location-based dynamic approach C) consumer-oriented geospatial static approach D) consumer-oriented location-based dynamic approach
Consumer-oriented geospatial static approach
Sentiment classification usually covers all the following issues EXCEPT A) classes of sentiment (e.g., positive versus negative). B) range of polarity (e.g., star ratings for hotels and for restaurants). C) range in strength of opinion. D) biometric identification of the consumer expressing the sentiment
Biometric identification of the consumer expressing the sentiment
When new analytics applications are introduced and affect multiple related processes and departments, the organization is best served by utilizing A) business flow management. B) multi-department analysis. C) process flow analysis. D) business process reengineering
Business Process Reengineering
What application is MOST dependent on text analysis of transcribed sales call center notes and voice conversations with customers? A) finance B) OLAP C) CRM D) ERP
CRM
In the opening vignette, the CERN Data Aggregation System (DAS), built on MongoDB (a Big Data management infrastructure), used relational database technology True False
False
In the patent analysis case study, text mining of thousands of patents held by the firm and its competitors helped improve competitive intelligence, but was of little use in identifying complementary products. True False
False
Oklahoma Gas & Electric employs a two-layer information architecture involving data warehouse and improved and expanded integration. True False
False
During information extraction, entity recognition (the recognition of names of people and organizations) takes place after relationship extraction. True False
False
ES/DSS were found to improve the performance of new managers but not existing managers True False
False
Hadoop and MapReduce require each other to work. True False
False
IaaS helps provide faster information, but provides information only to managers in an organization. True False
False
In sentiment analysis, sentiment suggests a transient, temporary opinion reflective of one's feelings True False
False
In text mining, creating the term-document matrix includes all the terms that are included in all documents, making for huge matrices only manageable on computers. True False
False
In the Dublin City Council case study, GPS data from the city's buses and CCTV were the only data sources for the Big Data GIS-based application True False
False
In the Great Clips case study, the company uses geospatial data to analyze, among other things, the types of haircuts most popular in different geographic locations. True False
False
All of the following are components in a service-oriented DSS environment EXCEPT A) information technology as enabler. B) data as infrastructure. C) process as beneficiary. D) people as user
Data as infrastructure
Which component of service-oriented DSS can be described as a subset of a data warehouse that supports specific decision and analytical needs and provides business units more flexibility, control, and responsibility? A) information delivery portals B) information services with library and administrator C) extract, transform, load D) data marts
Data marts
Big Data simplifies data governance issues, especially for global firms True False
False
Big Data uses commodity hardware, which is expensive, specialized hardware that is custom built for a client or application True False
False
Detecting lies from text transcripts of conversations is a future goal of text mining as current systems achieve only 50% accuracy of detection True False
False
Which of the following is true of data-as-a-Service (DaaS) platforms? A) Knowing where the data resides is critical to the functioning of the platform. B) There are standardized processes for accessing data wherever it is located. C) Business processes can access local data only. D) Data quality happens on each individual platform
There are standardized processes for accessing data wherever it is located
Which of the following is true about the furtherance of homeland security? A) There is a lessening of privacy issues. B) There is a greater need for oversight. C) The impetus was the need to harvest information related to financial fraud after 2001. D) Most people regard analytic tools as mostly ineffective in increasing security
There is a greater need for oversight
In text mining, tokenizing is the process of A) categorizing a block of text in a sentence. B) reducing multiple words to their base or root. C) transforming the term-by-document matrix to a manageable size. D) creating new branches or stems of recorded paragraphs
categorizing a block of text in a sentence