CIS 3252
In text mining, tokenizing is the process of A) categorizing a block of text in a sentence. B) reducing multiple words to their base or root. C) transforming the term-by-document matrix to a manageable size. D) creating new branches or stems of recorded paragraphs.
categorizing a block of text in a sentence.
In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected? transformation extraction load cleanse
cleanse
A large storage location that can hold vast quantities of data (mostly unstructured) in its native/raw format for future/potential analytics consumption is referred to as a(n) extended ASP data cloud data lake relational database
data lake
What type of analytics seeks to recognize what happened or is happening? descriptive prescriptive predictive domain
descriptive
All of the following are challenges associated with natural language processing EXCEPT dividing up a text into individual words in English. understanding the context in which something is said. distinguishing between words that have more than one meaning. recognizing typographical or grammatical errors in texts.
dividing up a text into individual words in English.
When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is
drill down
All of the following are benefits of hosted data warehouses EXCEPT smaller upfront investment. better quality hardware. greater control of data. frees up in-house systems.
greater control of data.
All of the following are true about in-database processing technology EXCEPT it pushes the algorithms to where the data is. it makes the response to queries much faster than conventional databases. it is often used for apps like credit card fraud detection and investment risk management. it is the same as in-memory storage technology.
it is the same as in-memory storage technology.
Which of the following BEST enables a data warehouse to handle complex queries and scale up to handle many more requests?: use of the Web by users as a front-end parallel processing Microsoft Windows a larger IT staff
parallel processing
What type of analytics seeks to make decisions to achieve the best performance possible? descriptive prescriptive predictive domain
prescriptive
Which of the following is NOT an example of transaction processing? ATM withdrawal bank deposit sales report cash register scans
sales report
When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure? star schema snowflake schema relational schema dimensional schema
star schema
Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are: subject-oriented and nonvolatile. product-oriented and nonvolatile. product-oriented and volatile. subject-oriented and volatile.
subject-oriented and nonvolatile.
A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a: one-tier architecture. two-tier architecture. three-tier architecture. four-tier architecture.
three-tier architecture.
How does the use of cloud computing affect the scalability of a data warehouse? Cloud computing vendors bring as much hardware as needed to users' offices. Hardware resources are dynamically allocated as use increases. Cloud vendors are mostly based overseas where the cost of labor is low. Cloud computing has little effect on a data warehouse's scalability.
Hardware resources are dynamically allocated as use increases.
Which of the following statements about Web site conversion statistics is FALSE? Web site visitors can be classed as either new or returning. Visitors who begin a purchase on most Web sites must complete it. The conversion rate is the number of people who take action divided by the number of visitors. Analyzing exit rates can tell you why visitors left your Web site.
Visitors who begin a purchase on most Web sites must complete it.
Search engine optimization (SEO) is a means by which Web site developers can negotiate better deals for paid ads. Web site developers can increase Web site search rankings. Web site developers index their Web sites for search engines. Web site developers optimize the artistic features of their Web sites. Saturday,
Web site developers can increase Web site search rankings.
Online transaction processing (OLTP) systems handle a company's routine ongoing business. In contrast, a data warehouse is typically the end result of BI processes and operations. a repository of actionable intelligence obtained from a data mart. a distinct system that provides storage for data that will be made use of in analysis. an integral subsystem of OLTP.
a distinct system that provides storage for data that will be made use of in analysis.
What does Web content mining involve? analyzing the universal resource locator in Web pages analyzing the unstructured content of Web pages analyzing the pattern of visits to a Web site analyzing the PageRank and other metadata of a Web page
analyzing the unstructured content of Web pages