Chapter 9
In the regression line Y = a+ bX + e, Y is the value of the independent variable. T/F
False
Data governance involves identifying people who are responsible for fixing and preventing issues with data. T/F
True
Data mining is used to explore large amounts of data, looking for hidden patterns that can be used to predict future trends and behaviors. T/F
True
During drill-down, you go from high-level summary data to detailed levels of data. T/F
True
Data within the data cube has been ______ by specific dimensional values. a. drilled into b. correlated c. summarized d. regressed
c. summarized
In linear regression, you are to find a line that best fits the data. Sometimes the line may be curved. T/F
False
Users still need help from the IT function of the organization to create customer reports using modern reporting tools. T/F
False
A conversion funnel is a visual depiction of a set of words that have been grouped together because of the frequency of their occurrence. T/F
False
Suppose your university administrators wish to summarize student GPA on Major, Gender, High School, Home City and ZIP Code dimensions. This cannot be done with one data cube. T/F
False
For an organization to get real value from its BI efforts, it must have a solid data management program. T/F
True
If you wish to have a visual depiction of relative frequencies of words in a document, a word cloud would be an appropriate option. T/F
True
Power Pivot and Power Query are components of Microsoft's Power BI tool. T/F
True
Some insurance companies can detect fraudulent claims using BI software. T/F
True
Suppose you are good in math and statistics. Adding programming to your skill set will be necessary if you want to be a data scientist. T/F
True
Which of the following is NOT a component of a KPI (key performance indicator)? a. format b. measure c. time frame d. direction
a. format
Mastery over which subject is NOT required for being a data scientist? a. Statistics b. English c. Computer science d. Mathematics
b. English
The coefficient of determination, r squared, is ____. a. the same as the slope of the linear regression b. a number that indicates how well data fits the statistical model c. a number that must be close to zero to be useful d. the same as the error term in a linear regression
c. a number that must be close to zero to be useful
Which of these analysis methods describes neural computing? a. a specialized set of algorithms sorts through data and forms statistical rules about relationships among the items b. a mathematical procedure to predict the value of a dependent variable based on a single independent variable c. historical data is examined for patterns that are then used to make predictions d. historical if-then-else cases are used to recognize patterns
c. historical data is examined for patterns that are then used to make predictions
Which of the following is a potential disadvantage with self-service BI? a. Encourages nontechnical end users to make decisions based on facts and analyses rather than intuition. b. Accelerates and improves decision making. c. Gets valuable data into the hands of the people who need it the most—end users. d. Can lead to overspending on unapproved data sources and business analytics tools.
d. Can lead to overspending on unapproved data sources and business analytics tools.
What is the projected shortage of data scientists in the USA, according to a McKinsey & Co. report? a. 200,000 to 500,000 b. more than 500,000 c. 100,000 to 200,000 d. 100,000 or less
c. 100,000 to 200,000
_____ is used to explore large amounts of data for hidden patterns to predict future trends. a. Drill down b. Data governance c. Data mining d. Linear regression
c. Data mining
One of the goals of business intelligence is to _________. a. use the most sophisticated techniques b. ensure the transaction data is properly stored c. present the results in an easy to understand manner d. educate the users about business statistics
c. present the results in an easy to understand manner
During modeling of the CRISP-DM method, we would ______. a. assess if the model achieves business goals b. clarify business goals for the data mining project c. select a subset of data to be used and prepare it d. apply selected modeling techniques
d. apply selected modeling techniques
