Business Analytics ch. 3 4 & 5
Which of the following correlation coefficient describes the weakest relationship between two quantitative variables? 0.50 1.29 0.15 0.70 - 0.80
0.15
Which of the following correlation coefficients describes the strongest relationship between two quantitative variables? 1.29 0.15 0.25 - 0.80 0.85
0.85
From the video we watched in class (the link is provided in the powerpoint in module Misleading Graphs), the act of choosing a portion of the axis which leads to telling a misleading story is called: Analytics Visualization Informed Consent Biased Selection Openness Cherry Picking
Cherry picking
Which of the following are included in the Key Points of Storytelling? Eliminate distorted axes Always include a title and data values Include slashes Always start the frequency axis at zero Choose the appropriate graph & tell an accurrate story
Choose the appropriate graph & tell an accurrate story
Use the Pareto Chart given in the lecture powerpoint to answer this question. What are the "vital few" defect(s) that should be corrected based on the 80/20 Pareto Principle?
Crooked Label, Missing Label, Printing Error, Loose Label
When a company sells your data from a datasourcing method such as clickstream, this is an example of which data ethic principle? Privacy Openness Currency Ownership Consent
Currency
What is the process of displaying large quantities of data in a meaningful way called? Visual Analytics Storytelling Data Visualization Big Data None of these
Data Visualization
If Amazon wanted to ensure a customers' response to an online survey was confidential and the sampling method was unbiased, which of the following should be implemented? Make sure those customers who volunteer to reply are coded with random identification number Identify each randomly selected customer by a random identification number Use clickstream to collect the data Send an email to customers with only a first or last name in their userid but not both Use a means of distribution to ensure there is no identification to any of the participants
Identify each randomly selected customer by a random identification number
If a dataset's maximum value is 1500 but the graph only goes to 1000, this is an example of: Misleading Graphs None of these Data Ethics Ownership Data Analytics
Misleading graphs
The Data Ethics principle that supports datasets can be made available to the public is: Openness Consent Ownership Privacy Currency
Openness
Classify each of the following qualitative variables as ordinal or nominative. Statistics course letter grade A B C D F Door choice on Let's Make A Deal- Door #1 Door #2 Door #3 Television show classifications- TV-G TV-PG TV-14 TV-MA Personal computer ownership- Yes No Restaurant rating- ***** **** *** ** * Income tax filing status- Married filing jointly; Married filing separately; Single; Head of household; Qualifying widow(er)
Ordinal Nominative Ordinal Nominative Ordinal Nominative
The Kia Plant needs to determine the cumulative frequency of defects in the production of a car door hinge. Which type of graph should they use to accurately present this information?
Pareto chart
The question "Do you use third-party cookies" is an example of what would be asked if a person was concerned about: Data Visualization Principle All of these Openness Data Ethics Principle Privacy Data Ethics Principle Privacy Storytelling Principle
Privacy Data Ethics Principle
Which of the following is not an example of unethical statistical practices? None of the other answers is correct. Using graphs to make statistical inferences Improper sampling Inappropriate interpretation of statistical results Descriptive measures that mislead the user
Using graphs to make statistical inferences
A ________ displays the frequency of each class with qualitative data and a ________ displays the frequency of each class with quantitative data. stem-and-leaf, pie chart histogram, stem-and-leaf display scatter plot, bar chart bar chart, histogram
bar chart, histogram
________ and ________ are used to describe qualitative (categorical) data.
bar charts, pie charts
The Data Ethics principle that ensures a participant is aware of what data is being collected, the purpose, and how the data will be used is
consent
In ________ we select elements because they are easy to sample. judgment sampling probability sampling random sampling convenience sampling
convenience sampling
A Yes or No question is
dichotomous
Which of the following is a type of question used in survey research?
dichotomous, open-ended, multiple-choice
Given a scatterplot, if one quantitative variable increases when the other quantitative variable variable decreases, the _____ is _______.
direction, negative
Given a scatterplot, if one quantitative variable increases as the other quantitative variable increases, the _____ is _______.
direction, positive
A scatter plot is mainly used to identify outliers. True or False
false
When data are qualitative, the bars should never be separated by gaps.
false
The number of measurements falling within a class interval is called the ________.
frequency
Which of the following divides quantitative measurements into classes and graphs the frequency, relative frequency, or percentage frequency for each class? scatter plot stem-and-leaf display dot plot histogram
histogram
Which one of the following graphical tools is used with quantitative data? Pareto chart bar chart pie chart histogram
histogram
Which of the following is a quantitative variable? whether a person has a charge account a person's gender mileage of a car the manufacturer of a cell phone whether a person is a college graduate
mileage of a car
A(n) ________ variable is a qualitative variable such that there is no meaningful ordering or ranking of the categories. interval ratio nominative ordinal
nominative
When developing a frequency distribution, the class (group) intervals must be
nonoverlapping
An identification of police officers by rank would represent a(n) ________ level of measurement. ordinal nominative interval ratio
ordinal
An unusually large or small observation separated from the rest of the data is a(n) absolute extreme outlier mode quartile
outlier
If we are concerned whether the data was collected confidentially or anonymously, this is an example of which data ethic principle? Currency Consent Ownership Privacy Openness
privacy
percent frequency
relative frequency as a percentage
A plot that allows us to visualize the relationship between two variables is a(n) ________ plot. frequency ogive scatter dot
scatter
A ________ shows the relationship between two quantitative variables. pareto chart scatter plot pie chart histogram bar chart
scatter plot
If a histogram has a long left or right tail, the data is described as: All of these Approximately Normal Skewed A story Misleading
skewed
What should be included on a graph axis when the scale does not start at zero?
slashes
An example of manipulating a graphical display to distort reality is ________. adding an unbiased caption starting the axes at zero stretching the axes making the bars in a histogram equal widths
stretching the axes
frequency
the number itself
relative frequency
the proportion of measurements in a class
A bar chart is a graphic that can be used to depict qualitative data
true
Beginning the vertical scale of a graph at a value different from zero can cause increases to look more dramatic. True or false
true
Statistical inference is the science of using a sample of measurements to make generalizations about the important aspects of a population of measurements. True or false
true
The relative frequency is the frequency of a class divided by the total number of measurements.
true
Using a nonrandom sample procedure in order to support a desired conclusion is an example of an unethical statistical procedure. True or false
true
When looking at the shape of the distribution using a histogram, a distribution is skewed to the right when the left tail is shorter than the right tail. True or False
true
What is it called when we combine graphs (visuals) with predictive analytics? Storytelling Data Visualization Graphic Tabulations Visual Analytics None of these
visual analytics
Which of the following is a categorical variable? air temperature bank account balance whether a person has a traffic violation daily sales in a store value of company stock
whether a person has a traffic violation