Business 2600
11.44 94/822 = 11.44%
822 recently purchased books were randomly selected from all recent book purchases over the Internet. The chart below shows the breakdown of the classification of the book type. What percentage of the books in the sample were self-help books?
Undercoverage
Amazon collects random reviews from their customers on the West Coast as the data they use for their nationwide marketing plan. This sample suffers from which sampling bias?
stretching the axes
An example of manipulating a graphical display to distort reality is ________.
Ordinal
An identification of police officers by rank would represent a(n) ________ level of measurement.
Outlier
An unusually large or small observation separated from the rest of the data is a(n) ________.
True
Beginning the vertical scale of a graph at a value different from zero can cause increases to look more dramatic.
Identify each randomly selected customer by a random identification number
If Amazon wanted to ensure a customers' response to an online survey was confidential and the sampling method was unbiased, which of the following should be implemented?
Misleading Graphs
If a dataset's maximum value is 1500 but the graph only goes to 1000, this is an example of:
Privacy
If we are concerned whether the data was collected confidentially or anonymously, this is an example of which data ethic principle?
False
If we examine some of the population measurements, we are conducting a census of the population.
convenience sampling
In ________ we select elements because they are easy to sample.
False
In order to select a stratified random sample, we divide the population into overlapping groups of similar elements.
True
In systematic sampling, the first element is randomly selected from the first (N/n) elements.
Make predictions
In the Analytic Process, the next step after the descriptives (descriptives is the data that has occurred up to the current point in time) have been collected, is to use the data to:
True
It is possible to use a random sample from a population to make statistical inferences about the entire population.
Strata
Nonoverlapping groups of similar elements in a population are called
Cluster Random Sampling by regions nationwide
The 2020 Presidential Election was in November and we saw reports on the percentage of Americans who were registered to vote in the national. If we wanted to sample all registered to vote in the national election. If we wanted to sample all registered voters in the United States and avoid bias, as well as, save time and money, the appropriate sampling method to use would be:
consent
The Data Ethics principle that ensures a participant is aware of what data is being collected, the purpose, and how the data will be used is _________. (one word answer)
Openness
The Data Ethics principle that provide supports all datasets are available to the public is
Frequency
The number of measurements falling within a class interval is called the ________.
relative frequency
The proportion of measurements in a class is called the ________ of that class.
to make statistical inferences about the entire population
The purpose of collecting a random sample from a population is:
Data Ethics
The question "Do you use third-party cookies" is an example of what would be asked if a person was concerned about:
True
Using a nonrandom sample procedure in order to support a desired conclusion is an example of an unethical statistical procedure.
Visual Analytics
What is it called when we combine graphs (visuals) with predictive analytics?
Data Visualization
What is the process of displaying large quantities of data in a meaningful way called?
slashes
What should be included on a graph axis when the scale does not start at zero? (one word answer)
Currency
When a company sells your data from a datasourcing method such as clickstream, this is an example of which data ethic principle?
Conduct another sample to increase the response rate
When a low response bias occurs, what should you do?
Measurement Error
When a person gives an answer that they think you want to hear or is politically correct, this is an example of which bias:
False
When data are qualitative, the bars should never be separated by gaps.
nonoverlapping
When developing a frequency distribution, the class (group) intervals must be ________.
Choose DataSourcing Method-Collect Descriptives-Make Prediction-Ask Next Question
Which of the following describes the steps of the Analytic Process after we know the business problem?
histogram
Which of the following divides quantitative measurements into classes and graphs the frequency, relative frequency, or percentage frequency for each class?
whether a person has a traffic violation
Which of the following is a categorical variable?
milage of a car
Which of the following is a quantitative variable?
mileage of a car
Which of the following is a quantitative variable?
All of the other answers are correct.
Which of the following is a type of question used in survey research?
Which of the following is not an example of unethical statistical practices?
Which of the following is not an example of unethical statistical practices?
Since hacking is so common, are you confident that the information you provide online is secure?
Which of the following questions would result in Response Bias?
histogram
Which one of the following graphical tools is used with quantitative data?
Bar charts, pie charts
________ and ________ are used to describe qualitative (categorical) data.
Undercoverage
________ occurs when some population elements are excluded from the process of selecting the sample.
nominative
A(n) ________ variable is a qualitative variable such that there is no meaningful ordering or ranking of the categories.
36.50 300 mystery or science fiction/fantasy books purchased; 300/822 = 36.5%.
822 recently purchased books were randomly selected from all recent book purchases over the Internet. The chart below shows the breakdown of the classification of the book type. What percentage of the books in the sample were either mystery or science fiction/fantasy?
dichotomous
A Yes or No question is ________.
bar chart, histogram
A ________ displays the frequency of each class with qualitative data and a ________ displays the frequency of each class with quantitative data.
Sample
A ________ is a subset of the units in a population.
Scatterplot
A ________ shows the relationship between two variables.
quantitative
A ________ variable takes on values that are numbers on the real number line.
True
A bar chart is a graphic that can be used to depict qualitative data
Sensor
A commercial shipping company operates a large fleet of trucks. The company needs to collect data on vehicle usage including acceleration, braking, and vehicle maintenance such as fluid levels. The datasourcing method needed to collect the data for this scenerio is
True
A common practice in selecting a sample from a large geographic area is multistage cluster sampling.
Point of Sale
A doctor's office needs to know how many of their services are used by their patients during a one year time period. Before a patient leaves their office, a summary report includes the date, the doctor they saw, the services provided, and the total charge for each service. Which of the following dataourcing methods best describes this data collection scenario?
skewed to the left
A histogram that has a longer tail extending toward smaller values is ________.
convenient
A local store wants to find out if a new item is linked by their customers. They ask the first 10 customers that enter the store and use only this data. This is an example of sampling.
False
A low response rate has no effect on the validity of a survey's findings.
Category / Class A, 84, 0.4, 40% Frequency B, 84, 0.4, 40% RelativeFrequency C, 21, 0.10, 10% Percent Frequency D 21, 0.10, 10%
A multiple choice question on an exam has four possible responses—(A), (B), (C), and (D). When 210 students take the exam, 84 give response (A), 84 give response (B), 21 give response (C), and 21 give response (D). Write out the frequency distribution, relative frequency distribution, and percent frequency distribution for these responses. (Round your relative frequency answers to 2 decimal places.)
Figure A is Correct
A multiple choice question on an exam has four possible responses—(A), (B), (C), and (D). When 250 students take the exam, 100 give response (A), 25 give response (B), 75 give response (C), and 50 give response (D). (a) Write out the frequency distribution, relative frequency distribution, and percent frequency distribution for these responses. (Round your relative frequency answers to 2 decimal places.)
nominative
A person's telephone area code is an example of a(n) ________ variable.
scatter
A plot that allows us to visualize the relationship between two variables is a(n) ________ plot.
True
A population is a set that includes all elements about which we wish to draw a conclusion.
False
A quantitative variable can also be referred to as a categorical variable.
True
A random sample is selected so that every element in the population has the same chance of being included in the sample.
True
A recording error is an error of observation.
Population
A set of all elements we wish to study is called a ________.
qualitative
A(n) ________ variable can have values that indicate into which of several categories of a population it belongs.
Quantitative; dollar amounts correspond to values on the real number line. Quantitative; net profit is a dollar amount. Qualitative; which stock exchange is a category. Quantitative; national debt is a dollar amount. Qualitative; which type of medium is a category.
Below we list several variables. Which of these variables are quantitative and which are qualitative?
False
Business analytics is a new field that does not use traditional statistics to analyze big data.
Systematic
By randomly selecting a starting point then choosing every 50th person, we are conducting a ________ random sample.
True
By taking a systematic sample in which we select every 100th shopper arriving at a specific store, we are approximating a random sample of shoppers.
Letter Grades: Ordinal - each grade from A to F indicates an increasingly lower grade. Door Choices: Nominative - each door is the same except for the number given. For example, Door 1 is not better or worse or higher or lower than Door 3. TV Classifications: Ordinal - each category from TV-G to TV-MA indicates programming appropriate for increasingly older viewers. PC Ownership: Nominative - no ordering of categories. Restaurant Ratings: Ordinal - each rating from 5-star to 1-star indicates an increasingly lower rating. Filing Status: Nominative - no ordering of categories.
Classify each of the following qualitative variables as ordinal or nominative.
.150 (5 + 1) = 6 over 24 miles; 6/40 = .15.
Consider the following data on distances traveled by people to visit the local amusement park and calculate the relative frequency for the distances over 24 miles.
.375 Total of 40 measurements: 15/40 = .375.
Consider the following data on distances traveled by people to visit the local amusement park and calculate the relative frequency for the shortest distance.
False
Convenience sampling is a type of probability sampling in which we select elements to sample because we believe they have the highest probability of responding.
Errors of non-observation occur when data values are recorded incorrectly
Errors of non-observation occur when data values are recorded incorrectly
False
Errors of non-observation occur when data values are recorded incorrectly.
census
Examining all population measurements is called ________.
Ordinal
Possible qualitative data for Stitchfix to collect is a clothing size such as small, medium, and large. This is an example of __________ (ordinal, nominal, discrete, or continuous) variable
True
Sampling error occurs because a characteristic of a random sample may not exactly equal the population characteristic that we are attempting to estimate.
True
Statistical inference is the science of using a sample of measurements to make generalizations about the important aspects of a population of measurements.
stratified
StitchFix has 1000 customers they sampled by age. They should use _________ random sampling to randomly select 50 customers from each age group.
Stratified Random Sampling
StitchFix wants to make sure different specific characteristics, such as all different age groups, are represented in their sample. The appropriate sampling method they should use is:
True
Stratification can at times be combined with multistage cluster sampling to develop an appropriate sample.