Week 1: Introduction to Statistics, Data Collection, and Data Concepts
Which of the following would be the standard deviation for this sample data set: 5, 7, 6, 9, 6, 4, 4, 6, 5, 9, 3? - 5.82 - 1.85 - 1.94 - 2.91
1.94
Which of the following would be the mean of this data set: 5, 7, 12, 56, 4, 2, 33, 21 - 18.7 - 8.0 - 140.0 - 17.5
17.5
Which of the following would be the mean of this data set: 5, 17, 22, 56, 4, 2, 33, 21 - 18.00 - 18.06 - 20.00 - 16.90
20.00
Given the following data: 12.1, 56.8, 14.4, 45.2, 14.4, 23.7, 14.4, 21.3, 17.6. Determine the mean. - 12.1 - 17.6 - 14.4 - 24.4
24.4
Given the following data: 12.1, 62.8, 14.4, 45.2, 14.4, 23.7, 14.4, 21.3, 17.6. Determine the mean. - 25.1 - 62.8 - 17.6 - 14.4
25.1
What is the mode of the following data set? 2, 2, 3, 3, 3, 3, 4, 4, 5, 5, 5, 6, 7, 7, 8 - 3 - 2, 4, 5, and 7 - 2 - 3 and 4
3
What would be the mean of this data set: 8, 9, 7, 508, 7, 1, 5, 3, 4? - 55.2 - 5.7 - 57.0 - 61.3
61.3
Which of the following would be the variance of this population data set: 3, 9, 8, 9, 4, 5, 7, 11, 9, 7, 5, 4, 3, 1 - 7.92 - 8.53 - 2.92 - 2.81
7.92
Which of the following would be the variance of this population data set: 3, 9, 8, 9, 4, 5, 7, 11, 9, 7, 5, 4, 3, 1 - 8.53 - 2.81 - 7.92 - 2.92
7.92
A graph shows a number line with a mark for each data value in the data set above its value. This graphic is most likely to be: - a stem-leaf plot - a frequency histogram - a frequency polygon - a dot plot
A dot plot
A graph shows vertical bars with the number of data points on the y-axis and groups on the x-axis. This graphic is most likely to be: - a dot plot - a stem-leaf plot - a frequency histogram - a frequency polygon
A frequency histogram
Stratified sample
A sample where the population is divided into groups and several are randomly sampled from each group.
Sample
A subset of part of a population
Observational unit
A train passenger
Population
All adult train passengers from the five cities in Australia
Inferential statistics
Allow for conclusions to be drawn about the population from the observed sample of data.
"Qualitative variable" is another term for a ______ variable. - quantitative - categorical
Categorical
A car's maker is a ______ variable. - quantitative - categorical
Categorical
A house's address is a _____ variable. - quantitative - categorical
Categorical
A house's color is a ______ variable. - quantitative - categorical
Categorical
Population
Collection of all data that is of interest
Draft a policy proposal based on estimates from the sample.
Conclusion
Qualitative data
Consists of attributes, labels, or non numerical entries
To gather information on customer satisfaction, a researcher goes into the local store and interviews six randomly selected customers. This sampling technique is called: - cluster - random - stratified - convenience
Convenience
You need to study the satisfaction of customers of a specific restaurant. You decide to order food and talk to those customers sitting next to you. This would most closely describe which type of sampling technique? - Convenience - Stratified - Systematic - Random
Convenience
Send surveys to a sample of Australian citizens
Data collection
A sample of 50 cities and create a bar chart showing the population of each. This is an example of: - nominal data - qualitative data - descriptive statistics - inferential statics
Descriptive statistics
Generate a map of Australia's population colored by level of satisfaction with transportation.
Descriptive statistics
The average age of students in a statistics class is 28 years. The 28 years would be considered an example of: - descriptive statistics - inferential statistics - a population - qualitative data
Descriptive statistics
The average height of 15 basketball players is an example of: - a population - inferential statistics - qualitative data - descriptive statistics
Descriptive statistics
The difference between cluster sampling and stratified sampling would be: - Each divides the population into groups. Stratified sampling uses some in each group while cluster uses all in selected groups. - Each divides the population into groups. Cluster sampling uses some in each group while stratified uses all in selected groups. - There is no difference. It is two names for the same type of sampling. - Cluster divides the population into groups and uses all in selected group while stratified uses some only in selected groups.
Each divided the population into groups. Stratified sampling uses some in each group while cluster uses all in selected groups.
The difference between cluster sampling and stratified sampling would be: - There is no difference. It is two names for the same type of sampling. - Each divides the population into groups. Cluster sampling uses some in each group while stratified uses all in selected groups. - Each divides the population into groups. Stratified sampling uses some in each group while cluster uses all in selected groups. - Cluster divides the population into groups and uses all in selected group while stratified uses some only in selected groups.
Each divides the population into groups. Stratified sampling uses some in each group while cluster uses all in selected groups.
In manufacturing, systematic sampling could be used to determine if the machines are operating correctly. Which of the following best describes this type of sampling? - Every 10th product in the line is selected - Products are put into groups, some groups are selected and all products from each selected group are included in the sample - Samples are randomly selected throughout the day - Products are put into groups and some are randomly selected from each group
Every 10th product in the line is selected
In manufacturing systematic sampling could be used to determine if the machines are operating correctly. Which of the following best describes this type of sampling? - products are put into groups, some groups are selected and all products from each selected group are included in the sample - samples are randomly selected throughout the day - products are put into groups and some are randomly selected from each group - every 10th product in the line is selected
Every 10th product in the line is selected.
What type of data collection might be best to estimate whether a new medicine is more effective in curing the common cold than aspirin? - simulation - experiment - survey - observational
Experiment
A pharmaceutical company wants to determine if its new medicine helps health bruises more quickly than not treating the bruise. What type of study would be best suited to make this determination? - Observational - Experimental - Simulation - Survey
Experimental
Which graph would be best to display the dispersion of ages of movie goers at the latest Marvel Universe picture? - Pie chart - Pareto chart - Histogram - Line plot
Histogram
Based on past experience, it is predicted that 31% of registered voters in Oklahoma will vote in the next primary. The 31% would be considered an example of: - Inferential statistics - Qualitative data - Descriptive statistics - A sample
Inferential statistics
Based on poll, we estimate that Mickey Mouse will receive 35% of the vote int he most popular cartoon character contest. This is an example of: - qualitative data - a pareto chart - inferential statistics - descriptive statistics
Inferential statistics
Estimate that the overall average satisfaction is between 45 and 50%.
Inferential statistics
A data set that includes the years in which your hometown team won championships would be classified as what type of data? - nominal - ratio - interval - ordinal
Interval
A meteorologist wants to create a graph that displays the high temperature for each day for the past 30 days. Which of the following graphs might be most effective to use? - Line chart - Pie chart - Pareto chart - Scatter plot
Line chart
A sales manager wants to creat a graph that displays the sales figures for each month for the last 12 months. Which of the following graphs might be most effective to use? - Pareto chart - Scatter plot - Line chart - Pie chart
Line chart
Which graph would be best to display movie ticket sales over time in months? - Line plot - Scatter plot - Bar chart - Pie chart
Line plot
A car comes in 5 possible colors: red, gray, brown, black, and white. - nominal - ordinal
Nominal
A form asks a person to indicate a country of birth. - nominal - ordinal
Nominal
A survey asks users to enter a number indicating political affiliation: 1 for Libertarian, 2 for Democratic, 3 for Republican, and 4 for Other. - nominal - ordinal
Nominal
The data set that lists a company's competitors and their locations would be classified as what type of data? - Nominal - Ratio - Ordinal - Interval
Nominal
Cameras are set up to watch an intersection and determine how many cars are let through with each green light interval. This study design would be considered: - survey - experimental - observational - simulation
Observational
What type of data collection might be best to estimate the amount of time individuals spend socializing? - simulation - experiment - survey - observational
Observational
What type of data collection might be best to study how many books students bring into the media center during finals week? - Simulation - Survey - Observational - Experiment
Observational
Collecting data on happiness and money by giving a five-question survey to individuals in a downtown area would be collecting data through an _____. - experiment - observational study
Observational study
Count and size data for trout and salamanders were collected from two types of forest areas, clear-cut and old growth. Data has been collected yearly 1987-2019. This data was collected through an _____. - experiment - observational study
Observational study
A data set that includes the values given by customers to determine their level of satisfaction with their purchases. The scale used is very satisfied, somewhat satisfied, somewhat dissatisfied, and very dissatisfied. The data of the customer responses would be classified as what type of data? - Interval - Ratio - Nominal - Ordinal
Ordinal
A movie has 5 possible rabies: G, PG, PG-13, R, and NC-17. - Nominal - Ordinal
Ordinal
An Amazon product has 5 possibles ratings: 1 star, 2 stars, 3 stars, 4 stars, or 5 stars. - nominal - ordinal
Ordinal
A survey of all 35 employees at a small company finds that 89% of them like the recent changes to the company's benefits. Is this percentage a parameter of a statistic and why? - parameter as it represents the sample - parameter as it represents the population - statistic as it represents the sample - statistic as it represents the population
Parameter as it represents the population
A survey of all 35 employees at a small company finds that 89% of them like the recent changes to the company's benefits. Is this percentage a parameter or a statistic and why? - statistic as it represents the population - parameter as it represents the sample - statistic as it represents the sample - parameter as it represents the population
Parameter as it represents the population
A car wash company has location in five cities. The owner wants to use a graph to display the percentage of total cars washed in each city. The percentages must add up to 100%. Which of the following graphs would be the best for displaying this data? - histogram - dot plot - bar chart - pie chart
Pie chart
Cluster sample
Population is divided into groups, some groups are selected and all observations from each selected group are included in the sample.
Among 500 people at the concert, a survey of 35 found 28% found it too loud. What is the population and what is the sample? - population: 500 at that concert; sample: the 35 in the survey — population: all concert goers; sample: the 28% who found it too loud - population: 500 at that concert; sample; the 28% who found it too loud - population: all concert goers; sample: the 500 at that concert
Population: 500 at that concert; Sample: the 35 in the survey.
A survey of 385 people who like wild sweaters found that 74% had a wild holiday sweater. What is the population and what is the sample? - Population: people who like wild sweaters; Sample: the 385 people who like wild sweaters. - Population: people who like wild sweaters; Sample: the 385 people who had a wild holiday sweater. - Population: people who like wild sweaters; Sample: the 74% that had a wild holiday sweater - Population: people who like sweaters; Sample: the 74% that had a wild holiday sweater.
Population: people who like wild sweaters; Sample: the 385 people who like wild sweaters.
Descriptive statistics
Provide ways to explore the observed data through visualizations and numerical summaries.
Classify the data of all the political parties in the world. - classical - quantitative - qualitative - statistics
Qualitative
If a data set included the color of cars in a parking lot, those data would be considered: - interval data - ratio data - quantitative data - qualitative data
Qualitative data
"Numerical variable" is likely another term for a _____ variable. - quantitative - categorical
Quantitative
A car's age is a ______ variable. - quantitative - categorical
Quantitative
A house's square footage is a _____ variable. - quantitative - categorical
Quantitative
Classify the data of the number of customers at a restaurant. - Qualitative - Statistics - Quantitative - Classical
Quantitative
The ages of 20 first graders would be considered: - quantitative data - nominal data - internal data - qualitative data
Quantitative data
A customer relations director needs to know which of three email messaging strategies the company currently uses causes the highest customer satisfaction score. Which data collection strategy should be used to best meet the director's needs. - randomly assign one of the three messaging strategies to a sample of current combers and then collect customer satisfaction data. - collect a random sample of prior customer satisfaction data from customers who received each of the three messaging strategies.
Randomly assign one of the three messaging strategies to a sample of current customers and then collect customer satisfaction data.
A data set that includes the number of products that were produced within each hour by a company would be classified as what type of data? - interval - ordinal - ratio - nominal
Ratio
The annual salaries for all teaching in Ohio would be considered: - ratio data - qualitative data - sample data - interval data
Ratio data
The milligrams of tar in 30 cigarettes would be considered: - interval data - ratio data - ordinal data - nominal data
Ratio data
A medical researcher wants to study if the number of burgers eaten over a year can be used to estimate/predict one's cholesterol level. Which of the following graphs would be the best for displaying this dat? - bar chart - scatter plot - frequency polygon - stem and leaf
Scatter plot
We want to see if a person's high can be used to predict their weight. Which of the following graphs would be the best for displaying this data? - bar chart - scatter plot - stem and leaf - frequency polygon
Scatter plot
In a survey of 1000 adults, 34% found they prefer charcoal to gas grills. The 34% would be considered a: - population - parameter - sample - statistic
Statistic
A survey of 481 of your customers shows that 79% of them like the recent changes to the product. Is this percentage a parameter or a statistic and why? - Statistic as it represents the sample. - Parameter as it represents the sample - Parameter as it represents the population - Statistic as it represents the population
Statistic as it represents the sample
An analyst creates a visualization of monthly active users on a website. - Step 1 - Step 2 - Step 3 - Step 4
Step 3
A researcher randomly selects and interviews fifty male and fifty female teachers. This sampling technique is called: - stratified - random - cluster - convenience
Stratified
To gather information on customer satisfaction, a researcher goes into each store and interviews six randomly selected customers at each store. This sampling technique is called: - stratified - Convenience - Cluster - Random
Stratified
A personnel director at a large company would like to determine whether the company cafeteria is widely used by employees. She calls each employee and asks them whether they usually bring their own lunch, eat at the company cafeteria, or go out for lunch. This study design would be considered: Homework Help: - Survey - Simulation - Observational - Experimental
Survey
A political campaign wants to know the public's reaction to their candidates latest policy initiative. What type of study would be best suited to make this determination? - Observational - Experimental - Survey - Simulation
Survey
What type of data collection might be best to estimate how many books individuals read in the past 6 months? - Survey - Experiment - Observational - Simulation
Survey
Every fifth person boarding a plane is asked additional security questions. This sampling technique is called: - random - cluster - stratified - systematic
Systematic
You need to study the satisfaction of customers of a specific restaurant. You ask every 10th customer as they leave after their meal. This would most closely describe which type of sampling technique? - Stratified - Random - Systematic - Convenience
Systematic
Sample
The 7,000 train passengers surveyed
Mean
The average
Data
The city reported by each passenger
Range
The difference between the largest and smallest data points
Which of the following graphs would be a line graph? - This graph has a vertical and horizontal axis. It shows a blue shape generally traveling downwards from left to right. There are three peaks with each peak being lower than the one to the left of it. The first peak is at the far left. Then we go downwards before climbing to the second peak that is lower than the first. Then we go downwards again before climbing to the third peak that is lower than the second peak. After the third peak, we go downwards to the end of the graph. - The is graph has a vertical and horizontal axis. It looks like a line that moves generally down to a low point. Then it moves up to high point. Finally, it begins moving downwards again. - This graph shows 6 contiguous vertical bars with height generally decreasing from left to right. - This graph has a vertical and horizontal axis. It has points plotted in a generally increasing pattern from left to right. There is also a red line of best fit plotted through middle of the points, also increasing from left to right.
The is graph has a vertical and horizontal axis. It looks like a line that moves generally down to a low point. Then it moves up to high point. Finally, it begins moving downwards again.
Medical
The middle value in a dataset
Mode
The most frequent number appearing in a dataset
Standard deviation
The square root of the variance
Which of the following graphs would be a dot plot? - This graph has a vertical and horizontal axis. It has points plotted in a generally increasing pattern from left to right. There is also a red line of best fit plotted through middle of the points, also increasing from left to right. - This graph shows 6 horizontal bars with the lengths of the bars generally decreasing from bottom to top. - This graph shows 6 contiguous vertical bars with height generally decreasing from left to right - This graph has a horizontal axis with 5 columns of vertical dots.
This graph has a horizontal axis with 5 columns of vertical dots.
Which of the following graphs would be considered a uniform distribution? - This graph is a histogram with 6 bars all of the same height. - This is a histogram with 5 bars. The tallest bar is in the middle. The height of the bars decreases on either side symmetrically with the smallest bars on the far left and far right. - This is a histogram with 6 bars. The tallest bars are on the far-left side of the gird. The bars decrease in height going from left to right with the shortest bars on the far right. - This is a histogram with 6 bars. The tallest bars are on the far-right side of the gird. The bars decrease in height going from right to left with the shortest bars on the far left.
This graph is a histogram with 6 bars all of the same height
Which of the following graphs would be considered normally distributed? - This is a histogram with 6 bars. The tallest bars are on the far-right side of the gird. The bars decrease in height going from right to left with the shortest bars on the far left. - This is a histogram with 5 bars. The tallest bar is in the middle. The height of the bars decreases on either side symmetrically with the smallest bars on the far left and far right. - This is a histogram with 6 bars. The tallest bars are on the far-left side of the gird. The bars decrease in height going from left to right with the shortest bars on the far right. - This graph is a histogram with 6 bars all of the same height.
This is a histogram with 5 bars. The tallest bar is in the middle. The height of the bars decreases on either side symmetrically with the smallest bars on the far left and far right.
Which of the following graphs would be considered right skewed? - This graph is a histogram with 6 bars all of the same height. - This is a histogram with 6 bars. The tallest bars are on the far-left side of the gird. The bars decrease in height going from left to right with the shortest bars on the far right. - This is a histogram with 6 bars. The tallest bars are on the far-right side of the gird. The bars decrease in height going from right to left with the shortest bars on the far left. - This is a histogram with 5 bars. The tallest bar is in the middle. The height of the bars decreases on either side symmetrically with the smallest bars on the far left and far right.
This is a histogram with 6 bars. The tallest bars are on the far-left side of the gird. The bars decrease in height going from left to right with the shortest bars on the far right.
histogram
Vertical bar chart that shows frequency on the y-axis
Data for a project come from 30 randomly selected apple trees where each tree was assigned one time to be pruned: early-, mid-, or late-season and then the final apple yield for each tree was recorded. The data were collected from an experiment because the _____. - 30 apple trees were randomly selected - prune times were assigned to trees - final yield was collected for each pruned tree
pruning times were assigned to trees