221B EXAM 3
Yes. The distribution of mean differences is normal because the data have been determined to be normal.
A real estate company wants to evaluate two of their home appraisers for consistency in preparation for their annual state certification. It is important that different appraisers be able to give similar estimates for homes and other properties. The company selects a random sample of 10 homes in different neighborhoods around the city and independently asks each appraiser for an estimate of the home value. After they receive the estimates for each home, the company subtracts the value given by the first appraiser from the value given by the second appraiser. They create a Q-Q plot of the differences and see that the points in the plot roughly form a straight line. Are the requirements for using the desired confidence interval satisfied? Explain. Yes. The distribution of mean differences is normal because the sample size is large. Not necessarily, because the company needs to check that the values given by each appraiser are normal, not that the differences are normal. Yes. The distribution of mean differences is normal because the data have been determined to be normal. No. The distribution of mean differences is not normal because the sample size is not large enough.
1. 0.518 2. 0.735 3. 0.109
For the next 3 questions, construct a 95% confidence interval for the slope of the regression line obtained when 90-Days-before-flight ticket prices are used to predict 14-Days-before-flight ticket prices. 1. What is the lower bound of your confidence interval? Round to three decimal places. ____________(with margin: 0.003) 2. What is the upper bound of your confidence interval? Round to three decimal places. __________(with margin: 0.003) 3. Determine Margin of Error for this 95% confidence interval. ________(with margin: 0.003)
1. H0:β1=0 Ha:β1≠0 2. 11.509 3. 0
For the next 4 questions, conduct a hypothesis test to determine if the slope of the true linear regression line is not equal to zero. Question 390 / 1 pts 1. Choose the correct null and alternative hypotheses for this analysis. H0:b0=0 Ha:b0≠0 H0:b1=0 Ha:b1≠0 H0:β0=0 Ha:β0≠0 H0:β1=0 Ha:β1≠0 2. What is the value of the test statistic? Round to three decimal places. __________(with margin: 0.003) 3. What is the P-value? Round to three decimal places. __________(with margin: 0.003)
1. 115.64 2. 0.627
For the next two questions, compute the y-intercept and the slope of the linear regression line used to predict the price of tickets 14 days before a flight based on the price of tickets listed 90 days before the same flight. 1. What is the y-intercept? Round to three decimal places. _____________ (with margin: 0.003) 2. What is the slope? Round to three decimal places. _________(with margin: 0.003)
As percent compliance goes up by 1 percent, on average, 5 pounds are lost in three months.
For the study in the previous question, the b1 of the regression line was 5 pounds. Interpret this statistic. The average weight loss of all participants was 5 pounds. On average, the exercise participants lost 5 pounds more than non-exercisers. As percent compliance goes up by 1 percent, on average, 5 pounds are lost in three months. Not enough information is given to answer this question.
One sample t-test
In the United States, mothers who live in poverty generally have babies with lower birthweight than those who do not live in poverty. The mean birthweight for babies born in the U.S. to mothers living in poverty is approximately 2800 grams. The CDC carries out a study to test the effectiveness of a new prenatal care program in reducing the number of low birthweight babies born. For the study, 30 mothers, all of whom live in poverty, participate in the program and birthweight data is recorded. Which hypothesis test would be most appropriate for this study? One sample z-test One sample t-test Paired samples t-test Independent sample t-test ANOVA Test of one proportion Test of two proportions Chi-Squared test of independence Linear Regression
Yes. The distribution of sample means is normal because the data are normal.
Marine biologists have been studying the effects of acidification of the oceans on weights of male Beluga whales in the Arctic Ocean. One of the studies involves a random sample of 16 Beluga whales. The researchers want to create a 95% confidence interval to estimate the true mean weight of male Beluga whales. Their data follow a normal distribution. The population standard deviation of weights of male Beluga whales is σ=125 kg, and the researchers feel comfortable using this standard deviation for their confidence interval. Are the requirements for the use of a confidence interval met? Explain. Yes. The distribution of sample means is normal because the sample size is large. Yes. The distribution of sample means is normal because the data are normal. No. The fact that the data are normal does not imply that the distribution of sample means is normal. No. The distribution of sample means is not normal because the sample size is small.
0.78
Pressures of supply and demand act directly on the prices of airline tickets. As the seats available on the plane begin to fill, airlines raise the price. If seats on a flight do not sell well, an airline may discount the tickets or even cancel the flight. Allie Henrich, a BYU-Idaho student, WANTED TO PREDICT THE PRICE OF TICKETS 14 DAYS BEFORE A FLIGHT BASED ON THE PRICE OF TICKETS LISTED 90 DAYS BEFORE THE SAME FLIGHT. For her study she compared the prices of one-way flights from London's Heathrow Airport to various destinations in Europe. Using Travelocity.com, she recorded the lowest published fares for 90 nonstop midweek flights 90 days in advance and then found the lowest published price for the same destination again 14 days in advance. The prices (in US dollars) are given in the file DirectFlightCosts.xlsx (Links to an external site.). Notice that for some destinations, flights were not available. Compute the sample correlation coefficient of the price of tickets 14 days before compared with the price of tickets 90 days before. Give your answer accurate to three decimals. __________(with margin: 0.003)
1. An observational study, because the students had no control over the price of the individual cereals. 2. An experiment, because the students had control over which sampling method they used.
Students in an Introductory Statistics class at BYU-Idaho were studying prices of cold cereal at grocery stores in Rexburg. To get a sample of cold cereal prices, they went to Albertson's and rolled a die to decide which box from the left of the top shelf they would start on. They then recorded every 6th cereal after the first, moving from left to right down the shelves, recording the name, size, and price of each cereal in their sample. Is this study an experiment or an observational study, and why? (Choose all that apply) An observational study, because the students had no control over which cereals were selected. An observational study, because the students had no control over the price of the individual cereals. An experiment, because the students had control over which sampling method they used. An experiment, because the students had control over which cereals would be selected for the study.
95% of all 95% confidence intervals actually contain the true mean.
Suppose you create a 95% confidence interval for a mean, and get (-2.346, 8.775). You've been told to report this by saying, "We are 95% confident that the true mean is between -2.346 and 8.775." Exactly what does this mean? There is a 95% chance that the true mean is between -2.346 and 8.775. 95% of all 95% confidence intervals actually contain the true mean. 95% of the data are between -2.346 and 8.775. 95% of the sample means are between -2.346 and 8.775
0.096
Use the following information to answer the next 2 questions. Pressures of supply and demand act directly on the prices of airline tickets. As the seats available on the plane begin to fill, airlines raise the price. If seats on a flight do not sell well, an airline may discount the tickets or even cancel the flight. Allie Henrich, a BYU-Idaho student, wanted to determine if there is a difference in the price of tickets listed 90 days before a flight compared with 14 days before the same flight. For her study she compared the prices of one-way flights from London's Heathrow Airport to various destinations in Europe. Using Travelocity.com, she recorded the lowest published fares for 90 nonstop midweek flights 90 days in advance and then found the lowest published price for the same destination again 14 days in advance. The prices (in US dollars) are given in the file DirectFlightCosts.xlsx (Links to an external site.). Notice that for some destinations, flights were not available. What is the P-value for this test? Round your answer to 3 decimal places. (Example: 0.045) _________(with margin: 0.003)
His actual weight loss was 5.5 pounds higher than his predicted weight loss.
Weight Watchers wanted to predict how much weight a client would lose if they followed a prescribed exercise program in addition to their diet program. Volunteers were randomly divided into two groups, one group dieted but didn't exercise, and the other group dieted and followed the exercise program. For the exercise group, they used linear regression with percent compliance with the exercise program as the explanatory variable and pounds lost in three months as the response variable. One of the clients was told that his residual was 5.5 pounds. What does this mean? His predicted weight loss was 5.5 pounds higher than his actual weight loss. His actual weight loss was 5.5 pounds higher than his predicted weight loss. His actual weight loss was 5.5 pounds higher than it would have been if he didn't exercise. His predicted weight loss was 5.5 pounds higher than it would have been if he didn't exercise.
Reject the null. There is sufficient evidence to suggest that the slope of the true linear regression line does not equal zero. We conclude that there is a linear relationship between 14-day ticket prices and 90-day ticket prices.
What decision do you make based on the P-value and the level of significance (α=0.05), and, thus, what do you conclude? Fail to reject the null. There is sufficient evidence to suggest that the slope of the true linear regression line does not equal zero. We conclude that there is not a linear relationship between 14-day ticket prices and 90-day ticket prices. Fail to reject the null. There is insufficient evidence to suggest that the slope of the true linear regression line does not equal zero. We conclude that there is not a linear relationship between 14-day ticket prices and 90-day ticket prices. Reject the null. There is sufficient evidence to suggest that the slope of the true linear regression line does not equal zero. We conclude that there is a linear relationship between 14-day ticket prices and 90-day ticket prices. Reject the null. There is insufficient evidence to suggest that that the the slope of the true linear regression line does not equal zero. We conclude that there is a linear relationship between 14-day ticket prices and 90-day ticket prices.
1. The residuals are Normally distributed. 2. There is a linear relationship between X and Y. 3. The variance of the error term is constant for all values of X.
When creating a scatter plot, a residuals plot, and a QQ plot, which of the five assumptions about linear regression are being checked? Check all that apply. The X values are measured without error. The residuals are Normally distributed. The observations are independent. There is a linear relationship between X and Y. The variance of the error term is constant for all values of X.
The number of people in a randomly selected classroom on campus.
Which of the following variables is/are quantitative? (Mark all that apply) The telephone number of a randomly selected business in town. The 3 most common brands of car found in the US. The number of people in a randomly selected classroom on campus.
Linear Regression
A business marketing firm specializes in radio advertising. They hope to show there is a linear relationship between sales and the amount of money a client invests in radio advertising. Which procedure would be most appropriate for addressing this question? One sample z-test One sample t-test Paired samples t-test Independent sample t-test ANOVA Test of one proportion Test of two proportions Chi-Squared test of independence Linear Regression
No, because in at least one case n(1−p^)<10.
A study is planned to compare the proportion of men who dislike anchovies with the proportion of women who dislike anchovies. The study seeks to determine if the proportions of men and women who dislike anchovies are different. A sample of 41 men was taken and the p̂ estimate for the true proportion of men who dislike anchovies was determined to be 0.67. A sample of 56 women was also taken and the p̂ estimate for the true proportion of women who dislike anchovies was determined to be 0.84. Are the requirements satisfied to perform this hypothesis test? Why? Yes, because the sample sizes of both groups are greater than 5. Yes, because in both cases np^>10 . Yes, because we know that the populations are normally distributed. No, because in at least one case n(1−p^)<10.
μd≠0
A study was conducted to measure the effectiveness of a diet program that claims to help manage weight. Subjects were randomly selected to participate. Before beginning the program, each participant was given a score based on his or her fitness level. After six months of following the diet, each participant received another score. The study wanted to test whether there was a difference between before and after scores. What is the correct alternative hypothesis for this analysis? μ≠0 μd≠0 μ1≠μ2 p1≠p2
Skewed right.
A team of dermatological researchers were studying skin cancer among 60-70-year-old American men. Suppose you were a part of the research team and take a sample of men from this age group and calculate the mean number of skin cancer spots in the sample. Also, suppose that it is known that the distribution of the number of cancer spots is right-skewed. Suppose the sample size for this study was n=2. What would be the shape of the distribution of sample means? Skewed right. Normal. Skewed left. We cannot be sure until we examine the data.
50
If p = 4/5, what is the smallest value of n that satisfies the requirements for a normally distributed p̂? __________(with margin: 0)
H0:μd=0 Ha:μd≠0
Now suppose the company of the previous question wants to use a hypothesis test to determine whether the estimates given by the two appraisers are different. Which of the following would be the most appropriate pair of hypotheses for such a test? H0:μ1=μ2 Ha:μ1≠μ2 H0:μ1≠μ2 Ha:μ1=μ2 H0:μd=0 Ha:μd≠0 H0:μd≠0 Ha:μd=0
(146.8)
The scores on the LSAT are approximately normal with mean of 150.7 and standard deviation of 10.2. (Source: www.lsat.org.) Queen's School of Business in Kingston, Ontario requires a minimum LSAT score of 157 for admission. Find the 35th percentile of the LSAT scores. Give your answer accurate to one decimal place. Use the applet. (Example: 124.7) __________ (with margin: 0.3)
(2.5)
The time to complete a standardized exam in the BYU-Idaho Testing Center is approximately normal with a mean of 70 minutes and a standard deviation of 10 minutes. Using the 68-95-99.7 rule, approximately what percentage of students will complete the exam in under fifty minutes? Give your answer accurate to one decimal place. (Example: 24.7) ____________(with margin: 0)
1. (2.236) 2. (0.991)
Use the following information to answer the next 2 questions. Tanker trucks are designed to carry huge quantities of gasoline from refineries to filling stations. A factory that manufactures the tank of the trucks claims to manufacture tanks with a capacity of 8550 gallons of gasoline. The actual capacity of the tanks is normally distributed with mean μ=8544 gallons and standard deviation σ=12 gallons. 1. Suppose that a simple random sample of n = 20 tanks is selected and the sample mean capacity is found to be 8550 gallons. What is the z-score corresponding to this sample mean?. Round your answer to three decimal places. (Example: 0.398) ____________(with margin: 0.003) 2. What is the probability that the sample mean capacity of 50 randomly selected tanks will be greater than 8540 gallons? Round your answer to three decimal places. (Example: 0.398) _________ (with margin: 0.003)
Dependent sampling. He needs to select couples for his sample, so whether a particular wife is included depends on whether her husband is included.
A financial economist is studying married couples in which both spouses work. He wants to compare the mean income earned by husbands with the mean income earned by their wives. Should he use independent sampling or dependent sampling, and why? Independent sampling. How he selects the husband is independent of how he selects the wife. Dependent sampling. He needs to select couples for his sample, so whether a particular wife is included depends on whether her husband is included. Independent sampling. The two spouses are separate people with different jobs, so the husband's income doesn't depend on his wife's income. Dependent sampling. The husband's income depends to some degree on his wife's income.
51.2% of the gametes in the sample have the male chromosome.
A geneticist takes a simple random sample of n=140 gamete (sex) cells from the testes of a dog to determine the proportion which contain the chromosomes for development into a male. Assume the (unknown) true proportion of dog gametes that contain the male chromosome is 46%. The geneticist conducts her survey and calculates p̂=0.512. Which of the following is the most correct interpretation of this statistic? There is a 51.2% chance that a randomly selected gamete cell will have the male chromosome. 51.2% of the eggs fertilized with these gametes will produce female puppies. 51.2% of all the gametes in the dog's testes have the male chromosome. 51.2% of the gametes in the sample have the male chromosome.
Test of two proportions
A marriage counselor conducted a study of couples, categorizing each of the couples as "communicative" or "non-communicative". Among other things, the counselor wanted to see whether the percentage of communicative couples whose marriage ended in separation or divorce was the same as the percentage of non-communicative couples whose marriage ended in separation or divorce. Which hypothesis test would be most appropriate for this study? One sample z-test One sample t-test Paired samples t-test Independent sample t-test ANOVA Test of one proportion Test of two proportions Chi-Squared test of independence Linear Regression
0.182
Suppose you're testing H0:p=0.62 against Ha:p≠0.62 and you have calculated the test statistic to be z=1.334. The area to the right of z=1.334 (under the standard normal density curve) is 0.091. Which one of the following is the P-value of your hypothesis test? 0.046 0.091 0.182 0.909
Test of one proportion Test of two proportions
An attorney wants to know whether the percentage of Wal-Mart employees in Nevada who are over 45 years old is more than 30%. Which procedure would be most appropriate in this situation? One sample z-test One sample t-test Paired samples t-test Independent sample t-test ANOVA Test of one proportion Test of two proportions Chi-Squared test of independence Linear Regression
(0.019)
An investor with a stock portfolio worth several hundred thousand dollars sued his broker and brokerage firm because he felt that lack of diversification in his portfolio led to poor performance for many years in a row. In an effort to avoid close public scrutiny, the firm agreed to settle the conflict by an arbitration panel. The arbitration panel compared a sample of 39 months of the investor's returns with the average of the Standard & Poor's 500-stock index for the same period in order to determine whether there was a substantial decrease. The data are recorded in RatesOfReturn.xlsx (Links to an external site.). The S&P has a known population mean return of 0.95. Suppose that you are a member of this arbitration panel. Conduct a hypothesis test to determine if the investor's portfolio performed significantly worse than the performance of the S&P 500. Use a level of significance of α = 0.05. What is the P-value for this test? Give your answer accurate to three decimal places. (Example: 0.034) _________ (with margin: 0.003)
No, because the P-value was greater than the level of significance.
Based on the results of the test, can you say that there is a difference in the cost of airline tickets purchased 90 days in advance compared with 14 days in advance? Use a level of significance of α=0.05. No, because the P-value was greater than the level of significance. Yes, because the P-value was greater than the level of significance Yes, because the P-value was lower than the level of significance. No, because the P-value was lower than the level of significance.
Linear with a fairly strong positive association
Create a scatterplot using these two variables. Using this scatterplot and referring to the correlation coefficient, select the answer that best describes the data illustrated in the scatterplot. Linear with a fairly strong negative association Linear with a fairly strong positive association Linear with a strong negative association Nonlinear
One sample z-test
Engineers at an oil refinery find out how easily gasoline evaporates by measuring its vapor pressure. As they make gasoline, they need to determine whether its vapor pressure is high enough. The standard deviation of the vapor pressure of the gasoline made by the refinery is always 0.03 (pounds per square inch). Which hypothesis test would be most appropriate for this task? One sample z-test One sample t-test Paired samples t-test Independent sample t-test ANOVA Test of one proportion Test of two proportions Chi-Squared test of independence Linear Regression
It is not possible to determine this without the full data set.
For a Math 221 project, one group of students studied the ages of students on the BYU-Idaho campus. They collected data from a random sample of n=100 students. The sample mean was 21.2 and the sample standard deviation was 2.61. An excerpt of their data is given below. The group notices an error in their data. The age of one of the males (ID=3) was entered incorrectly. He is actually 21 years old. When the error is corrected, what will happen to the sample median? The median will increase. The median will decrease. It is not possible to determine this without the full data set. The median will stay the same.
241
Predict the price of tickets listed 14 days before a flight based on the same flight costing $200, 90 days before a flight. Give your answer accurate to 1 decimal place. (Example: 80.4) ___________(with margin: 0.3)
(0.099)
Researchers published a study in which they considered the incidence among the elderly of various mental health conditions such as dementia, bi-polar disorder, obsessive compulsive disorder, delirium, and Alzheimer's disease. In the U.S., 45% of adults over 65 suffer from one or more of the conditions considered in the study. Calculate the probability that fewer than 320 out of the n = 750 adults over 65 in the study suffer from one or more of the conditions under consideration. Give your answer accurate to three decimal places in decimal form. (Example: 0.398) ___________(with margin: 0.005)
Any n will do.
Suppose you are dog breeder and you want to use a confidence interval to estimate the true mean fertility levels of purebred cocker spaniels. It is known that the distribution of fertility levels of cocker spaniels is normal. How many measurements must you have in order to be sure the sampling distribution of x¯ is normal? n≥30 n≥10 n≥5 np≥10 and n(1−p)≥10 np≥5 and n(1−p)≥5 Any n will do.
Approximately 95% of all 95% confidence intervals that could be computed from the population of all adult females in the United States will contain the true mean height. We are 95% confident that the true mean height of adult females in the United States is somewhere in our confidence interval
The heights of young adult females in the United States are said to have a population standard deviation of σ=2.6 inches. A sample was taken of n=85 young adult females at BYU-I and the mean was computed to be x¯=65.4 inches. Which of the following explanations describes the correct way to interpret the 95% confidence interval for this problem? (Mark all that apply) 95% of the heights of female adults in our sample are in our confidence interval. 95% of all of the heights of female adults in the US are in our confidence interval. Approximately 95% of all 95% confidence intervals that could be computed from the population of all adult females in the United States will contain the true mean height. There is a 95% chance that the true mean height of adult females in the United States is in our confidence interval. We are 95% confident that the true mean height of adult females in the United States is somewhere in our confidence interval.
The probability of a type I error.
Which one of the following best describes the notion of "the significance level of a hypothesis test?" The probability of obtaining a test statistic at least as extreme as the one you calculated, assuming the null hypothesis is true. The probability of a type I error. The probability of a type II error. The probability of rejecting H0, whether it's true or not.
No. 40×0.233<10 so the conditions are not satisfied.
Your little brother has a a career batting average of 0.233. You want to determine the probability that he will have a batting average of over 0.250 next season when he will be up to bat 40 times. Are the conditions met so that the sample proportion p̂ will be approximately normal? Yes. There is a large sample (n>30) so the sample proportion must be normal. Yes. 40×0.250≥10 so the conditions are satisfied. No. p^ can only be normal if the sample data are normally distributed. No. 40×0.233<10 so the conditions are not satisfied.