Unit 3 ap stats

Ace your homework & exams now with Quizwiz!

A field researcher who studies lions conjectured that the more time a cub spends playing, the sooner the cub will begin to hunt. Observational data were collected from 20 lion cubs. The researcher recorded how long they spent playing and the age when they began hunting. Because male and female lions have different hunting behaviors, the researcher recorded the data for males and females separately. The two scatterplots show the data for the 10 female lions and the 10 male lions. a) For female cubs only b) For male cubs only c) For both male cubs and female cubs, with equal evidence d) For both male cubs and female cubs, with more evidence for female cubs than for male cubs e) For neither male cubs nor female cubs

A

An agriculturalist working with Australian pine trees wanted to investigate the relationship between the age and the height of the Australian pine. A random sample of Australian pine trees was selected, and the age, in years, and the height, in meters, was recorded for each tree in the sample. Based on the recorded data, the agriculturalist created the following regression equation to predict the height, in meters, of the Australian pine based on the age, in years, of the tree. predicted height = 0.29 + 0.48(age) Which of the following is the best interpretation of the slope of the regression line? a) The height increases, on average, by 1 meter each 0.48 year. b) The height increases, on average, by 0.48 meter each year. c) The height increases, on average, by 0.29 meter each year. d) The height increases, on average, by 0.29 meter each 0.48 year. e) The difference between the actual height and the predicted height is, on average, 0.48 meter for each year.

B

A factory has two machines, A and B, making the same part for refrigerators. The number of defective parts produced by each machine during the first hour of operation was recorded on 19 randomly selected days. The scatterplot below shows the number of defective parts produced by each machine on the selected days. Which statement gives the best comparison between the number of defective parts produced by the machines during the first hour of operation on the 19 days? a) Machine A always produced the same number of defective parts as machine B. b) Machine A always produced fewer defective parts than machine B. c) Machine A always produced more defective parts than machine B. d) Machine A usually, but not always, produced fewer defective parts than machine B. e) Machine A usually, but not always, produced more defective parts than machine B.

D

A botanist found a correlation between the length of an aspen leaf and its surface area to be 0.94. Why does the correlation value of 0.94 not necessarily indicate that a linear model is the most appropriate model for the relationship between length of an aspen leaf and its surface area? a) The value must be exactly 1 or −1 to indicate a linear model is the most appropriate model. b) The value must be 0 to indicate a linear model is the most appropriate model. c) A causal relationship should be established first before determining the most appropriate model. d) The value of 0.94 implies that only 88% of the data have a linear relationship. e) Even with a correlation value of 0.94, it is possible that the relationship could still be better represented by a nonlinear model.

E

The height h and collar size c, both in centimeters, measured from a sample of boys were used to create the regression line cˆ=−94+0.9h. The line is used to predict collar size from height, both in centimeters, for boys' shirt collars. Which of the following has no logical interpretation in context? a)The predicted collar size of a boy with height 140cm b) The h values in the sample c) The c values in the sample d) The slope of the regression line e) The c-intercept of the regression line

E

A researcher studying a specific type of tree creates a least-squares regression line for relating the height and the diameter, both in meters, of a fully grown tree. The results are shown in the following computer output. Which of the following values represents the predicted change in the height of the tree for each one-meter increase in the diameter of the tree? a) 30 b) 5 c) 4 d) 2.5 e) 1/30

a

Clear-cut harvesting of wood from forests creates long periods of time when certain animals cannot use the forests as habitats. Partial-cut harvesting is increasingly used to lessen the effects of logging on the animals. The following scatterplot shows the relationship between the density of red squirrels, in squirrels per plot, 2 to 4 years after partial-cut harvesting, and the percent of trees that were harvested in each of 11 forests. Which of the following is the best description of the relationship displayed in the scatterplot? a) Negative, linear, and strong b) Positive, linear, and weak c) Negative, nonlinear, and strong d) Positive, nonlinear, and weak e) Positive, nonlinear, and strong

a

Exercise physiologists are investigating the relationship between lean body mass (in kilograms) and the resting metabolic rate (in calories per day) in sedentary males. Based on the computer output above, which of the following is the best interpretation of the value of the slope of the regression line? a) For each additional kilogram of lean body mass, the resting metabolic rate increases on average by 22.563 calories per day. b) For each additional kilogram of lean body mass, the resting metabolic rate increases on average by 264.0 calories per day. c) For each additional kilogram of lean body mass, the resting metabolic rate increases on average by 144.9 calories per day. d) For each additional calorie per day for the resting metabolic rate, the lean body mass increases on average by 22.563 kilograms. e) For each additional calorie per day for the resting metabolic rate, the lean body mass increases on average by 264.0 kilograms.

a

A researcher collected data on the latitude, in degrees north of the equator, and the average low temperature, in degrees Fahrenheit, for a random sample of cities in Europe. The data were used to create the following equation for the least-squares regression line.predicted average low temperature=65.5−0.70(latitude) Which of the following is the best interpretation of the slope of the line? a) For each one degree north of the equator increase, the predicted average low temperature increases on average by 0.70 degree Fahrenheit. b) For each one degree north of the equator increase, the predicted average low temperature decreases on average by 0.70 degree Fahrenheit. c) For each 0.70 degree north of the equator increase, the predicted average low temperature decreases on average by 1 degree Fahrenheit. d) For each one degree Fahrenheit increase in average low temperature, the predicted latitude increases on average by 0.70 degree north of the equator. e) For each one degree Fahrenheit increase in average low temperature, the predicted latitude decreases on average by 0.70 degree north of the equator.

b

A restaurant manager collected data to predict monthly sales for the restaurant from monthly advertising expenses. The model created from the data showed that 36 percent of the variation in monthly sales could be explained by monthly advertising expenses. What was the value of the correlation coefficient? a) 0.64 b) 0.60 c) 0.40 d) 0.36 e) 0.13

b

A set of bivariate data was used to create a least-squares regression line. Which of the following is minimized by the line? a) The sum of the residuals b) The sum of the squared residuals c) The sum of the absolute values of the residuals d) The influence of outliers e) The slope

b

A tennis ball was thrown in the air. The height of the ball from the ground was recorded every millisecond from the time the ball was thrown until it reached the height from which it was thrown. The correlation between the time and height was computed to be 0. What does this correlation suggest about the relationship between the time and height? a) There is no relationship between time and height. b) There is no linear relationship between time and height. c) The distance the ball traveled upward is the same as the distance the ball traveled downward. d) The correlation suggests that there is measurement or calculation error. e) The correlation suggests that more measurements should be taken to better understand the relationship.

b

At a large airport, data were recorded for one month on how many baggage items were unloaded from each flight upon arrival as well as the time required to deliver all the baggage items on the flight to the baggage claim area. A scatterplot of the two variables indicated a strong, positive linear association between the variables. Which of the following statements is a correct interpretation of the word "strong" in the description of the association? a) A least-squares model predicts that the more baggage items that are unloaded from a flight, the greater the time required to deliver the items to the baggage claim area. b) The actual time required to deliver all the items to the baggage claim area based on the number of items unloaded will be very close to the time predicted by a least-squares model. c) The time required to deliver an item to the baggage claim area is relatively constant, regardless of the number of baggage items unloaded from a flight. d) The variability in the time required to deliver all items to the baggage claim area is about the same for all flights, regardless of the number of items unloaded from a flight. e) The time required to unload baggage items from a flight is related to the time required to deliver the items to the baggage claim area.

b

Dairy farmers are aware there is often a linear relationship between the age, in years, of a dairy cow and the amount of milk produced, in gallons per week. The least-squares regression line produced from a random sample is Milkˆ=40.8−1.1(Age). Based on the model, what is the difference in predicted amounts of milk produced between a cow of 5 years and a cow of 10 years? a) A cow of 5 years is predicted to produce 5.5 fewer gallons per week. b) A cow of 5 years is predicted to produce 5.5 more gallons per week. c) A cow of 5 years is predicted to produce 1.1 fewer gallons per week. d) A cow of 5 years is predicted to produce 1.1 more gallons per week. e) A cow of 5 years and a cow of 10 years are both predicted to produce 40.8 gallons per week.

b

In a recent survey, high school students and their parents were asked to rate 60 recently released movies. The ratings were on a scale from 1 to 9, where 1 was "horrible" and 9 was "excellent". For each movie, the average rating by the students and the average rating by their parents was calculated and the scatterplot below was constructed.The horizontal axis represents the student rating, and the vertical axis represents the parent rating.Thus, an individual data point would represent the rating of a single movie. Which of the following statements is justified by the scatterplot? a) The movies that the students liked the best also tended to be the movies that the parents liked the best, but the students tended to give lower scores. b) The movies that the students liked the best also tended to be the movies that the parents liked the best, but the students tended to give higher scores. c) The movies that the students liked the best also tended to be the movies that the parents liked the best, but each group tended to give the same scores. d) The movies that the students liked the best tended to be the movies that the parents liked the least, but the students tended to give lower scores. e) The movies that the students liked the best tended to be the movies that the parents liked the least, but the students tended to give higher scores.

b

The computer output below shows the result of a linear regression analysis for predicting the concentration of zinc, in parts per million (ppm), from the concentration of lead, in ppm, found in fish from a certain river. Which of the following statements is a correct interpretation of the value 19.0 in the output? a) On average there is a predicted increase of 19.0 ppm in concentration of lead for every increase of 1 ppm in concentration of zinc found in the fish. b) On average there is a predicted increase of 19.0 ppm in concentration of zinc for every increase of 1 ppm in concentration of lead found in the fish. c) The predicted concentration of zinc is 19.0 ppm in fish with no concentration of lead. d) The predicted concentration of lead is 19.0 ppm in fish with no concentration of zinc. e) Approximately 19% of the variability in zinc concentration is predicted by its linear relationship with lead concentration.

b

The following is a residual plot for a linear regression of y versus x. What is indicated by the plot? a) A linear model is appropriate. b) A linear model is not appropriate. c) Variability in y is constant for all values x. d) At least one point is influential with respect to the regression. e) At least one point is an outlier with respect to the regression.

b

There is a linear relationship between the number of chirps made by the striped ground cricket and the air temperature. A least squares fit of some data collected by a biologist gives the model ŷ = 25.2 + 3.3x 9 < x < 25, where x is the number of chirps per minute and ŷ is the estimated temperature in degrees Fahrenheit. What is the estimated increase in temperature that corresponds to an increase of 5 chirps per minute? a) 3.3 ° F b) 16.5 ° F c) 25.2 ° F d) 28.5 ° F e) 41.7 ° F

b

A researcher in Alaska measured the age (in months) and the weight (in pounds) of a random sample of adolescent moose. When the least-squares regression analysis was performed, the correlation was 0.59. Which of the following is the correct way to label the correlation? a) 0.59 months per pound b) 0.59 pounds per month c) 0.59 d) 0.59 months times pounds e) 0.59 month pounds

c

A restaurant manager collected data on the number of customers in a party in the restaurant and the time elapsed until the party left the restaurant. The manager computed a correlation of 0.78 between the two variables. What information does the correlation provide about the relationship between the number of customers in a party at the restaurant and the time elapsed until the party left the restaurant? a) The relationship is linear because the correlation is positive. b) The relationship is not linear because the correlation is positive. c) The parties with a larger number of customers are associated with the longer times elapsed until the party left the restaurant. d) The parties with a larger number of customers are associated with the shorter times elapsed until the party left the restaurant. e) There is no relationship between the number of customers in a party at a table in the restaurant and the time required until the party left the restaurant.

c

A roadrunner is a desert bird that tends to run instead of fly. While running, the roadrunner uses its tail as a balance. A sample of 10 roadrunners was taken, and the birds' total length, in centimeters (cm), and tail length, in cm, were recorded. The output shown in the table is from a least-squares regression to predict tail length given total length. Suppose a roadrunner has a total length of 59.0 cm and tail length of 31.1 cm. Based on the residual, does the regression model overestimate or underestimate the tail length of the roadrunner? a) Underestimate, because the residual is positive. b) Underestimate, because the residual is negative. c) Overestimate, because the residual is positive. d) Overestimate, because the residual is negative. e) Neither, because the residual is 0.

c

Data were collected on the fiber diameter and the fleece weight of wool taken from a sample of 20 sheep. The data are shown in the following graphs. Graph 1 is a scatterplot of fleece weight versus fiber diameter with the respective least-squares regression line shown. Graph 2 is the associated plot of the residuals versus the predicted values. One point is circled on graph 1. Five points labeled A, B, C, D, and E are identified on graph 2. Which point on graph 2 represents the residual for the circled point on graph 1 ? a) a b) b c) c d) d e) e

c

Researchers observed the grouping behavior of deer in different regions. The following scatterplot shows data collected on the size of the group and the percent of the region that was woodland. The relationship between group size and percent woodland appears to be negative and nonlinear. Which of the following statements explains such a relationship? a) As the percent of woodland increases, the number of deer observed in a group decreases at a fairly constant rate. b) As the percent of woodland increases, the number of deer observed in a group increases at a fairly constant rate. c) As the percent of woodland increases, the number of deer observed in a group decreases quickly at first and then more slowly. d) As the percent of woodland increases, the number of deer observed in a group increases quickly at first and then more slowly. e) As the percent of woodland increases, the number of deer observed in a group remains fairly constant.

c

The height and age of each child in a random sample of children was recorded. The value of the correlation coefficient between height and age for the children in the sample was 0.8. Based on the least-squares regression line created from the data to predict the height of a child based on age, which of the following is a correct statement? a) On average, the height of a child is 80% of the age of the child. b) The least-squares regression line of height versus age will have a slope of 0.8. c) The proportion of the variation in height that is explained by a regression on age is 0.64. d) The least-squares regression line will correctly predict height based on age 80% of the time. e) The least-squares regression line will correctly predict height based on age 64% of the time.

c

An engineer believes that there is a linear relationship between the thickness of an air filter and the amount of particulate matter that gets through the filter; that is, less pollution should get through thicker filters. The engineer tests many filters of different thickness and fits a linear model. If a linear model is appropriate, what should be apparent in the residual plot? a) There should be a positive, linear association in the residual plot. b) There should be a negative, linear association in the residual plot. c) All of the points must have residuals of 0. d) There should be no pattern in the residual plot. e) The residuals should have a small amount of variability for low values of the predictor variable and larger amounts of variability for high values of the predictor variable.

d

An experiment was conducted to investigate the relationship between the dose of a pain medication and the number of hours of pain relief. Twenty individuals with chronic pain were randomly assigned to one of five doses—0.0, 0.5, 1.0, 1.5, 2.0—in milligrams (mg) of medication. The results are shown in the scatterplot below. The data were used to fit a least-squares regression line to predict the number of hours of pain relief for a given dose. Which of the following would be revealed by a plot of the residuals of the regression versus the dose? a) The sum of the residuals is less than 0. b) The sum of the residuals is greater than 0. c) There are outliers associated with the lower doses. d) The variation in the hours of pain relief is not the same across the doses. e) There is a positive linear relationship between the residuals and the dose.

d

Consider n pairs of numbers (x1,y1), (x2,y2), ..., and (xn, yn). The mean and standard deviation of the x-values are x̄ =5 and sx = 4, respectively. The mean and standard deviation of the y-values are ȳ = 10 and sy= 10 respectively. Of the following, which could be the least squares regression line? a) ŷ = -5.0 + 3.0x b) ŷ = 3.0x c) ŷ = 5.0 + 2.5x d) ŷ = 8.5 + 0.3x e) ŷ = 10.0 + 0.4x

d

For a random sample of 20 professional athletes, there is a strong, linear relationship between the number of hours they exercise per week and their resting heart rate. For the athletes in the sample, those who exercise more hours per week tend to have lower resting heart rates than those who exercise less. Which of the following is a reasonable value for the correlation between the number of hours athletes exercise per week and their resting heart rate? a) 0.71 b) 0.00 c) −0.14 d) −0.87 e) −1.00

d

For a specific species of fish in a pond, a wildlife biologist wants to build a regression equation to predict the weight of a fish based on its length. The biologist collects a random sample of this species of fish and finds that the lengths vary from 0.75 to 1.35 inches. The biologist uses the data from the sample to create a single linear regression model. Would it be appropriate to use this model to predict the weight of a fish of this species that is 3 inches long? a) Yes, because 3 inches falls above the maximum value of lengths in the sample. b) Yes, because the regression equation is based on a random sample. c) Yes, because the association between length and weight is positive. d) No, because 3 inches falls above the maximum value of lengths in the sample. e) No, because there may not be any 3-inch fish of this species in the pond.

d

The least-squares regression line yˆ=1.8−0.2x summarizes the relationship between velocity, in feet per second, and depth, in feet, in measurements taken for a certain river, where x represents velocity and y represents the depth of the river. What is the predicted value of y, in feet, when x=5? a) −16 b) −1 c) −0.2 d) 0.8 e) 1.8

d

Which of the following is the best description of a positive association between two variables? a) The values will create a line when graphed on a scatterplot. b) The values will create a line with positive slope when graphed on a scatterplot. c) As the value of one of the variables increases, the value of the other variable tends to decrease. d) As the value of one of the variables increases, the value of the other variable tends to increase. e) All values of both variables are positive.

d

A family would like to build a linear regression equation to predict the amount of grain harvested per acre of land on their farm. They subdivide their land into several smaller plots of land for testing and would like to select an explanatory variable they can control. Which of the following is an appropriate explanatory variable that the family could use to create a linear regression equation? a) The total amount of rainfall recorded at their farm b)The type of crop planted in the plot the previous year c) The average daily temperature at their farm d) The variety of grain planted in the plot e) The amount of fertilizer applied to each plot of land

e

A researcher collected data on the age, in years, and the growth of sea turtles. The following graph is a residual plot of the regression of growth versus age. Does the residual plot support the appropriateness of a linear model? a) Yes, because there is a clear pattern displayed in the residual plot. b) Yes, because about half the residuals are positive and the other half are negative. c) Yes, because as age increases, the residuals increase. d) No, because the points appear to be randomly distributed. e) No, because the graph displays a U-shaped pattern.

e

Suppose a certain scale is not calibrated correctly, and as a result, the mass of any object is displayed as 0.75 kilogram less than its actual mass. What is the correlation between the actual masses of a set of objects and the respective masses of the same set of objects displayed by the scale? a) -1 b) -0.75 c) 0 d) 0.75 e) 1

e

The least-squares regression line Sˆ=0.5+1.1L models the relationship between the listing price and the actual sales price of 12 houses, with both amounts given in hundred-thousands of dollars. Let Lrepresent the listing price and S represent the sales price. Which of the following is the best interpretation of the slope of the regression line? a) For each hundred-thousand-dollar increase in the listing price, the sales price will increase by $1.1. b) For each hundred-thousand-dollar increase in the listing price, the sales price will increase by $110,000. c)For each hundred-thousand-dollar increase in the listing price, the sales price will decrease by $110,000. d) For each hundred-thousand-dollar increase in the listing price, the sales price is predicted to increase by $1.1.. e)For each hundred-thousand-dollar increase in the listing price, the sales price is predicted to increase by $110,000.

e

The table shows several values of x and their corresponding values of y. Which of the following is closest to the correlation between x and y? a) −0.98 b) −0.95 c) 0.20 d) 0.95 e) 0.98

e

For which of the following scatterplots is the correlation between x and y closest to 0 ?

u shape

Which of the following scatterplots could represent a data set with a correlation coefficient of r = -1?

negative linear (diagnal left to right)


Related study sets

Comprehensive Exam - Public Speaking - Chapter 1

View Set

AP US History Chapter 19 Questions

View Set

Business Management Organizational Design Test

View Set

Principles of Data Analytics Quiz #3

View Set

Ch 4 Environments and Strategic Management

View Set

Sling Load Inspector Certification Course

View Set