Stats study sets 2.3-4.6 and 6.7

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

In a large set of data that are approximately normally distributed, r is the value in the data set that has a z-score of -1.00 s is the value of the first quartile, and t is the value of the 20th percentile. Which of the following is the correct order from least to greatest for the values of r, s, and t ?

B. r, t, s

A well-designed experiment should have which of the following characteristics? I. Subjects assigned randomly to treatments II. A control group or at least two treatment groups III. Replication

E. I, II and III

The distribution of assembly times required to assemble a certain smartphone is approximately normal with mean 4.6 minutes and standard deviation 0.6 minute. Of the following, which is closest to the percentage of assembly times between 4 minutes and 5 minutes?

C. 59%

Which of the following scatterplots could represent a data set with a correlation coefficient of r = -1?

A.

There is a linear relationship between the number of chirps made by the striped ground cricket and the air temperature. A least squares fit of some data collected by a biologist gives the model ŷ = 25.2 + 3.3x 9 < x < 25, where x is the number of chirps per minute and ŷ is the estimated temperature in degrees Fahrenheit. What is the estimated increase in temperature that corresponds to an increase of 5 chirps per minute?

B. 16.5 ° F

A random variable X has a mean of 120 and a standard deviation of 15. A random variable Y has a mean of 100 and a standard deviation of 9. If X and Y are independent, approximately what is the standard deviation of X-Y?

B. 17.5

A researcher studying a specific type of tree creates a least-squares regression line for relating the height and the diameter, both in meters, of a fully grown tree. The results are shown in the following computer output. Which of the following values represents the predicted change in the height of the tree for each one-meter increase in the diameter of the tree?

B. 30

Which of the following can be used to show a cause-and-effect relationship between two variables?

B. A controlled experiment

Which of the following is the best estimate of the standard deviation of the distribution shown in the figure above?

B. 10

For which of the following scatterplots is the correlation between x and y closest to 0 ?

E.

The table shows several values of x and their corresponding values of y. Which of the following is closest to the correlation between x and y?

E. 0.98

Carla wants to investigate whether a person's political party affiliation causes the person to be more vocal about political issues. She plans to administer a survey to a large sample of people. Which of the following describes why the method of data collection used will prevent Carla from achieving her goal?

B. Causation cannot be determined from a survey.

A roadrunner is a desert bird that tends to run instead of fly. While running, the roadrunner uses its tail as a balance. A sample of 10 roadrunners was taken, and the birds' total length, in centimeters (cm), and tail length, in cm, were recorded. The output shown in the table is from a least-squares regression to predict tail length given total length. Suppose a roadrunner has a total length of 59.0 cm and tail length of 31.1 cm. Based on the residual, does the regression model overestimate or underestimate the tail length of the roadrunner?

A. Underestimate, because the residual is positive.

In the design of a survey, which of the following best explains how to minimize response bias?

E. Carefully word and field-test survey questions.

The random variable X has a mean 12 and standard deviation 3. The random variable W is defined as W=7+2X. What are the mean and standard deviation of W?

D. The mean is 31, and the standard deviation is 6.

The random variable W has a geometric distribution with p = 0.25. Approximately how far do the values of W/ typically vary, on average, from the mean or the distribution?

D. 3.46

Scientists working for a water district measure the water level in a lake each day. The daily water level in the lake varies due to weather conditions and other factors. The daily water level has a distribution that is approximately normal with mean water level of 84.07 feet. The probability that the daily water level in the lake is at least 100 feet is 0.064. Which of the following is closest to the probability that on a randomly selected day the water level in the lake will be at least 90 feet?

A. 0.29

Zucchini weights are approximately normally distributed with mean 0.8 pound and standard deviation 0.25 pound. Which of the following shaded regions best represents the probability that a randomly selected zucchini will weigh between 0.55 pound and 1.3 pounds?

A.

The commuting time for a student to travel from home to a college campus is normally distributed with a mean of 30 minutes and a standard deviation of 5 minutes. If the student leaves home at 8:25 A.M., what is the probability that the student will arrive at the college campus later than 9 A.M.?

A. 0.16

The distribution of lengths of salmon from a certain river is approximately normal with standard deviation 3.5 inches. If 10 percent of salmon are longer than 30 inches, which of the following is closest to the mean of the distribution?

A. 26 inches

The manager of a public swimming pool wants to compare the effectiveness of two laundry detergents, Detergent A and Detergent B, in cleaning the towels that are used daily. As each dirty towel is turned in, it is placed into the only washing machine on the premises. When the washing machine contains 20 towels, the manager flips a coin to determine whether Detergent A or Detergent B will be used for that load. The cleanliness of the load of towels is rated on a scale of 1 to 10 by a person who does not know which detergent was used. The manager continues this experiment for many days. Which of the following best describes the manager's study?

A. A completely randomized design

A study was conducted to evaluate the impact of taking a nutritional supplement on a person's reaction time. One hundred volunteers were placed into one of three groups according to their athletic ability: low, moderate, or high. Participants in each group were randomly assigned to take either the nutritional supplement or a placebo for six weeks. At the end of the six weeks, participants were given a coordination task. The reaction time in completing the task was recorded for each participant. The study compared the reaction times between those taking the supplement and those taking the placebo within each athletic ability level. Which of the following is the best description of the study?

A. A randomized block design

At a certain clothing store, the clothes are displayed on racks. The clothes on each rack have similar prices, but the prices among the racks are very different. To estimate the typical price of a single piece of clothing, a consumer will randomly select four pieces of clothing from each rack. What type of sample is the consumer selecting?

A. A stratified random sample

A pharmaceutical company manufactures medicine to reduce pain caused by migraine headaches. The company is investigating whether a new medicine is more effective in reducing pain than the current medicine. A random sample of 500 participants who experience migraines was selected, and the participants were randomly assigned to one of two groups of equal size. The first group received the current medicine and the second group received the new medicine. When a participant experienced a migraine, he or she was instructed to take the medicine and, 15 minutes after taking the medicine, to rate the pain relief on a scale from 1 to 10, with 1 being no relief to 10 being complete relief. At the end of six months, the average pain relief for each participant was calculated. Which of the following is the best description of the study?

A. An experiment using a completely randomized design

A botanist found a correlation between the length of an aspen leaf and its surface area to be 0.94. Why does the correlation value of 0.94 not necessarily indicate that a linear model is the most appropriate model for the relationship between length of an aspen leaf and its surface area?

A. Even with a correlation value of 0.94, it is possible that the relationship could still be better represented by a nonlinear model.

Exercise physiologists are investigating the relationship between lean body mass (in kilograms) and the resting metabolic rate (in calories per day) in sedentary males. Based on the computer output above, which of the following is the best interpretation of the value of the slope of the regression line?

A. For each additional kilogram of lean body mass, the resting metabolic rate increases on average by 22.563 calories per day.

A field researcher who studies lions conjectured that the more time a cub spends playing, the sooner the cub will begin to hunt. Observational data were collected from 20 lion cubs. The researcher recorded how long they spent playing and the age when they began hunting. Because male and female lions have different hunting behaviors, the researcher recorded the data for males and females separately. The two scatterplots show the data for the 10 female lions and the 10 male lions. Based on the scatterplots, for which gender does there appear to be evidence that the more time a lion cub spends playing, the sooner the cub is likely to begin hunting?

A. For female cubs only

Clear-cut harvesting of wood from forests creates long periods of time when certain animals cannot use the forests as habitats. Partial-cut harvesting is increasingly used to lessen the effects of logging on the animals. The following scatterplot shows the relationship between the density of red squirrels, in squirrels per plot, 2 to 4 years after partial-cut harvesting, and the percent of trees that were harvested in each of 11 forests. Which of the following is the best description of the relationship displayed in the scatterplot?

A. Negative, linear, and strong

Researchers are investigating the effect of pHpH level in water on the breeding habits of the moon jellyfish. As part of a laboratory experiment, they will randomly assign one of three treatments, low pHpH, medium pHpH, or high pHpH, to the water in the tanks that hold the jellyfish. Which of the following is the best reason for the random assignment of a treatment level to an experimental unit?

A. Randomization tends to minimize the effects of uncontrolled variables, such as water temperature, so that such factors are not confounded with the treatment effects.

A city has designed a survey to collect information about residents' opinions about city services. Which of the following describes a scenario in which nonresponse bias is likely present?

A. Surveys were mailed to 500 people, and 200 of the surveys were completed and returned.

The quality-control manager of a large factory is concerned about the number of defective items produced by workers. Thirty workers at the factory agree to participate in a study of three different incentive plans to help reduce the number of defective items produced. The plans will be randomly assigned to the workers so that 10 workers received each plan. The reduction in the number of defective items produced by each worker will be recorded two weeks after the plans are implemented. Which of the following best describes why a completely randomized design is an appropriate design to use in this situation?

A. There is no blocking variable, and incentive plans will be randomly assigned to the workers.

A chemist for a paint company conducted an experiment to investigate whether a new outdoor paint will last longer than the older paint. Fifty blocks made from the same wood were randomly assigned to be painted with either the new paint or the old paint. The blocks were placed into a weather-controlled room that simulated extreme weather conditions such as ice, temperature, wind, and sleet. After one month in the room, the blocks were removed, and each block was rated on texture, shine, brightness of color, and chipping. The results showed that the blocks painted with the new paint generally had higher ratings than the blocks painted with the old paint. However, an analysis of the results found that the difference in ratings was not statistically significant. What can be concluded from the experiment?

A. There is not enough evidence to attribute the higher ratings to the new paint.

The amount of time required for each of 100 mice to navigate through a maze was recorded. The histogram below shows the distribution of times, in seconds, for the 100 mice.

B. 10 seconds

A company sells concrete in batches of 5 cubic yards. The probability distribution of X, the number of cubic yards sold in a single order for concrete from this company, is shown in the table below. The expected value of the probability distribution of Xis 10 75 and the standard deviation is 5 76 There is a fixed cost to deliver the concrete. The profit Y, in dollars, for a particular order can be described by M= 75X- 100. What is the standard deviation of Y.

B. $432.00

A restaurant manager collected data to predict monthly sales for the restaurant from monthly advertising expenses. The model created from the data showed that 36 percent of the variation in monthly sales could be explained by monthly advertising expenses. What was the value of the correlation coefficient?

B. 0.60

A grocery store receives deliveries of corn from two farms, one in Iowa and the other in Ohio. Both farms produce ears of corn with mean weight 1.26 pounds. The standard deviation of the weights of the ears of corn from the farm in Ohio is 0.01 pound greater than that from the farm in Iowa. A randomly selected ear of corn from the farm in Iowa weighed 1.39 pounds, which has a standardized score of 1.645 for the distribution of weights for the Iowa corn. If an ear of corn from the farm in Ohio weighs 1.39 pounds, how many standard deviations from the mean is the weight with respect to the Ohio distribution?

B. 1.46 standard deviations above the mean

A company ships gift baskets that contain apples and pears. The distributions of weight for the apples, the pears and the baskets are each approximately normal. The mean and standard deviation for each distribution is shown in the table below. The weights of the items are assumed to be independent. Let the random variable W represent the total weight of 4 apples, 6 pears, and 1 basket. Which of the following is closest to the standard deviation of W?

B. 1.97 ounces

Dairy farmers are aware there is often a linear relationship between the age, in years, of a dairy cow and the amount of milk produced, in gallons per week. The least-squares regression line produced from a random sample is Milk=40.8−1.1(Age). Based on the model, what is the difference in predicted amounts of milk produced between a cow of 5 years and a cow of 10 years?

B. A cow of 5 years is predicted to produce 5.5 more gallons per week.

The following is a residual plot for a linear regression of y versus x. What is indicated by the plot?

B. A linear model is not appropriate.

At a large conference of teachers from a variety of subjects, a random sample of 50 mathematics teachers attending the conference was selected. Among the selected mathematics teachers, 28 percent had taken one or more courses in statistics. For which of the following populations is 28 percent a reasonable estimate of the percentage of those who have taken one or more courses in statistics?

B. All mathematics teachers who attended the conference

A florist wanted to investigate whether a new powder added to the water of cut flowers helps to keep the flowers fresh longer than just water alone. For a shipment of roses that was delivered to the store, the florist flipped a coin before placing each rose in its own individual container with water. If the coin landed heads up, the rose was placed in water with the new powder; otherwise, the rose was placed in water alone. Which of the following is the best description of the method used by the florist?

B. An experiment with a completely randomized design

In a study to determine whether miles driven is a good predictor of trade-in value, 11 cars of the same age, make, model, and condition were randomly selected. The following scatterplot shows trade-in value and mileage for those cars. Five of the points are labeled A, B, C, D, and E, respectively. Which of the five labeled points is the most influential with respect to a regression of trade-in value versus miles driven?

B. B

In a certain school, students can choose whether to eat in the school's cafeteria. A reporter working for the school's newspaper polled students on their reactions to changes in the menu at the cafeteria. For each student leaving the cafeteria in one 30-minute time period, the reporter used a coin to determine whether to stop the student and ask how he or she felt about the new menu. In the reporter's article it was stated that a random sample of the students showed that 89 percent of the school's student population was happy with the new menu. Which of the following statements is true?

B. Because students self-selected whether to eat in the cafeteria, the sampling method might be biased and the sample might not be representative of all students in the school.

A researcher wanted to study the effects of a certain chemical on cell growth. The chemical was to be applied at two different doses, high and low, to two different cell types, strain A and strain B. Each combination of dose and cell type was to be replicated ten times. To have consistency from one replicate to the next, the researcher decided to use four lab technicians. One technician would be assigned the high dose with strain A. A second would be assigned the low dose with strain A. A third would be assigned the high dose with strain B. A fourth would be assigned the low dose with strain B. The assignment of lab technician to the replicates for a combination of dose and cell type would be randomized. A statistician told the researcher that the design could be improved by controlling confounding variables. Which of the following is potentially a confounding variable in this study?

B. Lab technician

The computer output below shows the result of a linear regression analysis for predicting the concentration of zinc, in parts per million (ppm), from the concentration of lead, in ppm, found in fish from a certain river. Which of the following statements is a correct interpretation of the value 19.0 in the output?

B. On average there is a predicted increase of 19.0 ppm in concentration of zinc for every increase of 1 ppm in concentration of lead found in the fish.

A large simple random sample of people aged nineteen to thirty living in the state of Colorado was surveyed to determine which of two MP3 players just developed by a new company was preferred. To which of the following populations can the results of this survey be safely generalized?

B. Only people aged nineteen to thirty living in the state of Colorado

A polling firm is interested in surveying a representative sample of registered voters in the United States. The firm has automated its sampling so that random phone numbers within the United States are called. Each time a number is called, the procedure below is followed. • If there is no response or if an answering machine is reached, another number is automatically called. • If a person answers, a survey worker verifies that the person is at least 18 years of age. • If the person is not at least 18 years of age, no response is recorded, and another number is called. • If the person is at least 18 years of age, that person is surveyed. Some people claim the procedure being used does not permit the results to be extended to all registered voters. Which of the following is NOT a legitimate concern about the procedure being used?

B. Registered voters with unlisted telephone numbers may be underrepresented in the sample.

At a large airport, data were recorded for one month on how many baggage items were unloaded from each flight upon arrival as well as the time required to deliver all the baggage items on the flight to the baggage claim area. A scatterplot of the two variables indicated a strong, positive linear association between the variables. Which of the following statements is a correct interpretation of the word "strong" in the description of the association?

B. The actual time required to deliver all the items to the baggage claim area based on the number of items unloaded will be very close to the time predicted by a least-squares model.

In a certain computer card game, the player is awarded 5 points for each card that is moved to a correct position. The player is penalized 10 points for each minute the game is played. Let the random variable X represent the number of cards moved to a correct position, and let the random variable Y represent the number of minutes the game is played. The means and standard deviations of the random variables for a particular laver are shown in the table below. Assume that X and Y are independent. What are the expected value and the standard deviation of the points per game for the player?

B. The expected value is -6.5, and the standard deviation is 65.4.

An agriculturalist working with Australian pine trees wanted to investigate the relationship between the age and the height of the Australian pine. A random sample of Australian pine trees was selected, and the age, in years, and the height, in meters, was recorded for each tree in the sample. Based on the recorded data, the agriculturalist created the following regression equation to predict the height, in meters, of the Australian pine based on the age, in years, of the tree. predicted height = 0.29 + 0.48(age) Which of the following is the best interpretation of the slope of the regression line?

B. The height increases, on average, by 0.48 meter each year.

In a recent survey, high school students and their parents were asked to rate 60 recently released movies. The ratings were on a scale from 1 to 9, where 1 was "horrible" and 9 was "excellent". For each movie, the average rating by the students and the average rating by their parents was calculated and the scatterplot below was constructed.The horizontal axis represents the student rating, and the vertical axis represents the parent rating.Thus, an individual data point would represent the rating of a single movie. Which of the following statements is justified by the scatterplot?

B. The movies that the students liked the best also tended to be the movies that the parents liked the best, but the students tended to give higher scores.

Staff members of a high school newspaper want to obtain an estimate of the average number of years teachers in the state have been teaching. At an educational conference attended by many teachers in the state, the staff members randomly selected 50 conference attendees and asked the attendees how long they have been teaching. Which of the following describes the sample and the population to which it would be most reasonable for the staff members to generalize the results?

B. The sample is the 50 conference attendees, and the population is all conference attendees.

A researcher conducted an experiment to study the effects of an herbal supplement on the duration of the common cold. From a sample of 50 people who had a cold, the researcher assigned 25 people to take the supplement each day. The other 25 people were asked to drink water each day and were not given the supplement. The researcher recorded the number of days the cold lasted for each person. What are the experimental units of the study?

B. The sample of 50 people who had a cold

Some contact lens wearers report problems with dryness in their eyes. A study was conducted to evaluate the effectiveness of a new eye-drop solution to relieve dryness for contact lens wearers. Twenty-five volunteers who wore contact lenses agreed to use the new solution for one month. At the end of the month, 36 percent of the volunteers reported that the new solution was effective in relieving dryness. The company that produced the new eye-drop solution concluded that using the new solution is more effective in relieving dryness than using no solution. Which of the following best explains why the study does not support such a conclusion?

B. The study had no control group.

A set of bivariate data was used to create a least-squares regression line. Which of the following is minimized by the line?

B. The sum of the squared residuals

A tennis ball was thrown in the air. The height of the ball from the ground was recorded every millisecond from the time the ball was thrown until it reached the height from which it was thrown. The correlation between the time and height was computed to be 0. What does this correlation suggest about the relationship between the time and height?

B. There is no linear relationship between time and height.

To compare the effectiveness of two treatments, researchers conducted a well-designed experiment using a randomized block design in which the subjects were blocked by age-group (under 40 years and 40 years or older). Which of the following must be true about the randomized block design of the experiment?

B. Treatments are randomly assigned to subjects within each block.

A researcher in Alaska measured the age (in months) and the weight (in pounds) of a random sample of adolescent moose. When the least-squares regression analysis was performed, the correlation was 0.59. Which of the following is the correct way to label the correlation?

C. 0.59

For a certain online store, the distribution of number of purchases per hour is approximately normal with mean 1,200 purchases and standard deviation 200 purchases. For what proportion of hours will the number of purchases at the online store exceed 1,400 ?

C. 16%

Let X represent the number on the face that lands up when a fair six-sided number cube is tossed. The expected value of X is 3.5, and the standard deviation of X is approximately 1.708. Two fair sided number cubes will be tossed, and the numbers appearing on the faces that land up will be added. Which of the following values is closest to the standard deviation of the resulting sum?

C. 2.415

At a certain store, the distribution of weights of cartons of large eggs is approximately normal with mean 26 ounces (oz). Based on the distribution, which of the following intervals will contain the greatest proportion of cartons of large eggs at the store?

C. 24 oz to 28 oz

The relationship between carbon dioxide emissions and fuel efficiency of a certain car can be modeled by the least-squares regression equation ln(yˆ)=7−0.045x, where xx represents the fuel efficiency, in miles per gallon, and y represents the predicted carbon dioxide emissions, in grams per mile. Which of the following is closest to the predicted carbon dioxide emissions, in grams per mile, for a car of this type with a fuel efficiency of 20 miles per gallon?

C. 446

A certain monthly magazine has both print and online subscribers. Print subscribers are people who pay to have the magazine physically delivered to them each month. Online subscribers are people who pay to have access to the electronic version of the magazine. The editors of the magazine want to study how online subscribers feel about the design of the electronic version, and they will gather data from a sample. Which of the following is a sample of the population of interest?

C. 50 online subscribers

To investigate whether the consumption of beetroot juice enhances exercise performance, a researcher selected a random sample of 5050 student athletes from all the student athletes at a college. The athletes in the sample were randomly assigned to one of two groups. In one group, 2525 athletes were given a daily dose of beetroot juice, and in the other group, the remaining athletes were given a daily dose of a placebo. At the end of six weeks of exercise training, the researcher compared the performances of the two groups. Based on the design of the investigation, which of the following is the largest population to which the results can be generalized?

C. All student athletes at the college

Researchers observed the grouping behavior of deer in different regions. The following scatterplot shows data collected on the size of the group and the percent of the region that was woodland. The relationship between group size and percent woodland appears to be negative and nonlinear. Which of the following statements explains such a relationship?

C. As the percent of woodland increases, the number of deer observed in a group decreases quickly at first and then more slowly.

Data were collected on the fiber diameter and the fleece weight of wool taken from a sample of 20 sheep. The data are shown in the following graphs. Graph 1 is a scatterplot of fleece weight versus fiber diameter with the respective least-squares regression line shown. Graph 2 is the associated plot of the residuals versus the predicted values. One point is circled on graph 1. Five points labeled A, B, C, D, and E are identified on graph 2. Which point on graph 2 represents the residual for the circled point on graph 1 ?

C. C

Which of the following statements about a least-squares regression analysis is true? I. A point with a large residual is an outlier. II. A point with high leverage has a yy-value that is not consistent with the other yy-values in the set. III. The removal of an influential point from a data set could change the value of the correlation coefficient.

C. III only

Measurements of water quality were taken from a river downstream from an abandoned chemical dumpsite. Concentrations of a certain chemical were obtained from 9 measurements taken at the surface of the water, 9 measurements taken at mid-depth of the water, and 9 measurements taken at the bottom of the water. What type of study was conducted, and what is the response variable of the study?

D. An observational study was conducted, and the response variable is the concentration of the chemical.

Researchers wanted to investigate whether babies have a sense of right and wrong. They showed each of 60 babies a puppet show in which a red puppet was trying to open a heavy box lid. A second puppet, called the helper, would try to help the red puppet open the box, while a third puppet, called the hinderer, would try to slam the box lid down. After watching the show, each baby was presented with a tray containing the helper puppet and the hinderer puppet, and the researchers recorded which puppet the baby reached for. The researchers wanted to determine whether the babies would reach for the helper puppet more than for the hinderer puppet.As part of the show, a green puppet and a yellow puppet served as the helper and hinderer. For each baby, a coin was tossed to determine which color would serve which role. Which of the following is the most important reason for the random assignment of color to role in the study?

C. If the same role is played by the same color puppet, the babies might show a preference for the color instead of a preference for the role.

In an observational study treatments are not randomly assigned, whereas in a randomized experiment treatments are randomly assigned.

C. In an observational study treatments are not randomly assigned, whereas in a randomized experiment treatments are randomly assigned.

An experiment will be conducted to determine whether children learn their multiplication facts better by practicing with flash cards or by practicing on a computer. Children who volunteer for the experiment will be randomly assigned to one of the two treatments. Because the children's gender may affect the outcome, there will be blocking by gender. After practice, the children will be given a test on their multiplication facts. Why will it be impossible to conduct a double-blind experiment?

C. The child will know whether he or she used flash cards or the computer.

An agricultural scientist wanted to compare the effect of a new fertilizer to that of three older fertilizers—X, Y, and Z—on the growth of vegetables typically grown in small gardens. Two hundred green bean seedlings were individually planted in identical pots and randomly assigned to one of four groups of 50 each. Seedlings in one group were given the new fertilizer, and the three remaining groups of seedlings were given fertilizers X, Y, or Z, respectively. At the end of four weeks, all seedlings were dried and weighed. The scientist found that the mean weight of the seedlings in the group given the new fertilizer was significantly greater than the mean weights of seedlings in the other three groups. The scientist concluded that the new fertilizer was more effective than the other fertilizers for all vegetables. Why is the scientist's conclusion not appropriate?

C. The experiment only included green beans, so the results cannot be generalized to all vegetables.

The director of a fitness center wants to examine the effects of two exercise classes (spinning and aerobics) on body fat percentage. A six-week spinning class and a six-week aerobics class are offered at the same time and on the same days, so that a person can enroll in only one of them. A new class of each is about to begin, and each class has 25 people in it. Ten people are randomly selected from each class. Each person's body fat percentage is measured at the beginning and again at the end of the six-week class. Using the change in body fat percentage as the response variable and conducting a test at the a = 0.01 level, the director determines that there is a significant difference between the treatment means. Which of the following is a confounding variable in the study?

C. The participants' choice of which class to take.

A restaurant manager collected data on the number of customers in a party in the restaurant and the time elapsed until the party left the restaurant. The manager computed a correlation of 0.78 between the two variables. What information does the correlation provide about the relationship between the number of customers in a party at the restaurant and the time elapsed until the party left the restaurant?

C. The parties with a larger number of customers are associated with the longer times elapsed until the party left the restaurant.

Under which of the following conditions is it preferable to use stratified random sampling rather than simple random sampling?

C. The population can be divided into strata so that the individuals in each stratum are as much alike as possible.

The height and age of each child in a random sample of children was recorded. The value of the correlation coefficient between height and age for the children in the sample was 0.8. Based on the least-squares regression line created from the data to predict the height of a child based on age, which of the following is a correct statement?

C. The proportion of the variation in height that is explained by a regression on age is 0.64.

A certain county school district has 15 high schools. The high school seniors' plans after graduation in each school vary greatly from one school to the next. The county superintendent will select a sample of high school seniors from the district to survey about their plans after graduation. The superintendent will use a cluster sample with the high schools as clusters. A random sample of 5 high schools will be selected, and all seniors at those high schools will complete the survey. What is one disadvantage to selecting a cluster sample to investigate the superintendent's goal?

C. The schools in the cluster sample might not be representative of the population of seniors.

A scatterplot of student height, in inches, versus corresponding arm span length, in inches, is shown below. One of the points in the graph is labeled A.

C. The slope of the least squares regression line increases and the correlation coefficient increases.

Nyasha's financial literacy project involved comparing the annual sales of companies in Canada and companies in the United States that produce software. Using the ratio of 1 Canadian dollar to 0.75 United States dollar, she converted all annual sales from the Canadian companies into United States dollars. For which of the following will the value of the statistic for the annual sales in Canadian dollars be equal to the value of the corresponding statistic in United States dollars?

C. The standardized score of the minimum annual sales

Researchers conducted a study to investigate the effects of soft drink consumption on fat stored in muscle tissue. From a sample of 80 adult volunteers, 40 were randomly assigned to consume one liter of a soft drink each day. The remaining 40 were asked to drink one liter of water each day and not to consume any soft drinks. At the end of six months, the amount of fat stored in each person's muscle tissue was recorded. The people in the group who drank the soft drink had, on average, a higher percentage of fat stored in the tissue than the people who drank only water. Which of the following is the most appropriate conclusion?

C. There is evidence that consuming soft drinks causes more fat storage in muscle tissue than drinking only water, and the conclusion can be generalized to adults similar to those in the study.

A company that makes fleece clothing uses fleece produced from two farms. Northern Farm and Western Farm. Let the random variable X represent the weight of fleece produced b a sheep from Northern Farm. The distribution of X has mean 14.1 pounds and standard deviation 13 pounds. Let the random variable represent the weight of fleece produced by a sheep from Western Farm. The distribution of Y has mean 6.7 pounds and standard deviation 0.5 pound. Assume X and Y are independent. Let W equal the total weight of fleece from 10 randomly selected sheep from Northern Farm and 15 randomly selected sheep from Western farm. Which of the following is the standard deviation, in pounds of W?

C. √10(1.3)^2 + 15(o.5)^2

Let X be a random variable whose values are the number of dots that appear on the uppermost face when a fair die is rolled. The possible values of X are 1, 2, 3, 4, 5, and 6. The mean of X is 7/2 and the variance of X is 35/12. Let Y be the random variable whose value is the difference (first minus second) between the number of dots that appear on the uppermost face for the first and second rolls of a fair die that is rolled twice. What is the standard deviation of Y

C. √35/12 +35/12

The Attila Barbell Company makes bars for weight lifting. The weights of the bars are independent and are normally distributed with a mean of 720 ounces (45 pounds) and a standard deviation of 4 ounces. The bars are shipped 10 in a box to the retailers. The weights of the empty boxes are normally distributed with a mean of 320 ounces and a standard deviation of 8 ounces. The weights of the boxes filled with 10 bars are expected to be normally distributed with a mean of 7,520 ounces and standard deviation of

C. √80 ounces

Which of the following is the best description of a positive association between two variables?

D. As the value of one of the variables increases, the value of the other variable tends to increase.

For a random sample of 20 professional athletes, there is a strong, linear relationship between the number of hours they exercise per week and their resting heart rate. For the athletes in the sample, those who exercise more hours per week tend to have lower resting heart rates than those who exercise less. Which of the following is a reasonable value for the correlation between the number of hours athletes exercise per week and their resting heart rate?

D. -0.87

A distribution of scores is approximately normal with a mean of 78 and a standard deviation of 8.6. Which of the following equations can be used to find the score x above which 33 percent of the scores fall?

D. 0.44 = x−78/8.6

A produce supplier ships boxes of produce to individual customers. The distribution of weights of shipped boxes is approximately normal with mean 36 pounds and standard deviation 4 pounds. Which expression represents the weight, in pounds, at the 75th percentile of the distribution?

D. 0.67(4) + 36

The least-squares regression line yˆ=1.8−0.2x summarizes the relationship between velocity, in feet per second, and depth, in feet, in measurements taken for a certain river, where x represents velocity and y represents the depth of the river. What is the predicted value of y, in feet, when x=5?

D. 0.8

An exponential relationship exists between the explanatory variable and the response variable in a set of data. The common logarithm of each value of the response variable is taken, and the least-squares regression line has an equation of log(yˆ)=7.3−1.5x. Which of the following is closest to the predicted value of the response variable for x=4.8?

D. 1.26

The heights of adult women are approximately normally distributed about a mean of 65 inches with a standard deviation of 2 inches. If Rachael is at the 99th percentile in height for adult women, then her height, in inches, is closest to

D. 70

The distribution of weights of female college cross-country runners is approximately normal with mean 122 pounds and standard deviation 8 pounds. Which of the following is closest to the percent of the runners who weigh between 114 pounds and 138 pounds?

D. 82%

Researchers used two footballs of the same size to examine the effect of helium on kicking distance. One football was filled with air, and the other was filled with helium. Eleven people participated in the study. Each person kicked the football filled with air and the football filled with helium, and the kicking distances, in yards, were recorded. The football that was kicked first was determined by the flip of a fair coin, and the people did not know which football was filled with air and which was filled with helium. What type of study was conducted by the researchers and, of the following, which is the appropriate t-interval for inference?

D. A matched-pairs design and a t-interval for a mean difference

George and Michelle each claimed to have the better recipe for chocolate chip cookies. They decided to conduct a study to determine whose cookies were really better. They each baked a batch of cookies using their own recipe. George asked a random sample of his friends to taste his cookies and to complete a questionnaire on their quality. Michelle asked a random sample of her friends to complete the same questionnaire for her cookies. They then compared the results. Which of the following statements about this study is false?

D. Because George and Michelle used the same questionnaire, their results will generalize to the combined population of their friends.

A program that was intended to cure a person's fear of spiders was offered at a local zoo. Volunteers with a fear of spiders participated in the program, which included holding a spider for 15 minutes. One month after they completed the program, the participants were contacted and surveyed about the program. Over 90 percent of the participants claimed they were cured of their fear of spiders. Based on the description of the program, which of the following statements is true?

D. Because the participants were self-selected, a person's desire to be cured could be a confounding variable.

A dog food company wishes to test a new high-protein formula for puppy food to determine whether it promotes faster weight gain than the existing formula for that puppy food. Puppies participating in an experiment will be weighed at weaning (when they begin to eat puppy food) and will be weighed at one-month intervals for one year. In designing this experiment, the investigators wish to reduce the variability due to natural differences in puppy growth rates. Which of the following strategies is most appropriate for accomplishing this?

D. Block on dog breed and randomly assign puppies to existing and new formula groups within each breed.

An observational study found that the amount of sleep an employee gets each night is associated with job performance. The correlation coefficient was found to be r=0.86. A reader of the study concluded that more sleep causes employees to perform better. Why is such a conclusion not correct?

D. Causation cannot be determined from an observational study.

A compact disc (CD) manufacturer wanted to determine which of two different cover designs for a newly released CD will generate more sales. The manufacturer chose 70 stores to sell the CD. Thirty-five of these stores were randomly assigned to sell CDs with one of the cover designs and the other 35 were assigned to sell the CDs with the other cover design. The manufacturer recorded the number of CDs sold at each of the stores and found a significant difference between the mean number of CDs sold for the two cover designs. Which of the following gives the conclusion that should be made based on the results and provides the best explanation for the conclusion?

D. It is reasonable to conclude that the difference in sales was caused by the different cover designs because the cover designs were randomly assigned to stores.

A factory has two machines, A and B, making the same part for refrigerators. The number of defective parts produced by each machine during the first hour of operation was recorded on 19 randomly selected days. The scatterplot below shows the number of defective parts produced by each machine on the selected days. Which statement gives the best comparison between the number of defective parts produced by the machines during the first hour of operation on the 19 days?

D. Machine A usually, but not always, produced fewer defective parts than machine B.

A researcher conducting a telephone survey is concerned about possible sources of bias. Of the following, which is the best example of nonresponse bias?

D. Many of the people selected to participate in the survey who do not respond might have opinions different from those who do respond.

For a specific species of fish in a pond, a wildlife biologist wants to build a regression equation to predict the weight of a fish based on its length. The biologist collects a random sample of this species of fish and finds that the lengths vary from 0.75 to 1.35 inches. The biologist uses the data from the sample to create a single linear regression model. Would it be appropriate to use this model to predict the weight of a fish of this species that is 3 inches long?

D. No, because 3 inches falls above the maximum value of lengths in the sample.

To check the effect of cold temperature on the elasticity of two brands of rubber bands, one box of Brand A and one box of Brand B rubber bands are tested. Ten bands from the Brand A box are placed in a freezer for two hours and ten bands from the Brand B box are kept at room temperature. The amount of stretch before breakage is measured on each rubber band, and the mean for the cold bands is compared to the mean for the others. Is this a good experimental design?

D. No, because temperature is confounded with brand.

A randomized block design will be used in an experiment to compare two lotions that protect people from getting sunburned. Which of the following should guide the formation of the blocks?

D. Participants within each block should be as similar as possible with respect to how easily they get sunburned.

Consider n pairs of numbers (x1,y1), (x2,y2), ..., and (xn, yn). The mean and standard deviation of the x-values are x̄ =5 and sx = 4, respectively. The mean and standard deviation of the y-values are ȳ = 10 and sy = 10 respectively. Of the following, which could be the least squares regression line?

D. ŷ = 8.5 + 0.3x

The owner of a food store conducted a study to investigate whether displaying organic fruit at the front of the store rather than at the back of the store will increase sales of the fruit. At the beginning of each week, the organic fruit display was randomly assigned to either the front or the back of the store, and sales for the week were recorded. At the end of 12 months, the owner determined that the average weekly sales of organic fruit displayed at the front of the store were greater than the average weekly sales of organic fruit displayed at the back of the store. The difference was statistically significant. What can be concluded from the study?

D. Placing the display at the front instead of the back of the store causes an increase, on average, of weekly sales of the organic fruit.

To estimate the percent of red marbles in a large bag of marbles, Margo will use the following sampling method. She will randomly select a marble, record its color, put it back into the bag, shake the bag to thoroughly mix the marbles, and then repeat those steps. She will perform the procedure many times. What type of sampling method is Margo using?

D. Random sampling with replacement

In a recent poll of 1,500 randomly selected eligible voters, only 525 (35 percent) said that they did not vote in the last election. However, a vote count showed that 80 percent of eligible voters actually did not vote in the last election. Which of the following types of bias is most likely to have occurred in the poll?

D. Response bias

A school nutritionist was interested in how students at a certain school would feel after taking a nutritional supplement. The nutritionist selected a random sample of twenty students from the school to participate in the study. Participants were asked to keep a journal on how well they felt after taking the supplement each day. What possible source of bias is present in the method of data collection?

D. Response bias where responses are self-reported

Researchers will use a well-designed experiment to test the effectiveness of a new drug versus a placebo in relieving symptoms of the common cold. Which of the following will provide evidence that the new drug causes relief of symptoms?

D. The difference between the responses to the new drug and the placebo must be shown to be statistically significant to provide evidence that the new drug causes relief.

As part of a study on facility needs, the administrators of a university wanted to estimate the percent of students who use the exercise facilities on a regular basis. From the 34,000 students who attend the university, a random sample of 370 male students and 400 female students was selected. Of the 770 students selected, 493 students indicated that they use the exercise facilities on a regular basis. What are the population and the sample of the study?

D. The population is the 34,000 students who attend the university, and the sample is the 770 students who were selected.

Mr. Ikeler conducted a study investigating the effectiveness of a new method for teaching a mathematics unit. He recruited 80 students at a college and randomly assigned them to two groups. Group 1 was taught with the new method, and group 2 was taught with the traditional method. Both groups were taught by the same teacher. At the end of the unit, an achievement test was administered and used to make a comparison of the two groups. What is the response variable in the study?

D. The score on the achievement test

The director of a community recreation center conducted a six-week study to examine the effects of four types of exercise—strength training, flexibility training, aerobics, and jogging—on maximal oxygen consumption. From the 40 members who participated, the director randomly assigned 10 members to each exercise type. Maximal oxygen consumption was measured for each member at the beginning of the study and again at the end of the six weeks. The director examined the change in maximal oxygen consumption for each member. Which of the following statements is a correct description of a feature of the study?

D. The treatments in the study are strength training, flexibility training, aerobics, and jogging.

An experiment was conducted to investigate the relationship between the dose of a pain medication and the number of hours of pain relief. Twenty individuals with chronic pain were randomly assigned to one of five doses—0.0, 0.5, 1.0, 1.5, 2.0—in milligrams (mg) of medication. The results are shown in the scatterplot below. The data were used to fit a least-squares regression line to predict the number of hours of pain relief for a given dose. Which of the following would be revealed by a plot of the residuals of the regression versus the dose?

D. The variation in the hours of pain relief is not the same across the doses.

An engineer believes that there is a linear relationship between the thickness of an air filter and the amount of particulate matter that gets through the filter; that is, less pollution should get through thicker filters. The engineer tests many filters of different thickness and fits a linear model. If a linear model is appropriate, what should be apparent in the residual plot?

D. There should be no pattern in the residual plot.

The distribution of the weights of loaves of bread from a certain bakery follows approximately a normal distribution. Based on a very large sample, it was found that 10 percent of the loaves weighed less than 15.34 ounces, and 20 percent of the loaves weighed more than 16.31 ounces. What are the mean and standard deviation of the distribution of the weights of the loaves of bread?

D. µ = 15.93, σ = 0.46

The following scatterplot shows two variables, xx and yy, along with a least-squares model. Which of the following is a high leverage point with respect to the regression?

E. (80, 70)

Suppose a certain scale is not calibrated correctly, and as a result, the mass of any object is displayed as 0.75 kilogram less than its actual mass. What is the correlation between the actual masses of a set of objects and the respective masses of the same set of objects displayed by the scale?

E. 1

According to a survey about how workers get to work in Wyoming. 77 percent of workers get to work by driving alone, 11 percent get to work by carpooling, 4 percent get to work by walking, and 8 percent get to work by other means of transportation. Suppose a sample of 200 Wyoming workers is selected at random. Let the random variable I represent the number of workers in the sample who get to work by driving alone. What is the expected value of D?

E. 154

A recent report indicated that 2 percent of the households in a certain community speak a language other than English at home.A reporter will randomly select a household from the community until the first household that speaks a language other than English at home is selected. Let random variable Q represent the number of attempts needed until the first household that speaks a language other than English at home is selected. The random variable Q has a geometric distribution with p = 0.22

E. 16.1157

For the purpose of determining the value of its end-of-year inventory, a clothing store creates a list at the end of the year of every item currently in stock along with each item's wholesale price. Which of the following is the best description of the end-of-the-year activity?

E. A census, because the wholesale prices of all items are listed.

A researcher selects a simple random sample of 1,200 women who are students at Midwestern colleges in the United States to use for an observational study. Which of the following describes the population to which it would be most reasonable to generalize the results?

E. All women who are students at Midwestern colleges in the United States

Athletes in a particular sport are classified as either offense or defense. The distribution of weights for the athletes classified as offense is approximately normal, centered at 200 pounds, and ranges from 150 pounds to 250 pounds. The distribution of weights for the athletes classified as defense is approximately normal, centered at 300 pounds, and ranges from 250 pounds to 350 pounds. There are 1,000 athletes in each classification. Which of the following is the best description of a histogram of the weights of all 2,000 athletes?

E. Bimodal

The least-squares regression line Sˆ=0.5+1.1L models the relationship between the listing price and the actual sales price of 12 houses, with both amounts given in hundred-thousands of dollars. Let L represent the listing price and S represent the sales price. Which of the following is the best interpretation of the slope of the regression line?

E. For each hundred-thousand-dollar increase in the listing price, the sales price is predicted to increase by $110,000.

A local television news station includes a viewer survey question about a current issue at the beginning of every evening news broadcast. Viewers are invited to use social media to respond to the question. The results of the survey are shared with the audience at the end of each broadcast. In relation to the opinions of the population of the region, which of the following is a possible reason why the results of such surveys could be biased? I. Viewers with strong opinions about the current issue are more likely to respond than are viewers without strong opinions. II. The opinions of viewers of one television station are not necessarily representative of the population of a region. III. Viewers with access to social media are not necessarily representative of the population of a region.

E. I, II and III

The Physicians' Health Study, a large medical experiment involving 22,000 male physicians, attempted to determine whether aspirin could help prevent heart attacks. In this study, one group of about 11,000 physicians took an aspirin every other day, while a control group took a placebo. After several years, it was determined that the physicians in the group that took aspirin had significantly fewer heart attacks than the physicians in the control group. Which of the following statements explains why it would not be appropriate to say that everyone should take an aspirin every other day? I. The study included only physicians, and different results may occur in individuals in other occupations. II. The study included only males and there may be different results for females. III. Although taking aspirin may be helpful in preventing heart attacks, it may be harmful to some other aspects of health.

E. I, II, III

Consider a data set of positive values, at least two of which are not equal. Which of the following sample statistics will be changed when each value in this data set is multiplied by a constant whose absolute value is greater than 1? I. The mean II. The median III. The standard deviation

E. I, II, and III

A local employer asked for help selecting a new type of desk chair. Thirty employees volunteered, and each employee used the new desk chair for two weeks and the current desk chair for two weeks. To determine which chair was used first, a coin was flipped for each employee. Heads represented using the new chair first, and tails represented using the current chair first. At the end of each two-week period, the employees were asked to rate their satisfaction with the new chair. Which of the following best describes this study?

E. It is a well-designed experiment because there is random assignment, replication, and comparison of at least two treatment groups.

A researcher collected data on the age, in years, and the growth of sea turtles. The following graph is a residual plot of the regression of growth versus age. Does the residual plot support the appropriateness of a linear model?

E. No, because the graph displays a U-shaped pattern.

Data were collected on two variables, x and y, to create a model to predict y from x. A scatterplot of the collected data revealed a curved pattern with a possible cubic relationship (y=ax^3, where aa is a constant) between the variables. Which of the following transformations would be most appropriate for creating linearity between the variables?

E. Taking the log of both y and x

An experiment will be conducted in which 20 pepper plants are randomly assigned to two groups. The plants in Group 1 will receive the current fertilizer, Fertilizer A, and the plants in Group 2 will receive a new fertilizer, Fertilizer B. All other growing conditions, including amount of sunlight and water, will be kept the same for the two groups. The growth of the pepper plants will be compared for the two groups. What are the experimental units in this experiment?

E. The 20 plants in the two groups

A family would like to build a linear regression equation to predict the amount of grain harvested per acre of land on their farm. They subdivide their land into several smaller plots of land for testing and would like to select an explanatory variable they can control. Which of the following is an appropriate explanatory variable that the family could use to create a linear regression equation?

E. The amount of fertilizer applied to each plot of land

A bank surveyed all of its 60 employees to determine the proportion who participate in volunteer activities. Which of the following statements is true?

E. The bank does not need to use an inference procedure to determine the proportion of employees who participate in volunteer activities because the survey was a census of all employees.

The height h and collar size c, both in centimeters, measured from a sample of boys were used to create the regression line cˆ=−94+0.9h. The line is used to predict collar size from height, both in centimeters, for boys' shirt collars. Which of the following has no logical interpretation in context?

E. The c-intercept of the regression line

A new restaurant is interested in determining the best time-temperature combination for roasting a five-pound cut of lamb. The times to be tested are 45 minutes, 60 minutes, and 90 minutes at temperatures of 350 degrees Fahrenheit and 425 degrees Fahrenheit for each time, with the exception of the 90 minute - 425 degree combination. That combination is being eliminated because it will overcook the lamb, which leaves five combinations remaining. From 10 identical cuts of lamb, 2 are randomly selected to roast using each of the time-temperature combinations in the same oven. The quality of the finished product is evaluated for each roast. Which of the following is true?

E. The two cuts that are being roasted for each time-temperature combination are an example of replication.

Each person in a simple random sample of 2,000 received a survey, and 317 people returned their survey. How could nonresponse cause the results of the survey to be biased?

E. Those who did respond may differ in some important way from those who did not respond.


Ensembles d'études connexes

CHAPTER 1: FOUNDATIONS AND MODELS

View Set

Chapter 15.1: WI Life & Health State Laws

View Set

Exercise 5; Seed Plants: Gymnosperms & Angiosperms

View Set

Women's Health (Exam Practice questions)

View Set