Test 2

अब Quizwiz के साथ अपने होमवर्क और परीक्षाओं को एस करें!

Name a situation in which a scatterplot is most useful for displaying measurement data.

1) FOR DISPLAYING THE RELATIONSHIP BETWEEN TWO MEASUREMENT VARIABLES.

A number of anomalies can cause misleading correlations. Name two problems that can cause distortion with correlations

1) OUTLIERS CAN SUBSTANTIALLY INFLATE OR DEFLATE THEM; 2) GROUPS COMBINED INAPPROPRIATELY MAY MASK RELATIONSHIPS.

Most researchers are willing to declare that a relationship is statistically significant if the chances of observing the relationship in the sample when actually nothing is going in the population are less than what percent? a. 5% b. 50% c. 95% d. None of the above

5%

Empirical Rule also known as

68%, 95%,99% Rule

Suppose you took a standardized test and the scores had a bell-shaped distribution. You only need three pieces of information in order to find your percentile in the population of test scores. What are those three pieces of information?

: 1) YOUR TEST SCORE; 2) THE MEAN OF THE POPULATION OF TEST SCORES; AND 3) THE STANDARD DEVIATION OF THE POPULATION OF TEST SCORES.

Name three types of statistical pictures that are used to represent measurement data.

ANY 5 OF THE FOLLOWING ARE OK: 1) HISTOGRAM; 2) STEMPLOT; 3) LINE GRAPH; 4) SCATTERPLOT; OR 5) BOXPLOT.

Give an example where a randomized experiment cannot be done, even though we know that is the best way to try to establish a causal connection between two measurement variables.

ANY REASONABLE ANSWER OK. EXAMPLES: DOES SMOKING CAUSE LUNG CANCER?

The Empirical Rule says that for a normal curve, approximately 68% of the values fall within 1 standard deviation of the mean in either direction, while 95% of the values fall within 2 standard deviations of the mean in either direction. Explain why you don't have twice as many values within 2 standard deviations as you do within 1 standard deviation.

BECAUSE OF THE NORMAL, OR BELL-SHAPED CURVE. THE MAJORITY (68%) FALL CLOSE TO THE MEAN, WHERE THE "BELL" PART OF THE CURVE IS. AS YOU MOVE AWAY, YOU GET INTO THE TAILS OF THE CURVE, WHICH CONTAIN LESS AREA.

A table that displays the number of individuals who fall into each combination of categorical variables is called a(n) ____ table

Contingency

The ___between two measurement variables is an indicator of how closely their values fall to a straight line

Correlation

A regression equation relating study time=X and exam score = Y (out of 100 points) is: Y= 21+4.5 X • Explain clearly what meaning does the slope of 4.5 have in this situation.

For every hour increase in study time the test score increases by 4.5 points.

Which of the following is not a type of picture for organizing categorical data? A pie chart. A bar graph. A pictogram. A histogram.

Histogram

Explain clearly what meaning does the y-intercept. A regression equation relating study time=X and exam score = Y (out of 100 points) is: Y= 21+4.5 X

If student does not study at all (x=0) then the exam score is 21.

Assuming there is a statistical relationship between height and weight for adult females, which of the following statements is true? • If we knew a woman's height, we could predict her weight. • If we knew a woman's height, we could determine the exact weight for all women with that same height. • If we knew a woman's height, we could predict the average weight for all women with that same height. • All of the above are true.

If we knew a woman's height, we could predict the average weight for all women with that same height.

Determine whether or not the following statement could be statistically correct. If not, explain why not. "The correlation between tree diameter and weight of fruit harvested was found to be 2.3.

NO. CORRELATION MUST BE BETWEEN -1 AND +1.

Determine whether or not the following statement could be statistically correct. If not, explain why not. "We found a strong correlation between gender and political party."

NO. CORRELATION REFERS TO TWO Categorical VARIABLES. Gender and Political party are categorical variables.

Suppose you are on a jury in a trial someday. How could you encounter Simpson's Paradox? • You could see data that were collected from two different studies, giving you two different results. • One side could present the data using two variables, and the other side could break the same data down by a third variable that reverses the direction of the results. • One side could use counts to summarize the data, and the other side could use percentages or rates, reversing the direction of the relationship. • All of the above.

One side could present the data using two variables, and the other side could break the same data down by a third variable that reverses the direction of the results.

A data point that is far removed from the rest of the data is called a(n) ___

Outlier

GRE scores are normally distributed with a mean of 497 and standard deviation of 115. • Draw a picture of the GRE scores showing the cut off values for the 99.7% of scores.

Picture should show bell shaped curve, centered at 497, the left and right ends should be marked 152 and 842 (99.7% of the area in within 3 standard deviations about the mean )

A regression equation relating study time=X and exam score = Y (out of 100 points) is: Y= 21+4.5 X Would the correlation between study time and exam score be positive or negative? Explain.

Positive, since Y increases as X increases (slope is >0)

It is very difficult to establish a causal connection between two variables without the use of anything except a ___

Randomized Experiment

A_____represents the number of standard deviations the observed value or score falls above or below the mean

STANDARD SCORE (OR Z-SCORE)

A(n) _____is useful for displaying the relationship between two measurement variables.

Scatterplot

When omitting a third variable masks the relationship between two categorical variables, this phenomenon is called

Simpson's Paradox

Suppose the correlation between two measurement variables is −1. Which of the following statements is not true As one of the variables increases, the other decreases. The data looks the same as when two variables have a deterministic linear relationship. The correlation between the variables is very weak. All of the above statements are true.

The correlation between the variables is very weak.

Suppose one individual in a certain population had a z-score of −2. Which of the following is true? • This is a good thing because the individual is above average. • This individual's measurement is 2 standard deviations below the mean. • This individual's original measurement was a negative number. • All of the above are true.

This individual's measurement is 2 standard deviations below the mean.

For any normal curve, almost all of the values will fall within ___of the mean

Three Standard Deviations

Which of the following describes a strong statistical correlation? The value of one measurement variable is always equal to the square of the value of another measurement variable. One measurement variable has a cause and effect relationship with another measurement variable. Two measurement variables have a strong linear relationship. All of the above.

Two measurement variables have a strong linear relationship.

In which case(s) should you be suspicious of a correlation that is presented? • When the data is likely to contain outliers. • When the sample size is small. • When removing one point in the data set actually reverses the direction of the trend. • All of the above

When the data is likely to contain outliers. When the sample size is small. When removing one point in the data set actually reverses the direction of the trend.

32.A regression equation relating study time=X and exam score = Y (out of 100 points) is: Y= 21+4.5 X • What is the score for 2 hours of study time?

Y=21+4.5(2)=30 points

If there is no linear relationship between two measurement variables, the correlation is

Zero

Use the Empirical Rule to approximate the percentage of students with GRE scores below 382. z = 382−497 =−1 115

and Empirical Rule states that about 68% of all GRE scores 115 will be within 1 standard deviation about the mean, that leaves 32% for the tails, so 16% of all scores are below 382 (because of the symmetry of the normal curve).

With a frequency curve, to figure out what percentage or proportion of the population falls into a certain range, you have to figure out the ____ under the curve over that range.

area

A____ can be used to represent two ot three categorical variable simultaneously

bar graph

The bell-shaped frequency curve is so common that if a population has this shape, the measurements are said to follow a _____distribution.

normal

For Conitnuous random variable area means

probability

A student had a GRE score of 687. Find and interpret the standard score for this student.

z = 687−497 =1.67 The student scored 1.67 standard deviations above the mean. 115

Suppose your score on the GRE (Graduate Records Exam) was at the 90th percentile. What does that mean? • You got 90% of the questions right. • 90% of the other students scored lower than you did. • 10% of the other students scored lower than you did. • None of the above.

• 90% of the other students scored lower than you did.

Which of the following describes the entire area underneath a frequency curve? • The entire area is 1 or 100%. • The entire area is equal to the total number of individuals in the population. • The entire area is equal to the total percentage of individuals in the population with the measurement being studied. • None of the above.

• The entire area is 1 or 100%.


संबंधित स्टडी सेट्स

Ch. 12- Supply Chain Management in the Service Industry

View Set

Productions & Operations CH 6,7,8 Questions

View Set

Chapter 25: Genitourinary Disorders

View Set

Guide to Computer Forensics and Investigations 5th Ed Chapter 3 Review Questions

View Set

Solving Quadratic Equations by Factoring

View Set

Praxis Art Ch. 4: Digital Photography Processes

View Set

GB311: Midterm 1- Quiz Questions

View Set