Statistics

Ace your homework & exams now with Quizwiz!

If we compute a correlation on data that come only from the middle of the X distribution (restricted range) rather than from the entire range, the correlation is likely to be ______ the correlation from the entire range

Smaller than

Type of correlation used to find out whether theres a relationship between one interval variable and one ordinal variable?

Spearman

Type of correlation used with any ranks or ordinal scales:

Spearman

coefficient of determination is equal to...

r squared

To know whether theres a relationship btwn two variables, you draw a line around the outer edges of a scatterplot. You can tell there is no relationship when

the scatterplot is either circular or elliptical, with the ellipse being parallel to the X axis

When no relationship btwn two variables, the value of every Y' is equal to

the value of the Y-intercept

In a correlational analysis, N stands for the

total number of pairs of scores

The sum of the deviations of the true Y scores from the predicted Y' scores is always

zero

A regression line is usually used when

the correlation coefficient is not 0.0

Master's degree-Candice-rainfall and height of ground water- which formula?

(E XY)-(E X)(E Y) ------------- square root of N......

Which of the following r-values indicates the weakest relationship between two variables?

+0.03

Calculate the appropriate correlation coefficient for the following data where 0 = not depressed and 1 = depressed (reading speed test score (X) and number of books read (Y))

+0.59

Calculate the appropriate correlation for data of runner, runner on race preparedness survey, and rank of race finish

+0.83

Calculate the appropriate correlation coefficient for the following data (Participant, Reading speed test score [X], number of books read [Y])

+0.95

Which of the following r-values indicates the strongest relationship between two variables?

-0.89

Calculating for the appropriate correlation coefficient of Employee, number of units produced, and minutes spent in break room

-0.92

Calculate the appropriate correlation for the following (employee, visits to break room, number of units produced)

-0.97

Y' = -0.85X + 20.93

...

Y' = 1.64X - 3.08

...

When r = 0.0, the slope of the regression line equals...

0

When r = 1.0, then Sy equals

0

The Y-intercept is the value of Y' when X =

0.0

if no relationship btwn two variables, the slope of the regression line will equal

0.0

Student test grocery memorability with images and without images- What is probability of obtaining a sample mean of 10 or higher?

0.0062 (lowest answer)

Which of the following is the criterion that psychologists usuall use to determine the likelihood that a sample mean was obtained by chance?

0.05=p

If probability of getting z score btwn the mean and +1 stand. deviation is 0.3413, what is probability of getting a z score lower than -1 standard deviation?

0.1587 (lowest answer)

Burnout and therapists relationship

0.28

president of college to survey students; probability for freshmen?

0.33

What is probability of getting a sample mean btwn 500 and 520 if pop mean is 500 and stand. dev. of sampling distribution is 20?

0.3413

Your profess. told the class that 20% of the class receives ...As...Bs..Cs...Ds..Fs. What is the probility that the student sitting next to you will recieve an A or F?

0.35

Advertising executive wanted to know about ad campaign against children smoking...; what is probability of obtaining a sample mean of -0.70 or lower (more negative)?

0.3632

Stress level + test score proportion of variance accounted for is

0.58

If probability of getting z score btwn the mean and +1 stand. deviation is 0.3413, what is probability of getting a z score of +1 or less?

0.8413 (highest answer)

12.43

12.43

A point on the scatter plot represents how many values?

2 (X and Y)

What is the slope of the regression equation: Y' = 0.56X + 2.41?

2.41

What is the slope of the regression equation: Y' = 2.69X - 3.92?

2.69

A "weak" relationship between two variables is represented by

A large spread of Y scores at each X score

How would a statistician define probability?

A mathematical statement indicating the likelihood of an event when we randomly sample a particular population

What kind of relationship is represented by a line slanting downward from left to right?

A negative relationship

"the bigger they are, the harder they fall" describes...

A positive linear correlation

Why can we never be sure that a sample represents a population?

A random sample may poorly represent the population , or it may represent a population that is different

The "error" in a SINGLE prediction is equal to the degree to which a participant's _____ score deviates from the _______.

Actual; corresponding predicted score

When the correlation coefficient representing the relationship between X and Y is intermediate, then all of the following are true except...

All data points fall on the regression line

when rolling a pair of dice, probability of rolling a total point value of (7) is 0.17, if you rolled a pair of dice 1,000 times and the point value of 7 appears 723 times, what would you conclude?

Although not impossible, this outcome is so unlikely that the fairness of these dice is questionable

Negative correlation

As one variable increases, the other decreases

Positive correlation

As one variable increases, the other variable increases

Prof. Miller found correlation btwn a "need for affiliation" and # of hours spent watching television is -.69; he should conclude...

As we observe people with higher and higher need for affiliation, we see a tendency for those people to spend less and less time watching television

One assumption of linear regression is...

At each X, the sample of Y scores should represent an approximately normal distribution

What statistic should be used to find out whether there is a relationship between hours spent participating in sports and GPA?

The Pearson correlation coefficient

Study at state University; negative correlation btwn smoking and lung capacity; after passing appropriate inferential test, what do the researchers do next?

Calculate the linear regression equation

In a non-linear or curvilinear relationship, as the X scores change, the Y scores...

Change consistently, but in more than one direction

If there is a relationship between "amount of coffee consumed" and "nervousness", then as the amount of coffee consumed increases, the amount of nervousness...

Changes in some consistent, predictable manner

What is the basis of all inferential statistics?

Deciding whether or not a sample of scores is representative of a particular population

If you see the notation E XY what should you do?

First multiply each X by its partner Y, then sum the results

If you see the notation (E X)(E Y) what should you do?

First sum the Xs, then sum the Ys, then multiply the sums

The mean of the population is 200, vaience = 100, probability of 5% only in upper tail...

Since the z value does not fall iwhtin the region of rejection, we should not conclude this sample mean represents some other population

A study about college aptitude of seniors... z score of +1.89, critical value for region of rejection is +1.96, what is correct conclusion?

Since the z value does not fall within the region of rejection, we should not conclude this sample mean represents some other population

In general, a positive correlation means that as the values of one variable ______, there is a tendency for the values of the other variable to _______.

Increase; increase

Words comprehended in a sentence; sample mean z = 3.00, critical values of +1.96, what should psych conclude?

It is an unlikely sample for the population of looking times ofr other words and probably represents some other population

Which of the following is NOT true of linear regression equation?

It is the equation from which the correlation coefficient is calculated

What does a correlation coefficient do?

It quantifies the pattern in a relationship

A nonlinear correlation looks like what?

Low to high and back to low, like a rainbow!

Formula for slope of the regression line?

N (E XY) -(E X)(E Y) __________________ N(E X^2)-(E X^2)

what type of relationship does a horizontal line parallel to the X axis represent?

No relationship

Sample mean with z score of -2.00, critical value is +1.96, what is correct conclusion?

Since the z value falls within the region of rejection, we should conclude this sample mean likely represents some other population

Phsyic. Fitness of 65 and older men with sit-ups;

Since z scores falls within the region of rejection, we should conclude this sample mean likely represents some other population

What statistic should be used to find out whether there is a relationship between years of education and annual income?

The Pearson correlation coefficient

The strength of a relationship is indicated by the extent to which ________ paired with each individual value of the _________ variable.

One value of the Y variable is; X

Type of correlation used when both variables are interval or ratio:

Pearson

Professor Johnston and strong correlation between neckties and strokes; claims wearing neckties causes strokes. What error has he made?

Professor Johnston is drawing a causal conclusion from correlational findings

When we square the correlation coefficient to produce r(squared), the result is equal to the

Proportion of variance accounted for

what do we call that portion of the sampling distribution in which values are considered too unlikely to have occurred by chance?

Region of rejection

The best-fitting line through a scatterplot is known as the __________ line.

Regression!

If the correlation coefficient turns out to be a relatively high value, then the value of Sy will be

Relatively low

When knowledge of a relationship is used, the average error remaining after predictions have been made based on the relationship is

S^2y

Which of the following is not true of the criterion?

Samples that meet the criterion occur more than 5% of the time

__________ occurs when random chance produces a sample statistic that is not equal to the population parameter it represents

Sampling error

The weakest relationship of Study A, Study B, and Study C is...

Study B

When r = 0.0, the value of Sy is equal to

Sy

Which correlation coefficient should we use if we want to find out whether a relationship exists between two vairables that represent pairs of ordinal scores?

The Spearman rank-order correlation coefficient

Linear regression is defined as the procedure for determining

The best-fitting straight line in a linear relationship

"the self-confidence of a group of students is positively correlated with their chances of getting through the course"- what does this statement mean?

The chances of passing the course tend to increase as the self-confidence scores of the students increase

What is the critical value?

The inner edge of the region of rejection

When a sample mean is different from the mean of the sampling distribution, two alternatives must be considered; the sample mean may represent _______, or it may represent _______.

The population poorly; a different population

Relationship between linear and scatter plot diagrams?

The regression line is the best-fitting line through a scatter plot

Mean = 100, stand. error of samp. diet. = 10, mean = 80,

The sample mean does not occur very often by chance in the sampling distribution of means and probably did not come from the given population

What can we conclude when the absolute value of a z-score for a sample mean is larger than the critical value?

The sample mean does not represent the particular raw score population on which the sampling distribution is based

Mean = 100, stand. error of samp. diet. = 10, mean = 110

The sample mean occurs very often by chance in the sampling distribution of means and probably did not come from the given population

What can you conclude about a sample mean that falls within the region of rejection?

The sample probably represents some population other than the one on which the sampling distribution was based

The criterion determines

The size of the region of rejection

In the regression equation, the slope summarizes __________ and the Y-intercept indicates_________.

The steepness and direction of the regression line; the value of Y' when X = 0

Professor Helgin has found that the correlation btwn length of an index finder and the person's IQ is -0.09. He should conclude that...

There is a very weak relationship between the length of the index finger and IQ because r is nearly 0

which relationship is stronger, r = +0.62 or r = -0.62?

There is no difference in the strength of the two relationships

which of the following is correct regarding means that fall within the region of rejection when the critical values are +1.96?

They occur with a probability of 5%

What can we conclude about a sample mean that is found to lie in the region of rejection? It is extremely ____ to have occured by chance, and represents ____

Unlikely; some other population

Statisticians use linear regression to

predict unknown Y scores from known X scores

Which of the following formulas represents the Y intercept of the regression line?

Y(bar) - (b)(Xbar)

Study about relationship btwn women's age and attitude about marriage using Pearson, what mistake was made?

You only surveyed young women in college causing a restriction of range

When we divide the error remaining after we use the relationship to predict Y scores by the total error when we use the mean to predict the Y scores and then subtract 1 from the result, the final result is

proportion of variance accounted for

"The more you save, the less you spend" describes...

a negative linear correlation

Which of the following best describes knowing the relative frequency of every possible event in a population?

a probability distribution

which of the following accurately decribes a theoretical probability distribution? It is based on...

a theoretical model of the relative frequency of events in a population

Using correlation design, a researcher found a relationship between the healthiness of one's heart and the amount of fish oil in one's diet. The researcher should conclude that...

although a relationship exists, one cannot infer that changes in one variable are causing changes in another variable

If we decide to reject the idea that a sample represents a particular population, bc the sample mean lies within the region of rejection,

although probability is low, our decision may be wrong

If we decide not to reject the idea...

although the probability is low, our decision may be wrong...

In general, a zero correlation means that...

as the values of one variable increase, there is no tendency for the values of the other variable to change in any consistent, predictable fashion

The standard error of the estimate is defined as the

average spread of actual Y scores around the predicted Y' scores

The "error" in all predictions made from a sample using linear regression is the...

avergage spread of actual Y scores around the predicted Y' scores

If we calculate a correlation coefficient and we find that there is a relationship between the two variables, we...

cannot conclude that changes in one variable cause changes in the other variable

How can we determine the representativeness of a sample mean for a particular population?

convert the sample mean to a z score and compare the z score to the critical value

At a basic level, when deciding whether a sample is representative of a particular population, we

decide against low-probability events in favor of high-probability events

When heteroscedasticity exists, the problem with r is that it..

does not accurately describe the strength of the relationship for all Xs

To predict a Y' score from a given X score using the regression constants, we would

first multiply X by the slope and then add the Y-intercept

compared to a strong relationship, a weak relationship btwn two variables results in

greater prediction error and a larger value of Sy

In a linear relationship, as the X scores increase, the Y scores change...

in only one direction

You roll a die twice, and both times you roll a 6. What type of events are these two rolls?

independent

As a general rule, when statisticians determine the probability of events, they assume that the events are __________ and sampled __________.

independent; with replacement

26 red cards and 26 black cards...

is the same as it has always been if the deck is a fair deck

Linear regression is important because

it is used to predict unknown Y scores based on X scores from a correlated variable

The scores that lie in the tails of a normal distribution have a ____ frequency and a ____ probability of occurring

low; low

The purpose of probability and inferential statistics is to...

make decisions about the population that have a good chance of being correct

When a z score is not in the region of rejection, we should

not reject the idea that the sample represents the raw score population

We calculate the proportion of variance accounted for because it is the statistical basis for evaluation

of the usefulness of a relationship

Regression line is best fitting because

on average, the regression line passes through the center of the various Y scores

The best-fitting line through a scatterplot is known as the

regression line

In an experimental design _________, whereas in a correlational design ________.

researchers assign each person an X score and then measure the score on the Y variable; researchers measure scores on variables that a participant has already experienced

Suppose you take candy out of jar, look at color, & put it back in jar before randomly selecting next piece. This sampling is called...

sampling with replacement

when plotting correlational data, the appropriate graph to use is the

scatterplot!

We should do a scatterplot of the data when we compute a correlation because the scatterplot allows us to

see the nature of the relationship between the two variables

Water quality study; pop = 100, ox= 25, area of 0.05,

since the z value falls within the region of rejection, we should conlucde this sample mean likely represents some other population

the slope of a line is a number indicating the

slant of the line and the direction in which it slants

In looking at the regression constants, we know the relationship is negative if the

slope value is negative

standard error of the estimate formula

square root of 1-r ^2

Which correlation coefficient should we use if we want to find out whether a relationship exists between two variables that are both interval or ratio variables?

the Pearson correlation coefficient

Homoscedasticity occurs when

the Y scores at all Xs are spread out to the same degree

Heteroscedasticity occurs when

the Y scores have a different degree of spread at different Xs

which of the following accurately describes an empirical probability distribution? It is based on...

the computed relative frequency of observed events

In general, the greater the proportion of variance accounted for,

the more accurately we can predict behavior

a probability distribution gives us...

the probability of every possible event in a population

Two events are said to be independent when...

the probability of one event is not influenced by the occurrence of the other event

An event's relative frequency in the population equals...

the probability of the event

How is the relative frequency of an event defined?

the proportion of times an event occurs in the population of events

the coefficient of alienation is interpreted as

the proportion of vairance not accounted for

the coefficient of determination is interpreted as the proportion of variance accounted for

the proportion of variance accounted for

when +1.645 is used as the critical value instead of +1.96,

the region of rejection is all placed in the positive tail


Related study sets

Consumer Behavior Ch. 13 MKTG 312

View Set

EXAM 3: Consumer and Producer Surplus Chapter 7 (Practice) (Practice)

View Set

Karch Chapter 57: Drugs Affecting GI Secretions Prep u

View Set