Research Methods Quiz 5
A helpful tool for visualizing test-retest reliability and interrater reliability is a: a. bar graph. b. Cronbach's alpha. c. correlation coefficient. d. scatterplot.
scatterplot
In which of the following ways are content and face validity similar? a. Both involve subjective judgments. b. Both involve asking participants for their opinions about the measurement. c. Both are preferred by psychologists as ideal measures of validity. d. Both are very difficult to establish.
both involve subjective judgements
Dr. Rodriquez is considering conducting a study examining whether narcissistic people have poorer romantic relationships than those who are not narcissistic. One of her first tasks is to determine which of her participants are narcissistic and which are not. She decides to use the scale created by a colleague, the Donal scale. Question 1 reads, " I tend not to think about other people as much as I think about myself." Question 2 reads, " I do not have a high opinion of myself." Question 3 reads, " I think other people think I am really special." Before using the measure in her study, Dr. Rodriquez gives the measure to a group of participants on the first day of the semester and again on the last day of the semester. She then compares the scores between the two time points. This is a test of which of the following? A. Interrater reliability B. Internal reliability C. Test-retest reliability D. Construct reliability
Test-retest reliability
For her research methods class, Serena plans to interview several teachers about their attitude toward teaching children who have attention-deficit/hyperactivity disorder (ADHD). This is an example of what type of measurement? a. Self-report measurement b. Observational measurement c. Physiological measurement d. Archival measurement
self-report measurement
Classify each operational variable below as categorical or quantitative. If the variable is quantitative, further classify it as ordinal, interval, or ratio. A. Degree of pupil dilation in a person's eyes in a study of romantic couples (measured in millimeters) B. Number of books a person owns C. A book's sales rank on Amazon D. The language a person speaks at home E. Nationality of participants in a cross-cultural study of Canadian, Ghanaian, and French students. F. A student's grade in school.
-Degree of pupil dilation in a person's eyes in a study of romantic couples (measured in millimeters): Ratio -Number of books a person owns: Ratio -A book's sales rank on Amazon: Ordinal -The language a person speaks at home: Categorical -Nationality of participants in a cross-cultural study of Canadian, Ghanaian, and French students: Categorial -Student's grade in school: Interval
Classify each of the following results as an example of internal reliability, interrater reliability, or test-retest reliability. A. A researcher finds that people's scores on a measure of extroversion stay stable over 2 months. B. An infancy researcher wants to measure how long a 3-month-old baby looks at a stimulus on the right and left sides of a screen. Two undergraduates watch a tape of the eye movements of ten infants and time how long each baby looks to the to the right and to the left. Two sets of timings are correlated, r=0.95. C. A researcher asks a sample of 40 people a set of five items that all capture how extroverted they are. The Cronbach's alpha for the five items is found to be 0.85.
-Stable over 2 months: Test-retest reliability -Two sets of timings are correlated: Interrater reliability -Ask sample how extroverted they are: Internal reliability
When using correlation coefficients to evaluate reliability, which of the following is undesirable? A. A correlation coefficient close to one B. A negative correlation coefficient C. A strong correlation coefficient D. It depends on the type of reliability being evaluated
A negative correlation coefficient
Classify each result below as an example of face validity, content validity, convergent and discriminant validity, or criterion validity. A. A professor gives a class of 40 people his five-item measure of conscientiousness (e.g., "I get chores done right away," "I follow a schedule," "I do not make a mess of thing"). Average scores are correlated (r = -0.20) with how many times each student has been late to class during the summer. B. A professor fives a class of 40 people his five-item measure of conscientiousness (e.g, "I get chores done right way," "I follow a schedule," "I do not make a mess of things"). Average scores are more highly correlated with a self-report measure of tidiness (r= 0.50) than with a measure of general knowledge (r=0.09) C. The researcher email his five-item measure of conscientiousness (e.g, "I get chores done right way," "I follow a schedule," "I do not make a mess of things") to 20 experts in personality psychology and ask them if they think his items are a good measure of conscientiousness. D. The researcher emails his five-item measure of conscientiousness (e.g, "I get chores done right way," "I follow a schedule," "I do not make a mess of things") to 20 experts in personality psychology and ask them if they think he has included all the important aspects of conscientiousness.
A. Criterion B. Convergent and discriminant validity C. Face validity D. Content validity
In looking at a scatterplot of interrater reliability, why would a researcher want to see all the dots close to the line of agreement? A. Because it indicates a positive relationship B. Because it indicates that the researcher's two research assistants/raters are making similar measurements. C. Because it indicates that the researcher's measurement is valid D. Because it indicates that the researcher's measurement will also have high test-retest reliability.
Because it indicates that the researcher's two research assistants/raters are making similar measurements.
Dr. Sheffield is a clinical psychologist who specializes in treating pathological gambling. Pathological gambling is defined as being unable to resist impulses to gamble. Bothered by not having a good measure that he can give to clients to determine whether they are suffering from this condition, he creates a new measure of pathological gambling. The measure has 15 questions, and it takes 20 minutes to complete. To test his measure, Dr. Sheffield gives his measure to a group of his clients and at the same time measures how many times they have been gambling in the past month. He predicts that clients who score higher on his measure will also report gambling more times in the past month. This procedure is meant to provide evidence for which of the following? A. Face validity B. Content validity C. Criterion validity D. Discriminant validity
Content validity
If I demonstrate, in a sample of people, that my new self-report measure of extroversion correlates with an observation of the number of conversations each person has in a day, I have demonstrated __________ validity. A. Face B. Content C. Criterion D. Convergent E. Discriminant
Criterion
For which type of validity do we need to collect empirical evidence? A. Face validity B. Criterion validity C. Content validity D. Internal validity
Criterion Validity
A correlation-based statistic called _____________ is commonly used to determine internal reliability. a. Cronbach's alpha b. kappa c. a scatterplot d. Pearson's r
Cronbach's alpha
What does it mean that "reliability is necessary but not sufficient for validity"? A. If a measure is reliable, it is also valid B. If a measure is valid, it is also reliable C. Reliability and validity are unrelated concepts D. Reliability and validity are the same concept
If a measure is valid, it is also reliable
How many subcategories of quantitative variables exist? a. Two b. Three c. Four d. Five
three