Assessment midterm
111.Which is the most common type of scale used in psychological tests? A. continuous B. discrete C. contiguous D. None of these
A
24.The manifestation of behavior associated with a particular psychological trait is to some extent dependent on A. the situation the person is in. B. the financial pressures being exerted. C. an individual's blood pressure at the time. D. one's astrological sign.
A
29.If the standard deviation of a set of test scores is equal to 25, the variance is equal to A. 625. B. 5. C. 50. D. 12.5.
A
13.A 100-item achievement test is administered to 30 students. Students earn 1 point for each correct answer. In the test results there are three scores of 95. All of the other students score between 10 and 30. What measure of central tendency would be MOST representative of this set of scores? A. average deviation B. the median C. the mode D. the standard deviation
B
31.Interpretations of criterion-referenced tests are typically made with respect to A. the total number of items the examinee responded to. B. the material that the examinee evidenced mastery of. C. a comparison of the examinee's performance with that of others who took the test. D. a formula that takes into account the total number of items for which no response was scorable.
B
31.Typically, which of the following is the primary objective of psychological testing and assessment? A. to measure a trait B. to predict future behavior C. to obtain a score on a test D. to observe a sample of behavior
B
32.Test users who treat ordinal data as if they were interval data must be constantly alert to the possibility of A. a highly skewed standard error. B. gross inequality of intervals. C. extreme kurtosis in a graphed distribution of test scores. D. legal challenges from the ACLU.
B
34.Any influence on psychological test scores from factors other than what the test is intended to measure is referred to by measurement specialists as A. spurious artifact. B. error. C. bias. D. construct-irrelevant stimuli.
B
34.As the confidence interval increases, the range of scores into which a single test score falls is likely to A. decrease. B. increase. C. remain the same. D. alternately decrease and increase.
B
37.A psychological trait A. has a real existence. B. is a construct. C. has no relation to observed behavior. D. All of these
B
37.One scale at the deli counter of a local supermarket consistently measures everything that is placed on it exactly one ounce more than its actual weight. An assessment professional would describe this scale as A. valid. B. reliable. C. unreliable. D. not properly normed.
B
41.The term psychometric soundness is typically a reference to a test's reliability and A. utility. B. validity. C. norms. D. cost-effectiveness.
B
42.What type of reliability estimate is appropriate for use in a comparison of "Form A" to "Form B" of a picture vocabulary test? A. test-retest B. alternate-forms C. inter-rater D. internal-consistency
B
44.A test designed to provide information about whether or not an aviator has mastered the ability to fly solo is an example of a test that is A. norm-referenced. B. criterion-referenced. C. trait-referenced. D. All of these
B
54.If the results of an examination are positively skewed, the exam questions were likely A. easy. B. difficult. C. biased. D. part of a make-up examination
B
80.A difference between a ratio and an interval scale is that a ratio scale A. is considered a unit of measurement. B. has an absolute zero point. C. is the most common scale used in psychological measurement. D. has an absolute freezing point
B
84.The main purpose of using statistics is A. to conduct experiments in a replicable fashion. B. to put data into an interpretable form. C. to rank-order data. D. to predict experimental outcomes.
B
98.Which of the following is TRUE of the Pearson r? A. It has a distribution that approximates the tetrachoric r if the data are not linear. B. It is legitimately used only when the two variables are linear. C. Pearson actually had very little to do with its development. D. It can never be larger than the Spearman rho if the data represent two true dichotomies.
B
1.In a research study, subjects are categorized as either "hospitalized" or "never hospitalized." This type of categorization is referred to as A. random. B. continuous. C. discrete. D. prime.
C
21.Observation of behavior in a controlled environment. A. an observable behavior. B. a biological phenomenon. C. a construct. D. what a test measures.
C
26.Research published by Henry H. Goddard supported A. the pro-life movement. B. the anti-whaling movement. C. the eugenics movement. D. the labor union movement.
C
30.From your reading of Chapter 4 in your textbook, which of the following statements is NOT true? A. Overt behavior is observable. B. Constructs are ideas rather than observable actions. C. Constructs are only used when behavior is not observable. D. The existence of a construct can be inferred from overt behavior.
C
33.Which of the following is TRUE about error in psychological measurement? A. As with other fields, error is synonymous with "mistake." B. Error only refers to deliberate misrepresentation of results rather than carelessness. C. Error is an expected component of measurement. D. Error can be eliminated only by care and vigilance.
C
35.Which of the following is the MOST important reason why translating a test into another language is not recommended? A. It can be extremely costly. B. It can be extremely time-consuming. C. Meanings of the items may change. D. Translation must conform to the specific dialect of the testtaker.
C
4.Miles-per-hour is an example of which type of scale? A. nominal B. ordinal C. ratio D. interval
C
40.Which type of reliability is directly affected by the heterogeneity of a test? A. test-retest B. inter-rater C. internal-consistency D. alternate-forms or parallel-forms
C
41.133. By definition, estimates of reliability can range from _______. A. -3.00 to +3.00 B. 1 to10 C. 0 to 1 D. -1 to 1
C
48.The definition of a norm-referenced test as compared to a criterion-referenced test differs primarily with respect to A. the error thought to be present in measurement. B. whether or not the test is theory-based. C. the score to which testtakers are compared. D. the reliability of the test in question.
C
48.The greater the proportion of the total variance attributed to true variance, the more ____________ the test. A. scientific B. variable C. reliable D. expensive
C
54.Most reliability coefficients, regardless of the specific type of reliability they are measuring, range in value from A. -1 to +1. B. 0 to 100. C. 0 to 1. D. negative infinity to positive infinity.
C
59.The fact that young children develop rapidly and in "growth spurts" is a problem when it comes to the estimation which type of reliability for an infant development scale? A. internal-consistency reliability B. alternate-forms reliability C. test-retest reliability D. interrater reliability
C
27.A synonym for inter-scorer reliability is A. inter-judge reliability B. observer reliability C. inter-rater reliability D. All of these
D
45.In theory, the distribution of the normal curve ranges from A. +3 to -3. B. 0 to 100. C. 0 to infinity. D. negative infinity to positive infinity
D
6.As used in your text, test can refer to: A. a paper-and-pencil examination. B. a task. C. an interview with a client. D. All of these
D
59.The term age norms is synonymous with A. age-equivalent scores. B. age-standard scores. C. age-derived scores. D. age-defying scores.
A
76.Most scores on tests that measure psychological variables A. are continuous. B. are discrete. C. are error-free. D. lack discretion.
A
26.Which of the following is the BEST example of a psychological trait? A. eye color B. hair color C. skin color D. None of these
D
104.What is the correlation coefficient of choice when two variables are ordinal? A. the Spearman rho B. the Mooney-O C. the Anna-O D. None of these
A
38."Test-related behavior predicts non-test-related behavior." This statement A. is an assumption on which the assessment enterprise is based. B. is true only if the test-related behavior mimics the non-test-related behavior. C. is true only if the non-test-related behavior mimics the test-related behavior. D. All of these
A
10.Measures of central tendency refer to which part of the frequency distribution? A. the high end of the distribution B. the middle of the distribution C. the low end of the distribution D. the very end of the distribution
B
11.Computer-scorable items have tended to eliminate error variance due to A. item sampling. B. scorer differences. C. content sampling. D. testtakers' reactions to environmental variables.
B
66.The amount of time that passes between the presentation of a word on a computer screen and the reading of a word by a testtaker involves measurement on which type of scale? A. ordinal B. ratio C. interval D. nominal
B
14.For the distribution of test scores 85, 76, 71, 86, and 92, the arithmetic mean is equal to A. 71. B. 85. C. 82. D. 80.
C
15.A distribution of test scores has a three-way tie for the most frequently occurring score. This distribution could be described as A. trimedial. B. kurtotic. C. trimodal. D. skewed.
C
21.Which statistic describes the most frequently occurring test score? A. the mean B. the median C. the mode D. the range
C
43.What index of reliability would be BEST use to compare two evaluators' assessments of a group of job applicants? A. KR-20 B. coefficient alpha C. the Kappa statistic D. the Spearman-Brown correction
C
7.A confidence interval is a range or band of test scores that A. has proven test-retest reliability. B. is calculated using the standard error of the difference. C. is likely to contain the true score. D. None of these
C
15.Which of the following is TRUE for estimates of alternate- and parallel-forms reliability? A. Two test administrations with the same group are required. B. Test scores may be affected by factors such as motivation, fatigue, or intervening events like practice, learning, or therapy. C. Item sampling is a source of error variance. D. All of these
D
50.Which is NOT a possible source of error variance? A. test administration B. test scoring C. test interpretation D. All are possible sources of error variance.
D
7.A key difference between psychological testing and psychological assessment has to do with A. the number of hours it takes to proctor a test session. B. the role of the test user in interpreting the results. C. whether or not the evaluation includes an oral test. D. the utility of the test in a cost versus benefit analysis.
B
70.A school psychologist administers a psychological test to a child. For the purposes of her report, she converts the testtaker's raw scores to percentiles. Age and grade norms are provided in the manual but she declines to cite them because A. percentiles are all that are needed for accurate reporting of results. B. age and grade norms tend to be misinterpreted by the general public. C. she prefers to report results using only means and standard deviations. D. that's what they're expecting her to do.
B
73.If the mean of a distribution is 7 and the standard deviation is 2, what is the z score that is equivalent to a raw score of 3? A. 2 B. -2 C. 3 D. 6
B
78.Rank-ordering individuals on the variable of leadership ability is an example of which type of scale? A. nominal B. ordinal C. interval D. ratio
B
49.A score earned by a testtaker on a psychological test may BEST be viewed as equal to A. the raw score plus the observed score. B. the error score. C. the true score. D. the true score plus error.
D
49.This tool is used to estimate or infer how far an observed score deviates from a true score, and it is called A. a standard deviation. B. a measure of central tendency. C. the variance. D. a standard error of measurement.
D
51.A test is considered standardized if the manual for the test includes A. clearly specified procedures for administration. B. clearly specified procedures for scoring. C. normative data. D. All of these
D
52.A test is said to be standardized if the test's manual contains clearly specified procedures for test A. administration. B. scoring. C. interpretation. D. All of these
D
47.In the formula X = T + E, T refers to A. the true score. B. the time factor. C. the average test score. D. test-retest reliability.
A
79.In a normal distribution of scores, approximately what percentage of test scores falls between +1 and -1 standard deviations from the mean? A. 50% B. 66% C. 75% D. less than 1%
B
8.When using trait terms, it is important to keep in mind that A. there are many synonyms for the trait term that could be used instead. B. trait terms are relative in that unstated comparisons are always being made. C. the trait being exhibited is subject to sudden change as in multiple personality. D. traits, much like most human abilities, are all "in the eye of the beholder."
B
89.129. Skewness provides an indication of the extent to which the shape of the distribution is A. curved. B. symmetrical. C. flat. D. peaked.
B
9.Psychometrics may BEST be defined as A. the science of test development. B. the science of psychological measurement. C. the study and use of correlational techniques. D. the study of psychic phenomena.
B
99.A correlation coefficient that is significant at the p < .01 level A. has a 99% chance of being accurate. B. could have been expected to occur by chance alone one time or less in 100. C. could have been expected to occur by chance alone 99 times or more in 100. D. accounts for about 1% of the variance.
B
1.The term psychometric soundness refers to the A. the general psychiatric health of an assessee. B. mental status of an individual during assessment. C. technical quality of a test or other tool of assessment. D. competence of a defendant to stand trial.
C
11.Of the following parties to the assessment enterprise, which group would be LEAST likely to have read the Standards? A. test developers B. test users C. testtakers D. test publishers
C
9.It is incorrect to conceive of error "creeping into" the measurement process because A. this implies that it appears slowly when in reality it is instantaneous. B. to do so is to imply that error is negative when in reality it is positive. C. error is a well acknowledged part of the measurement process. D. All of these
C
9.Reliability, in a broad statistical sense, is synonymous with A. consistently good. B. consistently bad. C. consistency. D. validity.
C
96.A correlation coefficient equal to -.98 would indicate A. a weak inverse relationship between the two variables. B. a weak direct relationship between the two variables. C. a strong inverse relationship between the two variables. D. a strong direct relationship between the two variables.
C
36.X is to abscissa as Y is to A. oblongata. B. kudos. C. kurtotic. D. ordinate.
D
59.What is the relationship between the mean, the median, and the mode for a distribution of scores that is normally distributed? A. The mean and the median are larger than the mode. B. The mean and the median are smaller than the mode. C. The mean and the mode are larger than the median. D. The mean, the median, and the mode are equal.
D
5As used by your textbook authors, the term psychological assessment applies to A. clinical settings only. B. self-administered tests only. C. employment, clinical, and educational settings only. D. the use of tests and other tools of evaluation.
D
6.According to your text, which of the following assumptions inherent in the assessment enterprise is repeatedly emphasized in the codes of ethics of associations of assessment professionals? A. Psychological traits and states can be quantified. B. Various sources of error are a part of the process. C. Test-related behavior predicts non-test-related behavior. D. Tests and other tools have strengths and weaknesses.
D
6.Which would NOT be useful in estimating a test's inter-item consistency? A. Cronbach's alpha B. the Kuder-Richardson formulas C. the average proportional distance D. a coefficient of equivalence
D
68.The number corresponding to a grade of A on your next psychological measurement exam is an example of what type of score? A. raw score B. standard score C. percentile rank D. cut-off score
D
69.If a person scores at the median on a test, and if the scores on the test are normally distributed, the individual will be in which stanine? A. the first B. the fifth C. the ninth D. the fifteenth
B
16. It's a group of people whose performance on a particular test is analyzed for reference in evaluating the performance of future testtakers. This group is called A. a normative sample. B. a test validation group. C. a stratified random sample. D. a "back to the future" group.
A
1.According to a study cited in the text, random error affecting a score on a personality test is best illustrated with reference to the A. weather on the day that the personality test was administered. B. testtaker's general self-concept on the day that the personality test was administered. C. amount of the specific personality trait that actually exists. D. All of these
A
1.Numbers are assigned to each player on the university basketball team. This use of numbers could BEST be characterized as A. a nominal scale. B. an ordinal scale. C. an interval scale. D. a ratio scale.
A
10.Although the question "Is this test fair?" is frequently raised, the underlying and more pertinent moral question is, A. "What do we, as a society, wish to accomplish by the use of this test?" B. "What offensive cultural stereotypes are inherent in the use of this test?" C. "Can school districts in impoverished areas afford to use this test?" D. "Does this test discriminate any one or any group in a demonstrable way?"
A
10.In everyday practice, responsibility for appropriate test administration, scoring, and interpretation lies with: A. test users. B. test developers. C. elected representatives. D. test publishers.
A
102.What is the relationship between the coefficient of determination and the correlation coefficient? A. The larger the correlation coefficient, the larger the coefficient of determination. B. The larger the correlation coefficient, the smaller the coefficient of determination. C. The relationship between them would have a correlation coefficient of zero exactly. D. The larger the correlation coefficient, the more variance can be attributed to error or chance.
A
106.A scatterplot of the relationship between two variables is graphed upward and sloping to the right. This is indicative of A. a strong positive relationship. B. a strong negative relationship. C. a weak inverse relationship. D. a purely Platonic relationship.
A
107.Which of the following statements is TRUE concerning a correlation coefficient? A. A restricted range in either correlated variable makes the correlation lower. B. A correlation coefficient provides information regarding causation. C. No meaning is attached to the sign of a correlation coefficient, since the coefficient is ultimately squared. D. Restriction of range is now illegal in the 48 contiguous states.
A
108.Among school-age children, as age increases, so do reading skills. This relationship between two variables illustrates A. a positive correlation between two variables. B. a negative correlation correlations between two variables. C. a zero correlation. D. None of these
A
109.A correlation coefficient is equal to .30. Using the concept of coefficient of determination, the variance accounted for by chance, error, and other unexplained factors would be: A. approximately 91%. B. approximately 30%. C. approximately 3%. D. None of these
A
12.The mean, the median, and the mode are all A. measures of central tendency. B. measures of variability. C. measures of dispersion. D. standard scores.
A
15.A psychologist writing a journal article is looking for a critical review of an intelligence test that was published 4 years ago. Which of the following sources would be BEST to consult? A. The Mental Measurements Yearbook B. Tests in Print C. Psychological Assessment D. Men's Health
A
17. Norms come in many different forms. Which of the following is NOT correctly referred to by the term "norms"? A. trait norms B. age norms C. grade norms D. percentile norms
A
17.Which typically provides the most objective evaluation of a test? A. a published review of the test in a journal B. the publisher's test catalogue C. the Standards for Educational and Psychological Tests D. a manual of use for the test itself
A
18.As discussed in the text, examinations in ancient China for civil service positions were, for the most part, A. only open to men. B. only open to members of affluent families. C. open to all citizens. D. open to all citizens and immigrants.
A
2.In the test-retest method to estimate reliability A. the time frame between interviews must be relatively short. B. separate interviews are conducted by certified raters. C. a minimum of two re-tests are required. D. All of these
A
20. Measurement can be defined as A. the assignment of numbers or symbols to characteristics of objects/people. B. the administering and scoring of psychological tests to members of a defined population. C. the process of determining the reliability and validity of tests. D. the A psychological trait can BEST be described as
A
20.Which of the following positions would Galton support? A. Genius runs in families. B. Environment is the most important determinant of genius. C. Genius ruins families. D. Darwin's theory was overstated.
A
21.Which of the following is usually minimized when using split-half estimates of reliability as compared with test-retest or parallel/alternate-form estimates of reliability? A. time and expense B. reliability and validity C. reliability only D. time spent in scoring and interpretation
A
22.For which type of data is the mode most frequently used? A. nominal data B. ordinal data C. interval data D. ratio data
A
23.Which is the only measure of central tendency that can be used for all nominal, ordinal, interval, and ratio scales of measurement? A. the mode B. the median C. the mean D. the standard deviation
A
25.A trait is typically viewed as "relative." This means that inherent in its definition, a comparison is being made to: A. other people. B. only people who are totally lacking in the trait. C. only blood relatives. D. only distant relatives
A
26.Which statistic conveys the LEAST precise measure of dispersion? A. the range B. the variance C. the standard deviation D. the semi-interquartile range
A
27.The mental ability evaluations conducted at Ellis Island could best be characterized as A. psychological testing. B. psychological assessment. C. case-study techniques. D. role-play.
A
30.If all scores in a set of test scores were the same, the variance would be equal to A. zero. B. one. C. two. D. None of these
A
30.Which type(s) of reliability estimates would be most appropriate for a measure of heart rate? A. test-retest B. alternate-form C. parallel form D. internist consistency
A
32.Which of the following is NOT a part of the formula for the standard error of measurement for a particular test? A. the validity of the test B. the reliability of the test C. the standard deviation of the group of test scores D. Both the reliability of the test and the standard deviation of the group of test scores
A
34A raw score is so called because it is A. a straightforward, unmodified accounting of performance, usually numerical in nature. B. an untreated estimate of performance on a test prior to placement in a frequency distribution. C. ratio level measurement that has not yet been converted into any sort of graphic form. D. None of these
A
35.An assumption inherent in classical test theory (CTT) is that A. each test-taker has a true score that would have been obtained if not for measurement error. B. testtakers can be ranked according to the level of the trait or ability that they exhibit in much the same way that rankings can be made by height. C. measurement can be perfectly accurate if the measuring tool is well-constructed. D. All of these
A
36.According to a study cited in the text, random error affecting a score on a personality test is best illustrated with reference to the A. weather on the day that the personality test was administered. B. testtaker's general self-concept on the day that the personality test was administered. C. amount of the specific personality trait that actually exists. D. All of these
A
36.If the standard deviations of two tests are identical but the reliability is lower for Test A as compared to Test B, then the standard error of measurement will be ________ for Test A as compared with Test B. A. higher B. lower C. the same D. hard to tell
A
37.A histogram is a A. graph with vertical lines drawn at each class interval. B. frequency distribution with horizontal lines at each class interval. C. graphic illustration from history with ratio level data. D. surgical procedure once performed on women to treat depression.
A
37.What is the difference between alternate forms and parallel forms of a test? A. Alternate forms do not necessarily yield test scores with equal means and variances. B. Alternate forms are designed to be equivalent only with regard to level of difficulty. C. Alternate forms are different only with respect to how they are administered. D. There are no differences between alternate and parallel forms of a test.
A
38.On a bar graph comparing men to women with regard to test scores, one would expect to find the gender variable listed on the A. abscissa. B. ordinate. C. frequency polygon. D. appendix.
A
4.Which is TRUE of measurement error? A. Like error in general, measurement error may be random or systematic. B. Unlike error in general, measurement error may be random or systematic. C. Measurement error is always random. D. Measurement error is always systematic.
A
42.The range is a measure of variation that is simple enough to calculate, but its value is A. greatly affected by extreme scores in the distribution. B. not affected enough by extreme scores in the distribution. C. extremely limited when a distribution is relatively platykurtic. D. too exaggerated for use with a normal distribute
A
43.There are 100 scores in a distribution of test scores and the average deviation (AD) is 12. The AD of 12 tells us that the 100 scores in this distribution varied, on average A. 12 points from the mean. B. 12 points from each other. C. 12 points from 100. D. None of these
A
45.An applicant for a job with the U.S. Postal Service scores in the bottom 5% of all applicants on a test that measures the ability to sort mail. This is an example of A. norm-referenced testing. B. criterion-referenced testing. C. stress-tolerance testing. D. an individual who may one day "go postal."
A
48.A z score scale has sometimes been referred to as a "zero plus or minus one" scale because a z score scale A. has a mean set at 0 and a standard deviation set at 1. B. has a mean set at 0 and a standard deviation set at -1. C. has a standard deviation set at 0, and a mean set at 1. D. None of these
A
5."Test-related behavior predicts non-test-related behavior." This statement A. is an assumption on which the assessment enterprise is based. B. is true only if the test-related behavior mimics the non-test-related behavior. C. is true only if the non-test-related behavior mimics the test-related behavior. D. All of these
A
50.Today, when someone tells you what their recently measured "IQ" is, the value quoted is most likely A. a standard score. B. a z-score. C. a T-score. D. a quotient.
A
51.A standard score obtained by a linear transformation is one in which A. a direct numerical relationship to the original score is retained. B. only the directionality of the relationship to the original score is retained. C. the relationship to the original score has been completely erased. D. None of these
A
55.If the results of an examination are negatively skewed, the exam questions were likely: A. easy. B. difficult. C. biased. D. quite novel in many respects.
A
55.Subgroup norms can best be described as a normative sample A. that has been segmented in some way. B. that is not representative of the population sampled. C. with scores below the 60th percentile on a test. D. for which test administration conditions were not optimal.
A
57.Kurtosis refers to steepness in the ________ of the distribution. A. center B. positive end C. negative end D. outliers
A
61.Because of the unique problems in assessing very young children, which of the following would be the BEST practice when attempting to estimate the reliability of tests designed to measure cognitive and motor abilities in infants? A. Use relatively short test-retest intervals. B. Use relatively long test-retest intervals. C. Do not use the test-retest method for estimating reliability of the test. D. Use only inter-scorer reliability estimates.
A
62.In general, criterion-referenced testing A. is useful for communicating information regarding the mastery of basic skills such as reading and writing. B. has been found to be extremely valuable in conveying information about a student's level of creativity relative to other students. C. is not a well-accepted method of testing in areas such as mastery of word processing programs. D. is a contradiction in terms due to the lack of a valid criterion.
A
64.A scatterplot of the relationship between two variables is graphed upward and sloping to the right. This is indicative of A. a strong positive relationship. B. a strong negative relationship. C. a weak inverse relationship. D. a purely Platonic relationship.
A
66."Number of items answered correctly on an achievement test" defines what type of score on that test? A. a raw score B. a standard score C. a percentile rank D. a cut-off score
A
71.In a norm-referenced approach to testing, a testtaker's scores are compared with A. the scores of other individuals who were administered the test. B. a standard of excellence determined by experts in the area. C. Both the scores of other individuals who were administered the test and a standard of excellence determined by experts in the area. D. None of these
A
72.Which of the following is NOT typical of an interval scale? A. an absolute zero point B. continuous variables C. rank-orderings D. an interval
A
74.Which of the following statements is TRUE concerning a correlation coefficient? A. A restricted range in either correlated variable makes the correlation lower. B. A correlation coefficient provides information regarding causation. C. No meaning is attached to the sign of a correlation coefficient, since the coefficient is ultimately squared. D. Restriction of range is now illegal in the 48 contiguous states.
A
75.Among school-age children, as age increases, so do reading skills. This relationship between two variables illustrates A. a positive correlation between two variables. B. a negative correlation between two variables. C. a zero correlation between two variables. D. why "correlation" is synonymous with "causation".
A
75.Kate received a z score of 1 on a reading test. What do we know about Kate's performance, assuming that the reading test scores are distributed normally? A. She scored better than 84% of other students. B. She scored better than only 2/3 of the other students. C. She scored worse than only 2/3 of other students. D. She scored worse than 84% of other students.
A
76.A correlation coefficient is equal to .30. Using the coefficient of determination, the variance accounted for by chance, error, and other unexplained factors is A. approximately 91%. B. approximately 30%. C. approximately 3%. D. None of these
A
77.A diagnostic classification listed in a manual as "3.18 Attention Deficit Hyperactivity Disorder" is an example of: A. a nominal scale. B. an ordinal scale. C. an interval scale. D. a ratio scale.
A
8.Frequency distributions may be illustrated in a variety of ways. Which of the following illustrations is the MOST popular of illustrating a frequency distribution? A. a histogram B. a scatterplot C. a pie chart D. a radio dial
A
91.133. Which of the following is NOT true of the normal curve? A. The sides taper and touch the x-axis. B. The mean, the median, and the mode have the same value. C. The highest point is the center. D. It is perfectly symmetrical, with no skewness.
A
93.Correlation coefficients range from -1 to: A. +1. B. 0. C. + infinity. D. +10.
A
1.According to Gil et al. (2016), which of the following is a source of error in scores on psychological tests? A. whether or not the examiner has a beard B. whether the testtaker's country is at war or peace C. the body weight of the testtaker two weeks prior to the test D. None of these
B
101.The coefficient of determination is calculated by A. multiplying the correlation coefficient by 100. B. squaring the correlation coefficient and multiplying by 100. C. multiplying the correlation coefficient by the sample size. D. squaring the mean of each of the variables and then summing them.
B
12.The term psychometric soundness is typically a reference to a test's reliability and A. utility. B. validity. C. norms. D. cost-effectiveness.
B
15. Norm-referenced testing and assessment is both a method of evaluation and a way of deriving meaning from tests scores. Individual testtaker scores are evaluated against A. the scores achieved by a matched group of testtakers on a comparable test. B. the scores of a comparison group of testtakers. C. scores from a demographically more diverse group of testtakers. D. the scores achieved by a guy named norm.
B
16.When graphing ratio data, why is it preferable to set the ordinate of the y-axis at 0? A. It is simply the traditional way of doing things, and has no inherent benefit. B. Setting the ordinate to other values may yield an exaggerated impression of the changes in the variable. C. Doing so is the best protection against statistic-based challenges regarding methodology, findings, and conclusions. D. Ratio-level data has a theoretical range from 0 to plus or minus infinity.
B
17.Which of the following statistics is the preferred measure of central tendency for a skewed distribution? A. the mean B. the median C. the mode D. None of these
B
18. In order to conduct research using human subjects, a university requires researchers to complete an online ethics course, and then correctly respond to all of the items of the test on that material. This test process could BEST be characterized as A. norm-referenced testing. B. criterion-referenced testing. C. culturally informed assessment. D. authentic assessment.
B
18.The mean for the set of scores 8, 9, and 7 is A. 7. B. 8. C. 9. D. 24.
B
18.Which of the following types of reliability estimates is the most expensive due to the costs involved in test development? A. test-retest B. parallel-form C. internal-consistency D. Spearman's rho
B
19.What term refers to the degree of correlation between all the items on a scale? A. inter-item homogeneity B. inter-item consistency C. inter-item heterogeneity D. parallel-form reliability
B
2.A psychological trait A. has a real existence. B. is a construct. C. has no relation to observed behavior. D. All of these
B
2.Psychological tests share commonalities. For example, they all A. include an analysis of a naturally occurring behavior. B. include an analysis of a sample of behavior. C. include paper-and-pencil and oral responses. D. All of these
B
20.A distribution of test scores is: 75, 92, 96, 88, 75, 62, and 88. This distribution can be characterized as: A. unimodal with a mode of 75. B. bimodal with the modes of 75 and 88. C. unimodal with a mode of 88. D. trimodal with the modes of 96, 92, and 62.
B
22.According to your textbook, psychological traits A. can be seen and touched. B. can only be inferred. C. do not exist. D. exist only in theory.
B
22.Who coined the term mental test in 1890? A. Binet B. Cattell C. Wundt D. Galton
B
24.Typically, adding items to a test will have what effect on the test's reliability? A. Reliability will decrease. B. Reliability will increase. C. Reliability will stay the same. D. Reliability will first increase and then decrease.
B
27.Which of the following is the BEST example of a psychological trait? A. eye color B. aggressiveness C. skin color D. None of these
B
27.Which quartile may also be referred to as the median? A. the first quartile B. the second quartile C. the third quartile D. the fourth quartile
B
28.The median and the interquartile range are __________ in nature. A. reciprocal B. ordinal C. interval D. opposite
B
28.Which BEST conveys the meaning of an inter-scorer reliability estimate of .90? A. Ninety percent of the scores obtained are reliable. B. Ninety percent of the variance in the scores assigned by the scorers was attributed to true differences and 10% to error. C. Ten percent of the variance in the scores assigned by the scorers was attributed to true differences and 90% to error. D. Ten percent of the test's items are in need of revision according to the majority of the test's users.
B
29.An approach to personality assessment that does not employ self-report methods is referred to as A. a reflective method. B. a projective method. C. a factorial method. D. a nonempirical method.
B
3.Assuming the Sensation Seeking Scale (SSS) is a valid measure, how would you expect someone who goes parachuting once a month to score on the SSS? A. average B. above average C. below average D. impossible to say
B
32.Assessment professionals who use tests without understanding the limitations of the tests they use are A. violating state and federal laws which mandate such understanding. B. violating provisions of their profession's ethical codes. C. well meaning but bumbling in their every day work. D. psychiatrists rather than psychologists.
B
33.Who coined the term "mental test"? A. Alfred Binet B. James Cattell C. Victor Henri D. Charles Spearman
B
35.A simple frequency distribution is labeled as such to indicate that the data in it A. occurs with no particular frequency. B. have not been grouped. C. are not particularly complex. D. have only been averaged using the arithmetic mean.
B
35.As the reliability of a test increases, the standard error of measurement A. increases. B. decreases. C. remains the same. D. alternately increases, then decreases.
B
36Test reliability refers to A. how accurately a test measures what it purports to measure. B. how consistently a test measures what it purports to measure. C. the "depth" of measurement of a particular construct. D. the "bandwidth" of measurement of a particular construct.
B
39.If a student received a score of 50 on a math test with a standard error of measurement of 3, which of the following statements would be TRUE of the "true score"? A. In 68% of the cases, the "true score" would be expected to be between 44 and 56. B. In 68% of the cases, the "true score" would be expected to be between 47 and 53. C. In 95% of the cases, the "true score" would be expected to be between 47 and 53. D. In 95% of the cases, the "true score" would be expected to be between 44 and 56.
B
4.Psychological testing A. is typically more lengthy than assessment. B. may be one component of the process of assessment. C. is characteristically broader in scope than assessment. D. tends to be less accurate than assessment.
B
67.When test scores are found to be normally distributed, they take on the shape of the familiar "bell curve." In these kinds of graphs, which variable is on the y (vertical) axis? A. the test score B. the frequency C. the deviation from the mean D. the standard deviation
B
40.When "putting tests to the test" and researching the validity of a particular instrument for a particular purpose, test users A. will typically consult multiple issues of popular magazines. B. may need to research which combination of tests best suits a particular purpose. C. can be satisfied they are using the right instrument if the validity coefficient found is satisfactorily high. D. All of these
B
41.When "putting a test to the test" and researching the suitability of a particular test for a particular objective, test users will typically research A. the reliability of the test. B. the validity of the test. C. the utility of the test. D. All of these
B
46.A test containing 100 items is revised by deleting 20 items. What might be expected to happen to the magnitude of the reliability estimate for that test? A. It will be expected to increase. B. It will be expected to decrease. C. It will be expected to stay the same. D. It cannot be determined based on the information provided.
B
46.If a controversy exists over the definition of the trait being measured by a psychological test, questions will MOST likely be raised relative to A. the reliability of the test. B. the validity of the test. C. the standardization of the test. D. All of these
B
47.A test that yields information about a testtaker's relative standing in a group is referred to as A. criterion-referenced. B. norm-referenced. C. standard-referenced. D. None of these
B
47.Raw scores may be converted to standard scores A. because raw scores are more readily interpretable than standard scores. B. to better understand a test-taker's performance relative to others. C. Both because raw scores are more readily interpretable than standard scores and to better understand a test-taker's performance relative to others. D. None of these
B
50.The statistic known as the standard error of measurement is BEST associated with which of the following? A. validity B. reliability C. test standardization D. test administration
B
52.A nonlinear transformation of test scores into standard scores may be required when the test data under consideration: A. are normally distributed. B. are not normally distributed. C. were originally converted into A scores. D. were not obtained under standardized conditions
B
53.Manuel earns a 90 on a standardized math test. The standard error of measurement for this test is 5. Approximately 95% of the scores fall between ______________. A. 85 and 95. B. 80 and 100. C. 80 and 100. D. Cannot determine based on the information provided.
B
54.A distribution of raw scores from a test administration is normally distributed. The test user wishes to summarize these data using percentiles. A potential problem here is that score differences in the middle area of the distribution A. may be minimized. B. may be exaggerated. C. may not be accurate if the test is "pass/fail." D. will only be accurate if scored in a timely fashion.
B
55.All indices of reliability provide an index that is a characteristic of a particular A. test. B. group of test scores. C. trait. D. approach to measurement.
B
56.If few scores fall on the negative side of the distribution, the distribution is ________ skewed. A. positively B. negatively C. symmetrically D. asymmetrically
B
56.The precise amount of error inherent in the reliability estimate published in a test manual will vary with A. the purchase price of the test (the more expensive, the less the error). B. the sample of test-takers from which the data were drawn. C. the population of test user actually using a published test. D. All of these
B
57.Different types of reliability coefficients A. all reflect the same sources of error variance. B. may reflect different sources of error variance. C. never reflect the same source of error variance. D. reflect on error variance during leisure activities.
B
58.A test of infant development contains three scales: (1) Cognitive Ability, (2) Motor Development, and (3) Behavior Rating. Because these three scales are designed to measure different characteristics (that is, they are not homogeneous), it would be inappropriate to combine the three scales in calculating estimates of the test's A. alternate-forms reliability. B. internal-consistency reliability. C. test-retest reliability. D. interrater reliability.
B
6.A frequency distribution typically includes A. the average score and a measure of diversion around it. B. each possible score and how often it occurs. C. an estimate of how spread out the scores are. D. an index of how "popular" a particular frequency is.
B
60.Approximately what percentage of scores in a normal distribution falls between the mean and 1 standard deviation above and below the mean? A. 34% B. 68% C. 95% D. less than 1%
B
61.A norm-referenced test yields which type of information? A. information indicating that the examinee has the eye-hand coordination skills necessary to operate a particular piece of dangerous machinery B. information indicating that the examinee scored above 90% of individuals who took the test C. Both information indicating that the examinee has the eye-hand coordination skills necessary to operate a particular piece of dangerous machinery and information indicating that the examinee scored above 90% of individuals who took the test D. None of these
B
62.What is an advantage of T scores over z scores? A. z scores have no negative numbers B. T scores have no negative numbers C. T scores are more precise D. a T score has greater statistical flexibility
B
63.T scores have a mean of ________ and a standard deviation of ________. A. 20; 10 B. 50; 10 C. 100; 15 D. 100; 10
B
63.Which is TRUE about reliability in the psychometric sense? A. reliability is an all-or-none measurement B. a test may be reliable in one context and unreliable in another C. a reliability coefficient may not be derived for personality tests D. alternate forms reliability may not be derived for personality tests
B
65.What is the primary advantage of normalization of a skewed distribution? A. greater validity B. easier comparability to other scales C. greater reliability D. eliminates negative numbers
B
68.Which of the following values could be a stanine score? A. 0 B. 6 C. 7.6 D. All of these
B
103.The correlation coefficient of choice when both sets of measurements are in rank-order form and when fewer than 30 pairs of measurements are involved is A. the Pearson r. B. the tetrachoric r. C. the Spearman rho. D. the ROTC.
C
11.In calculating the mean of a distribution of test scores, the person analyzing the data takes account of A. only the extreme scores in the distribution. B. only the middle scores in the distribution. C. every score in the distribution. D. the standard scores in the distribution.
C
112.Measuring temperature with a Fahrenheit thermometer entails the use of which type of scale? A. ratio B. ordinal C. interval D. nominal
C
12.Which type of reliability estimate is obtained by correlating pairs of scores from the same person (or people) on two different administrations of the same test? A. a parallel-forms estimate B. a split-half estimate C. a test-retest estimate D. an au-paire estimate
C
13.Which type of reliability estimate would be appropriate only when evaluating the reliability of a test that measures a trait that is presumed to be relatively stable over time? A. parallel-forms B. alternate-forms C. test-retest D. split-half
C
16.The current edition of the Mental Measurements Yearbook contains: A. sample personality tests. B. sample mental ability tests. C. reviews of psychological tests. D. All of these
C
16.Which of the following is TRUE for parallel forms of a test? A. The means of the observed scores are equal for the two forms. B. The variances of the estimated scores are equal for the two forms. C. The means and variances of the observed scores are equal for the two forms. D. The means and variances of the estimated scores are equal for the two forms.
C
19.T score is to 50 as: A. z score is to 10. B. percentile is to 100. C. stanine is to 5. D. stanine is to 9.
C
2.An instructor rank-orders students in her measurement class based on their performance on a quiz of Chapter 3 in the textbook. In this instance, the instructor is using which type of scale? A. nominal B. interval C. ordinal D. comparative
C
21.How did the work of Wundt differ from that of Galton, Binet, and James McKeen Cattell? A. Wundt used standardized psychological tests. B. Wundt utilized humans and not animal research subjects. C. Wundt focused on how individuals were the same rather than different. D. Wundt focused on how individuals were different rather than the same.
C
23.Much of 19th-century psychological measurement focused on A. intelligence. B. ethics and values. C. sensory abilities. D. personality traits.
C
25.Projective tests may be viewed as remedying a deficiency of which other type of psychological test? A. intelligence tests B. proficiency tests C. self-report tests D. neurological tests
C
28.Psychological test data gathered from immigrant assessees at Ellis Island by Henry Goddard was A. cited extensively in Goddard's best-seller Welcoming Europe's Gifted. B. used to argue against compulsory sterilization for the "feeble-minded." C. cited to support anti-immigration arguments and legislation. D. compiled to determine the incidence of feeble-mindedness worldwide.
C
28.How do states different from traits? A. A trait is biologically determined, whereas a state in environmentally determined. B. A trait is measurable, whereas a state is not. C. A trait is relatively enduring, whereas a state is relatively short-lived. D. A trait is unchangeable, whereas a state is easily influenced.
C
30.Which of the following was the first personality test to be developed after the first world war? A. the Bernreuter Personality Inventory B. the Mooney Problem Checklist C. the Personal Data Sheet D. the MMPI
C
32.Which behavioral scientist viewed individual differences as a source of error in experimentation? A. Cattell B. Darwin C. Wundt D. Witmer
C
33.The standard error of measurement of a particular test of anxiety is 8. A student earns a score of 60. What is the confidence interval for this test score at the 95% level? A. 52-68 B. 40-68 C. 44-76 D. 36-84
C
39.When measuring a particular psychological trait, the term error variance refers to A. the degree to which impression management may influence the obtained score. B. the variations in score possible if another instrument was employed. C. the component of a test score attributable to sources other than the trait. D. All of these
C
4.An assumption in psychological testing and assessment is that psychological traits can be measured. For this to be true, A. the reference group used must be people who exhibit and do not exhibit the trait. B. the test must take into account changes in the trait that can occur over time. C. test developers must clearly define the trait constructs the test purports to measure. D. All of these
C
42.A norm group is a group of testtakers A. for whom a particular test is deemed inappropriate. B. taking a particular test for the very first time. C. that is typically described in the test manual. D. Only taking a particular test for the very first time and that is typically described in the test manual.
C
43.The term norms refers to the A. average score of all test-takers within a standardization sample. B. the typical performance of the norm group. C. scores with which the results of subsequent measurement can be compared. D. transformed scores used to create and improve test items.
C
44.The U.S. Navy is highly selective when it comes to applications for Navy Seal training. A distribution of test scores on a Navy Seal Qualifying and Screening Examination administered to a class of high school seniors would be expected to yield A. a normal distribution of test scores. B. a negatively skewed distribution of test scores. C. a positively skewed distribution of test scores. D. a trimodal distribution.
C
44.Which of the following is TRUE of the standard error of measurement? A. The larger the standard error of measurement, the better. B. The standard error of measurement is inversely related to the standard deviation (that is, when one goes up, the other goes down). C. The standard error of measurement is inversely related to reliability (that is, when one goes up, the other goes down). D. A low standard error of measurement is indicative of low validity.
C
46.The tail portion of a normal curve is the area of the curve between A. 2 and 3 standard deviations above the mean. B. -2 and -3 standard deviations below the mean. C. Both 2 and 3 standard deviations above the mean and -2 and -3 standard deviations below the mean. D. None of these
C
49.The T in T-scores came about because this variety of standard score was named after A. Thurstone. B. Titchener. C. Thorndike. D. Terman.
C
5.The more homogeneous a test is, the A. less inter-item consistency it can be expected to have. B. more utility the test has for measuring multifaceted variables. C. more inter-item consistency it can be expected to have. D. None of these
C
5.Which is the only type of scale that has an absolute zero point? A. nominal B. ordinal C. ratio D. interval
C
51.A goal of a test developer is to A. maximize error variance. B. minimize true variance. C. maximize true variance. D. minimize stress for testtakers.
C
52.In general, approximately what percentage of scores would be expected to fall within two standard deviations above or below the standard error of measurement of the "true score" on a test? A. 85% B. 90% C. 95% D. 99%
C
58.A score at the 20th percentile on a high school history test is A. the score at or below which 80% of the scores in the distribution fall. B. that point in a distribution at which 80% of the test questions were answered correctly. C. the score at or below which 20% of the scores in the distribution fall. D. grounds for termination from a varsity team in most high schools.
C
58.Which of the following describes the shape of the normal curve? A. positively skewed B. negatively skewed C. symmetrical D. bimodal
C
60.A third-grade student who earned a grade-equivalent score of 5.0 a standardized test of mathematics A. has the same mathematics ability as the average fifth-grade student in that same school. B. should not be enrolled in a fifth-grade math class. C. performed similarly to a hypothetical fifth-grade student. D. will most probably earn a grade of A in the course.
C
60.In the language of psychological testing and assessment, reliability BEST refers to A. how well a test measures what it was originally designed to measure. B. the complete lack of any systematic error. C. the proportion of total variance that can be attributed to true variance. D. whether or not a test publisher consistently publishes high quality instruments.
C
61.What does the "tail" of a normal distribution refer to? A. the area of the normal curve between 2 and 3 standard deviations above the mean B. the area of the normal curve between 2 and 3 standard deviations below the mean C. the extremes of the distribution D. the bottom of the distribution
C
62.The directions for scoring a particular motor ability test instruct the examiner to "Give credit if the child holds his hands open most of the time." Because what constitutes "most of the time" is not specifically defined, directions such as these could result in lowered reliability estimates for A. test-retest reliability. B. alternate-form reliability. C. inter-rater reliability. D. parallel forms reliability.
C
63.It is an extremely atypical score that can sometimes provide a hint regarding some deficiency in the testing or scoring procedures. It is A. a nonlinear plot point. B. a standard error. C. an outlier. D. an error.
C
64.What standard score has a mean of 5 and a standard deviation of approximately 2? A. decile B. T C. stanine D. z
C
65.Raw scores may be converted to norms A. so that the scores are more easily interpreted. B. so that scores are easily communicated to others. C. Both so that the scores are more easily interpreted and so that scores are easily communicated to others. D. None of these
C
7.When measuring a particular psychological trait, the term error variance refers to A. the degree to which impression management may influence the obtained score. B. the variations in score possible if another instrument was employed. C. the component of a test score attributable to sources other than the trait. D. All of these
C
7.Which is the MOST common factor influencing a decision as to the size of class interval in a grouped frequency distribution? A. the number of scores above the mean B. the size of the mean C. convenience D. the size of the mean and the standard deviation
C
71.What do ordinal and nominal scales have in common? A. Both contain continuous variables. B. Both contain equal units of measurement. C. Both permit classifications. D. Both contain mutually exclusive variables.
C
73.Regarding norm-referenced and criterion-referenced testing A. norm-referenced testing is recommended. B. criterion-referenced testing is recommended. C. each can be appropriate depending on testing objectives. D. the two approaches can always be used interchangeably.
C
74.Which of the following is always located between the first quartile (Q1) and the third quartile (Q3)? A. the mean B. the range C. the median D. None of these
C
8.The term psychometrics A. was derived from the Latin for "to confuse and befuddle." B. is used to refer collectively to test catalogues, manuals, and reports. C. may be defined as the science of psychological measurement. D. All of these
C
81.Which of the following is the equivalent T score for an IQ score of 115 on an IQ test that has a mean of 100 and a standard deviation of 15? A. 40 B. 50 C. 60 D. 7
C
82.Which of the following is an advantage of the stanine score over other standard scores? A. It has greater reliability because it is a single digit. B. It has greater validity because of its three decimal places. C. It is easily manipulated because it is a single digit. D. It has greater precision because of its three decimal places
C
83.A testtaker who scores at the 5th stanine is scoring A. above average. B. below average. C. within the average range. D. in an unspecifiable range; it depends on the test.
C
86.The median is not an appropriate measure of central tendency for A. ratio data. B. interval data. C. nominal data. D. ordinal data.
C
88.The purpose of deriving a standard deviation of a distribution is A. to average all the scores in a distribution. B. to determine the central value of the scores in a distribution. C. to determine the dispersion of scores around the mean of a distribution. D. to determine the range of scores in each of the quartiles of a distribution.
C
94.Generally, what type of correlation exists between hours of study time spent studying for an achievement test and the student's performance on the test? A. it depends what the subject of the test is B. a negative correlation C. a positive correlation D. zero correlation
C
10.A source of error variance may take the form of A. item sampling. B. testtakers' reactions to environment-related variables such as room temperature and lighting. C. testtaker variables such as amount of sleep the night before a test, amount of anxiety, or drug effects. D. All of these
D
100. If the correlation coefficient is equal to .30, the coefficient of determination is equal to A. .90. B. .999. C. 9. D. None of these.
D
105.If the calculated value of the correlation coefficient for two variables is 0, what would the resulting scatterplot look like? A. upward, sloping to the left B. downward, sloping to the right C. upward, sloping to the right D. None of these
D
11.Students may fantasize about a world without tests. But in reality, a world without tests would be more of a nightmare than a dream. This is so because A. nepotism in hiring and promotions would be rampant. B. professionals would not be properly credentialed. C. educational difficulties would be more difficult to diagnose. D. All of these
D
110.Statistical tools are used for A. describing numbers. B. making inferences about numbers. C. drawing conclusions about numbers. D. All of these
D
12A test is set into large type for a visually impaired testtaker. This is an example of A. a physical environment modification. B. an interpersonal environment modification. C. a compromise. D. an accommodation.
D
13. A prospective test user may ask many questions about a test's validity. Which of the following is NOT a validity-related question? A. Do the test items adequately sample the range of areas that should be sampled? B. What do high scores on this test really tell us about the targeted construct? C. What do low scores on this test really tell us about the targeted construct? D. As a tool of measurement, is this test consistent?
D
13.If an accommodation is made for the purpose of administering a test, then A. the meaning of the test scores may not be clear. B. a notation should be made on the test report. C. a need for alternate assessment clearly existed. D. All of these
D
14. "What's a good test?" In part, a good test is one that A. trained examiners can administer with a minimum of difficulty. B. is useful in the sense that it yields actionable results. C. will ultimately benefit individual testtakers or society as a whole. D. All of these
D
14.In the language of psychological testing and assessment, scoring refers to assigning evaluative numbers, codes or statements to performance on A. tests. B. tasks. C. interviews. D. All of these
D
14.Which of the following might lead to a decrease in test-retest reliability? A. the passage of time between the two administrations of the test. B. coaching designed to increase test scores between the two administrations of the test. C. practice with similar test materials between the two administrations of the test. D. All of these
D
17.Which source of error variance affects parallel- or alternate-form reliability estimates but does not affect test-retest estimates? A. fatigue B. learning C. practice D. item sampling
D
19. In selecting a test for use, the responsible test user does some advance research on A. how appropriate the available norms are for use with the contemplated testtakers. B. how appropriate the reading level of the test is for use. C. cultural factors as they may relate to the administration and scoring of the test. D. All of these
D
19.When a test must be administered with an aid of an interpreter, A. subtle nuances of meaning may be lost. B. pre-training for the interpreter is desirable. C. pre-training for the assessor is desirable. D. All of these
D
20.Test-retest estimates of reliability are referred to as measures of ________, and split-half reliability estimates are referred to as measures of ________. A. true scores; error scores B. internal consistency; stability C. inter-scorer reliability; consistency D. stability; internal consistency
D
22.Which of the following factors may influence a split-half reliability estimate? A. fatigue B. anxiety C. item difficulty D. All of these
D
23.Psychological traits are present and exhibited to others A. 100% of the time. B. 75% of the time. C. 50% of the time. D. None of these
D
23.The Spearman-Brown formula is used for A. correcting for one half of the test by estimating the reliability of the whole test. B. determining how many additional items are needed to increase reliability up to a certain level. C. determining how many items can be eliminated without reducing reliability below a predetermined level. D. All of these
D
24.Which of the following would be LEAST likely to be used as an item on a projective test of personality? A. ink spilled on paper B. a painting of a rowboat on a lake C. a cloud D. a digital clock
D
24.Which statement is TRUE regarding this distribution of scores? 0, 11, 10, 0, 0, 5, 4, 3, 2, 1, 1, 1 A. There is no mode. B. The mode is 0. C. The mode is 1. D. There are two modes, 0 and 1.
D
25.Which of the following is NOT an acceptable way to divide a test when using the split-half reliability method? A. Randomly assign items to each half of the test. B. Assign odd-numbered items to one half and even-numbered items to the other half of the test. C. Assign the first-half of the items to one half of the test and the second half of the items to the other half of the test. D. Assign easy items to one half of the test and difficult items to the other half of the test.
D
25.Which statement is TRUE regarding this distribution of scores? 1, 2, 2, 3 A. The arithmetic mean is not an integer value. B. The distribution is bimodal. C. There is no mode. D. The arithmetic mean is equal to the mode.
D
26.Which of the following is, generally speaking, the preferred statistic for obtaining a measure of internal-consistency reliability? A. KR-20 B. KR-21 C. Kendall's Tau D. coefficient alpha
D
29.If a test is homogeneous A. it is functionally uniform throughout. B. it will likely yield a high internal-consistency reliability estimate compared with a test-retest reliability estimate. C. it would be reasonable to expect a high degree of internal consistency. D. All of these
D
29.Referring to a psychological trait as a construct means that it A. is an idea whose time has come, gone, and come back again. B. has few practical "real-world" applications and is used for academic purposes. C. was developed by naturalistic observation rather than any theory. D. is a scientific concept developed to describe or explain behavior.
D
3. In classical test theory, an observed score on an ability test is presumed to represent the testtaker's A. true score. B. true score less the variance. C. true score combined with extraneous factors. D. the testtaker's true score and error.
D
3.Psychological tests may differ with respect to A. content. B. format. C. administration. D. All of these
D
3.Yards gained by running backs during a football game is an example of which type of scale? A. nominal B. ordinal C. interval D. ratio
D
31.Users of psychological tests are frequently tempted to treat ordinal data as if it were interval data. This is the case because of the A. difficulties that would be encountered if the data were treated as ratio data. B. frequent need to do more than simply rank order test scores. C. data manipulation capabilities given the equal intervals between points measured. D. added flexibility of interval level data for statistical manipulation.
D
31.Which of the following statements is NOT true about Henry Goddard? A. He raised questions about how meaningful intelligence tests were for people of diverse backgrounds. B. He used intelligence test data to argue against capital punishment for the "feebleminded." C. He advocated for the institutionalization or sterilization of the mentally retarded to prevent future generations from having low intelligence. D. He developed a culturally sensitive intelligence test that measured culturally specific aspects of intelligence common to East Asian immigrants.
D
33.To make data more manageable, it is sometimes converted to graphs or tables. Graphs or tables can be created from A. nominal level data. B. ordinal level data. C. ratio level data. D. All of these
D
34.An intelligence test originally written in English is to be administered to a group of Japanese immigrants who do not speak English. In order to obtain an accurate measure of intelligence and attempt to eliminate any possible effects due to language, the test administrator should A. have a professional translator read the test to the group, simultaneously translating the items word-for-word. B. have a friend or family member of the group who is fluent in English and Japanese read the test to the group, simultaneously translating the items word-for-word. C. have a teacher fluent in Japanese and English conduct a brief tutorial in English prior to administering the test in English, with specific attention given to the meaning of the wording of key items and corresponding responses. D. None of these
D
38.For criterion-referenced tests, which of the following reliability estimates is recommended? A. test-retest reliability estimates B. alternate-form reliability estimates C. split-half reliability estimates D. None of these
D
38.Test validity refers to how A. "true" the score is in terms of true score theory. B. rigorous the process of norming the test was. C. "deep" the depth of trait measurement is. D. well a test measures what it purports to measure.
D
39.A distribution of test scores can be described by A. a measure of central tendency. B. a measure of deviation. C. a graph. D. All of these.
D
39.When "putting a test to the test" and debating whether or not a particular instrument should be used for a particular objective, relevant questions to be asked include A. What is the objective in using this test? B. Who is this test designed for use with? C. How is what the test measures defined? D. All of these
D
40.Students may fantasize about a world without tests. But in reality, a world without tests would be more of a nightmare than a dream. This is so because A. nepotism in hiring and promotions would be rampant. B. professionals would not be properly credentialed. C. educational difficulties would be more difficult to diagnose. D. All of these
D
41.Unlike the arithmetic mean of scores in a distribution, the mode A. may be totally atypical of other scores in the distribution. B. may lie at the extreme end of the distribution. C. is determined by counting the scores and determining which occurs most frequently. D. All of these
D
42.A prospective test user may ask many questions about a test's validity. Which of the following is NOT a validity-related question? A. Do the test items adequately sample the range of areas that should be sampled? B. What do high scores on this test really tell us about the targeted construct? C. What do low scores on this test really tell us about the targeted construct? D. As a tool of measurement, is this test consistent?
D
45.What type of reliability estimate is obtained by correlating pairs of scores from the same person on two different administrations of the same test? A. parallel-forms B. split-half C. interrater D. test-retest
D
53.A student received a z score of 1 on a test of English as a Second Language and the distribution of test scores on that test was normal. This same student received a z score of 1 on another test of English as Second Language and the distribution of test scores on this second test were highly skewed. In all probability, A. these two standard scores mean the same thing. B. the T-scores on both test would be equal to 80. C. the student speaks English better than many native-born Americans. D. these two standard scores do not mean the same thing.
D
53.The terms percentile and percentage correct A. are, for all intents and purposes, synonymous. B. have in common that neither is a converted raw score. C. both entail calculation by multiplying by 100 and dividing by the number of items. D. None of these
D
56.If the calculated value of the correlation coefficient for two variables is 0, what would the resulting scatterplot look like? A. upward, sloping to the left B. downward, sloping to the right C. upward, sloping to the right D. None of these
D
57.Percentiles divide a distribution into ________ equal parts. A. 5 B. 10 C. 1,000 D. 100
D
67.A raw score of 0 on a test suggests to the test user that the testtaker A. did not possess any of the construct/ability being assessed by the test. B. did not understand the instructions. C. was not physically capable of responding appropriately. D. All of these
D
69.104. The number corresponding to a "Pass" on a state teacher certification examination is an example of what type of score? A. raw score B. standard score C. percentile rank D. cut-off score
D
70.The nominal scale is a type of measurement that uses A. an absolute zero point. B. continuous variables. C. rank-ordering. D. None of these
D
72.Unlike norm-referenced interpretation, criterion-referenced interpretation A. requires that the test-taker's score be compared with a matched sample. B. describes test performance in terms of a test-taker's relative standing in a group. C. relies on percentile as opposed to grade or age norms. D. describes a test-taker's performance in terms of predetermined standards.
D
77.In general, samples of behavior may be obtained by A. direct observation. B. self-report. C. paper-and-pencil tests. D. All of these
D
8.The standard error of measurement is A. used to infer how far an observed score is from the true score. B. also known as the standard error of a score. C. is used in the context of classical test theory. D. All of these
D
85.The mean should be chosen as the measure of central tendency when the distribution is A. skewed in a generally positive direction. B. skewed in a generally negative direction. C. approximately j-shaped in nature. D. approximately symmetrical in nature.
D
87.Which of the following statistics is derived by calculating the difference between the highest and lowest scores in a distribution? A. average deviation B. variance C. standard deviation D. range
D
9.In a grouped frequency distribution A. the total of the frequency column is equal to the total number of scores in the distribution. B. test-score intervals replace the actual test scores. C. each test score must fall in only one test-score interval. D. All of these
D
90.131. Kurtosis refers to this characteristic of a graphed distribution. The characteristic is A. dispersion. B. smoothness. C. symmetry. D. steepness.
D
92.In a normal curve, approximately 68% of all scores fall A. above the mean. B. below the mean. C. between the mean and 1 standard deviation below the mean. D. None of these
D
95.A perfect correlation is indicated by A. +1.00. B. -1.00. C. 0. D. Both +1.00 and -1.00.
D
97.Charles Spearman is credited with A. developing a type of correlation coefficient. B. developing a way to predict the accuracy of a test. C. developing factor analysis. D. All of these
D
