Testing & Measurement Exam 2
In 2002, the _____ of the Graduate Record Examination Test (GRE) was (were) changed from a multiple-choice format. A. Analytical Reasoning section B. Subject Tests C. Verbal section D. Quantitative section
A. Analytical Reasoning section
The standardization sample of the 1916 Stanford-Binet scale was inadequate in that A. it was comprised exclusively of white children from California B. it was obtained in France but was used in testing American children C. It was comprised exclusively of children from rural areas D. Only children between ages of 6 and 12 were represented
A. It was comprised of white children from California
Which of the following Wechsler subtests is one of the most stable measures of general intelligence? A. Digit Span B. Vocabulary C. Comprehension D. Information
B. Vocabulary
The ability test score of a ten-year-old Stacy is most likely to be highest if her examiner: A. is familiar to her, does not initiate conversation, and does not comment on her performance B. is unfamiliar to her, does not initiate conversation and does not comment on her performance C. is familiar to her, initiates conversation, and praises her for her performance D. is unfamiliar to her, initiates conversation, and praises her performance
C. is familiar to her, initiates conversation, and praises her for her performance
The Columbia Mental Maturity Scale-3rd edition (CMMS) is a measure of A. intellectual functioning among individuals with sensory, physical, or language deficits B. nonintellectual factors such as motivation and anxiety that influence intelligence test performance C. cognitive and achievement deficits associated with learning disabilities among children D. mild, moderate, sever, and profound levels of mental retardation
A. intellectual functioning among individuals with sensory, physical, or language deficits
Questions such as, "A man sells twelve apples at 25 cents apiece. How much money does he make?" might be found on the _____ subtest of the ______ scale of the WAIS-IV. A. Arithmetic; Performance B. Digit Span; Performance C. Arithmetic; Verbal D. Digit Span; Verbal
C. Arithmetic; Verbal
The Law School Admissions Test (LSAT) contains all of the following types of problems except A. reading comprehension B. analytical reasoning C. ethical reasoning D. logical reasoning
C. Ethical reasoning
The Illinois Test of Psycholinguistic Abilities (ITPA-3) is based on the notion that learning problems can occur as a result of problems in any one of three stages of information processing, including A. encoding, storage, and retrieval B. sensation, perception, and action C. input, information analysis, and response D. Stimulus, response, modification
C. Input, information analysis, and response
Which of the following is FALSE with regard to the psychometric properties of the 2003 (Fifth) edition of the Stanford-Binet? A. Internal consistency reliabilities for the three IQs are all above 0.90 B. Test-retest reliabilities are strong, but vary according to age level and time interval C. Interscorer agreement was relatively low with average coefficients of 0.50 - 0.60 D. Adequate convergent validity with other intelligence test has been established
C. Interscorer agreement was relatively low with average coefficients of 0.50-0.60
If Dr. Hamline wants to know whether the most difficult items on the statistics test she created were answered correctly only by the top-performing students in her class, she would need to assess: A. item difficulty B. the guessing threshold C. Item discriminability D. the antimode
C. Item discriminability
Which of the following is NOT an advatage of many alternative individual ability tests over the Wechsler and Binet scales? A. nonverbal test administration is possible B. less influenced by scholastic achievement C. stronger standardization samples D. better at assessing people with sensory deficits
C. Stronger standardization samples
Which of the following is NOT a potential problem created by the inclusion of ineffective distractors on a test? A. They can decrease the reliability of a test B. They can give clues to examinees about the correct response C. They can decrease the time examinees spend on items D. They can decrease the validity of the test
C. They can decrease the time examinees spend on items
Which of the following tests is an individual scale of achievement that purports to measure reading, spelling, and arithmetic? A. Kaufman Assessment Battery for Children-2nd edition (KABC-II) B. Leiter International Performance Scale-Revised (LIPS-R) C. Wide Range Achievement Test-4 (WRAT-4) D. McCarthy Scales of Children's Abilities (MSCA)
C. Wide Range Achievement Test-4
Which of the following factors is NOT measured by the Torrance Test of Creativity? A. flexibility B. originality C. fluency D. spontaneity
D. Spontaneity
For which group of students does the SAT and other college entrance test have difficulty predicting college grades? A. Students who score in the lower ranges B. Students who score in the middle ranges C. Students who score in the higher ranges D. All students
Students who score in the middle ranges
Which of the following concepts did Binet incorporate into the first two versions (1905 and 1908) of the Binet-Simon scale of intelligence? A. Intelligence quotient B. Mental age C. Crystallized intelligence D. Deviation IQ
B. Mental age
Which of the following is NOT an example of a visiographic test? A. Benton Visual Retention Test B. Peabody Picture Vocabulary Test C. Bender Visual Motor Gestalt Test D. Memory-for-Designs Test
B. Peabody Picture Vocabulary Test
Which of the following is NOT true of the most recent version of the SAT? A. It is 45 minutes shorter than the original SAT B. The verbal section of the SAT emphasizes reading comprehension C. The test may disadvantage students with learning disabilities such as ADHD D. The writing section consists of both an essay and multiple choice questions
C. The test may disadvantage students with learning disabilities such as ADHD
The knowledge you have acquired through your academic studies would best be described in terms of A. crystallized intelligence B. g C. IQ D. fluid intelligence
A. .crystallized intelligence
On all the Wechsler scales, subtest scores have a mean of ___ and a standard deviation of ___, whereas the FUll Scale IQ has a mean of _______ and a standard deviation of _____. A. 10, 3; 100, 15 B. 15, 5; 500, 100 C. 100, 15; 10, 3 D. 10, 3; 50, 10
A. 10, 3: 100, 15
A test comprised of items to which responses can vary on a 6-point scale ranging from "Strongly agree" to "strongly disagree" is an example of.... A. a ploytomous format B. a Likert-scale format C. a category format D. a dichotomous format
B a likert-scale
Researchers have found that a ___________ scale on a test employing a category format is sufficient for discriminating among individuals. A. 5 point B. 10 point C. 20 point D. 100 point
B. 10 point
The WISC-IV is designed for children between the ages of approximately A. 3 and 18 B. 5 and 18 C. 6 and 16 D. 3 and 12
B. 5 and 18
Dr. Marcus analyzed items on his physics test by calculating the proportion of students who got each item correct. Dr. Marcus was examining: A. item difficulty B. the guessing threshold C. item discriminability D. the antimode
C item discriminability
On a multiple choice test with four response options, the optimum item difficulty level is about: A. 0.250 B. 0.500 C. 0.625 D. 0.875
C. 0.625
The current version of the SAT Reasoning Test was first administered in A. 1994 B. 2000 C. 2005 D. 2007
C. 2005
Which of the following subtests is included on the WPPSI-III, but not the WAIS-III or the WISC-IV? A. Object assembly B. Coding C. Animal Pegs D. Symbol Search
C. Animal Pegs
A study of 22 graduate students found that scoring errors on the WAIS-R diminished only after the students had completed ____ or more practice sessions. A. 3 B. 5 C. 8 D. 10
D. 10
If a 10-year-old child was foudn to have a mental age of 5 on the 1916 Stanford-Binet scale, the child's intelligent quotient (IQ) would be: A. 200 B. 150 C. 100 D. 50
D. 50
Both the SAT and the GRE have standard mean scores of ___ and the standard deviations of ___. A. 40; 2 B. 50; 10 C. 100; 15 D. 500; 100
D. 500; 100
Which of the following characteristic curves represents the best item? A. A curve that rises steadily to the midpoint of test performance and then falls steadily to the highest performance levels (i.e., an inverted U-shaped curve) B. A curve that is flat to the midpoint of test performance, rises sharply at the midpoint, and then flattens out at the highest performance levels C. A curve that drops steadily to the midpoint of test performance and then rises steadily to the highest performance levels (i.e., a U-shaped curve) D. A curve that rises steadily and smoothly to the highest performance levels
D. A curve that rises steadily and smoothly to the highest performance levels
A test comprised of items such as "I usually feel rested when I wake up in the morning" to which examinees must respond "True" or "False" is an example of: A. A ploytomous format B. A Likert-scale format C. A category format D. A dichotomous format
D. A dichotomous format
A central problem of the 1937 revision of the Stanford-Binet scale was that: A. each age group in the standardization sample was comprised of 30 or fewer children B. The age range of examinees for whom the test was appropriate decreased significantly C. The reliability coefficients were approximately the same across age groups. D. Different age groups showed significant differences in the standard deviation of IQ
D. Different age groups showed significant differences in the standard deviation of IQ
IDEA provides for A. free intelligence testing for low income and disadvantaged children B. access to achievement test preparation programs for all children C. free, daily, private tutoring for children with learning disabilities D. Free and appropriate special education services for all learning disabled children
D. Free and appropriate special education services for all learning disabled children
Which of the following is NOT a group test of intelligence? A. The Cognitive Abilities Test (COGAT) B. The Hemnon-Nelson Test (H-NT) C. Miller Analogies Test (MAT) D. Kuhlman-Anderson Test (KAT)
D. Kuhlman-Anderson Test (KAT)
Reactivity results in A. more accurate observer ratings B. less accurate observer ratings C. More test-taker deception D. Less test-taker deception
A. More accurate observer rating
In one study, graduate students were told that ambiguous responses to intelligence test items were given by either "bright" or "dull" people. In fact, the responses were exactly the same. When they scored the tests, the graduate students gave more credit to responses they believed were from bright examinees. The findings of this study illustrate: A. expectancy effects B. reactivity C. Incentive effects D. Drift
A. Expectancy effects
Which of the following tests provides both an ability/intelligence score and a corresponding achievement score? A. Kaufman Assessment Battery for Children-2nd Edition (KABC-II) B. Leiter International Performance Scale-Revised (LIPS-R) C. Illinois Test of Psycholinguistic Abilities (ITPA) D. McCarthy Scales of Children's Abilities (MSCA)
A. Kaufman Assessment Battery for Children-2nd Edition
Which of the following is NOT correct with regard to the psychometric properties of the WAIS-III? A. Test-retest reliabilities of the Verbal, Performance, and Full Scale IQs are all very strong B. Test-retest reliabilities of the subtests are all very strong C. The standard error of measurement (SEM) for the Full Scale and Verbal IQs are smaller than the SEM for the Performance IQ D. Internal consistency reliabilities for the non-speeded subtests are very strong
B. Test-retest reliabilites of all the subtests are all very strong
Which of the following is considered an advantage of group ability tests over individual ability tests? A. They allow the examiner to closely observe the examinee's behavior in a standardized setting B. They have more objective and reliable scoring procedures C. They provide considerable information about examinees beyond the test scores D. They require a high level of skill and training to adminster
B. They have more objective and reliable scoring procedures
Although Hal was trained in a standardized method of behavioral observation six months ago, his ratings of the same behavior have changed over time. This illustrates the problem of observer _____________. A. expectancies B. drift C. reactivity D. bias
B. drift
Tests based on Item Response Theory (IRT): A. yield scores reflecting the level of difficulty of items examinees were able to answer correctly B. Are less adaptable to computer administration than a traditional test C. May be more biased toward examinees who are slow in completing tests D. Define total test scores in term of the number of items examinees answered correctly
A. yield scores reflecting the level of difficulty of items examinees were able to answer correctly
In the context of psychological assessment, the halo effect is an example of a A. subject variable B. scoring or rating error C. Standardized procedure D. self-serving bias
B. scoring or rating error
________ is a standard score with a mean of 100 and a standard deviation of 16 (later 15) that was first introduced in the 1960 revision of the Stanford-Binet. A. The intelligence quotient (IQ) B. Mental age (MA) C. The deviation IQ D. g
C. the deviation IQ
The concept of g refers to A. the degree to which intelligence is genetically determined B. The level of giftedness demonstrated by examinees on an intelligence test C. The view that one general mental ability factor underlies all intelligent behavior D. the notion that gradation of performance are reflected in intelligence test scores
C. the view that one general mental ability factor underlies all intelligent behavior
Research on the effects of the examiner's race on children's IQ test performance indicates: A. substantial negative effects when the examiner is African-American and the child is white B. substantial negative effects when the examiner is white and the child is African-American C. substantial positive effects when the examiner and the child are the same race D. Minimal, if any, effects
D. Minimal, if any, effects
A recent innovation in the WISC-IV is the use of _________ to identify item biases. A. opinions of experts in the field of intelligence testing B. judges who were trained to recognize bias C. Content analysis D. Statistical analysis
D. Statistical analysis
Administration of the modern Stanford-Binet requires examiners to continue testing until A. the examinee passes all items on the routing tests B. the examinee fails all items on the routing tests C. The examinee's basal level is reach D. The examinee's ceiling is reached
D. The examinee's ceiling is reached
One of the primary concerns about the use of integrity tests is A. their remarkable ability to predict future criminal behavior B. they all require very expensive equipment and extensive training C. that scores on these tests are highly correlated with IQ D. the number of false positives identified by the test
D. The number of false positives identified by the test
Wechsler's criticisms of Binet scales related to A. the Binet scale's use of a point scale rather than an age scale B. the lack of validity of Binet scale items designed for children C. the failure of Binet to include any speeded (or timed) items D. the Binet scale's inadequate measurement of adult intelligence
D. the Binet scale's inadequate measurement of adult intelligence
Computer-assisted test administration A. is generally less reliable than traditional assessment B. Requires that the same items be presented in the same order to test takers. C. Typically yields test scores somewhat lower than those on paper-and-pencil tests D. yield lower rates of scoring errors than traditional assessment procedures
D. yield lower rates of scoring errors than traditional assessment procedures
On the WAIS-IV, Digit Span and Arithmetic comprise the _____ Index, whereas Block Design, Matrix Reasoning, and Visual Puzzles comprise the ______ Index. A. Working Memory; Perceptual Reasoning B. Verbal Comprehension; Working Memory C. Processing Speed; Perceptual Reasoning D. Perceptual Reasoning, Verbal Comprehension
A. Working Memory; Perceptual Reasoning
If a correction for guessing is used on a test, it means that: A. test scores will be adjusted to take into account the percentage of items examinees are expected to answer correctly by chance alone. B. Examinees who guess on items they do not know the answer to will receive lower test scores than if they had simply left the unknown items blank C. Test scores are curved to reflect the percentage of examinees who are expected to guess on 50% or more of test items D. Test scores of examinees who make random guesses will be artificially inflated compared to the scores of those who do not make random guesses.
A. test cores will be adjusted to take into account the percentage of items examinees are expected to answer correctly by chance alone
Lucy is being administered a nonverbal ability test consisting of a series of designs or patterns with missing parts. She must select the part that fits the design or completes the pattern from as many as eight choices. Lucy is being administered: A. the Raven Progressive Matrices (RPM) B. the Armed Services Vocational Aptitude battery (ASVAB) C. The General Aptitude Test Battery (GATB) D. the Goodenough-Harris Drawing Test
A. the Raven Progressive Matrices (RPM)
The gf-gc theory of intelligence is A. the basis of the original and all subsequent revisions of the Binet scales of intelligence B. a hierarchical model on which only the later versions of the Stanford-Binet are based C. a single-factor model introduced by Spearman and used as the basis of the 2003 revision of the Stanford-Binet D. no longer a viable model of intelligence in contemporary times
B. A hierarchical model on which only the later versions of the Stanford-Binet are based
Which of the following has NOT been found with regard to the effects of reinforcement on test performance? A. Culturally-relevant verbal reinforcement has a significant effect on IQ test scores of African-American children. B. Children's socioeconomic class and gender are related to effects of reinforcement on test performance C. Effects of tangible reinforcements such as money and candy are significantly greater than effects of verbal praise on test performance D. None of the above reflects the research findings on the effects of reinforcement on test performance
B. Children's socioeconomic class and gender are related to effects of reinforcement on test performance
Which of the following scales of intelligence in infants and young children employs a developmental quotient (DQ) which is evaluated by assessing the presence of absence of behavior associated with maturation? A. Brazelton Neonatal Assessment Scale (BNAS) B. Gessell Developmental Schedules (GDS) C. Bayley Scales of INfant Development - 3rd edition (BSID-III) D. Cattell Infant Intellignece Scale (CIIS)
B. Gessell Developmental Schedules (GDS)
Which of the following is true with regard to the psychometric properties of the GRE? A. Test-retest and internal consistency reliabilities are less than adequate B. Predictive validity regarding first-year graduate school grades is less than adequate C. The achievement of older students is over-predicted D. The achievement of younger students is under-predicted
B. Predictive validity regarding first-year graduate school grades is less than adequate
An examiner administrating the WAIS-IV asks Julie a series of questions such as "In what way are the sun and the moon alike?" The examiner is administering the ______ subtest. A. Comprehension B. Similarities C. Vocabulary D. Information
B. Similarities
Which of the following scales is a measure of ability in children that has been employed in studies of early intervention of at at-risk children, the effects of mothers' smoking and exposure to lead on children's abilities, and cognitive abilities among Mexican-American children? A. Kaufman Assessment Battery for Children - 2nd Edition (KABC-II) B. Leiter International Performance Scale-Revised (LIPS-R) C. Wide Range Achievement Test-4 (WRAT-4) D. McCarthy Scales of Children's Abilities (MSCA)
D. McCarthy Scales of Children's Abilities