Assessment and Testing
A counselor who fears the client has an organic, neurological, or motoric difficulty would most likely use the
Bender Gestalt II
The Binet stressed age-related tasks. Utilizing this method, a 9-year-old task would be one which
50% of the 9-year-olds could answer correctly.
In most instances, who would be the best qualified to give the Rorschach Inkblot Test?
A clinical psychologist.
The NCE and the CPCE would be examples of a(n) ________ test.
forced choice
A short answer test is a(n) ________ test.
free choice
A new IQ test which yielded results nearly identical to other standardized measures would be said to have
good concurrent validity.
IQ means
intelligence quotient
Appraisal can be defined as
the process of assessing or estimating attributes
Interest inventories are positive in the sense that
they are reliable and not threatening to the test taker.
J. P. Guilford isolated 120 factors which added up to intelligence. He also is remembered for his
thoughts on convergent and divergent thinking.
Both the Rorschach and the Thematic Apperception Test (TAT) are projective tests. The Rorschach uses 10 inkblot cards while the TAT uses
pictures
An aptitude test is to ________ as an achievement test is to ________.
potential; what has been learned
In constructing a test you notice that all 75 people correctly answered item number 12. This gives you an item difficulty of
1.0
The mean on the Wechsler and the Stanford-Binet Intelligence scales (SB5) is ________ and the standard deviation is ________.
100; 15 Wechsler, 16 Stanford-Binet
The standard error of measurement tells you
how accurate or inaccurate a test score is.
A good practice for counselors is to
never generalize on the basis of a single test score.
A reliable test is ________ valid.
not always
The National Counselor Exam (NCE) is a(n) ________ test because the scoring procedure is specific.
objective
________ did research and concluded that intelligence was normally distributed like height or weight and that it was primarily genetic.
Galton
Most counselors would agree that
more public education is needed in the area of testing.
Infant IQ tests are
more unreliable than those given later in life.
In a projective test the client is shown
neutral stimuli
You are uncertain whether a test is intended for the population served by your not-for-profit agency. The best method of researching this dilemma would be to
read the test manual included with the test.
One major criticism of interest inventories is that
they emphasize professional positions and minimize blue- collar jobs.
Today, the Stanford-Binet IQ test is
a standardized measure.
Test bias primarily results from
a test being normed solely on White middle-class clients
In a cyclical test
you have several sections which are spiral in nature
In a spiral test
the items get progressively more difficult
Which measure would yield the highest level of reliability?
a very accurate postage scale
An achievement test measures maximum performance or present level of skill. Tests of this nature are also called attainment tests, while a personality test or interest inventory measures
typical performance
The group IQ test movement began
with the Army Alpha and Army Beta in World War I.
Clients should know that
a test is merely a single source of data and not infallible.
Lewis Terman
Americanized the Binet
A valid test is ________ reliable.
always
A counselor created an achievement test with a reliability coefficient of .82. The test is shortened since many clients felt it was too long. The counselor shortened the test but logically assumed that the reliability coefficient would now
be lower than .82
One problem with interest inventories is that the person often tries to answer the questions in a socially acceptable manner. Psychometricians call this response style phenomenon
social desirability (the right way to feel in society).
A counselor doing research decided to split a standardized test in half by using the even items as one test and the odd items as a second test and then correlating them. The counselor
was testing reliability via the split-half correlation method.
In the field of testing, validity refers to
whether the test really measures what it purports to measure
The NCE is
an achievement test
A client who takes a normative test
can legitimately be compared to others who have taken the test.
Counselors often shy away from self-reports since
clients often give inaccurate answers.
One major testing trend is
computer-assisted testing and computer interpretations
According to Public Law 93-380, also known as the Buckley Amendment, a 19-year-old college student attending college
could view her record, which included test data; could view her daughter's infant IQ test given at preschool; and could demand a correction she discovered while reading a file.
A true/false test has ________ recognition items.
dichotomous
Most experts would agree that the Wechsler IQ tests gained popularity, as the Binet
didn't seem to be the best test for adults.
In a culture-fair test
Items are known to the subject regardless of their culture
IQ stands for intelligence quotient, which is expressed by
MA/CA × 100
Group IQ tests like the Otis-Lennon, the Lorge-Thorndike, and the California Test of Mental Abilities are popular in school settings. The advantage is that
group tests are quicker to administer.
A job test which predicted future performance on a job very well would
have high criterion/predictive validity.
A colleague of yours invents a new projective test. Seventeen counselors rated the same client using the measure and came up with nearly identical assessments. This would indicate
high reliability
A counselor peruses a testing catalog in search of a test which will repeatedly give consistent results. The counselor
is interested in reliability
A counseling test consists of 300 forced response items. The person taking the test can take as long as he or she wants to answer the questions.
this is most likely a power test
The most critical factors in test selection are
validity and reliability
In an ipsative measure the person taking the test must compare items to one another. The result is that
you cannot legitimately compare two or more people who have taken an ipsative test.
Your client, who is in an outpatient hospital program, is keeping a journal of irrational thoughts. This would be
an informal assessment technique.
The word psychometric means
any form of mental testing
The WAIS-IV is given to 100,000 individuals in the United States who are picked at random. A counselor would expect that
approximately 68% would score between 85 and 115.
The ________ index indicates the percentage of individuals who answered each item correctly.
difficulty
Simon and Binet pioneered the first IQ test around 1905. The test was created to
discriminate children without an intellectual disability from children with an intellectual disability.
A test format could be normative or ipsative. In the normative format
each item is independent of all other items.
Construct validity refers to the extent that a test measures an abstract trait or psychological notion. An example would be
ego strength
One method of testing reliability is to give the same population alternate forms of the identical test. Each form will have the same psychometric/statistical properties as the original instrument. This is known as
equivalent or alternate forms reliability.
Which is more important, validity or reliability?
validity
Your supervisor wants you to find a new personality test for your counseling agency. You should read
professional journals; the Buros Mental Measurements Yearbook; and classic textbooks in the field as well as test materials produced by the testing company.
The counselor who favors projective measures would most likely be a
psychodynamic clinician
Short answer tests and projective measures utilize free response items. The NCE and the CPCE uses forced choice or so-called ________ items.
recognition
The 16 PF reflects the work of
Raymond B. Cattell
Today the Stanford-Binet is used from age 2 to adulthood. The IQ formula has been replaced by the
SAS
A test battery is considered
a horizontal test
The Myers-Briggs Type Indicator reflects the work of
Carl Jung
The MMPI-2 is
a standardized personality test.
A new IQ test has a standard error of measurement (SEM) of 3. Tom scores 106 on the test. If he takes the test a lot, we can predict that about 68% of the time
Tom will score between 103 and 109.
The best IQ test for a 22-year-old single male would be the
WAIS-IV
The best intelligence test for a sixth-grade girl would be the
WISC-IV
The best intelligence test for a kindergartner would be the
WPPSI-IV
________ would be an informal method of appraisal.
a checklist
One future trend which seems contradictory is that some experts are pushing for
a greater reliance on tests while others want to rely on them less.
A reliability coefficient of 1.00 indicates
a perfect score which has no error.
A word association test would be an example of
a projective test
Tests are often classified as speed tests versus power tests. A timed typing test used to hire secretaries would be
a speed test
The same test is given to the same group of people using the test-retest reliability method. The correlation between the first and second administration is .70. The true variance (i.e., the percentage of shared variance or the level of the same thing measured in both) is
49%
An excellent psychological or counseling test would have a reliability coefficient of
.90
One method of testing reliability is to give the same test to the same group of people two times and then correlate the scores. This is called
test-retest reliability.
A researcher working with a personality test discovers that the test has a reliability coefficient of .70 which is somewhat typical. This indicates that
70% of the score is accurate while 30% is inaccurate.
A counselor who had an interest primarily in testing would most likely be a member of
AARC
Face validity refers to the extent that a test
looks or appears to measure the intended attribute
The first intelligence test was created by
Alfred Binet and Theodore Simon.
A career counselor is using a test for job selection purposes. An acceptable reliability coefficient would be ________ or higher.
.80
Francis Galton felt intelligence was
a unitary faculty.
A counselor can utilize psychological tests to help secure a ________ diagnosis if third-party payments are necessary.
DSM or ICD
You want to admit only 25% of all counselors to an advanced training program in psychodynamic group therapy. The item difficulty on the entrance exam for applicants would be best set at
.25
The black versus white IQ controversy was sparked mainly by a 1969 article written by ________.
Arthur Jensen
An aptitude test predicts future behavior while an achievement test measures what you have mastered or learned. In the case of a test like the ________ the distinction is unclear.
GRE
Which method of reliability testing would be useful with an essay test but not with a test of algebra problems?
Inter-rater/inter-observer.
The ________ are examples of aptitude tests.
O*NET Ability Profiler and the MCAT
An interest inventory would be least valid when used with
an eighth-grade male with an IQ of 136.
When a counselor tells a client that the Graduate Record Examination (GRE) will predict her ability to handle graduate work, the counselor is referring to
predictive validity
A test can be defined as a systematic method of measuring a sample of behavior. Test format refers to the manner in which test items are presented. The format of an essay test is considered a(n) ________ format.
subjective
A counselor is told by his supervisor to measure the internal consistency reliability (i.e., homogeneity) of a test but not to divide the test in halves. The counselor would need to utilize
the Kuder-Richardson coefficients of equivalence.
In a counseling research study, two groups of subjects took a test with the same name. However, when they talked with each other they discovered that the questions were different. The researcher assured both groups that they were given the same test. How is this possible?
the researcher gave parallel forms of the same test