Psych Assessment Chapter 4-6

Ace your homework & exams now with Quizwiz!

A

"Test related behavior predicts non-test related behavior." This statement A. Is an assumption on which the assessment enterprise is based. B. is true only if the test-related behavior mimics the non-test-related behavior C. Is true only if the non-test related behavior mimics the test-related behavior D. All of these

D

"What's a good test?' A good test is one that A. Trained examiners can administer with a minimum of difficulty. B. Is useful in the sense that it yields actionable results C. Will ultimately benefit individual test takers or society as a whole. D. All of these

A

A basic assumption of all scaling is that A. traits and states can be quantified and measured. B. traits and states can best be understood by those who experience them. C. traits and states are only measurable in human beings. D. measures developed in one culture have no meaning in other cultures.

C

A confidence interval is a range or band of test scores that A. has proven test-retest reliability B. is calculated using the standard error of the difference C. is likely to contain the true score

D

A prospective test user may ask many questions about a test's validity. Which of the following is NOT a validity-related question? A. Do the test items adequately sample the range of areas that should be sampled? B. What do high scores on this test really tell us about the targeted construct? C. What do low scores on this test really tell us about the targeted construct? D. As a tool of measurement, is this test consistent

D

A prospective test user may ask many questions about a test's validity. Which of the following is not a validity related question? A. Do the test items adequately sample the range of areas that should be sampled? B. What do high scores on this test really tell us about the targeted construct C. What do low scores on this test really tell us about the targeted construct D. As a tool of measurement, is this test consistent?

A

A psychological trait can BEST be described as A. An observable behavior B. A biological phenomenon C. A construct D. What a test measures

B

A psychological trait is a A. Has a real existence B. is a construct C. Has no relation to observed behavior D. All of these

B

A rater systematically assigns ratings in the middle range, this avoiding extremely positive and negative characterizes this rater's ratings A. Leniency error B. central tendency error C. Severity Error

B

A test designed to provide information about whether or not an aviator has mastered the ability to fly solo is an example of a test that is A. norm-referenced B. Criterion-referenced C. Trait referenced

C

A test is considered bias if A. 50% of the test-takers fail the test B. one group, such as males, consistently performs better than another group. such as females C. A factor inherent in the test systematically prevents accurate measurement

D

All Validity evidence can be interpreted as _____ Validity A. Content B. Criterion-related C. Predictive D. Construct

A

Among school-age children, as age increases, so do reading skills. This relationship between two variables illustrates A. A positive correlation between two variables B. A negative correlation between two variables C. A zero correlation between two variables

A

An applicant for a job with the U.S. Postal service scores in the bottom 5% of all applicants on a test that measures the ability to sort mail. This is an example of A. Norm-referenced testing B. Criterion-referenced testing. C. Stress-Tolerance testing

C

An assumption in psychological testing and assessment is that psychological traits can be measured. For this to be true, A. The reference group must be people who exhibit and do not exhibit the trait B. The test must take into account changes in the trait that can occur over time C. Test developers must clearly define the trait constructs the test purports to measure D. All of these

D

An estimate of test retest reliability is often referred to as coefficient of stability when the time interval between the test and retest is more than A. 30 days B. 60 days C. 3 months D. 6 months

C

Any definition of test fairness as used in a psychometric context would be likely to include reference to A. The percent of items answered correctly by members of different groups B. The mean scores earned by various groups on a particular test C. The degree to which a test is used in an impartial, just and equitable way.

B

Any influence on psychological test scores from factors other than what the test is intended to measure is referred to by measurement specialists as? A. Spurious artifact B. Error C. Bias D. Construct-irrelevant stimuli

B.

As the reliability of a test increases, the standard error of measurement A. Increases B. Decreases C. Remains the same

B

Assessment professionals who use tests without understanding the limitations of the tests they use are A. Violating state and federal laws which mandate such understanding. B. Violating provisions of their professions' ethical code. C. Well meaning but bumbling in their everyday work. D. Psychiatrists rather than psychologists

A

Before constructing a comprehensive final examination that covers everything you have studied since Day 1 of your course Your instructor reviews the objectives of the course, the textbook and all the lecture notes. Your instructor is clearly making a diligent effort to maximize the _____ validity of the final examination. A. Content B. criterion related C. Predictive

C

Coefficient alpha is appropriate to use with all of the following test formats EXCEPT A. True-False B. short-answer for which partial credit is awarded. C. Essay exam with no partial credit awared

C

Face validity refers to A. the most preferred method for determining validity B. Another name for content validity C. the appearance of relevancy of the test items

A

How do states different from traits? A. A trait is biologically determined, whereas a state in environmentally determined. B. A trait is measurable, whereas a state is not. C. A trait is relatively enduring, whereas a state is relatively short-lived. D. A trait is unchangeable, whereas a state is easily influenced.

A

If a newly developed test designed to measure happiness correlates with other tests of happiness but not with tests of sadness, this is referred to as _______ and _______ evidence of validity, respectively A. Convergent; discriminant B. Discriminant; convergent C. Homogeneous; concurrent

c

If new predictors explain something about a predicted score that was not already explained by existing predictors, the new predictor might be praised for its A. test-retest reliability B. incremental validity C. Construct validity

B

In an undergraduate measurement course, an instructor announces that the first examination will cover the topics of reliability and validity. One student in class, Jaiyan, publicly predicts that only questions on reliability will be posed. As it turns out, true to Jaiyan's predictions all of the test questions are only on the topic of reliability. Given this background, which of the following is the most reasonable conclusion that Jaiyam's fellow students could draw. A. The first examination lacked concurrent validity B. The first examination lacked content validity. C. The first examination lacked face validity

B

In psychological testing and assessment, Bias BEST refers to A. random variation in test performance attributable to covert prejudice on the part of the test developer B. Systematic variation in test performance that is unrelated to the construct that the test is intended to be measured C. A test or testing practice that systematically favors the performance of one group of testtakers over another

D

In selecting a test for use, the responsible test user does some advance research on A. How appropriate the available norms are for use with the contemplated testtakers. B. How appropriate the reading level of the test is for use. C. Cultural factors as they may relate to the administration and scoring of the test. D. All of these

A

In the test-retest method to estimate reliability A. the time frame between interviews must be relatively short B. Separate interviews are conducted by certified raters C. A minimum of two re-tests are required

A

It's a group of people whose performance on a particular test is analyzed for reference in evaluating the performance of future testtakers. This group is called A. a normative sample. B. a test validation group. C. a stratified random sample. D. a "back to the future" group.

C

It's a measure of validity that arrived by a comprehensive analysis of how scores on the test relate to other test scores" This statement is a reference to A. Face validity B. Content validity C. Construct validity

D

Item response theory is to latent trait theory as observer reliability is to A. Generalizing theory B. Domain sampling theory C. Odd-even reliability D. Inter-scorer reliability

B

Norm-referenced testing and assessment is both a method of evaluation and a way of deriving meaning from tests scores. Individual testtaker scores are evaluated against A. the scores achieved by a matched group of testtakers on a comparable test. B. the scores of a comparison group of testtakers. C. scores from a demographically more diverse group of testtakers. D. the scores achieved by a guy named norm.

A

Norms come in many different forms. Which of the following is NOT correctly referred to by the term "norms"? A. trait norms B. age norms C. grade norms D. percentile norms

B

Predictive and concurrent validity can be sumsumed under A. Content validity B. Criterion-related validity C. Face Validity

D

Referring to a psychological trait as a construct means that it A. is an idea whose time has come, gone, and come back again. B. has few practical "real-world" applications and is used for academic purposes. C. was developed by naturalistic observation rather than any theory. `D. is a scientific concept developed to describe or explain behavior.

B

Sampling all different subgroups within a population is an example of A. Random sampling B. Stratified sampling C. Incidental sampling D. Purposive sampling

C

Students may fantasize about a world without tests. But in reality, a world without tests would be more of a nightmare than a dream. This is because A. Nepotism in hiring and promotions would be rampant B. Professionals would not be properly credentialed C. Educational difficulties would be more difficult to diagnose D. All of these

B

Test reliability refers to A. How accurately a test measures what it purports to measure B. How consistently a test measures what it purports to measure C. The depth of measurement of a particular construct

C

Test validity refers to how A. True the score is in terms of true score theory B. rigorous the process of norming the test was. C. well a test measures what it purports to measure

C

The definition of a norm-referenced test as compared to a criterion-referenced test differs primarily with respect to A. the error thought to be present in measuremet B. Whether or not the test is theory based C. The score to which testtakers are compared

A

The form of criterion-related validity that reflects the degree to which a test score correlates with a criterion measure that was obtained some time subsequent to the test score is known as A. Predictive validity B. Constructive validity C. Concurrent validity

C

The form of criterion-related validity that reflects the degree to which a test score is correlated with a criterion measure obtained at the same time that the test scores was obtained is known as A. Predictive validity B. Constructive validity C. Concurrent Validity

C

The more homogenous a test is the A. less inter-item consistency it can be expected to have B. more utility the test has for measuring multifaceted variables C. More inter-item consistency it can be expected to have

C

The multiple-choice test items on this examinations are all examples of A. Dichotomous test items B. Latent trait test items C. Polytomous test items

D

The standard error of measurement is A. used to infer how far an observed score is from the true score B. Also known as the standard error of a score C. is used in the context of classical test theory D. All of htese

B

The statistic known as the standard error of measurement is BEST associated with which of the following? A. Validity B. Reliability C. Test standardization

C

The term norms refers to the A. Average score of all test-takers within a standardization of sample B. The typical performance of the norm group C. Scores with which the results of subsequent measure can be compared

B

The term psychometric soundness is typically reference to test's reliability and A. Utility B. Validity C. Norms D. Cost effectiveness

A

The term test heterogeneity BEST refers to the extent to which test items measure A. Different factors B. The same factor C. A unifactorial trait

A

The type of sample that is convenient or available for use is referred to as A. An incidental sample B. A stratified sample C. Coincidental sample

d

This tool is used to estimate or infer how far an observed score deviates from a true score A. Standard deviation B. A measure of central tendency C. The variance D. A standard error of measurement

B

This variety of error has also been referred to as "noise" It is A. Systematic error B. Random error C. measurement error

B

What is the relationship between the correlation coefficient and the standard error of estimate? A. it is positive relationship B. It is an inverse relationship C. No relationship exists

B

What term refers to the degree of correlation between all the items on a scale A. Inter-item homogeneity B. Inter-item consistency C. Inter-item heterogeneity

C

When measuring a particular psychological trait, the term error variance refers to A. The degree to which impression management may influence the obtained score. B. The variations in score possible if another instrument was employed. C. The component of a test score attributable to sources other than the trait. D. All of these

c

Which is NOT a method of evaluating the validity of a test? A. Evaluating scores on the test as compared to scores obtained on other tests B. Evaluating the content of a test C. Evaluating the percentage of passing and failing grades on the test

A

Which is an example of a false positive A. A test identifies a client as schizophrenic when the client is not. B. test correctly identifies a client as schizophrenic C. A test correctly identifies a client as not having schizophrenia

A

Which is term is used to refer to the tendency of a rater to evaluate ratees higher than they objectively deserve because of the rater's inability to discriminate between aspects of the ratee's behavior? A. Halo Effect B. Random error C. Generosity error D. Severity error

C

Which of the following constitutes a recommended sampling technique for use in obtaining a standization sample? A. Volunteer sampling B. Remanded sampling C. Purposive sampling

D

Which of the following is TRUE for estimates of alternate and parallel-forms reliability A. Two test administrations with the same group are required B. Test scores may be affected by factors such as motivation, fatigue, or intervening events like practice, learning or therapy C. Item sampling is a source of error variance D. All of these

C

Which of the following is TRUE for parallel forms of a test A. The means of the observed scores are equal for the two forms B. The variances of the estimated scores are equal for the two forms C. The means and variances of estimated scores are equal for the two forms

B.

Which of the following is the BEST example of a psychological trait? A. Eye color B. Aggressiveness C. Skin color D. None of these

D

Which qualifies as a contruct? A. Depression B. Intelligence C. Mechanical aptitude D. All of these

D

Which source of error variance affects parallel- or alternate form reliability estimates but does not affect test-retest estimates A. fatigue B. learning C. practice D. Item sampling

C

Which type of reiability estimate is obtained by correlating pairs of scores from the same person on the two different administrations of the same test? A. parallel forms estimate B. Split half estimate C. A test retest estimate

A

Which would NOT be useful in estimating a test's inter-item consistency? A. Cronbach's Alpha B. The kuder-richardson formulas C. The average proportional distance D. A coefficient of equivalence


Related study sets

Chapter 9: The Early Middle Ages

View Set

Chapter 8 Gestalt Therapy Practice Questions

View Set

Chapter 13 Test: Endocrine System [PT-1]

View Set