Intro to Psy testing Quizzes

¡Supera tus tareas y exámenes ahora con Quizwiz!

What was the first published tests of mental ability? 1)Wechsler-Bellevue Intelligence Scale 2)The Rorschach Inkblot Test 3)Binet-Simon Scale 4)The Personal Data Sheet

Binet-Simon Scale

When experts meet to rate how essential each test question is to the test, their data can be used to provide evidence of validity by calculating a 1) content validity ratio 2) correlation coefficient 3) mean rating 4) frequency table of ratings

Content validity ratio

According to the text, how many steps are involved in demonstrating evidence of validity based on test content during test development? 1) 6 2) 4 3) 3 4) 5

Most measurement experts think in terms of how many levels of measurement? 1) 1 2) 4 3) 2 4) 3

What is a sample? 1) A group of respondents who may receive the survey 2) All respondents in the target audience 3) A representative subset of the population 4) Those respondents who return the survey

A representative subset of the population

If you were to categorize demographic data (such as grouping people by gender, race, or place of residence), what level of measurement was used? 1) Ordinal scales 2) Ratio scales 3) Nominal scales 4) Equal interval scales

Nominal scales

What is a method for defining a construct by illustrating its relation to as many other constructs and behaviors are possible? 1) Nomological network 2) Construct explication 3) Multitrait-multimethod design 4) Correlation matrix

Nomological network

What provides us with a standard against which we can compare individual test scores? 1) Norms 2) Descriptive statistics 3) z scores 4) Percentiles

Norms

Which one of the following criteria in an organizational setting is objective? 1) Number of excused absences 2) Sales judgments 3) Supervisor ratings 4) Customer satisfaction surveys

Number of excused absences

Which of the following is observable and measurable, such as the number of accidents on the job, days absent, or disciplinary problems in a month? 1) Objective criterion 2) Internal criterion 3) Validity criterion 4) Subjective criterion

Objective criterion

What do we call changes in test scores resulting from the sequence in which the tests were taken? 1) Practice effects 2) Alternate effects 3) Order effects 4) Fatigue effects

Order effects

A time interval must elapse between the test administration and the criterion measurement when using which method for establishing evidence of validity? 1) Predictive 2) Test-retest 3) Concurrent 4) Content

Predictive

Which of these are smaller-scale assessments in which candidates complete a job-related task such as building a sawhorse or designing a doghouse? 1) Assessment center 2) Work sample 3) Integrity test 4) Behavioral interview

Work sample

Both Walter Dill Scott and Hugo Münsterberg proposed methods for validating tests prior to which war? 1) World War II 2) Vietnam War 3) World War I 4) Korean War

World War I

The slope (b) of the regression line is the expected 1) change in a for each unit of b 2) reliability 3) change in X for every one unit change in Y 4) change in Y for every one unit change in X

change in Y for every one unit change in x

If there was the formula: GPA= 1.4856 +.0014SATV + .0002SATM + .0084HS, what type of equation is this? 1) Cross-validation regression 2) Logistic regression 3) Multiple regression 4) Linear regression

chapter 11

If we administer a test at another time following a validation study to confirm the results of the validation study, what are we trying to do to the test? 1) Confirm differential validity 2) Confirm single group validity 3) Acculturate the test 4) Cross-validate the test

chapter 11

Which of these groups is NOT included in 360˚ feedback? 1) Managers 2) Peers 3) Direct reports 4) None. All of these are included in 360˚ feedback.

chapter 14

In the equation Y' = a + bX, what does "X" represent? 1) Predicted criterion score 2) Slope 3) individual's score 4) Intercept

individual's score

Specific requirements for the test administration location such as privacy, quiet, and comfortable chairs, tables, or desks should be part of the instructions for 1) test takers 2) test administrators 3) test developers 4) scorers

test administrators

Content validity ratios can range from 1) 10 to +10 2) 1 to 0 3) 0 to +1 4) -1 to 1

-1 to 1

A content validity ratio can range between: 1) -.75 to .75 2) -1.25 to 1.25 3) -.50 to .50 4) -1.00 to 1.00

-1.00 to 1.00

If a correlation between a test and criterion is .30, when what would be the coefficient of determination? 1) 9 2) .03 3) 3 4) .09

.09

The Standards describe how many sources of validity evidence? 1) 3 2) 9 3) 5 4) 7

Who coined the term "dragnet empiricism"? 1) Sherer 2) Cronbach 3) Campbell 4) Binet

Cronbach

Which of these is not a part of the Five-Factor Theory of Personality? 1) Openness 2) Neuroticism 3) Conscientiousness 4) Depression

Depression

What mathematical operation, when applied to the reliability of the test, will set the upper limit of the validity coefficient of the test? 1) Square root 2) Cube root 3) Square 4) Cube

Square root

Test Administrators

The person who responds to test questions or whose behavior is being measured is called a: 1) Psychologist 2) Test Taker 3) Test purchaser 4) Test user

Test Taker

When psychological tests are used in clinical and counseling settings, the results 1) are combined with information that is gathered in other ways 2) can be interpreted without additional information from other sources 3) my be freely shared with others because the tests are commercially available 4) can only be reviewed by personnel who have a MD degree

are combined with information that is gathered in other ways

Certification and licensure are credentials individuals obtain to demonstrate they 1) have met the requirements for earning an advanced degree 2) are qualified to administer psychological tests and assessments 3) are qualified to practice in their field 4) are certified to protect the health and safety of the public

are qualified to practice in their field

In survey research, survey validity refers to the degree to which the survey 1) assesses the concept a researcher is measuring 2) correlates with another similar measure 3) has convergent evidence of validity 4) results predict an expected outcome

assesses the concept a researcher is measuring

The criterion of success in college is most often students' 1) classroom test scores 2) grade point average 3) community service activities 4) professors' ratings

grade point average

Items for which the p value falls in the range of .90 to 1.00 are usually considered 1) too difficult 2) somewhat difficult 3) somewhat easy 4) too easy

too easy

A multi-trait multi-method correlation matrix is shown below. What is the correlation of Test of Trait 2/Essay with Test of Trait 3 /Interview and what is its type? 1) .08; heterotrait-heteromethod 2) .40; heterotrait-monomethod 3) .40; heterotrait-heteromethod 4) .08; heterotrait-monomethod

.08; heterotrait-heteromethod

The mean of a distribution of test scores will always have a z score of: 1) 1 2) 2 3) 3 4) 0

If the correlation (r) between a test and a criterion is .04, then the coefficient of determination (r2) is 1) 2 2) 0.02 3) 0.0016 4) 0.16

0.0016

When assessing item difficulty, which p value would yield distributions of test scores with the most variation? Question options: 1) 0.5 2) 0.2 3) 0.9 4) 0.7

0.5

In what year did Walter Dill Scott publish "The Scientific Selection of Salesmen?" 1) 1915 2) 1920 3) 1925 4) 1918

1915

When was the Polygraph Protection Act passed? 1) 1988 2) 1998 3) 2008 4) 1978

1988

During what period of time did portfolios start to become a popular assessment tool in education? 1) 2000s 2) 1990s 3) 1980s 4) 1970s

1990s

According to the text, how many important differences are there between a survey and psychological test? 1) 6 2) 4 3) 8 4) 2

Some scholars believe that psychological tests can be traced back to: 1) 500 AD 2) 1000 AD 3) 1000 BCE 4) 2200BCE

2200 BCE

According to the text, how many steps are involved in construct explication? 1) 5 2) 3 3) 6 4) 4

How many major types of factor analysis procedures are presented in your text? 1) 4 2) 3 3) 2 4) 1

3) 2

T scores always have a mean of: 1) 150 2)50 3) 100 4) 200

If a content validity ratio has a number of 0.00, then what percentage of experts agree that the item is essential? 1) 100% 2) 50% 3) 25% 4) 75%

50%

The mean of a normal distribution always has a percentile of 1) 50. 2) 0. 3) 10. 4) 100.

50.

According to the text, how many primary differences are there among psychological tests? 1) 8 2) 2 3) 4 4) 6

The text lists what they consider to be the most important assumptions for psychological tests. How many are there? 1) 8 2) 2 3) 6 4) 4

How much variance do a test and criterion share if the coefficient of determination is .09? Question 1) 81% 2) .09% 3) 3% 4) 9%

If you wanted an expert to look at your test related to 5th grade level math, who might be the best pick? 1) An I/O psychologist with two children in the 5th grade 2) The principal of the school 3) A 12th grade teacher with 20 years of experience 4) A 5th grade teacher with 10 years of experience

A 5th grade teacher with 10 years of experience

The reliability of a test derives from which factors? 1) A. Reliability within the test 2) B. Reliability across time 3) C. The validity of the test 4) A and B

A and B

When calculating a test of significance, what may be challenging when looking for statistically significant results? Question options: 1) A. Small sample sizes 2) B. Criterion contamination 3) C. A and B 4) D. None of the above

A and B

Which of these is a step involved in gathering theoretical evidence for construct validity? 1) Establish a nomological network 2) Propose experimental hypotheses 3) A and B 4) None of the above

A and B

Which of the following is a benefit of developing competency models? 1) A competency model is most useful for developing high stakes selection programs for police and firefighters 2) A competency model can often be used for multiple different jobs within an organization 3) A competency model will describe the specific tasks that are performed on a job 4) A competency model will focus on the technical skills necessary to perform a job

A competency model can often be used for multiple different jobs within an organization

What is the MMPI-2? 1) A comprehensive, clinically oriented self-report test used for diagnosis and making treatment plans 2) A symptom checklist used for self-diagnosis of minor mental health concerns 3) A pre-employment test used to identify which potential employees have the best mental health functioning 4) A projective, clinically oriented personality test used in residential treatment centers for adolescents

A comprehensive, clinically oriented self-report test used for diagnosis and making treatment plans

What is the SCID-I? 1) A semi-structured clinical interview that covers a wide range of disorders 2) A semi-structured clinical interview that covers a single disorder 3) A structured interview for diagnosing obsessive compulsive disorder 4) A structured interview used to evaluate candidates for jobs

A semi-structured clinical interview that covers a wide range of disorders

If a clinical practitioner used a tool where all of the questions that had to be answered were predetermined, which kind of tool would it be? A) A prognosis B) A structured clinical interview C) A semistructured clinical interview D) A nondirective clinical interview

A structured clinical interview

Which one of the following research studies could yield information about evidence of validity based on the content of a test? 1) A study that involves performing a series of systematic steps during test development 2) A study that involves correlating test scores with another previously identified test 3) A study where test takers provide their opinion on how well test questions relate to the test topic 4) A study that involves correlating test scores with a previously identified performance measure

A study that involves performing a series of systematic steps during test development

What rating scale would be most appropriate for responding to a survey question that asks, "How much time should a student study each week for an undergraduate college course?" 1) A: Not much; B: A little; C: An average amount; D: A lot; E: Almost always 2) A: 0 to 30 minutes; B: 31 minutes to 60 minutes; C: 61 minutes to 120 minutes; D: More than 120 minutes 3) A: 0 to 1 hours; B: 1 to 2 hours; C: 2 to 3 hours; D: More than three hours 4) A: Below average; B: Average; C: Above average; D: No opinion

A: 0 to 30 minutes; B: 31 minutes to 60 minutes; C: 61 minutes to 120 minutes; D: More than 120 minutes

The degree to which an immigrant or a minority member has adapted to a mainstream culture is called: 1) Generalizability 2) Acculturation 3) Differential validity 4)Cultural assimilation

Acculturation

Evidence of validity based on content is most appropriate for what types of tests? 1) Comparative tests that measure personality characteristics 2) Concrete tests that measure aptitude and achievement 3) Achievement tests that measure concrete attributes 4) Personality tests that measure absolute attributes

Achievement tests that measure concrete attributes

Test and survey developers should balance items for which the correct response would be positive with an equal number of items for which the correct response would be negative to control for which bias? 1) Social desirability 2) Faking 3) Random responding 4) Acquiescence

Acquiescence

Which response set is a problem when the test taker has a tendency to agree with any ideas or behaviors presented? 1) Random responding 2) Faking 3) Social desirability 4) Acquiescence

Acquiescence

What could a test developer do to increase internal consistency of a test? 1) Add questions that measure the same concept 2) Adjust the reliability coefficient using the KR-20 formula 3) Accurately measure the test-retest reliability of the test 4) Add well-written questions to each test form

Add questions that measure the same concept

If specialists or administrators use the results of tests to determine where money should be spent and what programs should be implemented to improve the achievement scores of a school, district, state, or nation, what kind of decision has been made? 1) Instructional 2) Program and curriculum 3) Diagnostic 4) Administrative policy

Administrative policy

A child is brought to a clinician because of concerns that the child might be suffering from an autism spectrum disorder. Which one of the following tests might the clinician give the child to aid in the diagnosis? 1) House-Tree-Person 2) Draw a Person 3) Ages and Stages Questionnaire 4) Beck Depression Inventory

Ages and Stages Questionnaire

According to the text, which of these is NOT a "Mode of Induction" according to Albert Bandura's theory? 1) Aggressiveness 2) Live Modeling 3) Participant Modeling 4) Attribution

Aggressiveness

A test manual should include which kinds of information? 1) Information that the test purchaser can use to evaluate the psychometric characteristics of the test 2) Detailed information about proper administration and scoring procedures 3) Information about how to compare test scores with those of norm groups 4) All of the above

All of the above

Generally speaking, which of these could be considered a criterion? Question options: 1) Accidents on the job 2) Absenteeism 3) Disciplinary problems 4) All of the above

All of the above

In a multitrait-multimethod design, which of these is a way that the design collects evidence for construct validity? Question options: 1) Establish evidence for reliability 2) Convergent evidence of validity 3) Divergent evidence for validity 4) All of the above

All of the above

People from multicultural backgrounds could belong to various minority groups based on: 1) Race 2) Family unit 3) Primary language 4) All of the above

All of the above

People from multicultural backgrounds could belong to various minority groups based on: Question options: 1) Race 2)Family unit 3) Primary language 4) All of the above

All of the above

To calculate a 95% confidence interval, what components are needed? 1) An individuals observed test score 2) Plus and minus 1.96 3) The standard error of measurement for the test 4) All of the above

All of the above

What are some reasons why people believe that integrity tests should not be used? 1) They believe that they are not reliable or valid 2) They believe that they are an invasion of privacy 3) They believe that they have a different and more inhibiting effect on minorities 4) All of the above

All of the above

What is a way to help reduce scoring errors? 1) Providing training to raters 2) Checking questions with unusually high numbers of correct or incorrect answers for mistakes in scoring 3) Frequent checking of computations 4) All of the above

All of the above

What kind of educational settings would use tests? 1) Middle schools 2) High schools 3) Universities 4) All of the above

All of the above

When defining the target audience, what is a factor that needs to be considered? 1) The reading level of the audience 2) Whether the audience has any characteristics that would require special test administration or interpretation 3) Whether test takers will be motivated enough to answer test questions honestly 4) All of the above

All of the above

Which of these actions is the ethical responsibility of test publishers? 1) Make test manuals available before tests users purchase their tests 2) Make validity information available before test users purchase their tests 3) Refuse to provide test materials to persons who do not have proper testing credentials 4) All of the above

All of the above

Which of these is a benefit of an assessment center? 1) Research suggests that they are one of the best predictors of job performance 2) There is a relative lack of sex bias 3) There is a relative lack of race bias 4) All of the above

All of the above

Which of these is a way to gather psychometric evidence for construct validity? 1) Experimental interventions 2) Evidence of validity based on content 3) Evidence of validity based on external criteria 4) All of the above

All of the above

Which of these is an abstract construct? 1) Intelligence 2) Beauty 3) Self-Esteem 4) All of the above

All of the above

Which of these is an exercise that you may find in an assessment center? 1) In-basket exercise 2) Role-play exercise 3) Leaderless group discussion 4) All of the above

All of the above

Which of these methods can be used to determine internal-consistency reliability? 1) Spearman-brown formula 2) KR-20 formula 3) Coefficient alpha 4) All of the above

All of the above

Which of these statements related to expectations of self-efficacy is true: 1) Expectations personal efficacy determine whether coping behavior will be initiated 2) Expectations of personal efficacy determine how much effort will be expanded 3) Expectations of personal efficacy determine how long effort will be sustained in the face of obstacles and aversive experiences 4) All of the above

All of the above

Which one of the following test required test takers to follow directions and perform a series of ballet movements that were confusing and distracting? 1) Army's assessment center 2) Stanford-Binet test 3) Wechsler Intelligence Test 4) Alpha and Beta tests

Alpha and Beta tests

What type of reliability is being estimated when someone administers two forms of the test to the same people? 1) Alternate or Parallel forms 2) Internal consistency 3) Interrater reliability 4) Test-retest

Alternate or Parallel forms

The State Board of Education where Joy went to college decided to give every graduate student in a science program with a combined GRE score more than 1400 a $5,000 education grant. What type of decision was this? 1) An absolute individual decision 2) A comparative institutional decision 3) A comparative individual decision 4) An absolute institutional decision

An absolute institutional decision

What is a portfolio? 1) Letters of reference and other external judgments about an individual's work 2) A compilation of an individual's test scores and other performance data for a specific purpose 3) An assortment of an individual's work completed for a specific time period 4) An objective catalog of information about an individual

An assortment of an individual's work completed for a specific time period

Which of the following should scoring instructions include? 1) A test key 2) Scoring procedures 3) An explanation of how the test scores relate to the construct the test measures 4) All of the above

An explanation of how the test scores relate to the construct the test measures

Which of these is NOT a stage involved in the psychological testing process: 1) An individual reports the reliability and validity of the test to the test taker 2) An individual administers a test to the test taker 3) An individual scores the test 4) An individual or group determines a need for psychological testing

An individual reports the reliability and validity of the test to the test taker

What does X stand for in a regression equation? 1) An individual's score on the predictor 2) The intercept on the Y axis 3) The correlation coefficient 4) The number on the Y axis that the linear regression coefficient predicts

An individual's score on the predictor

Which of these statements is true? 1) Reliability and validity are not related 2) An instrument can be valid, but not reliable 3) An instrument that is reliable is always valid 4) An instrument can be reliable, but not valid

An instrument can be reliable, but not valid

Which one of the following characteristics of a test is included in the test plan? 1) The planned date of publication 2) The content of each question on the test 3) An operational definition of the construct 4) A plan for validation

An operational definition of the construct

What is the fourth step in a scientific approach to survey design? 1) Construct the survey 2) Communicate the survey findings 3) Analyze the survey data 4) Administer the survey

Analyze the survey data

Julio conducted a research survey that involved asking students about their academic standards and integrity. He asked the students to not put their name or any other identifying information on the survey. What was he providing the students? 1) Confidentiality 2) Obscurity 3) Reliability 4) Anonymity

Anonymity

Which one of the following statements about the characteristics of a normal probability distribution is FALSE? 1) The curve can continue to infinity, and therefore the right and left tails of the curve will never touch the baseline. 2) Approximately 68.2% of the population will score between the mean and 1 standard deviation. 3) Most people will score near the middle of the distribution, making the center of the distribution the highest point. 4) Most test scores cluster or fall near the middle of the distribution, forming what we refer to as the average or the central tendency.

Approximately 68.2% of the population will score between the mean and 1 standard deviation.

Which test is currently primarily used by the armed forces to help qualify people for certain military branches and certain jobs? 1) Army Alpha 2) Armed Services Vocational Aptitude Battery 3) General Aptitude Test Battery 4) Army Beta

Armed Services Vocational Aptitude Battery

What mental test designed for group testing was developed for test takers who were unable to read, write, or speak English? 1) Army Omega test 2) Army Alpha test 3) Army Beta test 4) Army Alpha-Beta test

Army Beta test

Which of these would have large-scale simulations of the job that would require a test taker to solve typical job problems by role-playing or to demonstrate proficiency at job functions such as making presentations and fulfilling administrative duties? 1) Integrity tests 2) Behavioral interviews 3) Assessment centers 4) Work samples

Assessment centers

Which developmental disability affects communication and social interaction often involving restricted interests and stereotyped, repetitive patterns of behavior? A) Alzheimer's Disease B) Comorbid Disorders C) Autism Spectrum Disorders D) Obsessive Compulsive Disorder

Autism Spectrum Disorders

The Ages and Stages questionnaire is used to diagnose which of the following conditions? 1) Depression 2) Obsessive compulsive disorder 3) Autism spectrum disorders 4) Alzheimer's disease

Autism spectrum disorders

If is test is administered the same way to all people, this demonstrates which characteristic of psychological tests? 1) Behaviors thought to measure an attribute or thought to predict an outcome are representatively sampled 2) Behavior samples are obtained under standardized conditions 3) There are rules for scoring 4) None of the above

Behavior samples are obtained under standardized conditions

Which of these methods would use on-the-job behaviors as anchors for ratings scales that represent job dimensions? 1) Forced ranking 2) Behaviorally anchored rating scales 3) Forced distribution 4) Graphic rating scales

Behaviorally anchored rating scales

Which one of the following is NOT a consequence a test user might experience by violating an ethical standard? 1) Negative consequences to individuals 2) Negative consequences to organizations 3) Expulsion of a member from an association/organization 4) Being tried or sued in a court of law

Being tried or sued in a court of law

Which test scoring model places test takers in a particular group or class? 1) Ipsative model of scoring 2) Categorical model of scoring 3) Cumulative model of scoring 4) Projective model of scoring

Categorical model of scoring

When raters only use the middle of the rating scale and ignore the highest and lowest scale categories, this is called: 1) Halo effect 2) Severity error 3) Central tendency error 4) Leniency error

Central tendency error

Which of these validity methods is used to determine whether a test indeed predicts what it was designed to predict? 1) Construct validity 2) Face validity 3) Content validity 4) Criterion-related validity

Chapter 6

Which one of the following is a difference between clinical and counseling psychologists? 1) Clinical psychologists must have an MD degree, while counseling psychologists have a PhD degree 2) Clinical psychologists usually treat more serious problems, while counseling psychologists usually treat everyday problems 3) Clinical psychologists can prescribe medications, but counseling psychologists cannot 4) Clinical psychologists are covered by a different set licensing requirements than counseling psychologists

Clinical psychologists usually treat more serious problems, while counseling psychologists usually treat everyday problems

In which of these settings would one use psychological tests to help diagnose psychological disorders, plan treatment programs, and determine the effectiveness of treatment programs? 1) Educational settings 2) Clinical settings 3) Organizational settings 4) All of the above

Clinical settings

According to the text, Walter Dill Scott's selection system provided the basis for what type of salespeople? 1) Car 2) Clothing 3) Life insurance 4) Weapons

Clothing

Judy wants to estimate the internal consistency of a survey on which the respondent marks (1) Not at all, (2) Sometimes, (3) Most of the time, or (4) Always. Which one of the following is most appropriate for her to use? 1) KR-20 2) Spearman Brown 3) Coefficient alpha 4) Cohen's kappa

Coefficient alpha

Which method has seen the strongest increase over time? 1) Spearman-brown formula 2) KR-20 formula 3) Coefficient alpha 4) None. They are all used in equal amounts by researchers.

Coefficient alpha

Mental retardation, learning disabilities, and traumatic brain injuries are considered to be: 1) Sensory impairments 2) Motor Impairments 3) Cognitive impairments 4) None of the above

Cognitive impairments

Which of these types of tests measure the test taker's mental capabilities, such as general mental ability tests, intelligence tests, and academic skills tests? 1) Mentality tests 2) Cognitive tests 3) Personality tests 4) Polygraph tests

Cognitive tests

Which one of the following are assessments that measure the test taker's mental capabilities such as general mental ability tests, intelligence tests, and academic skills tests. 1) General pre-employment tests 2) Cognitive tests 3) Personality tests 4) Skills tests

Cognitive tests

According to the text, a typical job analysis would NOT include which of the following job factors? 1) Skills 2) Competencies 3) Knowledge 4) Abilities

Competencies

What is a powerful test that clinical psychologists can use if they want to gather information about the personality and emotional functioning of their patient? A) Projected Clinically Oriented Personality Tests B) Structured Interviews C) Comprehensive Clinically Oriented Self-Report Tests D) Behavior Rating Scales

Comprehensive Clinically Oriented Self-Report Tests

When pretesting a survey, you ask respondents to describe their thoughts when answering survey questions. What method are you using? 1) Retrospective "think aloud" interview 2) Confidence ratings 3) Paraphrasing 4) Concurrent "think aloud" interview

Concurrent "think aloud" interview

What is a method for establishing evidence of validity based on a test's relationships with other variables in which test administration and criterion measurement happen at roughly the same time? 1) Divergent evidence of validity 2) Concurrent evidence of validity 3) Predictive evidence of validity 4) Convergent evidence of validity

Concurrent evidence of validity

What is the method for establishing evidence of validity based on a test's relationships with other variables in which test administration and criterion measurement happen at roughly the same time? 1) Concurrent evidence of validity 2) Divergent evidence of validity 3) Convergent evidence of validity 4) Predictive evidence of validity

Concurrent evidence of validity

Which method of demonstrating evidence of validity do test developers use when the test scores and criterion scores are collected at approximately the same time? 1) Predictive method 2) Concurrent method 3) Convergent method 4) Content method

Concurrent method

What is the statistical procedure that would be used when researchers consider a theory associated with a test and propose a set of underlying factors that they expect the test will contain? 1) Multitrait-multimethod design 2) Confirmatory factor analysis 3) Exploratory factor analysis 4) Dragnet empiricism

Confirmatory factor analysis

Which one of the following processes requires researchers to first specify a hypothesis? 1) Content validity study 2) Exploratory factor analysis 3) Multitrait-multimethod study 4) Confirmatory factor analysis

Confirmatory factor analysis

Which of these validity methods refers to evidence that demonstrates that a test is measuring the underlying hypothetical (not directly observable) concept that the test was designed to measure? 1) Construct validity 2) Content validity 3) Criterion-related validity 4) Face validity

Construct validity

Which of the following are considered "traditional" terms that are no longer used in the most recent version of the Standards for Educational and Psychological Testing? 1) Content and criterion-related validity 2) Reliability and precision 3) Internal consistency and test-retest reliability 4) Test developer and test publisher

Content and criterion-related validity

Evidence of validity that reflects the extent to which the questions of a test are representative of the attribute being measured is called: 1) Criteria 2) Criterion-related validity 3) Construct validity 4) Content validity

Content validity

Which type of validity is carried out as the test is developed? 1) Construct validity 2) Content validity 3) Criterion-related validity 4) External validity

Content validity

What is a method for establishing evidence of validity based on a test's relationships with other variables in which test administration and criterion measurement happen at roughly the same time? 1) Predictive evidence of validity 2) Divergent evidence of validity 3) Concurrent evidence of validity 4) Convergent evidence of validity

Convergent evidence of validity

What is the method for establishing evidence of validity that show that other or similar constructs that theoretically should be related to the construct measured by a test in question are indeed related? 1) Predictive evidence of validity 2) Concurrent evidence of validity 3) Convergent evidence of validity 4) Divergent evidence of validity

Convergent evidence of validity

When the scores on a test correlate strongly with scores on other tests that measure the same construct, we say the test has 1) Discriminant evidence of validity 2) Evidence of validity based on relations with external criteria 3) Convergent evidence of validity 4) Evidence of validity based on content

Convergent evidence of validity

Which two validation strategies below help us to establish evidence of construct validity? 1) Construct explication and nomological networks 2) Coefficient alpha and test-retest reliability 3) Convergent evidence of validity and discriminant evidence of validity 4) Analysis of variance and multi-trait, multi-method analysis

Convergent evidence of validity and discriminant evidence of validity

When calculating the slope of a regression line, the letter r represents the: 1) Beta 2) Mean distribution of X 3) Intercept 4) Correlation coefficient

Correlation coefficient

What is the table in which the same tests and measures are listed in the horizontal and vertical headings and correlations of the tests are shown in the body of the table? 1) Correlation matrix 2) Nomological network 3) Factor analysis table 4) None of the above

Correlation matrix

Which of these is NOT considered to be a concrete attribute? 1) Creativity 2) Playing the piano 3) Writing a paper 4) Kicking a ball

Creativity

Outcomes that a test has been designed to predict (such as job performance or success in a training program) are called: 1) Criteria 2) Reliability 3) Constructs 4) Validity

Criteria

The measure of performance that is correlated with test scores is called the: 1) Predictor 2) Validity coefficient 3) Reliability coefficient 4) Criterion

Criterion

If a criterion measures more dimensions than those measured by the test, what is probably present? 1) Criterion contamination 2) Criterion deficiency 3) Restriction of range 4) None of the above

Criterion contamination

What do we call an evaluative standard that researchers use to measure performance, attitude, or motivation? 1) Utility measure 2) Criterion measure 3) Reliability measure 4) Validity measure

Criterion measure

Which one of the following types of tests will you take to obtain your driver's license? 1) Norm-referenced 2) Criterion-referenced 3) Objective 4) Subjective

Criterion-referenced

If a statement said "Zachary has learned 50% of the material he needs to know to demonstrate proficiency in third-grade mathematics," this test is an/a: 1) Norm-referenced test 2) Authentic test 3) Difference test 4) Criterion-referenced test

Criterion-referenced test

Which test-scoring model assumes that the more the test taker responds in a particular fashion, the more the test taker exhibits the attribute being measured? 1)Cumulative model of scoring 2) Ipsative model of scoring 3) Projective model of scoring 4) Categorical model of scoring

Cumulative model of scoring

According to the text, when a test developer makes a list of the characteristics of persons who will take the test, particularly those characteristics that affect how test takers will respond to the test questions, they are helping to: 1) Define the target audience 2) Define the test purpose 3) Define the testing universe 4) Develop a test plan

Define the target audience

According to the text, when a test developer is preparing a working definition of the construct that the test will measure, they are helping to: 1) Develop a test plan 2) Define the test purpose 3) Define the target audience 4) Define the testing universe

Define the testing universe

What is the first step in test development? 1) Defining the testing universe, audience, and purpose 2) Writing the administration instructions 3) Developing a test plan 4) Composing the test items

Defining the testing universe, audience, and purpose

Which test measures the four developmental areas of: personal-social, fine motor, language, and gross motor skills? A) Denver Developmental Screening Test B) Gesell Developmental Schedules C) Ages and Stages Questionnaire D) Bayley Scales of Infant and Toddler Development

Denver Developmental Screening Test

Two 1st-year college students, Carlos and Carl, are used to taking different types of academic tests. Carlos has mostly taken essay tests, and Carl has mostly taken multiple choice tests. What problem may arise when they take the same tests in college? 1) Depending on the type of tests given, there may be construct bias present in the test scores 2) Depending on the type of tests given, there may be method bias present in the test scores 3) Depending on the type of tests given, there may be method bias present in the test scores 4) Depending on the type of tests given, there may be reliability bias present in the test scores

Depending on the type of tests given, there may be method bias present in the test scores

Which one of the following is most likely to be the criterion measure for establishing evidence of validity based on relations with external criteria for a clinical psychological test? 1) Academic ability 2) Job performance ratings 3) Diagnoses made by two or more clinicians 4) Intelligence quotient

Diagnoses made by two or more clinicians

If a teacher has used a test and determines that a student can write a sentence but only a simple sentence, then the teacher has made what kind of decision? 1) Instructional 2) Counseling and guidance 3) Placement 4) Diagnostic

Diagnostic

What is a challenge with diagnosing autism disorders? A) Finding licensed psychologists who have certification in diagnosing autism B) Determining whether the child's parent has anything to do with their child's slow developmental progression C) Finding a test that evaluates autism while still being valid and reliable D) Differentiating between normal and abnormal development in social interaction, communication, and interests

Differentiating between normal and abnormal development in social interaction, communication, and interests

If you correlate scores on one test that that measures a particular construct with scores from another test thought to measure a different construct, what type of construct validity evidence have you tried to demonstrate? 1) Evidence of validity based on content 2) Establish evidence for reliability 3) Discriminant evidence of validity 4) Convergent evidence of validity

Discriminant evidence of validity

What is the method for establishing evidence of validity that shows that constructs that should not be related to the construct measured by the test in question are in fact not related? 1) Convergent evidence of validity 2) Concurrent evidence of validity 3) Discriminant evidence of validity 4) Predictive evidence of validity

Discriminant evidence of validity

In a multiple-choice question, what is another name for the incorrect responses that are listed? 1) Distractors 2) Stems 3) Erroneous responses 4) All of the above

Distractors

The scores on Dr. Katz's tests always had a mean of 50 and a standard deviation of 10. What statement best explains this phenomenon? 1) Dr. Katz converts his students' raw scores to T scores. 2) The same descriptive statistics for each test is a coincidence. 3) Dr. Katz grades his tests using a normal curve. 4) Dr. Katz converts his students' raw scores to z scores.

Dr. Katz converts his students' raw scores to T scores.

Which of these is NOT an effective way to gather evidence for construct validity? 1) Establishing reliability 2) Using a multimethod-multitrait design 3) Establishing a nomological network 4) Dragnet empiricism

Dragnet empiricism

Which of these learning disabilities is related to math? 1) Dyslexia 2) Dysgraphia 3) Dyscalculia 4) None of these

Dyscalculia

One's ability to understand one's own feelings and the feelings of others and to manage one's emotions is called: 1) Behavior 2) Emotional intelligence 3) Feelings intelligence 4) Psychological intelligence

Emotional intelligence

A Likert scale is often assumed to be representative of what level of measurement? 1) Ordinal scales 2) Equal interval scales 3) Nominal scales 4) Ratio scales

Equal interval scales

According to your textbook, which one of the following test formats do students prefer because it allows them to focus on demonstrating what they have learned rather than limiting them to answering specific questions? 1) True/false 2) Multiple choice 3) Essay 4) Forced choice

Essay

If a test developer wanted to develop a question that assesses a higher-order skill such as analysis, synthesis, or evaluation, what might be the best type of test question to use? 1) True/False 2) Interview 3) Essay 4) Multiple choice

Essay

Which one of the following types of questions provides a freedom of response that facilitates assessing higher cognitive behaviors such as analysis, synthesis, and evaluation? 1) Essay 2) Forced choice 3) Multiple choice 4) True/false

Essay

Which of these is NOT an objective test format? 1) Fill-in-the blank 2) Multiple choice 3) Essay question 4) True/False questions

Essay question

Problems for which there are no clear or agreed-on moral solutions are called: 1) Ethical principles 2) Ethical standards 3) Ethical dilemmas 4) Ethical values

Ethical dilemmas

The American Psychological Association has a set of professional practice guidelines or codes know as: 1) Ethical standards 2) Ethical dilemmas 3) APA rules and regulations 4) APA values

Ethical standards

What refers to issues or practices that influence the decision-making process in terms of "doing the right thing?" 1) Anonymity 2) Certification 3) Confidentiality 4) Ethics

Ethics

The 2014 Standards for Educational and Psychological Testing recognize five sources of evidence of validity. Which one of the following is NOT one of those sources? 1) Evidence based on relations to other variables 2) Evidence based on test content 3) Evidence based on face validity 4) Evidence based on response processes

Evidence based on face validity

Which one of the following types of evidence of validity can be gathered without using correlations? 1) Concurrent evidence of validity 2) Evidence based on test content 3) Predictive evidence of validity 4) Evidence based on relations with a construct

Evidence based on test content

If a test's questions are a representative sample of a well-defined domain, what type of evidence for construct validity has been demonstrated? 1) multitrait-multimethod design 2) Evidence for validity based on content 3) Evidence of validity based on relationships with external criteria 4) Establish evidence for reliability

Evidence for validity based on content

Which one of the following statements about validity is FALSE? 1) Evidence of validity based on content involves examining the questions on the test to see if they represent the domain being measured 2) Evidence of validity based on a test's relations with a criterion can be gathered via the concurrent and predictive methods 3) Evidence of validity based on a test's relations with other variables involves correlating test scores to a measure of performance or another test 4) Evidence of validity based on content involves correlating test scores to a criterion

Evidence of validity based on content involves correlating test scores to a criterion

What is statistical procedure that would be used when researchers do not propose a formal hypothesis, but instead use the procedure to broadly identify underlying components? 1) Dragnet empiricism 2) Exploratory factor analysis 3) Multitrait-multimethod design 4) Confirmatory factor analysis

Exploratory factor analysis

Which of the following is concerned with how test takers perceive the attractiveness and appropriateness of a test? 1) Content validity 2) Construct validity 3) Face validity 4) Concurrent validity

Face validity

Which of the following is not an acceptable means of determining the TRUE validity of a test? 1) Criterion-related validity 2) Construct validity 3) Face validity 4) Content validity

Face validity

What are the underlying concepts or constructs that the tests or groups of test questions are measuring? 1) Factors 2) Predictors 3) Criteria 4) None of the above

Factors

Which of these is a method that is NOT recommended by management consultants because an employee may be placed in a category that does not accurately describe his or her performance? 1) Forced distribution 2) Behavioral anchored rating scale 3) 360˚ feedback 4) Graphic rating scales

Forced distribution

Which of these is a method that is NOT recommended by management consultants because an employee may be placed in a category that does not accurately describe his or her performance? 1) Forced distribution 2) Graphic rating scales 3) Behavioral anchored rating scale 4) 360˚ feedback

Forced distribution

1 in 5 Fortune 500 companies use: 1) Forced distribution 2) Forced ranking 3) Behaviorally anchored rating scales 4) Graphic rating scales

Forced ranking

Professor Washington asks the students in his class to take a short quiz at the end of every two-week period during the semester. He assures the students that the quiz results are for him to better understand what they are and are not learning-not for grading purposes. He is using the test results as what type of assessment? 1) Diagnostic 2) Formative 3) Placement 4) Summative

Formative

Assessments that help teachers determine what information students are and are not learning during the instructional process are called? 1) Summative assessments 2) Formative assessments 3) Diagnostic assessments 4) Placement assessments

Formative assessments

When conducting a job analysis, what area group of activities that allow the incumbent to accomplish one of the primary objectives of the job? 1) Other characteristics 2) Functions 3) Abilities 4) Skills

Functions

We can divide the process of establishing evidence of construct validity into two parts. Name them. 1) Collecting data and analyzing the data 2) Proposing a hypothesis and proving that 3) Gathering theoretical evidence and gathering psychometric evidence 4) Interviewing study participants and testing those participants

Gathering theoretical evidence and gathering psychometric evidence

When a test can be expected to produce similar results even though it has been administered at different locations, we can now say that the test is: 1) Generalizable 2) Acculturated 3) Cross-validated 4) Repeatable

Generalizable

Jane is considering using two different tests of mechanical aptitude, both of which have good evidence of validity, to help select her auto mechanics. From prior research she knows that the two tests are correlated at .93. Which of the following recommendations would be best to make to Jane? 1) Find another test that has a higher correlation than .93 with either of the other two tests 2) Give one test or the other, but not both 3) Use neither test because testing is not a good way to select mechanics 4) Give both tests to all applicants

Give one test or the other, but not both

Which of these is NOT considered to be a subjective criterion? 1) Self-reports 2) Grade point average 3) Ratings of performance 4) Teacher's recommendations

Grade point average

If a teacher uses the results of a test to assign points to students, what kind of decision is being made? 1) Administrative policy 2) Selection 3) Instructional 4) Grading

Grading

According to the text, the most popular method for rating employee performance is using: 1) Behavioral anchored rating scales 2) Graphic rating scales 3) Forced distribution 4) Forced ranking

Graphic rating scales

Which one of the following was developed by a psychologist in the years prior to World War I? 1) Performance appraisals 2) Group tests for personnel selection 3) Assessment centers 4) Job analyses

Group tests for personnel selection

What occurs when raters let their judgment on one dimension influence judgments on another dimension? 1) Halo effect 2) Severity error 3) Leniency error 4) Central tendency error

Halo effect

Which one of the following rater errors occurs when raters let their judgment on one dimension influence judgments on other dimensions? 1) Leniency error 2) Halo effect 3) Central tendency error 4) Severity error

Halo effect

Imagine you were a testing professional delivering a presentation on test taker responsibilities to a group of students preparing to take the SAT. Which one of the following would you NOT include as a test taker responsibility in your presentation? 1) Carefully read or listen to, as well as follow, test administrator instructions 2) Request testing accommodations if they have a physical condition 3) Protect test security and copyrights by not sharing test content with others 4) Have your test scored and results delivered within a reasonable period of time

Have your test scored and results delivered within a reasonable period of time

Which type of test replicates job settings as realistically as possible? 1) Integrity tests 2) Low-fidelity tests 3) Polygraph tests 4) High-fidelity tests

High-fidelity tests

A bar graph used to represent frequency data in statistics is called a: 1) Histogram 2) Normal probability distribution 3) Frequency distribution 4) Frequency graph

Histogram

If a teacher uses the results of a test to determine the pace of courses, what kind of decision is being made? 1) Instructional 2) Administrative policy 3) Selection 4) Grading

Instructional

Relating to positive practices when providing instructions to test takers, which of these statements is false? 1)Instructions on where to respond should be given 2) Instructions on how to respond should be given 3) Instructions should be written at a high reading level 4) None. All of these aspects of providing instructions should be in effect.

Instructions should be written at a high reading level

Which one of the following criteria used in educational settings is subjective? 1) Instructors' letters of recommendation 2) Number of courses completed 3) Grade point average 4) Number of dismissals or withdrawals

Instructors' letters of recommendation

Which type of test is used to measure individual attitudes and experiences toward concepts such as honesty and dependability? 1) Integrity tests 2) Intelligence tests 3) Achievement tests 4) Aptitude tests

Integrity tests

In a regression equation, the letter a represents the: 1) Intercept 2) Slope of the regression line 3) Mean distribution of Y 4) Correlation coefficient

Intercept

The place where the regression line crosses the y-axis is called the: 1) Correlation coefficient 2) Slope of the regression line 3) Intercept 4) Mean distribution of Y

Intercept

Cohen's kappa is an index of what? 1) Internal consistency 2) Interrater reliability 3) Test-retest reliability 4) Interrater agreement

Interrater agreement

Which one of the following levels of measurement involves assigning numbers to rank-order data, the distance between the numbers is judged to be equal, and there is no absolute zero point? 1) Ratio scale 2) Nominal scale 3) Interval scale 4) Ordinal scale

Interval scale

What is the line in this graph called? 1) Normal curve 2) Regression line 3) Item characteristic curve 4) Probability curve

Item characteristic curve

What is the name of the line that results when one graphs the probability of answering an item correctly with the level of ability on the construct being measured? 1) Item discrimination curve 2) Item characteristic curve 3) Item probability curve 4) Item difficulty curve

Item characteristic curve

Which one of the following statistics can be used to make decisions about retaining or discarding an item based on how well the item discriminates between high- and low-scoring test takers? 1) Item-total correlation 2) Inter-item correlation 3) Difficulty level 4) Discrimination index

Item-total correlation

The Psychological Corporation, an organization that continues today under the name Harcourt Assessment as a unit of Pearson, was organized by who? 1) Alfred Binet 2) Millicent Pond 3) J. McKeen Cattell 4) Walt Dill Scott

J. McKeen Cattell

Which one of the following is an abstract construct? 1) Job turnover 2) Job satisfaction 3) Number of customer complaints in one week 4) Days absent in a 30-day period

Job satisfaction

Jonathan wanted to estimate the internal consistency of a multiple choice test. Which one of the following would be most appropriate for him to use? 1) Spearman Brown 2) KR-20 3) Coefficient alpha 4) Cohen's kappa

KR-20

Which one of the following is used for calculating internal consistency for tests whose questions can be scored as either right or wrong? 1) Coefficient alpha 2) KR-20 formula 3) Split halves method 4) Test-retest method

KR-20 formula

Which of these is an exercise where the test taker is placed in a small group with the goal of discussing or solving a job-related issue? 1) Role-play exercise 2) In-basket exercise 3) Leaderless group discussion 4) Interviews

Leaderless group discussion

When raters give all employees better ratings than they deserve, this is called: 1) Severity error 2) Central tendency error 3) Halo effect 4) Leniency error

Leniency error

When we are interested in predicting criterion (Y') from a set of test scores, we can use what statistical procedure? 1) Linear regression 2) Predictive validity 3) Correlation 4) Multiple regression

Linear Regression

Which one of the following procedures would be best to use to answer the question, "If a student scores 85 on the academic abilities test, what course grade would we expect the student to receive?" 1) Linear regression 2) Multiple regression 3) Test of significance 4) Correlation

Linear Regression

What is the statistical process called when we use one set of test scores to predict one set of criterion scores? 1) Linear regression 2) Multiple regression 3) A test of significance 4) None of the above

Linear regression

Which of these is NOT specified in a test plan: 1) Scoring procedures 2) The operational definition of a construct 3) Literature review procedures 4) The format for the questions

Literature review procedures

What type of test simulates tasks using a written, verbal, or visual description? 1) High-fidelity tests 2) Polygraph tests 3) Low-fidelity tests 4) Integrity tests

Low-fidelity tests

Which one of the followings test is a comprehensive clinically oriented self-report test used for diagnosis and making treatment plans? 1) MMPI-2 2) Beck Depression Inventory 3) Symptom Checklist 90 4) TAT

MMPI-2

If a test developer wants to "make all items independent," then they should try to : 1) Make sure that cues for the correct response in one time are not found anywhere else on the test 2) Make sure all items are reliable and valid 3) Make sure that each item has its own factor 4) Make sure that items are homogeneous

Make sure that cues for the correct response in one time are not found anywhere else on the test

Variations or inconsistencies in the measurements yielded by a test or survey are called: Question options: 1) Variation error 2) Inconsistency error 3) Measurement error 4) None of the above

Measurement error

How do professional societies and associations decide what to put in their standards? 1) Members propose their codes and agree to abide by them only after the codes are accepted by consensus 2) The members form a committee that adapts their codes from those of other professional organizations 3) Members vote on and adopt the standards after extensive discussion and debate 4) Organizations derive their ethical codes from governmental agencies and guidelines

Members vote on and adopt the standards after extensive discussion and debate

Which of these is NOT a source of self-efficacy in Albert Bandura's theory? 1) Mental experience 2) Performance accomplishments 3) Verbal persuasion 4) Vicarious experience

Mental experience

If you were a testing professional, which one of the following would you NOT do when testing individuals with disabilities? 1) Provide special accommodations, if necessary, to individuals with disabilities 2) Modify test scores to take into account a test taker's disability 3) Rely on more than test scores when testing individuals for diagnostic purposes 4) Ensure the intended test outcomes indicate the intended attributes

Modify test scores to take into account a test taker's disability

If we wanted to predict job performance using the results of a personality test and an intelligence test, which of the following statistical techniques would be best to use? 1) Linear regression 2) Correlation 3) Coefficient of determination 4) Multiple regression

Multiple Regression

Jane is putting together a battery of tests to select employees for her company. She is already using a test of cognitive ability and a test of numerical reasoning. She is now considering adding a personality test as well. Which one of the following would be the best statistical procedure to help make a decision about whether to add the personality test? 1) Multiple regression 2) Cohen's kappa 3) Simple linear regression 4) Correlation

Multiple regression

What is the statistical process called when we use more than one set of tests scores to predict one set of criterion scores? 1) Linear regression 2) Multiple regression 3) A test of significance 4) None of the above

Multiple regression

Which one of the following would we use in the denominator of the standard deviation formula when calculating the standard deviation of a population of test scores? 1) n - 1 2) N -1 3) n + 1 4) N

When choosing which items to put in the final draft of the test, which of the following should NOT be considered? 1) Evidence of the item's validity 2)Interritem correlation 3) Item difficulty 4) None. All of these should be considered

None. All of these should be considered

Which of these is NOT considered to be a step in helping to demonstrate evidence of validity based on test content during test development? 1) Defining the testing universe 2) Developing test specifications 3) Constructing test questions 4) None. They all are steps involved in the process.

None. They all are steps involved in the process.

If you were to receive the results of a test and you saw that you were in the 85th percentile, what kind of test did you take? 1) Criterion-referenced 2) Percentile test 3) Authentic test 4) Norm-referenced

Norm-referenced

What type of test would you design if the items were constructed to distinguish high from low performers? 1) Norm-referenced 2) Criterion-referenced 3) Portfolio assessment 4) Authentic assessment

Norm-referenced

For which test are test takers more likely to earn the "average" score, no matter how well they perform? 1) Authentic test 2) Criterion-referenced test 3) Average test 4) Norm-referenced test

Norm-referenced test

An academic achievement test where the test taker who achieves the highest score will receive a scholarship is an example of which approach to test interpretation? 1) Normative 2) Criterion-related 3) Construct 4) Predictive

Normative

Which one of the following was a criterion for the validity study for the Suicide Probability Scale conducted at Father Flanagan's Boys Home in Nebraska? 1) Number of self-destructive behaviors 2) Test scores on the Suicide Probability Scale 3) Number of prior attempts of suicides 4) Number of suicides

Number of self-destructive behaviors

Psychological assessment involves what two types of components? 1) Group and individual 2) Strong and weak 3) objective and subjective 4) Structured and unstructured

Objective and subjective

Juanita asks children to line up in order of their height, placing the shortest child first and the tallest child last. She then assigns the first child "1," the second a "2," and so on. What type of scale is Juanita using? 1) Equal interval scale 2) Ratio scale 3) Ordinal scale 4) Nominal scale

Ordinal scale

If a teacher asked 20 children to line up in order of height, then assigned numbers to the children based on how tall they were (e.g., the shortest child receives a "1" all the way up to the tallest child, who receives a "20"), what level of measurement was used? 1) Ordinal scales 2) Equal interval scales 3) Ratio scales 4) Nominal scales

Ordinal scales

A human resource professional would most likely use a psychological test in a/an: 1) Organizational setting 2) Educational setting 3) Clinical setting 4) None of the above

Organizational setting

In which setting do Industrial/Organizational practitioners use tests to measure applicants' knowledge, skills, abilities, and personalities? 1) Educational settings 2) Clinical settings 3) Organizational settings 4) None of the above

Organizational settings

If you are rated by your colleagues or equals, then you are being rated by your: 1) Supervisors 2) Direct reports 3) Peers 4) Customers

Peers

Which of these is NOT a linear transformation? 1) Standard deviation units 2) Percentiles 3) Percentages 4) Z-scores

Percentiles

An interitem correlation matrix would be comprised of what type of coefficients? 1) Phi coefficients 2) Point-biserial coefficient 3) Pearson product-moment correlation coefficient 4) None of the above

Phi coefficients

Which one of the following two general categories do assessments for integrity fall into? 1) Polygraphs and interviews 2) Assessment centers and interviews 3) Physiological measures and paper-and-pencil tests 4) Simulations and polygraphs

Physiological measures and paper-and-pencil tests

Assessments that are used to determine whether students have the skills or knowledge necessary to understand new material and to determine how much information students already know about new material are called? 1) Placement assessments 2) Diagnostic assessments 3) Formative assessments 4) Summative assessments

Placement assessments

What do we call tests used to determine whether students have the skills or knowledge necessary to understand new material and how much of the new material students already know? 1) Formative assessments 2) Diagnostic assessments 3) Placement assessments 4) Summative assessments

Placement assessments

In the equation Y' = a + bX, what does "Y'" represent? 1) Slope 2) Intercept 3) Individual's score 4) Predicted criterion score

Predicted criterion score

What is a method for establishing evidence of validity based on a test's relationships with other variables that shows a relationship between test scores obtained at one point in time and a criterion measured at a later point in time? 1) Convergent evidence of validity 2) Concurrent evidence of validity 3) Divergent evidence of validity 4) Predictive evidence of validity

Predictive evidence of validity

What is the method for establishing evidence of validity based on a test's relationships with other variables that shows a relationship between the test scores at one point in time and a criterion measured at a later point in time? 1) Divergent evidence of validity 2) Convergent evidence of validity 3) Predictive evidence of validity 4) Concurrent evidence of validity

Predictive evidence of validity

Which one of the following is evidence that a test can predict future behavior? 1) Content-based evidence of validity 2) Predictive evidence of validity 3) Concurrent evidence of validity 4) Face evidence of validity

Predictive evidence of validity

When it is important to show a relationship between test scores and a future behavior, researchers establish evidence of validity using what method? 1) Predictive method 2) Content method 3) Test-retest method 4) Concurrent method

Predictive method

What is it referred to as when a patient is displaying signs of dementia or Alzheimer's disease, but actually only has depression? A) Major depressive disorder B) A stroke C) Pseudodementia D) Parkinson's disease

Pseudodementia

Which of these methods can be used to establish face validity? 1) Qualitative Analysis 2) Acculturation 3) Quantitative Analysis 4) Cross-Validation

Qualitative Analysis

When test developers ask test takers to complete a questionnaire about how they viewed the test and how they answered the question, this is called: 1) Acculturation 2) Quantitative analysis 3) Qualitative analysis 4) Cross-validation

Qualitative analysis

The coefficient of multiple determination is also known as: 1) The square root of r 2) r2 3) r3 4) R2

Imagine you were a testing professional delivering a presentation on test taker rights to a group of students preparing to take the SAT. Which one of the following would you NOT indicate as a test taker right? 1) Have your test administered and your test results interpreted by appropriately trained individuals who follow professional codes of ethics 2) Receive a written or oral explanation of your test results within a reasonable amount of time after testing and in commonly understood terms 3) Respectfully request a change in the testing environment or conditions if you believe the conditions will negatively affect your results 4) Be treated with courtesy, respect, and impartiality, regardless of your age, disability, ethnicity, gender, national origin, religion, sexual orientation, or other personal characteristics

Respectfully request a change in the testing environment or conditions if you believe the conditions will negatively affect your results

When a validity coefficient is lowered because some people are dropped from a validity study, it may be possible that what has occurred? 1) Restriction of range 2) Predictive evidence of validity 3) Concurrent evidence of validity 4) None of the above

Restriction of range

What right includes the concept that individuals are entitled to full explanations of why they are being tested, how the test data will be used, and what the test results mean? 1) Right to know and understand test results 2) Right to privacy 3) Right to informed consent 4) Right to confidentiality

Right to informed consent

What is the right that a participant is entitled to a nontechnical explanation of the scores of all tests? 1) Right to protection from stigma 2) Right to know and understand results 3) Right to privacy 4) Right to informed consent

Right to know and understand results

If you wanted to learn more about the personal and legal responsibilities of test takers, what two resources would you benefit most from reading? 1) Standards for Educational and Psychological Testing and Code of Fair Testing Practices in Education 2) Rights and Responsibilities of Test Takers and Code of Fair Testing Practices in Education 3) Standards for Educational and Psychological Testing and Ethical Principles of Psychologists and Code of Conduct 4) Rights and Responsibilities of Test Takers and Standards for Educational and Psychological Testing

Rights and Responsibilities of Test Takers and Standards for Educational and Psychological Testing

In an effort to improve the credibility of psychological testing and establish psychology as a true scientific movement, who promoted the use of mental testing during World War I? 1) Walter Lippman 2) Robert Yerkes 3) Thomas Thorndike 4) Sigmund Freud

Robert Yerkes

Who developed the Army Alpha and Beta tests? 1) Asa Hilliard 2) Robert Yerkes 3) Walter Lippmann 4) Arthur Jensen

Robert Yerkes

Which exercise is one where the test taker interacts with one or more persons (trained professionals) to solve problems encountered on the job? 1) Leaderless group discussion 2) In-basket exercise 3) Interviews 4) Role-play exercise

Role-play exercise

Which one of the following tests measures personality by exploring the unconscious? 1) Rorschach Inkblot Test 2) Hogan Personality Inventory 3) Minnesota Multiphasic Personality Inventory 4) Myers-Briggs Type Indicator

Rorschach Inkblot Test

Today, what is the most widely used standardized college admission test? Question options: 1) SAT 2) ACT 3) DAT 4) LSAT

SAT

Scorer reliability and agreement concerns the consistency of what? 1) Test scores 2) Different test forms 3) Scorer judgments 4) Test takers' performance

Scorer judgments

Albert Bandura suggested the construct that is called: 1) Beauty 2) Intelligence 3) Self-efficacy 4) Self-esteem

Self-efficacy

What type of question is the following: Please circle the number that best represents how you are currently feeling? Happy 1 2 3 4 5 Sad 1) Ranking 2) Semantic differential 3) Free choice 4) Forced choice

Semantic differential

When raters give all employees worse ratings than they deserve, this is called: 1) Central tendency error 2) Leniency error 3) Halo effect 4) Severity error

Severity error

In cross-validation, what is the phenomenon when the correlation decreases due to chance factors? 1) Curtailment 2) Attrition 3) Shrinkage 4) Reduction

Shrinkage

Which one of the following guidelines did your textbook authors NOT suggest when printing a survey? 1) Stapling in the upper left corner for longer surveys 2) Spiral binding or printing in booklet format for extremely long surveys 3) Single spacing, eliminating the cover page 4) Printing on two sides of a sheet of paper for surveys less than three pages

Single spacing, eliminating the cover page

In the equation Y' = a + bX, what does "b" represent? 1) Slope 2) Predicted criterion score 3) Intercept 4) Individual's score

Slope

In a regression equation, the letter b represents the: 1) Correlation coefficient 2) Intercept 3) Mean distribution of Y 4) Slope of the regression line

Slope of the regression line

The expected change in one unity of Y for every change in X is called the: 1) Slope of the regression line 2) Correlation coefficient 3) Intercept 4) Mean distribution of Y

Slope of the regression line

What is the tendency of some test takers to provide or choose answers that are perceived to be most acceptable or that present the test taker in a favorable light called? 1) Social desirability 2) Social acquiescence 3) Social faking 4) Social responding

Social desirability

Which response set is a problem where some test takers want to choose answers that present themselves in a favorable light? 1) Random responding 2) Social desirability 3) Acquiescence 4) Faking

Social desirability

What type of tests' primary purpose concerns the intersection between mental health and the law? A) Specialized Forensic Tests B) Neuropsychological Tests C) Projective Personality Tests D) Cognitive Tests

Specialized Forensic Tests

As the reliability/precision of a test decreases, which one of the following items increases? 1) Difficulty 2) Standard deviation 3) Standard error of measurement 4) Internal consistency

Standard error of measurement

What is an index of how much an individual's test score is likely to differ from the individual's true score? 1) Confidence interval 2) Reliability index 3) Standard error of measurement 4) Cohen's kappa

Standard error of measurement

Which one of the following resources with professional practice standards would you need to purchase because the resource is not freely available on the Internet? 1) Principles for the Validation and Use of Personnel Selection Procedures 2) Ethical Principles of Psychologists and Code of Conduct 3) Standards for Educational and Psychological Testing 4) Code of Fair Testing Practices in Education

Standards for Educational and Psychological Testing

Most people make distinctions between objective and subjective tests based on their: 1) Content 2) Scoring procedures 3) Administration and format 4) Structure

Structure

Race, sex, and age could be considered to be what type of norms? 1) Biased norms 2) Differential norms 3) Discrimination norms 4) Subgroup norms

Subgroup norms

Questions that have no apparent relation to the test purpose or criterion are called: 1) Questionable items 2) Unrelated items 3) Subtle questions 4) Empirical questions

Subtle questions

How is the range of a distribution calculated? Question options: 1) Substract the lowest score in the distribution from the highest score 2) Average all of the scores in a distribution and multiply it by the number of scores in the distribution 3) Average all of the scores in a distribution 4) Subtract the highest score in the distribution from the lowest score

Subtract the lowest score in the distribution from the highest score

When conducting a job analysis, actions taken by the job incumbent that accomplish a job function are called: 1) Other characteristics 2) Functions 3) Skills 4) Tasks

Tasks

When conducting a job analysis, actions taken by the job incumbent that accomplish a job function are called: 1)Tasks 2)Skills 3) Other characteristics 4) Functions

Tasks

If a room is too hot, this may introduce error because of what kind of factor: 1) Test administration 2) Homogeneity 3) Scoring 4) Test length

Test administration

What test is designed to distinguish between legitimate memory impairment and memory impairment that is being faked? A) Test of Memory Malingering B) MMPI-2 C) Thematic Apperception Test (TAT) D) WISC-IV

Test of Memory Malingering

Lack of which one of the following can contribute to invalidating test results and result in psychological injury to individuals? 1) Confidentiality 2) Self-determination 3) Anonymity 4) Test security

Test security

What do we call a documented plan containing details about a test's content? 1) Content area 2) Test specifications 3) Testing universe 4) Test format

Test specifications

What is a documented plan containing details about a test's content? 1) Practical test 2) Content validity 3) Test specifications 4) Testing universe

Test specifications

The person who responds to test questions or whose behavior is being measured is called a: 1) Psychologist 2) Test taker 3) Test purchaser 4) Test user

Test taker

If a test developer gives the same test to the same group of test takers on two different occasions, then correlates the scores from the first and second administrations, what kind of reliability is the test developer trying to estimate? 1) Intrascorer reliability 2) Test-retest 3) Internal consistency 4) Scorer reliability

Test-retest

The increasing diversity of the population in the United States makes which one of the following processes important? 1) Using established norms to interpret test scores of minority test takers 2) Increasing the number of questions that require specific knowledge of various cultural minorities 3) Declining to test those who do not speak Standard English 4) Testing assessment tools to ensure they are valid for the minority population who will be the test takers

Testing assessment tools to ensure they are valid for the minority population who will be the test takers

The circumstances under which the test is administered is called the: 1) Testing surroundings 2) Testing location 3) Testing setting 4) Testing environment

Testing environment

If a test administrator uses the results of a test designed to measure student learning to assess teaching quality, which one of the following American Psychological Association recommendations for ensuring meaningful assessment is the test administrator violating? 1) No subgroup of students should be disadvantaged by the test or test-taking conditions 2) No important decisions should be made based on one test score 3) Tests used should have evidence of validity or provide useful information for the intended purpose 4) Students must have a fair opportunity to learn the curriculum they will be tested on

Tests used should have evidence of validity or provide useful information for the intended purpose

What are criterion-referenced tests? 1) Tests with evidence of validity based on relations with other variables 2) Tests where scores are interpreted by comparing a score with an objectively stated standard of achievement 3) Tests where scores are interpreted by comparing a score with a large group of test takers 4) Tests that distinguish between high and low achievers

Tests where scores are interpreted by comparing a score with an objectively stated standard of achievement

If you wanted to contact the publisher of the Uniform Guidelines on Employee Selection Procedures, whom would you contact? 1) The U.S. Human Testing Rights Association 2) The U.S. Equal Employment Opportunity Commission 3) The U.S. Department of Labor 4) The U.S. Bureau of Labor Statistics

The U.S. Equal Employment Opportunity Commission

The coefficient of determination indicates what? 1) How valid the test is 2) The amount of variance that the test and the criterion share 3) How confident we can be about the validity coefficient 4) The amount of test variance that is reliable

The amount of variance that the test and the criterion share

What is a polygraph? 1) The best-known physiological measure associated with evaluating truthfulness 2) The graph that shows the plots of multiple personality assessments for one test taker 3) A valid measure of truthfulness and honesty used for choosing job applicants for hire 4) A valid neuropsychological measure of truthfulness and honesty

The best-known physiological measure associated with evaluating truthfulness

Commonly, test practitioners use item-total correlations as an alternative to what? 1) Qualitative analysis 2) Acculturation 3) The discrimination index 4) Phi-coefficients

The discrimination index

How did the APA traditionally define construct validity? 1) The extent to which the test measures a theoretical construct 2) The extent to which the test represents all parts of the test domain 3) The strength of the relationship between a test and a criterion 4) The strength of the relationship between the test and itself

The extent to which the test measures a theoretical construct

The percentage of test takers who respond correctly is known as: 1) The discrimination index 2) The phi coefficient 3) The item difficulty 4) Acculturation

The item difficulty

What are raw scores? 1) Points on an interval or ratio scale. 2) Numbers assigned to a norm group. 3) The correct answer to a test question. 4) The most basic scores calculated from a test.

The most basic scores calculated from a test.

According to the text, which of these is NOT a myth about pre-employment testing? 1) Pre-employment testing is too time-consuming 2) Pre-employment tests cost too much 3) Pre-employment testing does not work 4) The organization will not be sued

The organization will not be sued

A large group of people takes a test at one point in time, waits for a period of time, and then their scores are correlated with some aspect of performance. What is the original test called that the large group of people took 1) The criterion 2) The reliability coefficient 3) The predictor 4) The validity coefficient

The predictor

Why is the following survey question inappropriate? "Did you enjoy watching and listening to the CD of the concert?" 1) The question is not in context 2) The question is not in correct syntax 3) The question is not specific about which concert 4) The question is a double-barreled question

The question is a double-barreled question

Which of these provides a quantitative estimate of a test's consistency of measurement? 1) The predictor 2) The reliability coefficient 3) The criterion 4) The validity coefficient

The reliability coefficient

If you were delivering a presentation about the American Psychological Association (APA)'s Code of Fair Testing Practices in Education, which one of the following facts would you NOT include in your presentation? 1) The standards directly apply to all types of tests in education, including teacher-made tests 2) The standards in the Code make clear the obligations of educational test developers and users 3) While consistent with the 1999 Standards, it is not consistent with the 2014 Standards 4) The APA originally published the Code in 1988 and revised it in 2004

The standards directly apply to all types of tests in education, including teacher-made tests

Why might some psychological tests not require strict procedures for test security? 1) The tests are developed by instructors to use in the classroom 2) The test developer intends the tests to be used for research purposes only 3) The tests are exempt from the APA's ethical principles 4) The tests are projective tests and do not have one correct answer

The test developer intends the tests to be used for research purposes only

Which of these does NOT need to be included in a test manual? 1) Validity information 2) The test itself 3) Norm information 4) Reliability information

The test itself

According to the Standards for Educational and Psychological Testing, who should provide test users, before purchasing a psychological test, the information needed to evaluate the psychometric characteristics of the test? 1) The test publisher 2) The American Psychological Association 3) The test developer 4) The test taker

The test publisher

Which statement regarding the testing environment is false? 1) The environment should be standardized 2) The testing environment has no effect on test takers 3) A proper test environment decreases variation 4) Instructions for administering the test should be specific and concise

The testing environment has no effect on test takers

According to Ghiselli, which of the following is NOT a condition that must be met to claim that two forms of the same test are comparable? 1) The tests should yield different distributions of scores 2) The scores from each test must correlate equally well with the external criteria 3) The individual rank-ordering of scores should be identical 4) The tests must yield identical scores

The tests should yield different distributions of scores

Which of the following is TRUE about objective criteria? 1) They are often based on a person's judgment 2) Their scope is often quite narrow 3) They are often expressed as ratings 4) They are often based on personal experience

Their scope is often quite narrow

Relating to objective criteria, which of these statements is false? Question options: 1) They often contain a narrow scope 2) They are observable and measureable 3) They are based on a person's judgment 4) If well-defined, these criteria usually contain less error

They are based on a person's judgment

Which one of the following is TRUE about authentic assessments? 1) They focus more on selecting a response than traditional assessment 2) They are more contrived than traditional assessment 3) They are more student-structured than traditional assessment 4) They are more focused on recall and recognition than traditional assessment

They are more student-structured than traditional assessment

Relating to subjective criteria, which of these statements is false? 1) They are observable and measurable 2) Ratings may be biased 3) They may take into account non-measureable factors, such as motivation 4) They are based on a person's judgment

They are observable and measurable

Which one of the following statements is TRUE about nonstandardized tests? 1) They provide more detailed test administration and test taking instructions than standardized tests 2) They have multiple standardization samples and norms 3) They are usually constructed by a teacher or trainer in a less formal manner for a single administration 4) They are projective tests administered in a less formal manner for multiple administrations

They are usually constructed by a teacher or trainer in a less formal manner for a single administration

Which of these statements related to behaviorally anchored rating scales is false? 1) They are difficult to develop 2) They require concentrated rater training due to their complexity 3) They do not provide more accurate ratings of employee behavior than do traditionally anchored rating scales 4) They are time consuming to develop

They do not provide more accurate ratings of employee behavior than do traditionally anchored rating scales

Which one of the following statements is FALSE about aptitude tests? 1) They assess an individual's potential for performing a new job or task 2) They measure the product of cumulative life experiences 3) They measure an individual's progress in learning a skill or task 4) They measure the maximum performance we can expect from individuals

They measure an individual's progress in learning a skill or task

All test users have the ethical responsibility to abide by what professional practice standards? 1) Those that apply to all professions and at their discretion those within their profession 2) Those that apply to all professions 3) Those within their profession and those that apply to all professions 4) Those within their profession and at their discretion those that apply to all professions

Those within their profession and those that apply to all professions

In 1964, which act that was passed by Congress stimulated great interest in fair employment practices? 1) Title VII of the Civil Rights Act 2) Age Discrimination and Employment Act 3) American Disabilities Act 4) None of the above

Title VII of the Civil Rights Act

What is the purpose of behavior coding? 1) To improve communication among multiple researchers when they interview groups 2) To classify respondents based on their behaviors 3) To inform respondents of their rights and responsibilities 4) To identify the types and frequencies of interviewer and respondent behaviors

To identify the types and frequencies of interviewer and respondent behaviors

According to the text, the Hogan Personality Inventory uses what kind of test item format? 1) Forced choice 2) True/False 3) Multiple choice 4) Essay

True/False

If you were delivering a presentation about the Standards for Educational and Psychological Testing, which one of the following facts would you NOT include in your presentation? 1) Two organizations, the American Educational Research Association and the American Psychological Association, collaborate to publish the Standards 2) The Standards are available as a 230-page book, organized into three parts: Foundations, Operations, and Testing Applications 3) Unless there is a very sound professional reason, anyone who develops and/or uses standardized psychological tests and assessments should follow the Standards 4) The Standards provide those who develop and use standardized psychological tests and assessments with criteria for evaluating tests and testing practices

Two organizations, the American Educational Research Association and the American Psychological Association, collaborate to publish the Standards.

Which government entity developed the General Aptitude Test Battery? 1) U.S. Employment Service 2) Equal Employment Opportunity Commission 3) The United States Test Development Service 4) Office of Personnel Management

U.S. Employment Service

Which one of the following is NOT a test taker right? 1) Be informed of your rights and responsibilities as a test taker 2) Have your test results kept confidential to the extent allowed by law 3) Be treated with courtesy, respect, and impartiality 4) Understand and accept the consequences of not taking a test

Understand and accept the consequences of not taking a test

When test developers work at all stages of test development to ensure that the only factor that will influence a test score is the individual's standing on the construct that the test is designed to measure, they are demonstrating concern for what? 1) Reliability/precision 2) Criterion contamination 3) Content validity 4) Universal design

Universal Design

Which one of the following statements is FALSE? 1) Criteria must be representative of the events they are supposed to measure 2) Unlike tests, reliability or consistency is unimportant for criteria 3) Evidence for validity of a criterion can be gathered using a similar process to that used when establishing evidence of validity of a test based on content 4) A criterion is valid to the extent that it matches or represents the events in question

Unlike tests, reliability or consistency is unimportant for criteria

Regarding test development, which of these is NOT a suggested best practice based on the test development literature: 1) Be sure that each item is based on an important learning objective or topic 2) Use colloquial language 3) Make all items independent 4) Write each item in a clear and direct manner

Use colloquial language

In order to detect and decrease rater bias in essays, and also create scorer reliability, it may be wise to: 1) Prepare and answer key 2) Consider the time necessary to generate a response 3) Use one person to score the essay 4) Use multiple independent scorers

Use multiple independent scorers

Why was Florida's IQ cutoff score of 70 ruled unconstitutional for determining intellectual disability? A) Using IQ scores is not a valid measure to use when determining intellectual disabilities B) IQ should not be considered in the death penalty process C) Using a fixed IQ score of 70 is too rigid D) Cutoff scores are never appropriate to use with any test

Using a fixed IQ score of 70 is too rigid

Evidence that the interpretations that are being made from the scores on a test are appropriate for their intended purpose is called: 1) Validity 2) Reliability 3) A construct 4) Criteria

Validity

A statistic used to infer the strength of the evidence of validity that the test scores might demonstrate in predicting job performance is called the: 1) Reliability coefficient 2) Validity coefficient 3) Predictor 4) Criterion

Validity coefficient

Which tells us whether individual scores tend to be similar to or substantially different from the mean? 1) Variance 2) Correlation coefficient 3) Mode 4) Range

Variance

What was one reason the Educational Testing Service gave for the decline in SAT scores of graduating high school seniors during the 1960s and 1970s? 1) Changing attitudes of parents leading to lack of confidence in education 2) Increasing numbers of minority students taking the SAT 3) Student perceptions that scoring well on standardized tests was not "cool." 4) Weaker academic records and increased diversity of student backgrounds

Weaker academic records and increased diversity of student backgrounds

Which test was developed because of the dispute over the definition of intelligence? 1) Stanford-Binet Intelligence Scales 2) Binet-Simon Scale 3) Wechsler-Bellevue Intelligence Scale 4) Personal Data Sheet

Wechsler-Bellevue Intelligence Scale

Which of the following statements is FALSE? 1) It is inappropriate for a publisher to simply state that their test is valid 2) A psychological test can be reliable, but not valid 3) When a test is reliable, it is automatically valid as well 4) Reliability is a characteristic of the test itself; validity depends on how the test is used

When a test is reliable, it is automatically valid as well

Relating to pilot testing, which of these statements is false? 1) During a pilot test, one should choose a sample of people who resemble or are a part of the test's target audience 2) The purpose of a pilot test is to study how well the test performs 3) When setting up pilot studies, developers follow the Standards for Educational and Pilot Testing 4) The pilot test setting should mirror the planned test setting

When setting up pilot studies, developers follow the Standards for Educational and Pilot Testing

R2 Change is a statistic from a multiple regression which is useful in determining what? 1) The place where a regression line crosses the Y axis in a regression 2) Whether adding an additional test to a test battery is justified 3) The correlation coefficient between two tests 4) The amount of measurement error present in a test score

Whether adding an additional test to a test battery is justified

Which one of the following is TRUE about certification and licensure? 1) Certification and licensure do not require passing an exam 2) Certification and licensure are credentials related to protecting the health and safety of the public 3) While state-level governments establish licensure, certification is not regulated 4) While licensure is mandatory for some professions, certification is voluntary

While licensure is mandatory for some professions, certification is voluntary

What practice is being used if a test user administers the same test to every test taker, but scores the test differently according to the race of the test taker? 1) Individual norming 2) Independent group norming 3) Within-group norming 4) Non-discriminatory norming

Within-group norming

When a calculated correlation is greater than the critical value shown in the table, we can infer that the probability of finding our correlation by chance is less than five chances out of 100. Therefore, we assume that there is 1) no relationship, and we refer to the correlation coefficient as "significant." 2) a true relationship, and we refer to the correlation coefficient as "significant." 3) no relationship, and we refer to the correlation coefficient as "not significant." 4) a true relationship, and we refer to the correlation coefficient as "not significant."

a true relationship, and we refer to the correlation coefficient as "significant."

Incremental validity addresses the question of whether 1) adding another test to a test battery already in use would improve its predictive capability 2) the reliability of the combined test battery would be improved by the addition of another test 3) the client would believe that the additional expense of another test would be justified 4) the addition of another test would make the client feel that his condition was getting worse

adding another test to a test battery already in use would improve its predictive capability

Which of these is an assumption about psychological tests? 1) Individuals understand test items similarly 2) Individuals will report their thoughts and feelings honestly 3) Individuals will report accurately about themselves 4) All of the above

all of the above

When psychological tests are used in clinical settings the results are to diagnose patients A) used in future research projects that help B) always combined with information that is gathered in other ways C) solely relied upon D) distributed to other psychologists who help

always combined with information that is gathered in other ways

The U.S. Department of Defense uses ASVAB scores to determine 1) an individual's qualifications for certain military occupations 2) enlisted personnel's developmental coaching needs 3) enlisted personnel's promotion potential 4) an individual's likelihood to succeed in the military

an individual's qualifications for certain military occupations

If you administered a test without obtaining information on the identity of the test takers, you would be following the practice of 1) anonymity 2) protection from stigma 3) confidentiality 4) informed consent

anonymity

Which one of the following indicates the expected change in Y for every one unit change in Xi, when all the other predictors in the equation do not vary or remain constant? 1) df 2) R 3) X 4) b

A subjective criterion is 1) observable and measurable 2) easily calculated leaving little chance of disagreement 3) based on a person's judgment 4) the number of days of absence from work in a year

based on a person's judgment

A test user is anyone who participates in 1) administering, interpreting, or using test results 2) using test results to make important decisions 3) buying, administering, interpreting, or using test results 4) designing a test or piloting the test

buying, administering, interpreting, or using test results

During World War II, who was the chief psychologist of the War Department, and supervised the development of the Army General Classification Test used to place U.S. Army recruits? 1) Hugo Münsterberg 2) Walter Dill Scott 3) Walter Bingham 4) Millicent Pond

chapter 14

Which of these methods is a graph for rating employees' performance that represents a dimension, such as quality or quantity of work, divided into categories defined by numbers, words, or both? 1) Behaviorally anchored rating scales 2) Forced distribution 3) Graphic rating scales 4) Forced ranking

chapter 14

Evidence that a test relates to other tests and behaviors as predicted by a theory is referred to as: Question options: 1) Face validity 2) Criterion-related validity 3) Content validity 4) Construct validity

chapter 8

Meta-analysis is a statistical technique used to 1) estimate the reliability of a test 2) determine the proper cut score of a test 3) combine the results of multiple studies into a single result 4) reduce the amount of data necessary to conduct a validation study

combine the results of multiple studies into a single result

Watson, Detra, Fox, Ewing, Gearhart, and DeMotts (1996) administered two self-report alcoholism measures to 118 volunteers recruited from a Veterans Administration Medical Center. At approximately the same time, they asked the volunteers to complete the Diagnostic Interview Schedule. The study is an example of 1) multiple regression study 2) predictive evidence of validity study 3) restriction-of-range study 4) concurrent evidence of validity study

concurrent evidence of validity study

In organizational settings, researchers often use _____ studies as alternatives to _____ studies, because employers do not want to hire applicants with low test scores. 1) predictive; content 2) concurrent; predictive 3) predictive; concurrent 4) concurrent; content

concurrent; predictive

If you assure individuals that all personal information they disclose will be kept private, you are ensuring their 1) anonymity 2) confidentiality 3) informed consent 4) protection from stigma

confidentiality

When individuals are promised that all personal information they disclose will be kept private and will not be disclosed without their explicit permission, they are being assured of 1) confidentiality 2) anonymity 3) ethics 4) integrity

confidentiality

Defining the testing universe is an important step for ensuring a test has evidence of validity based on its 1) constructs 2) content 3) reliability 4) relations with variables

content

The process of competency modeling would most likely be used during test development to gather evidence for validity based on a test's 1) response processes 2) relations with a criteria 3) constructs 4) content

content

If test scores correlate with measures of constructs that an underlying theory says are related, then we say the test has 1) convergent evidence of validity 2) evidence of validity based on content 3) discriminant evidence of validity 4) concurrent evidence of validity

convergent evidence of validity

If the criterion measures more dimensions than those measured by the test, we say there is 1) criterion enhancement 2) criterion contamination 3) criterion failure 4) criterion success

criterion contamination

When subjective test formats are used and the test universe covers a wide range of topics, evidence of validity based on the test's content usually 1) increases significantly 2) increases somewhat 3) remains stable 4) decreases

decreases

When educational administrators use test scores for accountability purposes, they are using test scores to 1) inform public policy and evaluation programs 2) determine positive or negative consequences for individuals 3) substantiate initiating, continuing, modifying, terminating, or expanding programs 4) provide benchmark data and monitor student progress over time

determine positive or negative consequences for individuals

Dividing the number of persons who answered correctly by the total number of persons who responded to the question is a measure of an item's 1) discrimination index 2) phi coefficient 3) difficulty 4) bias

difficulty

The percentage of test takers who respond correctly to a test item is a measure of the item's 1) difficulty 2) bias 3) ability to discriminate 4) item-total correlation

difficulty

If the correlation between social studies test scores and statistics test scores is zero (or not statistically significant), then the social studies test has demonstrated 1) discriminant evidence of validity 2) convergent evidence of validity 3) evidence of face validity 4) concurrent evidence of validity

discriminant evidence of validity

When test scores do not correlate with unrelated constructs, there is 1) evidence of face validity 2) convergent evidence of validity 3) evidence of validity based on relations with external criteria 4) discriminant evidence of validity

discriminant evidence of validity

According to your textbook, one of the largest and most deeply rooted controversies about psychological testing pertains to 1) educating the public 2) errors and bias in scoring 3) use of projective testing 4) discrimination against protected classes

discrimination against protected classes

Norm-referenced tests allow test users to 1) assume evidence of reliability and validity for the test scores 2) distinguish low performers from high performers 3) assess students who match the norm group 4) hold all students to high standards of achievement

distinguish low performers from high performers

According to your textbook, activists who believe that intelligence is determined primarily by environment have worked for years to _____ what they consider to be the unfair use of such tests. 1) enhance 2) eliminate 3) protest 4) encourage

eliminate

When psychologists use the predictive method to establish evidence of validity for a pre-employment test, they ask the employer to Question 1) hire all the applicants who pass the test 2) hire all the applicants who take the test 3) postpone hiring until the criterion is measured 4) use the test on all people employed by the company in a similar job

hire all the applicants who take the test

Estimating reliability/precision using methods of internal consistency is appropriate only for tests that are 1) unstandardized 2) standardized 3) heterogeneous 4) homogeneous

homogeneous

A test's purpose includes not only what the test will measure but also 1) the test's reliability and validity 2) the type of setting in which the test will be used 3) the theory the test relies on 4) how the test users will use the test scores

how the test users will use the test scores

Symptom checklists and self-report tests which clients complete themselves are most likely to be used 1) when a person wants to make a diagnosis about their own condition 2) in the early stages of mental health treatment to evaluate the level of distress 3) by legal authorities as part of a court proceeding 4) during a person's visit to an emergency room for a mental health complaint

in the early stages of mental health treatment to evaluate the level of distress

When someone is interested in determining whether adding an additional test to an existing battery of tests makes good sense, we can say they are most likely interested in the 1) reliability/precision of the test 2) incremental validity of the test 3) face validity of the test 4) content validity of the test

incremental validity of the test

In the equation Y' = a + bX, what does "a" represent? 1) Predicted criterion score 2) Intercept 3) Individual's score 4) Slope

intercept

The Fahrenheit and Celsius temperature scales are both 1) ordinal scales. 2) ratio scales. 3) interval scales. 4) nominal scales.

interval scales.

One advantage of empirically based tests is that 1) they have strong validity coefficients 2) their internal reliability is high 3) test takers prefer them to other types of tests 4) it is more difficult for test takers to fake responses

it is more difficult for test takers to fake responses

Evidence of validity for the structured interview is established by developing questions using 1) information from professional organizations 2) tests that measure skills required by the job 3) standardized tests 4) job analysis or competency modeling

job analysis or competency modeling

A "p" value < .01 tells the test user that the likelihood that the relationship being measured was found by chance was 1) less than 5 chances out of 100 2) significant 3) not significant 4) less than 1 chance out of 100

less than 1 chance out of 100

The item characteristic curve can provide a picture of an item's 1) distribution of responses 2) inter-item and item-total correlation 3) level of difficulty and discrimination 4) reliability and validity

level of difficulty and discrimination

Y' = a + bX is the 1) predictive validity equation 2) correlation equation 3) multiple regression equation 4) linear regression equation

linear regression equation

Having a restricted range of test scores means that the observed validity coefficient is likely to be 1) higher 2) lower 3) unaffected 4) unpredictable

lower

The validity coefficients that result when a test is cross-validated are usually expected to be 1) the same as the validity coefficients found in the original validation study 2) lower than the validity coefficients found in the original validation study 3) higher than the validity coefficients found in the original validation study 4) unrelated to the validity coefficients found in the original validation study

lower than the validity coefficients found in the original validation study

Evidence of construct validity is most appropriate when a test measures an abstract construct such as 1) marital satisfaction 2) English grammar 3) mathematical computation 4) the IRS tax code

marital satisfaction

Studies of the PREParation for Marriage Questionnaire (PREP-M) described in your text suggest that the questionnaire predicts marital satisfaction and stability 1) poorly 2) weakly to moderately 3) nearly perfectly 4) moderately to strongly

moderately to strongly

Whereas _____ tests measure broad knowledge and skills that come from academic textbooks and teacher syllabi and that are based on the judgments of curriculum subject matter experts, _____ tests measure narrow knowledge and skills taught in a specific educational curriculum or unit of instruction, as defined by specific instructional objectives. 1) norm-referenced; portfolio 2) portfolio; criterion-referenced 3) norm-referenced; criterion-referenced 4) criterion-referenced; norm-referenced

norm-referenced; criterion-referenced

When researchers want to measure test-retest reliability/precision, they must assume test takers' ability will 1) not change between the first administration and the second administration 2) decrease between the first administration and the second administration 3) increase between the first administration and the second administration 4) be affected by the order they took the tests

not change between the first administration and the second administration

Anonymity is important because researchers believe that people who complete surveys or tests anonymously 1) often provide more honest information about themselves 2) are more likely to follow directions and cooperate with the test administrator 3) will be more likely to follow recommendations based on the test results 4) often speak more positively about psychological testing

often provide more honest information about themselves

The Uniform Guidelines apply to tests used by 1) classroom teachers to measure student learning 2) organizations to make hiring decisions 3) educational institutions for screening applicants for educational programs 4) clinical psychologists and psychiatrists to make diagnoses

organizations to make hiring decisions

According to the National Institute of Neurological Disorders and Strokes, learning disabilities are disorders that affect a person's ability to do all EXCEPT which one of the following? 1) understand or use spoken or written language 2) do mathematical calculations 3) coordinate movements 4) perceive movement

perceive movement

All psychological tests require a person to 1) answer all test questions 2) correctly answer as many questions as possible 3) complete a written test 4) perform an observable and measurable action

perform an observable and measurable action

Upon completing psychological testing of a person referred from another source, a written report is prepared for the 1) person who made the referral 2) client who received treatment 3) client's family 4) psychologist who provided the treatment

person who made the referral

Tests in which the test taker is required to make up stories about a set of ambiguous pictures would be considered to be 1) objective tests 2) subjective tests 3) projective tests 4) essay tests

projective tests

Ethical standards are 1) laws and doctrines established by governmental bodies 2) published guidelines that guide the work of those who practice in a field 3) regulations established by one or persons more a community authorizes 4) rules of conduct or procedures recognized by a city or state

published guidelines that guide the work of those who practice in a field

A behavior validation survey is most likely to be used when developing 1) a personality test 2) an intelligence test 3) an aptitude test 4) questions to use on a job interview

questions to use on a job interview

Which one of the following symbols is used to indicate a validity coefficient? 1) a 2) R 3) b 4) r

The correlation coefficient is represented by 1) c. 2) sd. 3) r. 4) x.

The coefficient of determination is also known as: 1) r2 2) r3 3) square root of r 4) R2

In a regression equation, the intercept is the place where the 1) regression line crosses the y axis 2) x and y axes cross 3) slope is zero 4) regression line crosses the x axis

regression line crosses the y axis

When test scores correlate with independent behaviors, attitudes, or events, we say the test's scores have evidence of validity based on the test's 1) content 2) calculated reliability 3) relations with an external criteria 4) face validity

relations with an external criteria

Researchers must be careful to include participants in their studies who represent the entire possible distribution of performance on both the test and the criterion to avoid 1) practice effects 2) multicultural bias 3) restriction of range 4) criterion contamination

restriction of range

The Principles for the Validation and Use of Personnel Selection Procedures reflects the official statement from the Society of Industrial and Organizational Psychology regarding professionally accepted practices for 1) selecting, developing, and using test instruments to make employment-related decisions 2) training new test developers how to develop testing instruments using commonly accepted best practices 3) using test instruments in clinical, educational, and organizational settings 4) developing test instruments to make selection and development-related decisions

selecting, developing, and using test instruments to make employment-related decisions

If you explained to a group of test takers why they were being tested and the purpose of the test, you would be ensuring the test takers had the right to 1) anonymity 2) self-determination 3) privacy 4) confidentiality

self-determination

Squaring the validity coefficient between a test and a criterion will tell us about the amount of 1) shared variance 2) validity 3) homogeneity 4) reliability

shared variance

People who have minority status in terms of disabilities or ethnicity are referred to as 1) norm groups 2) test users 3) special populations 4) test takers

special populations

Many shortcomings of the traditional interview, including its low reliability and validity, can be overcome by 1) asking different questions to each candidate 2) preparing the applicant for the interview 3) interviewing only one applicant for each job 4) structuring the interview and the interviewing procedure

structuring the interview and the interviewing procedure

Joe designs and conducts surveys and analyzes survey results. According to your text, Joe is a 1) survey specialist 2) project analyst 3) scientific method researcher 4) survey researcher

survey researcher

The concept of accessibility in testing is most associated with 1) test fairness 2) test reliability 3) test quality 4) test length

test fairness

When a test developer gives the same test to the same group of test takers on two different occasions, the developer is gathering evidence of 1) scorer reliability 2) internal consistency 3) internal reliability 4) test-retest reliability

test-retest reliability

What do we call the measure of performance that we expect to correlate with test scores? 1) An intercept 2) The criterion 3) The predictor 4) A standardized test

the criterion

Evidence that the interview or test is job-related is especially important when 1) choosing candidates for upper management positions 2) seeking to hire honest and motivated employees 3) the hiring employer is sued for adverse impact of their selection system 4) employees of the hiring organization belong to a union

the hiring employer is sued for adverse impact of their selection system

When we ask, "Are the inferences being made from a set of test scores appropriate?" we are referring to 1) validity 2) internal consistency 3) reliability 4) face validity

validity

If a test includes questions that are a representative sample of the material covered in a training course, the test demonstrates evidence of 1) reliability 2) validity based on a relation with other variables 3) validity based on content 4) face validity

validity based on content

The predictive and concurrent methods provide two types of evidence for 1) validity based on relations with a construct 2) face validity 3) validity based on relations with a criteria 4) validity based on content

validity based on relations with a criteria

When test developers correlate test scores with the criterion scores, the resulting number is called the 1) validity coefficient 2) reliability coefficient 3) content coefficient 4) criterion coefficient

validity coefficient

According to the American Psychological Association, what are "ethics"? 1) Practices that influence the decision-making process to do the right thing 2) Values members of a profession must follow 3) A community's laws that govern "right" or "wrong" practices 4) A person's moral conscience based on his or her background and upbringing

values members of a profession must follow

Ver todos los conjuntos de estudio

Intro to Psy testing Quizzes

Conjuntos de estudio relacionados

Module 8: Network Layer

Mircoeconomics Final Exam

ABFM ITE 2018

MGMT 1 Chapter 13

9.1 - 9.9 Virtualization, Cloud Secuirty, and Securing Mobile Devices

MAN 4720 Exam 2

ECONOMICS EXAM 2

Business 101 Chapter 8

Entrepreneurship and Starting a Small Business

MA Chapter 9 quiz questions

5.1 Smartbook

Module 5 Topic Mastery 30 question Practice Test

PHY 101 - Astronomy Exam 2 Study Guide

Chapter 16: Defenses: Misrepresentation and Non Disclosure

unit 3 quiz questions genetics

AWS Cloud Practioner 3rd set

neuro 2

Taxes, Retirement, and Other Insurance Concepts

CompTIA Security+ 2023 (SY0-601) Chapter 4

Audience, Purpose & Tone