Psych Testing exam 1
The variance is determined to be 36. What would be the standard deviation?
6
The test manual reports a reliability coefficient (r) of .92, which means:
92% of the variance in scores is explained by real differences
The SEM for an achievement test is 2.45. Jonny scores 100 and we assume that 68% of the time his true score falls between + 1 SEM. This means the confidence interval would be between:
97.55 and 102.45
Which of the following is the best example of a nonstandardized test
A multiple-choice test created by a teacher to assess how well her students learned the material covered throughout the semester
When comparing the four scales of measurement, what distinguishes the interval scale from the ratio scale?
Absolute zero
Which of the following are advantages to using computer administered assessment instruments includes all?
All of the above
A researcher wants to measure content-sampling error and has two versions of an achievement test available. What measure of estimating reliability would be best in this situation?
Alternate forms
In studying the correlation between the number of hours studied and scores on the comprehensive exam, the researcher found a correlation of .30. According to Cohen's guidelines, what would this mean regarding the strength of the relationship?
Medium
An ______________________ contains selected-response items, each of which contains a single correct or best answer.
Objective
a group of researchers is attempting to design an instrument that will measure self-esteem. The researchers have an extensive list of traits that may contribute to this construct. Which analysis may assist the research team in narrowing the list of traits into a few dimensions?
Pearson's Product Moment Correlation
can help clinicians in the verbal or written communication of test results
Qualitative descriptions of test scores
is a quick process, usually involving a single procedure of instrument.
Screening
A researcher wants to measure internal consistency in a test that measures two different constructs (self-esteem and depression) without subdividing the items into the two construct groupings. Which of the following would be the best method to use in measuring internal consistency?
Split-half reliability
Which type of normalized standard score is widely used in education and categorizes test performance in nine broad units?
Stanines
in a follow-up study, a researcher reported that the correlation is .85. In the initial study, the correlation was .78. This means there is a(an):
Stronger correlation between the variables in the follow-up study
you are attempting to account for time sampling error and decide to administer the test a second time. In discussing reliability, you report this as what method of estimating reliability?
Test-retest
Which statement is correct?
Testing is only one part of the overall assessment process
Complaints about test use include all EXCEPT:
Tests do not demonstrate a master of competencies; we must always rely on grades and diplomas
The first major personality assessment was also developed for use during
World War I
A student conducting a research study was told by her professor to use a scatterplot in conjunction with calculating the correlation coefficient. She discovered that the data points clustered along a straight line. This means:
a linear relationship exists between variables
In addition to tests, professionals may also gather client information from
all of the above
In recent years, the prevailing political philosophy in the United States has changed from:
all of the above
Which of the following is an important guideline for a successful interview?
all of the above
Grade equivalents are useful because they
can easily provide a basis of comparison of a student's performance with other students at a given grade level.
When interviewing test takers who had an achievement test on three different occasions, participants reported that they had remembered some of the answers from the previous test administration. This is known as:
carryover effect
A type of standard score with a mean of 100 and a standard deviation of 15 is
deviation IQs.
The first and most important step in the assessment process is to:
identify the client's problems to be addressed and the reason for assessment
In order to convert a raw score to a Z score, the following values are needed:
(1) the raw score of the test taker, (2) the mean of the raw scores, and (3) the standard deviation of the raw scores.
A researcher is looking for a specific study report, in which he knows the kurtosis value. He reads the following kurtosis value in various reports. Which one supports a mesokurtic distribution?
0
In a test which results in a report that describes how well the test taker does in comparison with other individuals, it is important to consider whether the
norm group is clearly defined.
The teacher has a small class with only 7 students. The teacher grades their homework and reports scores of: 10, 7, 8, 12, 9, 11, and 13. What is the median?
10
A teacher reports that the class scores are generally distributed according to a bell curve. This means
100% of the scores fall between -3 and +3 standard deviations from the mean
The teacher calculates the highest score as being 97 and the lowest score as being 75. What is the range?
22
A T score has a fixed mean of __________ and a fixed standard deviation of ___________.
50; 10
If the researcher knows that the mean is 60 and the standard deviation is 6, then the majority of the scores falling between +1 or -1 standard deviation of the mean fall between:
54 and 66
if the reliability coefficient of a test is determined to be .27, what percentage is attributed to random chance or error?
73%
There are 12 participants who agree to take the test for a study focused on wellness. The total of all the participants' scores is 96. What is the mean?
8
The teacher grades the papers and determines the following set of scores: 90, 85, 87, 85, 92, 90, 83, 85, 98 What is the mode?
85
The group scores to which each individual is compared are referred to as
norms
Keisha receives a 79 on the test. This is her:
observed score
The primary purpose of an interview is to
obtain relevant information and determine the interviewee's problem
The first group intelligence test used in the United States military service was the
Army Alpha Test
Testing began
Around 2000 years ago
Which of the following statements is the most accurate?
Assessment occurs throughout the course of the helping relationship
One of the first scales to differentiate between children who could or could not function in a regular classroom was developed by:
Binet
A researcher wants to measure content-sampling error with a Likert scale test. Which of the following methods would be best?
Coefficient Alpha
A teacher analyzes the scores from a recent test and determines a positively skewed curve. She infers that the majority of students knew:
only a few of the answers due to low scores
Early interest in measuring intelligence dates back to the late 19th century when __________________ applied Darwin's evolutionary theory to attempt to demonstrate a hereditary basis for intelligence.
Galton
In comparing Spearman's Rho to Phi Coefficient, one would generally prefer to use Spearman's Rho when correlating:
Likert scale responses
tests are used to appraise some aspect of a person's knowledge, skills, or abilities.
Maximum-performance
is an assessment method that involves watching and recording the behavior
Observation
A research team designed a demographic questionnaire to collect information about participants. Which of the following variables identified on the questionnaire provides an example of an ordinal scale variable?
Rank in the military
a researcher wants to know if IQ predicts success in college. Which analysis is most appropriate?
Regression
Which of the following is true about an unstructured interview?
The interviewer is free to ask questions about whatever he or she feels is relevant
Denise took an aptitude test that was first taken by a large group of male engineers, on whom the test was standardized. Denise is a nurse. What might be a concern when interpreting her test scores?
The norm group is not relevant.
Percentile scores
are commonly used to compare the relative ranking of an individual's test score.
Tests that are non-evaluative are most likely to measure an individual's ___?
attitudes
In reviewing a newly developed test instrument, the evaluator noticed that some of the items did not appear to reflect the construct being measured. He reported there was:
content-sampling error
A raw score can be transformed into any type of standard score by first
converting the raw score to a Z score.
In __________________ test interpretation, content domain is an important consideration.
criterion-referenced
Jacqueline, a school psychologist, is observing a child in her classroom. Jacqueline is completing a structured checklist of the child's disruptive behaviors while the child is interacting with peers. What best describes the type of observation Jacqueline is doing?
direct, contrived, and obtrusive
Which of the following is NOT an advantage of a computer-based test?
duration recording
Sue, a school counselor is observing a student's behavior in the classroom. She is monitoring how often the student gets out of his seat while working on a class activity. In order to record this behavior, Sue makes a check mark on a tally sheet and counts how many times the student got out of his seat. Which observation recording method is Sue using?
event recording
A test should not be used for purposes not specifically recommended by the test developer unless
evidence is obtained to support an alternative use.
A school interested in measuring students' progress in academic subjects throughout the school years might choose to use
growth scale values (GSV).
Identify the correct order of the steps of the assessment process
identify the problem; select and implement assessment methods; evaluate the assessment information; report assessment results and make recommendations
A researcher determines that there is a positive correlation between sleep and test scores. This means as the amount of sleep is increased then test scores:
increase
Jose has developed a test that has poor reliability. He can seek to increase reliability by:
increasing the number of test questions
A test that measures and individual's verbal ability, abstract reasoning, and memory would be best described as a(n)
intelligence test
An administrator and the school psychologist were observing a child to assess for behavioral problems. An error may occur in reviewing what the two observers notice. This is reported as:
interrater differences
A client is assessed to determine a course of outcome that would improve his or her concerns or problems. In this situation, the purpose for assessment is:
intervention planning
The primary purpose of _____________ is to gather background information about the client relevant to the reason for assessment.
interviews
Aptitude, intelligence, and achievement tests are all examples of
maximum-performance tests.
Assessment involves selecting and utilizing __________ __________ of data collection.
multiple methods
The most thorough way that counselors should assess an individual is by using
multiple methods
Evadne, a teacher at an elementary school is completing a rating scale for one her students. The teacher has often complained about the student to the school counselor, leaving a bad impression about the student on the school counselor. Which of the following rating scale errors is most likely to occur?
negative halo
When looking at a list of students' test scores, the teacher notices that one test score is extremely lower than the majority of the scores. This is known as a(an):
outlier
Criterion referenced test interpretation is commonly used with
professional licensure or certification tests.
Several test takers complained that items on the test were vague and confusing. This creates concern for
quality of test items
Raw scores
represent the original test results that describe the number of correctly answered questions.
Miguel's score on a reading comprehension test is in the 80th percentile. This means that Miguel
scored higher than 80 percent of the students who took the test.
All of the following are forms of collateral sources of information except:
self-monitoring
You are reading about reliability of a test in the test manual and notice that the researchers report using a Spearman-Brown coefficient. You can infer that internal consistency reliability was measured using:
split-half reliability
Z scores, T scores, and deviation IQs are examples of
standard scores
Karen is a mental health counseling student who is working on a research project with one of her professors. As part of this research, she is required to interview prospective participants. Karen has a list of fifteen specific questions that she must ask in the same order with each participant. What type of interview is Karen most likely using?
structured
A researcher is concerned with measuring internal consistency reliability and has decided to use the Kuder- Richardson Formulas with a Likert Scale test. This is a problem because the:
test does not have dichotomous test items
An agency director who is concerned with the agency's budget is most likely to consult the _______ when evaluating the cost of a specific test?
test publisher's catalogue
Which of the following is most likely to be concerned with using the results of a test to make clinical decisions?
test user
A test was administered to a group of students the morning after homecoming. Several of the students appeared tired and some were coughing and sneezing. These factors may result in what type of error:
test-taker variables
One reason for choosing either norm-referenced or criterion-referenced interpretation involves
the breadth of the construct being measured.
When the researcher interprets the reliability coefficient, the closer the score is to 0,:
the more the scores represent random error
The researcher determines that the reliability coefficient is .65. This means the reliability is:
the spread of scores of a single individual if he/she took a test repeated times
While selecting a test to use in his private practice, Kent discovered that a particular test was not very consistent or stable over time. In other words, a test-taker's score varied each time he or she took it. Kent can most likely infer that:
the test has poor evidence of reliability
A teacher determines that the correlation between eating breakfast and doing well on the test is zero. This means:
there is an absence of a correlation
A researcher administers an achievement test to the same group of participants on three different occasions. In reporting the results, he describes the error that occurs from repeatedly testing the same individuals. This is called:
time-sampling error
Which of the following is the most frequent type of interview used by practicing counselors?
unstructured
When a test accurately measures what it is intended to measure it is said to have sound
validity