ED 401 Midterm

Ace your homework & exams now with Quizwiz!

Which one of the following statements could be technically correct? "The test is face-valid." "The test is definitely valid." "The test-based inference is valid." "The test is consequentially valid."

"The test-based inference is valid."

Reliability coefficients range from​ ________________. -1.00 to​ +1.00 0 to 1.00 -100.00 to​ +100.00 -10.00 to​ +10.00

-1.00 to​ +1.00

Formative assessment is a planned process in which assessment-elicited evidence of students' status is used by teachers to adjust their ongoing instructional procedures, or by students adjust their current learning tactics

.

According to the author, formative assessments serve _______purposes.

2

The author provides a _________step process for building a learning progression.

4

If educators wish to accurately estimate the likelihood of consistent decisions about students who score at or near a​ high-stakes test's previously determined​ cut-score, which of the following indicators would be most useful for this​ purpose? A standard error of measurement for the entire test A traditionally computed​ internal-consistency reliability coefficient A conditional standard error of measurement​ (near the​ cut-score) A standard deviation signifying the degree of score spread among​ test-takers

A conditional standard error of measurement​ (near the​ cut-score)

Which is the most appropriate description of a learning​ progression? A learning progression is an ordered sequence of the stuff a teacher must teach so as to achieve a significant curricular outcome. None of these appropriately describe a learning progression. A learning progression is an ordered sequence of the stuff a student must learn in order to perform at a high level on a summative assessment. A learning progression is an ordered sequence of the stuff a student must learn so as to achieve a significant curricular outcome.

A learning progression is an ordered sequence of the stuff a student must learn so as to achieve a significant curricular outcome.

As defined by the author, formative assessment is A formal attempt to determine a student's status with respect to an educational variable of interest. Assessments like exit tickets, 3-2-1, and mini whiteboards. A simple and quick assessment where teachers use the results to inform their future planning. A planned process in which assessment-elected evidence of students' status is used by teachers to adjust their ongoing instruction procedures or by students to adjust their current learning tactics. A test administered periodically during a school year intended to predict students' performance on standardized assessments.

A planned process in which assessment-elected evidence of students' status is used by teachers to adjust their ongoing instruction procedures or by students to adjust their current learning tactics.

Which of the following is the most useful indicator of the consistency of an individual student's test performance? A polytomous item-analysis A dichotomous item-analysis A standard error of measurement An alternate-form reliability coefficient

A standard error of measurement

Please select the one answer that most accurately identifies the particular​ item's "quoted" reason for teachers to know about assessment. ​"Wishing that students will make progress does not guarantee that students actually will do so. And this is why I believe teachers have a fundamental responsibility to monitor their​ students' progress throughout the school year. I try to administer informal​ progress-monitoring quizzes every few weeks to make sure my instruction is​ "taking." If my instruction is not working as well as I want it to​ work, then I can make modifications in my upcoming teaching plans.​ Assessment-based monitoring of​ students' progress is so very sensible that​ it's hard for me to understand why it is not more widely used. This​ is: A traditional reason for teachers to know about assessment Neither a traditional reason nor one of​ today's reasons for teachers to know about assessment Both a traditional reason and one of​ today's reasons for teachers to know about assessment One of ​today's reasons for teachers to know about assessment

A traditional reason for teachers to know about assessment

Please select the one answer that most accurately identifies the particular​ item's "quoted" reason for teachers to know about assessment.​ "Teachers need to give classroom assessments in order to assign grades to students indicating how well each student has attained the learning outcomes set for​ them." This​ is: Both a traditional reason and one of​ today's reasons for teachers to know about assessment Neither a traditional reason nor one of​ today's reasons for teachers to know about assessment A traditional reason for teachers to know about assessment One of ​today's reasons for teachers to know about assessment

A traditional reason for teachers to know about assessment

Which of the following is the best definition for the concept of accessibility for students with​ disabilities? Accessibility refers to the notion that some test takers must have an unobstructed opportunity to demonstrate their status with respect to the​ construct(s) being measured by an educational test. While most students have an unobstructed​ opportunity, educators only need to focus on a small few. Accessibility refers to the notion that all test takers must have an unobstructed opportunity to demonstrate their status with respect to the​ construct(s) being measured by an educational test. This is a key component of supporting individuals with disabilities. None of these are reasonable definitions. Accessibility refers to the notion that most test takers must have an unobstructed opportunity to demonstrate their status with respect to the​ construct(s) being measured by an educational test. It is impossible to achieve this for​ everyone, but is achievable for most students.

Accessibility refers to the notion that all test takers must have an unobstructed opportunity to demonstrate their status with respect to the​ construct(s) being measured by an educational test. This is a key component of supporting individuals with disabilities.

Which of the following types of assessment targets a​ student's attitudes,​ interests, and​ values? Standardized assessment Cognitive assessment Psychomotor assessment Affective assessment

Affective assessment

Which of the following terms refers to the degree to which there is a meaningful agreement between two or more of the​ following: curriculum,​ instruction, and​ assessment? Continuity Alignment Standardization Common core

Alignment

Which of the following views regarding the assessment of English language learners (ELL) is most defensible? ELL students, if tested with regular English-language tests, will often outperform students whose native language is English. The use of assessment accommodations for ELL students typically leads to more valid test-based inferences about those students. Almost all groups charged with the assessment of ELL students urge the use either of translated tests or, failing that, the provision of interpreters for ELL students. It is relatively easy to develop alternate tests for ELL students in those students' native la

Almost all groups charged with the assessment of ELL students urge the use either of translated tests or, failing that, the provision of interpreters for ELL students.

Which of the following is typically recommended for use with students who have the most serious cognitive disabilities? Assessment accommodations Regular assessments Alternate assessments No assessments whatsoever

Alternate assessments

Which statement best characterized this nation's current use of formative assessment? Although research-supported, formative assessment is not widely used. Only a five-strategies approach to formative assessment is often encountered. Most teachers now employ the formative-assessment process in their classes. A four-levels approach to formative assessment is used in the lower grades.

Although research-supported, formative assessment is not widely used.

Please assume you are a​ middle-school English teacher​ who, despite this​ chapter's urging that you​ rarely, if​ ever, collect reliability evidence for your own​ tests, stubbornly decides to do so for all of your midterm and final exams. Although you wish to determine the reliability of your tests for the group of students in each of your​ classes, you only wish to administer the tests destined for such reliability analyses on one​ occasion, not two or more. Given this​ constraint, which of the following coefficients would be most suitable for your​ reliability-determination purposes? A standard error of measurement A​ test-retest reliability coefficient An​ internal-consistency reliability coefficient An​ alternate-forms reliability coefficient

An​ internal-consistency reliability coefficient

Which of the following is a reasonable explanation for why the word assessment may be used over the word​ testing? Teachers would rather avoid the word testing as it creates situations where classroom management becomes difficult. The word testing produces anxiety in students. Assessment is a broader descriptor of the type of measurement practices in which teachers engage. Students are no longer asked to participate in testing given the more nuanced measurement approaches required by federal legislation.

Assessment is a broader descriptor of the type of measurement practices in which teachers engage.

If a​ teacher's students include children with disabilities or children who are English language​ learners, which assertion about assessment bias is most​ defensible? In view of the inherent difficulties that students with disabilities and English language learners are bound to experience in completing most educational​ tests, meaningful relaxations should be allowed in the levels of assessment challenges given to those two groups of students. Students with disabilities and English language learners are at no greater risk for experiencing assessment bias. Because assessment bias erodes the validity of inferences derivative from​ students' test​ performances, even greater effort should be made to reduce assessment bias when working with these two distinctive populations. Given the atypical nature of these two groups of students and the inability of most test accommodations to adequately level the playing​ field, there is really no need for​ test-developers to give any extra attention to​ bias-reduction for either of these two special groups of students.

Because assessment bias erodes the validity of inferences derivative from​ students' test​ performances, even greater effort should be made to reduce assessment bias when working with these two distinctive populations.

Why do some members of the measurement community prefer to use the phrase​ "absence-of-bias" rather than​ "assessment bias" when quantitatively reporting the degree to which an educational test appears to be​ biased? None of the provided answer choices is accurate. Because both reliability and​ validity, two key attributes of educational​ tests, are positive qualities.​ "Absence-of-bias" is a positive quality to be sought in educational tests. Because​ "absence of​ bias" does not include the qualifier​ "assessment" and,​ thus, is more comprehensive in its applicability. Because the quantitative indices typically employed when describing the extent to which a test is biased are most commonly conceptualized in a negative rather than a positive fashion.

Because both reliability and​ validity, two key attributes of educational​ tests, are positive qualities.​ "Absence-of-bias" is a positive quality to be sought in educational tests.

Please select the one answer that most accurately identifies the particular​ item's "quoted" reason for teachers to know about assessment. ​"I was quite surprised when our​ state's department of education insisted that each of the​ state's teachers collect accurate evidence of their​ students' growth because such evidence was to be used in evaluating all of the​ state's teachers. I​ have, for my entire​ career, collected pretest and posttest evidence of my​ students' achievement status because this helps meirrespective of what the state wants me to dodetermine which​ changes, if​ any, are needed during next​ year's instruction." This​ is: Neither a traditional reason nor one of​ today's reasons for teachers to know about assessment A traditional reason for teachers to know about assessment Both a traditional reason and one of​ today's reasons for teachers to know about assessment One of ​today's reasons for teachers to know about assessment

Both a traditional reason and one of​ today's reasons for teachers to know about assessment

Considering the knowledge​ you've gained regarding formative assessment in this​ chapter, which of the following characteristics of a class discussion could yield formative assessment​ information? Class discussions that get all students involved. Class discussions that are about important topics. Class discussions that show what students are thinking. None of these types of class discussions would yield formative assessment information.

Class discussions that show what students are thinking.

How does "classification consistency" differ conceptually from more traditional indicators of test reliability? Classification-consistency approaches are focused more on capturing the degree of students' consistent categorizations rather than supplying only numerical indices of students' score consistency. Classification-consistency approaches do not employ numerical indicators of an assessment's consistency, unlike traditional reliability procedures. In contrast to traditional reliability approaches, if there are no actual decisions linked to a test's use, it is impossible to determine a test's classification consistency. Whereas the three traditional forms of reliability evidence are largely interchangeable, classification-consistency approaches to reliability are truly distinctive.

Classification-consistency approaches are focused more on capturing the degree of students' consistent categorizations rather than supplying only numerical indices of students' score consistency.

These​ standards, released by the Council of Chief State School Officers and the National Governors Association for Best Practices were an attempt to establish continuity and consistency across varying state curricular aims. Common Core State Standards Individuals with Disabilities Education Act No Child Left Behind Every Student Succeeds Act

Common Core State Standards

Which one of the following best describes reliability? Numerical indicator of what-ever type of evidence has been assembled Improvement in an item's quality based on a student's performance on the item Consistency of results produced by measurement devices Degree to which evidence supports the accuracy of test-based inferences about students for a test's specific purpose

Consistency of results produced by measurement devices

Suppose that the developers of a new science achievement test had inadvertently laden their​ test's items with​ gender-based stereotypes regarding the role of women in science​ and, when the new test was​ given, the test scores of girls were markedly lower than the test scores of boys. Which of the following deficits most likely accounts for this gender disparity in​ students' scores? Predictive invalidity Concurrent invalidity Construct-irrelevant variance Construct underrepresentation

Construct-irrelevant variance

One of the most commonly misused terms in educational jargon is the word​ "standards." In​ reality, there is no​ singular, all-encompassing concept of a​ standard, but rather more specific subtypes of educational standards. Which of the following subtypes of standards could best be described as​ "the knowledge or skills that educators want students to​ learn"? Teaching standard Content standard Academic standard Performance standard

Content standard

Ms. Brown attempted to design an assessment that assessed the reading comprehension of her students in regard to their ability to comprehend informative text. After reviewing her drafted​ assessment, she realized she had relied on other kinds of text and had not included an adequate amount of informative text. Which phrase best describes Mrs.​ Brown's mistake? Construct-irrelevant variance Test bias Content underrepresentation Inaccurate assessment formation

Content underrepresentation

When students' test scores are-as predicted-correlated positively with those students' scores on a test aimed at a similar measurement mission, of what is this an example? Convergent validity evidence Divergent validity evidence Construct underrepresentation Construct-irrelevant variance

Convergent validity evidence

Which term best describes the type of measurement that would yield the following​ feedback: Jonathan mastered 92 percent of the tested​ content? Classroom-created measurement Assessment Norm-referenced measurement Criterion-referenced measurement

Criterion-referenced measurement

Which of the following was a major shift in the 2014 AERA-APA-NCME Standards for Educational and Psychological Testing? Adding a complete chapter on affective assessment for individual students. Deleting a significant chapter of previous standards dealing with "fairness." Defining assessment validity as inference-accuracy for a specific purpose. Formally sanctioning both the construct and the label: "Consequential Validity."

Defining assessment validity as inference-accuracy for a specific purpose.

Which one of the following best describes validity? Consistency of results produced by measurement devices Degree to which evidence supports the accuracy of test-based inferences about students for a test's specific purpose Improvement in an item's quality based on a student's performance on the item Numerical indicator of what-ever type of evidence has been assembled

Degree to which evidence supports the accuracy of test-based inferences about students for a test's specific purpose

Traditional Reasons for Assessments

Determining Students' Current Status Determining One's Own Instructional Effectiveness Monitoring Students' Progress Assigning Grades

Which one of the following pairs of validity evidence most frequently revolves exclusively around judgments focused on test​ content? Clearly explicated test​ developers' statements about what is to be assessed and correlations between​ test-takers' scores and their performances on external variables Careful analyses of a​ test's internal structure and​ test-developers' descriptions of the care with which a test was built Developmental-care documentation and external content reviews by nonpartisan judges Descriptions of developmental care and correlations of​ test-takers' performances with their scores on similar tests

Developmental-care documentation and external content reviews by nonpartisan judges

Which of the following answer choices depicts the chronologically accurate development of assessment legislation in the United​ States? ESSA, ESEA, NCLB ESEA, ESSA, NCLB NCLB, ESSA, ESEA ESEA, NCLB, ESSA

ESEA, NCLB, ESSA

Which of the following pieces of federal legislation attempts to install greater degrees of flexibility so that states and districts can particularize their programs for implementing chief provisions of the current successor to​ ESEA? ESSA NCLB IDEA ESEA

ESSA

Which of the following best captures the testing comparison between NCLB and ESSA? ESSA permits greater state-determinations of testing than NCLB. ESSA demands more stringent assessment than NCLB. ESSA emphasizes more appropriate testing of female students than NCLB. ESSA eliminates NCLB's focus on fairly assessing special subgroups of students.

ESSA permits greater state-determinations of testing than NCLB.

Classroom assessment in public schools is often a function of federal legislation. Which of the following pieces of legislation is historically considered to have had the greatest impact on public school testing​ policies? Elementary and Secondary Education Act​ (ESEA) of 1965 No Child Left Behind Every Student Succeeds Act Individuals with Disabilities Education Act of 2004

Elementary and Secondary Education Act​ (ESEA) of 1965

Differential item functioning (DIF) is employed in connection with which of the following approaches to bias-detection? Judgmental approaches Neither empirical nor judgmental approaches Both empirical and judgmental approaches Empirical approaches

Empirical approaches

The Common Core State Standards were an attempt to outline what students should know at each grade level in which of the following​ subjects? English Language Arts and Science English Language Arts and Mathematics Science and Mathematics Mathematics and Social Studies

English Language Arts and Mathematics

Assume a​ state's education authorities have recently established a policy​ that, in order for students to be promoted to the next grade​ level, those students must pass a​ state-supervised English and language arts​ (ELA) exam. Administered near the close of grades​ three, six, and​ eight, the three new​ grade-level exams are intended to determine a​ student's mastery of the official​ state-approved ELA curricular targets for those three grades. As state authorities set out to provide support for these​ promotion-denial exams, which one of the following sources of validity evidence are they likely to rely on most​ heavily? Evidence based on the test​ scores' relationship to other variables Evidence based on responses of students as they take a​ grade-level test Evidence based on test content Evidence based on internal structure of each of the tests

Evidence based on test content

Which of the following is not a step in the four steps for creating a learning​ progression? Identify all requisite precursory subskills and bodies of enabling knowledge. Arrange all the building blocks in an instructionally sensible sequence. Form a basic and introductory understanding of a target curricular aim. Determine the measurability of each preliminary identified building block.

Form a basic and introductory understanding of a target curricular aim.

Which represents the most appropriate definition of formative​ assessment? Formative assessment is a testing structure in which teachers assess unplanned student activities and use the results to inform their instruction. Formative assessment is an unplanned process in which​ assessment-elicited evidence of​ students' status is used by teachers to adjust their ongoing instructional procedures or by students to adjust their current learning tactics. Formative assessment is a testing structure in which student evidence is used by teachers to assign quantitative scores to student work. Formative assessment is a planned process in which​ assessment-elicited evidence of​ students' status is used by teachers to adjust their ongoing instructional procedures or by students to adjust their current learning tactics.

Formative assessment is a planned process in which​ assessment-elicited evidence of​ students' status is used by teachers to adjust their ongoing instructional procedures or by students to adjust their current learning tactics.

Which of the following questions is not an element in a research-supported conception of formative assessment? Formative assessment should be used only by teachers to adjust their ongoing instructional activities. Formative assessment must be carefully planned. Formative assessment is a process, not a test. Formative assessment calls for the use of assessment-elicited evidence in making adjustment decisions.

Formative assessment should be used only by teachers to adjust their ongoing instructional activities.

Which of the following represents the most appropriate strategy by which to support the validity of​ score-based interpretations for specific​ uses? Generation of an​ evidence-laden validity argument in support of a particular​ usage-specified score interpretation Isolation of as much as possible​ inference-linked validity​ evidence, both positive and​ negative, regarding how to interpret​ test-takers' scores Assembly of​ test-users' personal perceptions regarding the accuracy of their own​ score-based interpretations of​ test-takers' performances Collection of​ validity-relevant evidence, as well as​ validity-irrelevant evidence, regarding the best and the worst ways to give meaning to​ students' test scores

Generation of an​ evidence-laden validity argument in support of a particular​ usage-specified score interpretation

Today's Reasons for Assessments

Helping Evaluate Teachers Clarifying Teachers' Instructional Intentions Influencing Public Perceptions of Educational Effectiveness

Which of the following assertions most accurately captures the relationship between disparate impact of a test and assessment bias? A test that is biased against either gender group will almost certainly be biased against ethnic groups. If a test has a disparate impact on different student subgroups, the test is not necessarily biased. If a test has a disparate impact on different student subgroups, the test is a priori biased. If a test is biased, it will only rarely have a disparate impact on different student subgroups.

If a test has a disparate impact on different student subgroups, the test is not necessarily biased.

Which of the following terms best describes the means teachers employ in their attempt to promote​ students' achievement of the curricular ends being​ sought? Curriculum Instruction Measurability Assessment

Instruction

Cronbach's coefficient alpha and the Kuder-Richardson reliability formulae are examples of: Test-retest coefficients Alternate-form coefficients Internal-consistency coefficients None of the above

Internal-consistency coefficients

If Mr. Higgins, a fourth-grade teacher, tries to evaluate his major exams by ascertaining the degree to which his test's items are functioning in a similar manner, what kind of test-evaluative evidence is this? Test-retest reliability evidence Alternate-form reliability evidence Internal-consistency reliability evidence None of the above

Internal-consistency reliability evidence

Which of the following approaches to bias-elimination is it most reasonable to expect classroom teachers to use? Neither empirical nor judgmental approaches are reasonable for classroom teachers to use. Both empirical and judgmental approaches are equally reasonable for classroom teachers to use. Judgmental approaches Empirical approaches

Judgmental approaches

Please select the one answer that most accurately identifies the particular​ item's "quoted" reason for teachers to know about assessment.​ "Just as physicians need to know about​ patients' blood pressure and what it​ indicates, teachers need to know about educational testing. It is simply part of what a solid educational professional needs to​ understand." This​ is: Both a traditional reason and one of​ today's reasons for teachers to know about assessment A traditional reason for teachers to know about assessment Neither a traditional reason nor one of​ today's reasons for teachers to know about assessment One of ​today's reasons for teachers to know about assessment

Neither a traditional reason nor one of​ today's reasons for teachers to know about assessment

Which of the following pieces of federal legislation had notably been an attempt at reversing the growing achievement gap that left poor and minority students in failing schools while requiring a​ "show us" approach to student​ evaluation? No Child Left Behind Individuals with Disabilities Education Act Every Student Succeeds Act Elementary and Secondary Education Act

No Child Left Behind

The standard error of measurement is focused chiefly on: Relations to other variables evidence of validity Response processes evidence of validity None of the above Test content evidence of validity

None of the above

Which term best describes the type of measurement that would yield the following​ feedback: Jonathan scored within the 92nd percentile on the​ SAT? Norm-referenced measurement Classroom-created measurement Assessment Criterion-referenced measurement

Norm-referenced measurement

Which of the following is one of the five general item-writing guidelines? Not provide students with unintentional clues regarding appropriate responses Have an approximately equal number of items representing the two response categories Provide students with the approximate time to be expended on each item as well as each item's value Randomly assign correct answers to alternative positions

Not provide students with unintentional clues regarding appropriate responses

What should be the two major fairness concerns of a classroom teacher who wishes to eliminate bias in the teacher's assessment instruments? Offensiveness and absence-of-bias in items in the teacher's tests Unfair penalization and reliability of items in the teacher's tests Offensiveness and unfair penalization of items in the teacher's tests Disparate impact and offensiveness of items in the teacher's tests

Offensiveness and unfair penalization of items in the teacher's tests

Please select the one answer that most accurately identifies the particular​ item's "quoted" reason for teachers to know about assessment. ​"Just a year​ ago, the voters in our school district voted favorably in a huge​ school-levy election that brought in substantial tax dollars for our schools. Most of the​ district's teachers are convinced that this positive support for the schools was based on our​ schools' consistently high rankings on the​ state's annual accountability​ tests." This​ is: A traditional reason for teachers to know about assessment One of ​today's reasons for teachers to know about assessment Both a traditional reason and one of​ today's reasons for teachers to know about assessment Neither a traditional reason nor one of​ today's reasons for teachers to know about assessment

One of ​today's reasons for teachers to know about assessment

Suppose that you and several other teachers in a middle school were trying to construct a new test intended to be predictive of​ high-school students' subsequent scores on the SAT and ACT college admissions exams.​ Moreover, suppose that you were in no particular hurry to assemble validity evidence in support of the accuracy of those inferred predictions. Which source of validity evidence would supply the most compelling support for the validity of your anticipated​ predictions? Validity evidence based on the content of your new test Concurrent validity evidence based on the new​ test's relation to other variables Predictive validity evidence based on the new​ test's relation to other variables Validity evidence based on the new​ test's internal structure

Predictive validity evidence based on the new​ test's relation to other variables

Which of the following provide validity evidence? Internal consistency Relation to other variables Test-retest Alternate-form Test content Response processes Internal structure

Relation to other variables Test content Response processes Internal structure

What kind of evidence is most eagerly sought by the commercial testing firms that develop academic aptitude tests? Evidence that a test is regarded by some as face-valid Test content evidence of validity Relations to other variables evidence of validity Evidence that the consequences of a test's use will be appropriate

Relations to other variables evidence of validity

One important group of students in need of protection from assessment bias is English language Learners​ (ELLs). Therefore, it is important to understand the qualifying categories for an ELL student. Which is not a category of students considered to be English language learners​ (ELLs)? Students who are fluent in​ English, but prefer to speak another primary language. Students whose first language is not English and know​ little, if​ any, English. Students who are beginning to learn English but could benefit from school instruction. Students who are proficient in English but could use additional assistance in academic or social contexts.

Students who are fluent in​ English, but prefer to speak another primary language.

Based on the 2014 edition of the Standards for Educational and Psychological Testing​, and on common​ sense, which one of the following statements about​ students' test results represents a potentially appropriate phrasing​ that's descriptive of a set of​ students' test​ performances? The​ students' test scores are invalid. The​ students' test scores are valid for unlimited educational purposes. Students' scores on the test permit valid interpretations for this​ test's use. The scores are valid​ if, and only​ if, they were elicited by a valid test.

Students' scores on the test permit valid interpretations for this​ test's use.

Major Outcomes To Be Attained by Reading the Textbook

Teach students to master what's assessed in classroom and high-stakes tests Construct and evaluate their own classroom tests Interpret results of standardized tests

Which of the following is one of the most common mistakes teachers make in developing a learning progression? Teachers often disregard learning progressions and refer to pacing guides. Teachers often include too many building blocks. Teachers often refer to teacher manuals for advice. Teachers often include the higher-order cognitive skills and ignore the lower-level cognitive skills.

Teachers often include too many building blocks.

Which strategy seems most suitable for teachers to use when trying to detect and eliminate assessment bias in their own​ teacher-made tests? Teachers should pay particular attention to the possibility that assessment bias may have crept into their​ teacher-made tests and should strive to rely on their best judgments about the presence of such bias on all of their classroom testsbut especially on their most significant classroom assessments. Because teachers spend so much time focusing on avoiding​ bias, it is highly unlikely that​ teacher-produced exams contain bias. Because decisions based on hard evidence will almost always be more defensible than decisions based dominantly on human​ judgment, teachers should identify potentially biased items in their​ teacher-made tests by using empirical methodsand only thereafter confirm those identifications using human judgment. Given that​ students' self-identification of potentially biased items can prove remarkably illuminating to teachers regarding the items in their​ teacher-made tests, all significant classroom assessments should provide an opportunity for students themselves to indicate that they regarded an item as biasedand to indicate the nature of this biaswhile actually completing a​ test's items.

Teachers should pay particular attention to the possibility that assessment bias may have crept into their​ teacher-made tests and should strive to rely on their best judgments about the presence of such bias on all of their classroom testsbut especially on their most significant classroom assessments.

Which of the following sources of validity evidence are teachers most likely to collect? Response processes Relations to other variables Internal structure Test content

Test content

Classroom teachers are most apt to focus on which of the following? Evidence of alternate-form reliability Internal structure evidence of validity Evidence of internal-consistency reliability Test content evidence of validity

Test content evidence of validity

A history teacher, Mrs. Scroggins, tries to determine the consistency of her tests by occasionally re-administering them to her students, then seeing how much similarity there was in the way her students performed. What kind of reliability evidence is Mrs. Scroggins attempting to collect? Test-retest Alternate form Internal consistency Precision

Test-retest

Which of the following provide reliability evidence? Response processes Relation to other variables Internal structure Test content Test-retest Internal consistency Alternate-form

Test-retest Internal consistency Alternate-form

If a multistate assessment consortium has generated a new performance test of​ students' oral communication skills and wishes to verify that​ students' scores on the performance test remain relatively similar regardless of the time during the school year when the test was​ completed, which of the following kinds of consistency evidence would be most​ appropriate? Test-retest evidence of reliability Internal-consistency evidence of reliability Alternate-form evidence of reliability The standard error of measurement

Test-retest evidence of reliability

Stability data regarding assessment consistency is an instance of: Test-retest reliability Precision reliability Internal-consistency reliability Alternate-form reliability

Test-retest reliability

Which of the following should be most influential in guiding those who create educational tests? The AERA-APA-NCME Standards for Educational and Psychological Testing The Common Core State Standards Test specifications of the Smarter Balanced Assessment Consortium Test specifications of the PARCC Assessment Consortium

The AERA-APA-NCME Standards for Educational and Psychological Testing

Which of the following​ non-profit organizations played a significant role in the rapid adoption of the Common Core State Standards​? The Council for Exceptional Children The National Education Association The Bill and Melinda Gates Foundation The American Federation of Teachers

The Bill and Melinda Gates Foundation

Which of the following is a consequence of a collaborative effort by the National​ Governor's Association​ (NGA) and the Council of Chief State School Officers​ (CCSSO)? The Common Core State Standards​ (CCSS) Elementary and Secondary Education Act Every Student Succeeds Act No Child Left Behind

The Common Core State Standards​ (CCSS)

Which of the following conceptions of assessment validity is most constant with the Standards for Educational and Psychological Testing released in 2014 by AERA, APA, and NCME? How consistently a test measures whatever it measures The accuracy of score-based interpretations for intended test uses The accuracy of score-based inferences about test-takers The appropriateness of a test's usage

The accuracy of score-based interpretations for intended test uses

Please review the following item for assessment bias. It was used to assess the basic computation mathematics aims being pursued by an​ inner-city, elementary​ school's staff in a Midwestern state. Ramon Ruiz is sorting out empty tin cans he found in the neighborhood. He has four piles based on different colors of the cans. He thinks he has made a mistake in adding up how many cans are in each pile. Please identify​ Ramon's addition statement that is in error. a. 20 bean cans plus 32 cans​ = 52 cans. b. 43 bean cans plus 18 cans​ = 61 cans. c. 38 bean cans plus 39 cans​ = 76 cans. d. 54 bean cans plus 12 cans​ = 66 cans The assessment item does not appear to be biased. These are not two major causes of assessment biased encountered in typical educational tests. The assessment item appears to be biased in favor of recyclers. The assessment item appears to be biased against Americans of Latino backgrounds.

The assessment item appears to be biased against Americans of Latino backgrounds.

Review the following mathematics item for assessment bias. Amy Johnson has a large collection of Barbie dolls.​ Originally, she had 49.​ Recently, she somehow lost 12 Barbies. How many Barbies does Amy have​ left? (Show your​ work.) a. 37 Barbies b. 61 Barbies c. 27 Barbies The assessment item does not appear to be biased. This assessment item is biased against students who struggle with mathematics. The assessment might offend people who view girls as having much broader interests than playing with dolls. The assessment item incorporates a discernible stereotype regarding socioeconomic status.

The assessment might offend people who view girls as having much broader interests than playing with dolls.

Which of the following is generally conceded to be a key component of formative assessment? The framework provided by a learning progression's building blocks A heavy emphasis on using students' classroom test results as a dominant factor in determining students' grades A teacher's exclusive reliance on the collection of data using constructed-response tests. Data obtained via standardized achievement tests

The framework provided by a learning progression's building blocks

Validity evidence can be collected from a number of sources. For​ instance, suppose that a mathematics test has been built by a school​ district's officials to help identify those​ middle-school students who are unlikely to pass a statewide​ 11th-grade high-school diploma test. The new test will routinely be given to the​ district's seventh-grade students. To secure evidence supporting the validity of this kind of predictive​ application, the new test will be administered to current​ seventh-graders, and the​ seventh-grade tests will also be given to the​ district's current​ eleventh-graders. This will permit the​ eleventh-graders' two sets of test results to be compared. Which best describes this source of validity​ evidence? Test content evidence The relationship of​ 11th-graders' performances on the two tests The internal structure of the​ seventh-grade test Students' responses observed during the two​ test-taking experiences

The relationship of​ 11th-graders' performances on the two tests

Which of the following statements best describes the relationship among the three sanctioned forms of reliability evidence? All three types of evidence are essentially equivalent. Est-retest reliability evidence is more important than either internal-consistency evidence or alternate-form evidence. The three forms of evidence represent fundamentally different ways of representing a test's consistency. The three forms of evidence differ in their significance because internal-consistency evidence of a test's reliability is a necessary condition for the other two types of consistency.

The three forms of evidence represent fundamentally different ways of representing a test's consistency.

The Elementary and Secondary Education Act contains various subsections referred to as​ "titles." Which of the following​ "titles" gets the most attention​ (and funding) from​ policymakers? Title III Title I Title IX Title II

Title I

Which of the following is not a traditional reason that teachers assess students? To determine instructional effectiveness To clarify instructional intentions To assign grades to students To monitor students' progress

To clarify instructional intentions

Which source of validity evidence should be of most interest to teachers when evaluating their own​ teacher-made tests? Validity evidence based on the internal structure of the​ teacher-made tests Validity evidence based on response processes of​ students' taking the tests Validity evidence based on test content Validity evidence based on relationships between​ students' scores on​ teacher-made tests and those​ students' performances on other variables

Validity evidence based on test content

What is the chief function of validity evidence when employed to confirm the accuracy of​ score-based interpretations about​ test-takers' status in relation to specific uses of an educational​ test? To explore the full range of potential uses for educational tests originally built to support one​ or, at​ most, two specific uses of​ test-takers' results To confirm rival hypotheses that might legitimately challenge a proposed interpretation of a​ test-taker's performance To support relevant propositions in a validity argument​ that's marshaled to determine the defensibility of certain​ score-based interpretations To underscore the importance of consequential validity​ that's intended to verify that the particular uses for​ test-takers' scores are legitimate

To support relevant propositions in a validity argument​ that's marshaled to determine the defensibility of certain​ score-based interpretations

Which of the following is not one of the three types of reliability​ evidence? Internal consistency Validity Alternative form Test-retest

Validity

________is the most significant concept in assessment.

Validity

A district chooses a commercial test to provide information about the social studies skills and knowledge that the students seem to be having difficulty in mastering. A relatively elaborate series of alignment studies will be carried out early in the school year in an attempt to provide validity evidence to confirm this instructionally supportive usage. Which source of validity evidence will a person supervising the alignment studies rely​ on? Validity evidence based on internal structure of the social studies test Validity evidence based on test content Validity evidence based on response processes Validity evidence based on relationships between​ students' test scores and other variables

Validity evidence based on test content

Which of the following descriptions of validity is most accurate? Validity refers to the consistency with which a test measures whatever it is measuring. Validity describes the degree to which a test's usage leads to appropriate consequences for students. Validity refers to the accuracy of score-based interpretations for specific purposes. Validity describes the legitimacy of the decision to which a test-based inference will be put.

Validity refers to the accuracy of score-based interpretations for specific purposes.

Which of the following statements most accurately reflects the relationship between students' aptitude and their achievement? Actually, achievement is little more than an operationalization of aptitude. Whereas aptitude tends to reflect potential, achievement tends to reflect prior learning. Both aptitude and achievement are equivalent to a traditional conception of intelligence. The level of a student's aptitude can never exceed the level of the student's achievement.

Whereas aptitude tends to reflect potential, achievement tends to reflect prior learning.

The National Assessment of Educational Progress (NAEP): sets forth prescribed assessment frameworks at three grade levels, such frameworks to be followed by the 50 states in framing their own state content standards. was first established in 1996 by the U.S. Congress to serve as an accountability oriented nation's "report card." assesses national samples of U.S. students at three grade levels every few years in certain academic subjects. is a mandatory examination in reading and mathematics that must be taken annually by a sample of students in each state.

assesses national samples of U.S. students at three grade levels every few years in certain academic subjects.

A prominent procedure to minimize assessment bias for students with disabilities is to employ​ ________________. individualized education programs classroom modifications positive behavior supports assessment accommodations

assessment accommodations

Reliability refers to the​ ___________________. consistency of the test scores relevancy of the test scores usefulness of the test scores None of the provided answer choices

consistency of the test scores

According to Popham, which phrases below are included in the definition of educational assessment? (multiple answers) administered, scored, and interpreted determine students' status requires a student's response from "scratch" variables of interest formal attempt

determine students' status variables of interest formal attempt

Proper formative assessment is conducted​ ___________________. during the instructional process at the conclusion of the instructional process prior to the beginning of the instructional process when summative assessment does not provide clear evaluative information

during the instructional process

Decisions linked to classroom assessments should be made​ _____________. after thorough review in advance during initial review of the exam at the conclusion of the assessment

in advance

Public Law​ 94-142 installed the use of an ​ ___________________ to outline the educational processes for students with disabilities. individualized education plan assessment monitoring protocol accommodating education plan individualized education program

individualized education program

When educators collect​ test-based evidence to inform decisions about already completed instructional activities they are engaging in a basic form of​ _____________________. formative assessment instructional assessment standardized assessment summative assessment

summative assessment

Formative assessment is considered to be an assessment strategy as well as an instructional strategy. True False

true


Related study sets

cell bio section 3 practice/quiz questions

View Set

Diesel Engines: Lesson D 5: Timing Geartrains, Camshafts, Tappets, Rockers, and Cylinder Valves

View Set

Globalization Trends - Quick Check

View Set