Classroom Assessment Exam
Which of the following is an important issue to consider when you accommodate students with disabilities?
(ALL OF THEM ARE RIGHT)
What does a percentile rank of 22 mean?
22% of the group tested had the same score as the student.
Alex obtained a raw score of 72 in an algebra test. The standard error of measurement for the test is 4.0. What is the 68% uncertainty interval for Alex's score on the test?
68 through 76
Anthony obtained a score of 75 on a test which has a standard deviation of 5 and a mean of 70. What is his SS-score?
70
Components Term paperExam 1Exam 2Total Teacher's weight40%30%30%100% Alva's score454035120 Maximum score505050150 Refer to the table above. Suppose atotal points methodis used to create a composite grade. What is Alva's weighted composite grade on the 0-100 percentage scale?
80%
A teacher had students switch papers and grade each other's quizzes. Then she called roll and entered each student's quiz grade in the gradebook program as the student called it out. What principle has been violated in this scenario?
A students' right to privacy.
What is a good source of reading passages for a reading comprehension test in third grade?
A third-grade level book that students have not read
Which of the following learning objectives can NOT be assessed using experiment-interpretation items?
Ability to carry out an experiment in the lab and explain the results.
A parent noticed that the teacher made an error in the grading his child's exam. He went to the teacher the next day to complain. What should the teacher's reaction be?
Accept the parent's discovery and rescore the paper immediately.
A teacher of geography was interested in categorizing his students as masters and non-masters of what they were taught. He constructed five essay items to match what he taught the class and administered the items. What factor(s) could account for errors in his classification of students as masters and non-masters?
All of the above
After the day's lesson, a science teacher asked students to write on a 3x5 card one question they felt like they still needed to have answered in order to understand the water cycle. How might the teacher best use the information on the cards?
All of the above
For which of the following purposes can a teacher use standardized achievement tests?
All of the above
In general, using the results of a single assessment for purposes of grading a student is less valid than using multiple assessment results because any single assessment
All of the above
When planning a semester's instruction for a class, which of the following should be used?
All of the above
When you construct an analytic scoring rubric for essay-type items, you should specify
All of the above
Which of the following abilities can be assessed by fill-in-the-blank items?
All of the above
Which of the following decisions requires educators to use quality assessment information?
All of the above
Which of the following should you prepare before a parent conference begins?
All of the above
Which of the following is NOT a potential source of feedback for students?
All of the above are potential sources of feedback.
Which of the following are useful tools for supporting peer feedback?
All of the above are useful tools for supporting peer feedback.
The validity of grades depends on the
All of the above.
Which of the following statements is characteristic of the best answer item?
All options have some degree of correctness.
What is a context-dependent item?
An item that is based on some information or material that precedes it
Which of the following would most likely ensure that an essay test emphasizes the proper learning objectives?
Ask a colleague to review the items in light of the curriculum
A social studies teacher wanted to determine how reliably she had scored her students' research reports. Which of the following procedures would be BEST for her to use?
Ask another social studies teacher to score the reports and correlate the two sets of scores
At the beginning of the class period, students receive feedback on their first drafts of a graphic novel project. Which strategy is likely to be most effective for improvement?
Ask students to look at their feedback and make revisions as part of the same lesson.
Which of the following is the best way to assess the concept "bicycle"?
Ask the student to identify the bicycles among pictures of a number of different objects.
Among the following, which is the BEST way for a teacher to improve the validity of classroom assessment results?
Assess students on the same learning outcome using several different procedures.
Which of the following does NOT increase the student-centeredness of a classroom assessment?
Assessing students with tasks that are closely aligned with learning objectives
Which of the following activities is NOT a responsible professional action on the part of the teacher whose students ranged widely in socioeconomic status?
Assigning an out-of-class project that required purchasing art supplies
A sixth-grade class was studying a certain kind of math problems, and they were going to have a test on Tuesday. Which of the following is an example of student self-regulation of learning?
Both A and B
The relevance of a learning unit assessment-planning chart is in its use in planning to
Both A and B
When should a teacher NOT use the multiple-choice format for classroom assessments?
Both A and B
What type of students benefit from feedback?
Both high and low achieving students
Providing timely feedback to studterm-124ents after assessing them is important because students need to
Both know their grades and correct their mistakes to improve
The table below shows the results (fraction of items correct) of a diagnostic test for five primary school students. AdditionSubtractionMultiplicationDivision Anita12/1214/1514/1613/20 Brad9/1215/1514/1618/20 Cora11/1214/1513/1612/20 Doug11/1213/1513/1610/20 Refer to the table above. According to the content profile of strengths and weaknesses diagnostic approach to diagnosis, which student has a weakness in addition?
Brad
Which of the following grading approaches is most compatible with standard-based approaches to teaching?
CRITERION REFERENCED GRADING
A state requires new nursing school graduates to pass a test before they can be officially recognized as nurses. For what type of decision is this test used?
Certification decision
Which of the following kinds of comments distract students from their learning objectives?
Comments about the student as a person
Which of the following will be the best technique for the summative assessment of students' work?
Comprehensive tests
An elementary school teacher decided to differentiate her writing instruction based on a pretest, in which she asked students to write a descriptive paragraph. She used the results to group students into two groups. One group would work on new writing skills and the other group would review writing skills already taught. Which of the following reliability concerns is most important in this scenario?
Consistency of classification
Which of the following is NOT a way that technology has changed writing assessment?
Content understanding is no longer as important
For the following, select the type of technology use illustrated by the question below.A college professor posts a video on her online course, then has students submit a one-paragraph response and take a quiz on the material, also on the online course site.
Creating new learning networks
For the following, select the type of grading illustrated by the question below.A teacher compares the student's performance with the learning objectives the student is expected to attain during the marking period. Grades are awarded according to how many of the learning objectives were mastered.
Criterion-referenced grading
Homework results are most useful for which of the following classroom decisions?
Deciding if students are ready to move on to a new activity
Which of the following situations involves a placement decision?
Deciding who should get into Honors English
For the following, select the type of technology use illustrated by the question below.Middle school classes in two different states are studying ecosystems. Students build terrariums, then post pictures and descriptions of their learning. Students ask each other questions about their posts.
Developing collaborative technologies
The table below shows the results (fraction of items correct) of a diagnostic test for five primary school students. AdditionSubtractionMultiplicationDivision Anita12/1214/1514/1613/20 Brad9/1215/1514/1618/20 Cora11/1214/1513/1612/20 Doug11/1213/1513/1610/20 Refer to the table above. According to the mastery approach, what is Doug's weak area?
Division
Which of the following kinds of feedback is generally most effective for both computer-based and paper-pencil feedback?
Elaborative feedback
For what level students would extended response essay items be most problematic?
Elementary students
Which of the following formats best assesses higher-order thinking processes students use to solve problems?
Essay items
Which one of the following formats assesses the ability to interpret statements effectively?
Experiment-interpretation and statement-and-comment formats
A 50-item multiple-choice test on the addition of three-digit numbers was administered to a group of students. The KR20 reliability estimate will yield a higher coefficient than the split-halves procedure.
FALSE
A distractor is not functioning properly if no one in the upper group selected it.
FALSE
A multiple marking system reports students' progress in different curriculum areas but NOT in noncognitive areas.
FALSE
A single grade that combines both achievement and nonachievement factors is likely to be more valid than the one that reflects only achievement factors.
FALSE
A student who uses random guessing on a test comprised only of true-false items has a good chance of getting a good score.
FALSE
A student's true score in a subject does not change with the test questions the teacher asks.
FALSE
A student's true score on a test is not related to the specific questions appearing on the test.
FALSE
A test is said to be standardized if it serves as a standard against which students' performance is compared.
FALSE
Below is a section of a norm-table for a fifth-grade standardized achievement test. The table shows raw scores, grade-equivalents (GE), and percentile ranks (PR). Assume the test matches the fifth-grade school curriculum, norms were developed in March, and the local fifth grade students were tested in March. RawVocabularyReadingLanguageArithmetic ScoreGEPRGEPRGEPRGEPR 204.8434.6405.0305.043 305.7505.0455.7525.651 407.1735.6517.2726.172 508.0846.2748.1817.191 708.7957.1909.3948.398 Refer to the chart above. Ernie's grade-equivalent scores in Language and Arithmetic are both equal to 5.0. Therefore, Joan is equally able in these two subjects.
FALSE
Below is a section of a norm-table for a fifth-grade standardized achievement test. The table shows raw scores, grade-equivalents (GE), and percentile ranks (PR). Assume the test matches the fifth-grade school curriculum, norms were developed in March, and the local fifth grade students were tested in March. RawVocabularyReadingLanguageArithmetic ScoreGEPRGEPRGEPRGEPR 204.8434.6405.0305.043 305.7505.0455.7525.651 407.1735.6517.2726.172 508.0846.2748.1817.191 708.7957.1909.3948.398 Refer to the chart above. Sue's raw scores are Vocabulary = 30, Reading = 40, Language = 30, and Arithmetic = 30. This means she is more able in reading than in the other subjects.
FALSE
Even though different publishers of standardized achievement batteries use different techniques for creating developmental scales, the scales are quite closely equivalent.
FALSE
Experienced teachers seldom put much effort into preparing for parent-teacher conferences.
FALSE
Giving feedback in the form of a general letter grade on assessments of student writing is an example of integrating assessment and instruction.
FALSE
Grades should be used for formative assessment purposes.
FALSE
Half of a school's sixth graders have grade-equivalent scores below their grade placement. This means that instructional quality in this school is generally poor
FALSE
Harry's grade-equivalent score is 3.3. This means that he has mastered three tenths of the third grade curriculum.
FALSE
If a student has a good command of a learning objective, you should expect that the student would answer a test item about that objective correctly even if the item were worded rather ambiguously.
FALSE
If you use the tests that come with the curriculum materials and textbooks you have adopted, you can usually be assured that their content will match what was covered in class.
FALSE
In classroom assessment, fill-in-the-blank items are scored more reliably than multiple-choice items.
FALSE
In classroom assessment, it is acceptable if a few test items are focused on minor points of content.
FALSE
In classroom assessment, it is of very little importance if students use their knowledge of flaws in the items to answer them correctly.
FALSE
It is appropriate to include on a test a few questions that cover material you had taught but which you did not tell students would appear on the test as a check on whether the better students have studied.
FALSE
Mr. Jared's mathematics performance task requires students to develop and explain two or more ways to solve a mathematics problem. Because students will give many different but correct solutions to the problem, Mr. Jared's scoring rubric needs only general statements about how the alternative solutions will be marked
FALSE
One sound reason for keeping the length of an assessment procedure short is that the reliability of the students' scores is increased.
FALSE
One way to lower the reliability and validity of the results of questioning students during instruction is to supplement the results with test results.
FALSE
Other things being equal, an extended response essay is more likely to be reliably scored than a restricted response essay.
FALSE
Penalizing students who turn in assignments late by deducting points from the score increases the validity of the grade.
FALSE
Researchers have demonstrated that most teachers know how to evaluate the quality of test items without having to take special courses in assessment.
FALSE
Restricted response essay items can be used to assess only recall and comprehension of factual information.
FALSE
Small deviations from the instructions for administering a standardized achievement test are acceptable provided that they make test administration more practical than was suggested by the publisher.
FALSE
Test scores that have a high degree of reliability (e.g., .90) are also valid.
FALSE
The appropriate time to use the response categories "Right" and "Wrong" is when a true-false variety item asks a direct question.
FALSE
The best pre-instructional assessment is a pretest on a unit's learning objectives.
FALSE
The best way to evaluate a student's writing skill is to mark grammar and punctuation (conventions) and penmanship (presentation) on the students' first drafts.
FALSE
The best way to use writing standards and rubrics in the classroom to assess students is to use them primarily as summative evaluation tools.
FALSE
The content and thinking processes that are the focus of a best-works portfolio should be decided after all the entries are complete.
FALSE
The criteria for writing good reports are easily generalized to become criteria for constructing good multimedia reports.
FALSE
The directions on the cover page of a test booklet render any directions of a subsection of the test redundant or unnecessary.
FALSE
The legal issue of whether use of a test causes harm to a particular subgroup of the population focuses on technical aspects of testing.
FALSE
The primary advantage of using naturally occurring events to assess students is that they give all students the opportunity to demonstrate their ability to apply their learning in nearly identical natural settings.
FALSE
The validity of grades you assign remains the same regardless of which stakeholder uses them.
FALSE
The validity of the results from a restricted response essay item will tend to be lower than those from an extended response essay item.
FALSE
When crafting a completion item, it is best to use a statement taken verbatim from the textbook, delete an important word, and replace the word with a blank.
FALSE
When crafting completion items for elementary students, it is best to provide clues by making the length of the blanks proportional to the length of the omitted word.
FALSE
When deciding whether to use performance assessments the first thing you must consider is how many times you want to assess students during the marking period.
FALSE
You have no obligation to teach students test-taking skills that might raise their marks.
FALSE
You can use simple statistical analyses to identify the major types of errors students make in answering essay items.
FASLE
A diagnostic test of deficits in mathematics could also be used to assign students' grades in mathematics.
False
A good strategy for teaching students how to do self-assessment is to pass out a simple checklist and ask them to use it on one of their assignments.
False
A school expects teachers to report students' standing on certain dimensions such as science achievement, study habits, motivation to learn, and leadership skills. This means that teachers are expected to assign letter grades to students on each dimension.
False
Achievement tests that accompany curriculum materials are usually of higher quality than teacher-made items.
False
All standardized achievement tests are multilevel survey batteries.
False
Any performance activity by students in the classroom is a useful assessment task.
False
Authentic performance tasks align better to learning objectives than do paper-and-pencil tests.
False
Context-dependent assessment material should be arranged so that the items appear before the interpretive material.
False
Different standardized achievement test batteries overlap most schools' curricula to a very high degree.
False
Formative assessment results from quizzes, homework, and classroom activities should play a major role in determining a student's marking period grade.
False
Grading on the curve is compatible with self-referencing at the formative evaluation stages.
False
If you assess the same set of learning objectives with two different summative assessments, together they should have about twice as much weight in the students' grades as a set of learning objectives that was assessed only once.
False
If you use tests that come with your curriculum materials and textbooks you can usually be assured that you have very good quality test items.
False
If your school district has a poor grading policy, your best course of action is to ignore it and use a better system of grading.
False
In a certain elementary school, female students score significantly higher than male students on the state English Language Arts test every year. This is strong evidence for bias in the assessment.
False
In order to be flexible, good teachers seldom identify at the beginning of a marking period the types of assessment results that will be used for assigning grades.
False
It is a professional assessment responsibility of teachers to change their students' test results to reflect the teachers' belief about what the students are capable of doing.
False
It is acceptable to use two different publishers' national norms interchangeably because their large norm-samples overlap quite a bit.
False
It is better to use a single, but highly reliable, comprehensive assessment than to use a combination of several moderately reliable assessments covering the same material.
False
Requiring a high score to pass a prerequisite skills tests is more important when teaching in a linear, hierarchical sequence than when teaching in a spiral sequence.
False
The analysis of knowledge structure approach to diagnosis requires the student to understand her own error patterns.
False
The argument-based approach to validation implies that rhetoric is just as important as evidence in demonstrating the validity of interpretations and uses of assessment results.
False
The parent of an adult college student has the legal right to obtain the student's academic record, as long as it is the parent who is paying the college tuition.
False
To assess problem-solving skills, you should present to the student a novel situation.
False
Well-written rubrics are all students need for self-evaluation of projects and other performance assessment.
False
You should report to parents your evaluation of students on all variables for which you have information.
False
Multilevel survey test publishers describe how well their products match local curriculum frameworks.
Fasle
If a school uses Response to Intervention effectively, what result is expected?
Fewer students are placed in special education.
What type of assessment is most useful for planning for differentiated instruction?
Formative assessment
Which of these scoring practices is NOT responsible professional behavior?
Giving extra points on a project to students who showed a lot of effort
Which of the following actions is recommended as an ethical practice?
Giving students the scoring rubric for a performance assessment when the assignment is made
What is the most common testing accommodation for students with disabilities?
Giving the student extra time
Which of the following would you recommend to a history teacher who wants to assess students' knowledge of the order of events during the American Civil War?
Greater-same-less format
If you wanted to know how a specific fourth-grade teacher's class did on multiple-step mathematics problems compared to the average of all fourth-grade teachers in a particular school, which standardized test computer report would you consult?
Group item analysis (cluster) report
Which of the following standardized test computer reports would be most useful for an elementary school teacher to use when planning the mathematics teaching sequence for the upcoming year?
Group item analysis (cluster) report
Mr. Jones wanted his seventh-grade students to understand how to do argument-based writing effectively. Which of the following strategies would be the most effective?
Have students brainstorm criteria for good arguments from examples over a range ofquality.
Your school district is looking for a standardized test to assist with program evaluation. What is the most important consideration for the test selection committee?
How closely the test specifications match the district's curriculum
Which of the following is NOT part of a learning unit assessment plan?
How this unit's assessment fits into the year's overall assessment strategy
In selecting published material to assess students in your classroom, which of the following is the most important criterion?
How well the tasks match what you taught
What does the trait voice mean in the context of writing assessment?
How well the writing shows the personality of the writer
Which of the following is a learning target for one lesson?
I can set up a long division problem so it is easy to do.
On which trait should you focus your assessment at the prewriting stage of the writing process?
Ideas
Which of the following would be the LEAST appropriate classroom use of standardized test results?
Identifying students strengths and weaknessess
What is the most important reason you might want to use a completion variety of fill-in-the-blank item instead of the direct question variety?
If students might infer from a direct question that a long answer is required, use a completion item
College students in an Educational Psychology class were asked to participate in a research study about cooperative learning. Each student was asked to sign a release form giving the researchers permission to use data from observations, interviews, and assignments. This is an illustration of seeking
Informed Consent
In general, which of the following assessment alternatives would yield the most valid results for purposes of formative assessment of students?
Interviewing students individually
When empirical evidence shows differential item functioning for boys and girls of the same ability, the item is biased.
It cannot be determined.
What does a learning progression do?
It describes typical development of understanding of a particular concept or skill.
What is the effect when you use experiment-interpretation items to assess students' recall of a correct interpretation of experimental results?
It discourages them from learning skills required in interpreting experimental results.
What is a major problem associated with the content area strengths and weaknesses approach in diagnostic assessment?
It doesn't supply enough detail to support instructional planning.
Why is it considered unethical to give students practice on the same test items that they are going to be administered later?
It reduces the achievement inferences you can make from the test results
For which diagnostic approach would student think-alouds be most useful?
Knowledge structure analysis
Which of the following is a criterion-referenced interpretation?
Laura can spell ninety percent of the words in her spelling book.
When formative assessment strategies are used effectively, what is improved?
Learning and motivation
How does the textbook advise you to present the table and the list of responses in matrix items?
List of responses first, followed by the blank table
A social studies teacher wants to check the validity of her end-of-year report card grades. Which of these strategies would it be BEST for her to use?
Look for consistency in performance across several different assessments throughout the year.
When crafting best answer items, which of the following errors should the teacher guard against?
Making the best answer the longest option
A civics teacher assesses students by presenting them with a problem. The students must tell what type of problem it is and what other civics problems are like it, identify aspects of the problem that make it difficult to solve, and come up with at least two ways to solve the problem.
Modeling the problem, identifying obstacles, and describing multiple strategies
What type of item requires students to select the correct answer from a number of possible responses?
Multiple choice
What is one clear advantage of using multiple true-false items over traditional multiple-choice items
Multiple true-false items have higher reliability than comparable regular multiple-choice items.
A school superintendent released a data file containing students' state test scores (without identification) for a research study. Did the superintendent violate students' privacy?
NO
Can a school send students' records to another school when a student moves, without the permission of the parents?
NO
Ms. Jones' students keep all of their week's seatwork and homework assignments in a folder. At the end of the week they bring it home. Is Ms. Jones using a portfolio assessment method?
NO
Which of the following tasks is the least structured?
NOT Asking students to create a word problem that requires division to solve and explaining why it does Presenting a math word problem requiring division, and asking students to solve it, show their work, and explain their reasoning
Which of the following projects is most likely to encourage plagiarism?
NOT A project where students work together to make potato clocks and describe how they work a term paper investigating different proposals about tax reform and the positions behind them
Refer to the Item Analysis Information Chart above. What is the item discrimination index for Item 1?
NOT .40
Refer to the Item Analysis Information Chart above. What is the difficulty index for Item 2?
NOT .52 NOT .4
As part of their science grade, students watered lima bean seeds and watched them grow. Each day they recorded the length of the sprout and took a picture. After two weeks, they organized their data into a report describing how lima bean seeds germinate. What type of assessment task is this
NOT A structured on demand performance a demonstration
The table below shows the results (fraction of items correct) of a diagnostic test for five primary school students. AdditionSubtractionMultiplicationDivision Anita12/1214/1514/1613/20 Brad9/1215/1514/1618/20 Cora11/1214/1513/1612/20 Doug11/1213/1513/1610/20 Refer to the table above. According to the mastery approach, what is Brad's weak area?
NOT Addition
A perfectly reliable test is
NOT All of the above
What does scaffolding mean in performance assessment?
NOT Alternate strategies students use to complete a task
How are extended response essay items best used in the classroom?
NOT As a one-class-period, on-demand assessment
When should you tell students about the weights each assessment will count toward their final grade?
NOT At the beginning of the year
A teacher used the following scales to mark each of the components going into the marking period grade in social studies: each homework (0-10), each quiz (0-20), one map drawing (1-6), end-of-unit test (0-100). What is the first thing the teacher should do before she puts them together for the report period grade?
NOT Average each kind of score (homework, quiz, etc.) and then combine the result
The table below shows the results (fraction of items correct) of a diagnostic test for five primary school students. AdditionSubtractionMultiplicationDivision Anita12/1214/1514/1613/20 Brad9/1215/1514/1618/20 Cora11/1214/1513/1612/20 Doug11/1213/1513/1610/20 Refer to the table above. According to the mastery approach to diagnosis, which student has a weakness in addition?
NOT Brad
Which of the following comments is an example of self-referenced feedback?
NOT Can you find and fix three spelling mistakes?
An examiner wants to determine whether an assessment task (test item) functions differentially for females and males. How should the examiner proceed?
NOT Compare total-test performance for females and males and use judgment to decide whether the task in question contributed to any differential functioning. NOT Determine if the population of females scores higher or lower than males on the task.
Which of the following methods of reporting student progress is the most responsive to parents' questions?
NOT Detailed checklist of objectives mastered
You teach seventh grade social studies. You would like your students to give each other peer feedback on the first drafts of their reports on the American Revolution. Which of the following lessons is most likely to prepare students to do this effectively?
NOT Develop a checklist of peer editing strategies and have students role-play good and poor examples of each.
What are commonly recommended item difficulty and discrimination indices for a norm-referenced test?
NOT Difficulty indices between .16 and .84, discrimination indices greater than .30
The table below shows the results (fraction of items correct) of a diagnostic test for five primary school students. AdditionSubtractionMultiplicationDivision Anita12/1214/1514/1613/20 Brad9/1215/1514/1618/20 Cora11/1214/1513/1612/20 Doug11/1213/1513/1610/20 Refer to the table above. According to the mastery approach to diagnosis, which student has a weakness in subtraction?
NOT Doug
Which of the following is NOT an important quality dimension for designing performance assessments?
NOT Effective Communication
Students should be assessed by requiring them to reproduce what was taught in class.
NOT False
Which of the following actions violate(s) the teacher professional responsibility in assessment?
NOT Giving students information about an upcoming assessment NOT Both A and B are violations of professional responsibility.
What does a negatively discriminating item do on a classroom test?
NOT Indicates an item that is too hard NOT TOO EASY
Which of the following characteristics should be possessed by a good analytic scoring rubric for writing skills?
NOT It should be focused on one specific task that involves writing.
Refer to the Item Analysis Information Chart above. Which is the easiest item?
NOT Item 1
Refer to the Item Analysis Information Chart above. Which is the poorest functioning item?
NOT Item 2
Refer to the Item Analysis Information Chart above. Which item discriminates best?
NOT Item 2
Refer to the Item Analysis Information Chart above. Which item contains ambiguous alternatives?
NOT Item 2 NOT ITEM 1
You have interpreted the item analysis results for your unit test. You notice there seems to be one mis-keyed item, several items that were more difficult than anticipated, and a couple of negatively discriminating items. What should your immediate next step be?
NOT Make plans to reteach the material covered by the difficult items.
Which of the following is an advantage of the matching set over multiple-choice items?
NOT Matching sets measure more complex learning outcomes.
Which of the following could be included in a sixth-grade student's electronic portfolio for a unit on water in science class?
NOT Only A and B
Refer to the Item Analysis Information Chart above. Which of the following is the poorest functioning distractor?
NOT Option D of Item 1
Which of the following measures is most likely to be used for RTI progress monitoring in an elementary mathematics class?
NOT Proficiency level on the state mathematics test
Which of the following is most important when deciding on the weights of different components in a report card grade?
NOT Reliability of scoring an assessment
What is the effect of rater drift on essay scores?
NOT Scores from different raters move closer together
A middle school social studies class did individual reports on the contributions of different explorers to European expansion into the New World. Which of these scenarios uses timing most effectively?
NOT Students get feedback on a first draft of their final reports.
Which of the following comments is an example of cognitive feedback?
NOT Tell me what you were thinking when you wrote this.
A state wants to have a standardized test that specifically assesses the standards the state has adopted. What is the most likely way this test would be developed?
NOT The state would develop it in-house using the staff it hires
A teacher used the following assessment tasks while teaching a unit. Choose the correct weight for the assessment tasks based on the description below. An in-depth essay question covering 1 of the 8 unit learning objectives and marked with a well-defined scoring rubric vs. A test comprised of short-answer and completion items covering 7 of the 8 unit learning objectives
NOT The two tasks should have equal weight in determining the unit grade
Why do most current textbooks recommend that teachers NOT use norm-referenced grading?
NOT There is a big ceiling effect in norm-referenced grading.
What does "a balanced assessment system" look like?
NOT There is the same amount of formative assessment as summative assessment.
Which of the following factors is MOST important in deciding how many tasks to include on a summative classroom performance assessment?
NOT Time at the teacher's disposal for administering the assessment
What is the purpose of best works portfolio?
NOT To monitor learning and show development
You are writing pairs of "true" and "false" items. For one item, you can only write a "true" item. You can't figure out a way to write a good "false" item except to insert a "not" before the verb. What should you do?
NOT Use the "true" version.
Which of the following factors is MOST likely to lower the reliability of students' essay scores in an exam?
NOT Using rubrics for scoring
Which of the following kind of concept is NOT based on a definition?
NOT abstract concept
Who is supposed to be "distracted" by distractors?
NOT both high and low scoring students
Criterion-referenced grading bases grades on
NOT comparing a students achievement with other students in the class
When your assessment poses a few tasks to students, and you want to draw conclusions about how the student would do over the whole domain of possible tasks, which type of reliability is the most important consideration?
NOT consistency over time/testing occasion
The validity of a student's grades is a matter of his or her
NOT consistency.
The general approach to problem-solving strategies is almost always more powerful than approaches based on specific domains.
NOT false
What is the problem with putting "blanks" at the beginning of a completion item?
NOT increases the difficulty of the item
A teacher has achievement test results showing that a student is only partially mastering skills in reading. The student has a borderline "A/B" grade in English. Giving the student the lower grade would be
NOT invalid and educationally ill-advised.
Which of the following grading approaches compares students to expectations for their own progress?
NOT norm referenced grading
You should use caution in interpreting scores from a standardized multilevel achievement battery because your students may
NOT not match the national norm group. NOT not match the local norm group.
On which trait should you focus your assessment at the editing stage of the writing process?
NOT organization
Which of the following grading approaches is most compatible with the theory that intelligence is fixed?
NOT self-referenced grading
Grading on the curve is a method that
NOT shows which students have mastered your report period learning objectives
the most discriminating true-false items tend to be those
NOT that have the word NOT in them.
Are the grade-equivalents scores you obtain from the reading test of one publisher pretty much the same as you would obtain from the reading tests of other publishers?
No
Are the national norms provided by different standardized achievement test publishers essentially equivalent?
No
Do students with disabilities have to pay for accommodation provided for their assessment?
No
In certain school district the policy is to assign grades according to how well a student achieves the state's standards. A teacher gave Sophia a slightly higher grade even though she didn't meet all the standards, because she really tried very hard. Has the teacher acted in a manner consistent with the school policy?
No
Marlon continually picked fights with his classmates, called out in class, and interfered with instruction. Would Marlon's teacher be justified in failing him?
No, because achievement should be measured separately from behavior
A student reported being bored in class. She and her parents requested that she be considered for the gifted and talented program in the school. The school psychologist administered an intelligence test and, based upon the score, informed her parents that she did not qualify to get into the gifted and talented program. Is this an example of responsible use of assessment for decision-making?
No, because the pupil should have been given a number of assessments before classification
Dan's parents blamed his poor standardized test performance on the teacher. Is it appropriate to hold the teacher responsible for Dan's poor standardized test performance?
No, because the teacher is only one of a number of factors that influence Dan's learning
Which of the following activities is (are) OPTIONAL for a teacher when preparing to administer a standardized achievement test?
None of the above is optional.
Which of the following conditions is sufficient for the validation of a classroom test in Social Studies?
None of the above is sufficient.
For the following, select the type of grading illustrated by the question below.A teacher compares the performance of each student with the performances of other students in the class. Grades are awarded according to the student's rank relative to others.
Norm-referenced grading
On an arithmetic test taken by a class of 32 pupils, Rowan got 90. How well did Rowan perform?
Not enough information is given to answer this question.
Which of the following types of scores would be best to use to explain to parents how well a student has achieved the learning objectives in a particular curriculum area?
PR in the norm group on the test
What is the percentile rank of a student whose T-score is 50?
PR= 50
Which of the following methods of reporting student progress is likely to be most reliable?
Percentage grades (99, 98, 97, etc.)
Which type of score is recommended for charting student achievement over time?
Percentile Ranks (PR)
What type of scores would be best to use to explain to parents a child's growth in learning in a particular curriculum area?
Percentile ranks on tests in the area over the past few years
Which of the following would be an effective formative assessment technique for a class doing individual seatwork in mathematics?
Red light/green light teacher alerts
A teacher wants to use a norm-referencing framework to interpret Burt's standardized test score. Burt is a fifth-grader in a private school in a high socioeconomic status community in the Southeast United States. Which of the following norm groups should the teacher give the first priority for interpreting Burt's score?
Regional norms for all schools in the Southeast
Joseph took a standardized, norm-referenced achievement test with the accommodations of extra time and the use of a Spanish-English dictionary. When you report his results, you should
Report her norm-referenced scores and explain the accommodations used
Which of the following strategies is LEAST appropriate for assessing knowledge of "personification"?
Require students to write a definition of personification.
What is the most important reason to prefer tests developed using empirical research over tests that were not developed using empirical research?
Research-based tests are of higher quality.
What is a unique disadvantage of the greater-same-less item format?
Restricts assessment to pairs of relationships
An eighth grade Social Studies class had to write an analysis of Lincoln's Gettysburg Address. A low achieving middle school student turned in an assignment that was not acceptable. Which of the following feedback is most likely to help the student?
Return the paper with one specific suggestion for improvement and ask the student to revise.
Reviewing the learning objectives, students' abilities, and past performance
Review students' class assignments and homework results against the intended learning outcomes.
Which of the following set of assessment activities will best facilitate a teacher's instructional planning?
Reviewing the learning objectives, students' abilities, and past performance
When writing multiple-choice items, you discover that several items have the same options. What is the best thing to do?
Rewrite the items as a matching exercise.
Suppose correctness of spelling is not part of the learning objective being assessed with an essay test. How should spelling be handled when scoring the essays?
Score spelling separately but give it a weight of zero in the students' total score.
A teacher administers an in-class test containing five restricted response essays. How would you advise her to score the tests?
Score the same essay for all students before moving on to the next essay.
What is the most likely source of measurement error in a teacher-made essay test?
Scorer (grader) inconsistency
Which of the two methods of scoring tabular items provides a more reliable result?
Scoring each cell according to the correctness of the elements in it
For the following, select the type of grading illustrated by the question below.A teacher compares a student's achievement during the marking period with the learning ability she thinks the student has. The grades are awarded according to how well the student has lived up to his capacity.
Self-referenced grading
Ms. Barnes observed her band students playing the school song in order to determine what the students needed to learn to play it better. Which of the following best describes what Ms. Jonah did?
She assessed her students' performance.
Which of the following assessment formats would likely yield the most valid results for use in a pre-instructional assessment procedure?
Short answer items
Which of the following is most likely to help students learn to use feedback to improve?
Show students some examples of how other students revised their work.
Which of the following assessment alternatives yields results that have the lowest validity for assigning summative grades to students?
Standardized tests
Which of the following is the BEST source good general learning goals for a particular unit of instruction?
State standards for student achievement
Which part of a multiple-choice item should present a problem to be solved?
Stem
What action should a teacher take when, during seatwork, a primary student turns up the "sad face" on her desk?
Stop at the desk and offer assistance.
If you wanted to know the confidence interval around a student's score in each area of an assessment battery, which standardized test computer report would you consult?
Student (home) report
For which of the following assessment methods is the validity of results most affected by individual differences among teachers?
Student classroom discussions
States are said to use a standards-referenced framework when they align state test items with state standards and set the performance levels that qualify as "basic," "proficient," and so on. Why is this NOT simply an example of "criterion-referencing"?
Student performance data is used in the standard setting process
To whom is the teacher's primary responsibility when using assessments and making educational decisions?
Students and their learning
Which of the following is a high-stakes use of assessment information for students?
Students must pass a test in order to graduate from high school.
Which of the following is a general learning goal?
Students should be able to measure things accurately.
What, if anything, is wrong with the following item? "The instrument that measures barometric pressure is called a ________."
Students who do not know the content will be clued by the wording of the item.
What do true-false items assess
Students' ability to judge the correctness of verbal propositions
Mr. Green wanted to give his students choices about which essays they would answer on his unit test. He wanted to have them choose three out of five questions to answer. You want to discourage him from this plan. What reason should you give him for not allowing this kind of student choice on his exam?
Students' scores on the test will not all mean the same thing.
What information does diagnostic assessment seek to acquire?
Students' specific strengths and weaknesses
For what kind of subject matter is the prerequisite knowledge and skills approach most appropriate?
Subject areas with established learning trajectories
Which of the following is a recommended component of effective feedback?
Suggestions for next steps in learning
For the following, select the type of technology use illustrated by the question below.A high school uses scanning and scoring software to score final examinations and report the results.
Supporting routine assessment tasks
The correlation between scores on Math Test A and mathematics grades is —.90. The correlation between scores on Math Test B and mathematics grades is +.85. Which of the two test scores will give a better prediction of a person's mathematics grades?
TEST A
Test A and Test B each have the same value for their standard deviations, 9.3. The reliability coefficients of the two tests are .77 and .92, respectively. Which, if any, of the two tests has the smaller standard error of measurement (SEM)?
TEST B
A major problem with assessing students using portfolios is that two teachers often assign quite different grades to the same student's portfolio.
TRUE
A poorly functioning multiple-choice item can often be rewritten into two or more good true-false items.
TRUE
A table of specifications should include information on the weights to assign to different parts of the assessment.
TRUE
A teacher who wants to assess students' general writing skills should use extended response writing prompts rather than restricted response writing prompts.
TRUE
A test-retest reliability coefficient measures consistency of scores over time.
TRUE
Assigning students' grades helps schools to be accountable for student learning.
TRUE
Before you teach a unit, it is useful to understand students' current thinking about the concepts you intend to teach.
TRUE
Classroom tests should be maximum performance assessment procedures.
TRUE
Even if a teacher disagrees with using standardized tests, the teacher must explain to parents the meaning of scores (percentiles, grade-equivalents) that the tests provide.
TRUE
Grades do not motivate those students who believe the attainment of important learning objectives is beyond their ability.
TRUE
If your students differ greatly in their test-wiseness skills, the validity of their scores on your classroom assessment results is likely to be lower than if they all had equal test-wiseness skills.
TRUE
In crafting a summative classroom assessment, it is appropriate to have the number of points for each task be proportional to the amount of time you spent on teaching each topic.
TRUE
In general, combining the results from several assessments of a student yields scores of higher validity than using the result of only one assessment.
TRUE
In the development of assessment blueprints, the number of learning objectives in each topical area helps determine the weighting given to the particular area in the assessment.
TRUE
It is acceptable if a student can answer a test item correctly by using his or her partial knowledge of the learning objective.
TRUE
One major disadvantage of performance assessment procedures is that they are more time-consuming for the teacher and the student.
TRUE
Standardized achievement tests provide useful information that may be part of a study of whether a local district needs to change its curriculum.
TRUE
Students are likely to study specific details when preparing for an essay test if they know the teacher will require knowing them
TRUE
Students benefit from changing their answers to multiple-choice items when the changes are based on thoughtful reconsideration of the item.
TRUE
Students think multiple true-false items do a better job of assessing knowledge than do comparable traditional multiple-choice items.
TRUE
The depth of students' learning depends both on how students are taught and on how they are assessed.
TRUE
The letters assigned as students' grades are supposed to represent the levels of quality of students' performance.
TRUE
The primary value of true-false items is to assess a student's ability to judge the correctness of verbal propositions.
TRUE
When evaluating the quality of a classroom test item, you should give more weight to its discrimination index than to its difficulty index.
TRUE
When grades are used to compare students, they are unlikely to motivate students
TRUE
When teaching students to write, it is entirely appropriate to use only the ideas and organization rubrics to evaluate students' first drafts.
TRUE
You are likely to confuse students when the grade you assign combines your evaluation of the students' achievement and their classroom deportment.
TRUE
You should always start a parent conference by reviewing the agenda you prepared ahead of time.
TRUE
You will improve the reliability of your students' fill-in-the-blank test scores if you use an answer key to score their responses.
TRUE
You would generally expect to increase the validity of your students' scores on your completion items if you changed them into direct questions.
TRUE
What information is specified in a learning objective?
Tasks students can do after instruction
Which of the following is an inappropriate way of using assessments to motivate students?
Telling students the upcoming test will be very hard
Three multiple-choice tests were constructed to cover the same curriculum area. Test A had 20 items, Test B had 40 items, and Test C had 60 items. What is the most accurate statement that can be made about the reliability of these tests?
Test C will have the highest reliability and Test A will have the lowest reliability.
For the following, select the type of reliability estimate described in the assessment data collection statements below. Students were assessed on Monday and on Friday in the same week. The test was the same each time. The teacher wanted to find out whether students' relative ranking in the group remained the same.
Test-retest reliability estimation
Which of the following decision-making situations is the LEAST high-stakes?
Testing for placement in a sequence of mathematics courses
What must be TRUE about a masterlist item set in order for it to assess analytical reasoning?
The examples in the stems must be new to the students.
A teacher used the following assessment tasks while teaching a unit. Choose the correct weight for the assessment tasks based on the description below.An essay question requiring students to evaluate a new situation by using criteria taught in class vs. An essay question requiring students to recall explanations and reasons that were taught in class.
The first task should have more weight in determining the unit grade.
Which of the following is the most discernible advantage of the greater-less-same format item type over standard multiple-choice items?
The greater-less-same format is relatively easy to craft.
When a test undergoes a validation process, what is "validated"?
The interpretation and use of test results
What is the chief advantage of using local norms for interpreting standardized test scores?
The local norm group includes students with similar educational backgrounds and abilities
What is wrong with this MAZE item?Ms. Smith went to the grocery store. She took her baby with her. The baby ________. A.cried. B.giggled. C.screamed.
The passage does not contain all the information needed to answer correctly.
What is missing from the following statement of a specific learning objective? "The student should understand the functions of the parts of a flower."
The performance
Which of the following statements best illustrates the concept evaluation?
The performance of the school's students on the ACT exam was excellent.
Which of the following is the major difference between the original Bloom's taxonomy and its revision in 2001?
The revision separates learning into two dimensions; the original used one dimension.
A teacher used the following assessment tasks while teaching a unit. Choose the correct weight for the assessment tasks based on the description below.Quizzes, the content of which assessed 3 of the 6 unit learning objectives, vs.A performance assessment that assessed 5 of the 6 unit learning objectives
The second task should have more weight in determining the unit grade
A teacher used the following assessment tasks while teaching a unit. Choose the correct weight for the assessment tasks based on the description below.Student essays that were marked without using a scoring rubric vs. Student compositions that were marked with a well-defined scoring rubric
The second task should have more weight in determining the unit grade.
Which of the following is it LEAST important for you to specify in an essay item intended to assess students' ability to evaluate a debate in which the debaters express their opinions?
The side you expect students to take in the debate
Which of the following best illustrates an appropriately stated learning objective?
The student will explain how airplanes fly.
The standard error of measurement of a test is 2. A student obtained a score of 50 on the test. How should the student's score be interpreted?
The student's true score probably lies between 48 and 52.
When a teacher selects a taxonomy of cognitive skills for use in planning instruction, which of the following is probably the LEAST important consideration?
The taxonomy can be used to prepare official government reports.
What is the best way to align instruction with state standards?
The teacher breaks each state standard down into specific learning objectives for lessons.
A parent brought to the attention of a teacher that she had mis-marked their child's test. What would you recommend to the teacher?
The teacher should re-score the test, change the grade, and thank the parent for bringing it to her attention.
Which of the following is a necessary condition for a highly valid criterion-referenced interpretation of test scores?
The test items should sample a well-defined domain of learning objectives
One ten-point quiz consists only of traditional true-false items. Another ten-point quiz, assessing the same content, consists only of multiple true-false items. How does the reliability of the scores on the two quizzes compare?
The traditional true-false quiz is likely to be less reliable.
Suppose a student is given the same math test in two consecutive days. Which score will remain the same?
The true score will be the same.
On a state test, some items were found to assess standards the students had not had an opportunity to learn. Which aspect of the test would most likely be challenged in court?
The validity
Why are problem-solving and critical-thinking abilities often assessed together?
These abilities are often used together in both real-life and academic situations.
Which of the following is tconsidered to be a disadvantage when using stanines to interpret scores?
They do not precisely pinpoint a student's position in a group.
What is the most important reason you should teach students to ask open-ended questions?
They give evidence of student thinking.
Why are describing work and making suggestions for improvement usually the most powerful feedback?
They help students understand their own learning.
What do context-dependent item sets do?
They provide an opportunity for students to think about new information or situations.
Which of the following is the function of percentile ranks?
They specify a student's expected performance on a test.
Why are most standardized achievement test not very useful for providing diagnostic information to the teacher?
They usually cover a very broad range of learning objectives.
A standardized test was standardized between March 10th and March 24th. Assume that your school cannot test in March. On which of the following dates would it be best to schedule your school's testing in order to provide the most accurate estimate of a student's standing in the norm group?
Third week in February
Which of the following is an example of evaluative feedback?
This is poor-quality work.
Why do you need to plan the weight each assessment contributes to a report card grade?
To enhance validity
What is the most important reason for rewriting introductory materials for items assessing the ability to use reference materials?
To ensure that the material is suitable for the purpose it is to be used
What is the main purpose of the Standards for Teacher Competence in Educational Assessment of Students or other statements defining standards for teachers' assessment literacy?
To guide teachers in identifying what is most important to know about assessment
Benjamin is a middle school student. His teacher has watched him slip farther and farther behind in his learning and wonders if he may need some special education services. Which of the following would be the teacher's most appropriate first step?
To inform the school principal or the school psychologist
What is the purpose of a growth and learning progress portfolio?
To show change over time on selected learning outcomes
For which purpose would you more likely use a multilevel criterion-referenced test than a multilevel survey battery?
To understand students' performance on learning goals in one curriculum area
For which of the following purposes would a best-works portfolio be best suited?
To use as evidence at a parent conference to show parents what their student has learned
In a marking period there are 200 possible points to earn. A teacher allowed a maximum of 20 points for homework and quizzes, 80 points for the midterm, and 100 points for the end-of-unit exam. At the end of the marking period, the teacher summed up the scores for each student. On the basis of the final sum, the teacher awarded grades to the students. What type of criterion-referenced grading is the teacher using?
Total points method
A task that assesses understanding of a concrete concept should require a student to demonstrate how the concept is related to other concepts that the student knows.
True
All major standardized achievement test batteries report students' percentile ranks, grade-equivalent scores, and expanded standard scores.
True
An advantage of using the tests that come with your curriculum materials is that you shorten the time needed to craft assessments.
True
At the secondary school level, use of a standardized achievement test should be guided, among other things, by the consideration of continuous educational growth over the grades.
True
It is definitely wrong to set an educational goal that expects all students to be at or above grade level in a subject area determined by a standardized test's national norms.
True
Most of the major standardized achievement test batteries include options for performance tasks as well as multiple-choice items.
True
Often, different standardized achievement tests have subtests with the same name (e.g., reading) that contain items assessing quite different content and skills.
True
One benefit to using student predicting and graphing their own progress during the memorization of facts (math facts, spelling words, states and capitals, etc.) is that it aids metacognition in what otherwise can be a rote activity.
True
Reporting a school's standardized test results to the school board without describing the other characteristics of the school and its students is an unethical assessment practice.
True
Sally, a grade six student, performed very poorly on a standardized achievement test she took in the fall. Based only on this performance, her teacher decided to put her in remedial programs. The action of the teacher constitutes inappropriate use of standardized achievement tests.
True
Standardized achievement tests are a supplementary source of information to support teachers' observations of students' educational development.
True
The major advantage of crafting your own assessment tasks instead of using published assessment tasks is that your assessment will better match what you taught.
True
The typical multilevel achievement battery uses different test items for each grade level.
True
Very often the validity of your interpretations of students' test scores is lower when you use tests that come with your curriculum materials than when you craft your own assessment tasks.
True
When selecting a standardized test for a secondary school, school officials should focus on locating a test that assesses educational development in a few broad areas instead of more specific curriculum subjects.
True
You should base a student's report card grade on summative assessment information.
True
For which item type is the chance of guessing correctly the highest?
True-false
Which of the following is able to best assess students' breadth of content knowledge?
True-false
Which of the following learning objectives can be assessed more appropriately using objective items rather than essay items?
UUnderstanding the properties of the periodic table of the elements in a chemistry course
A state has its own mandated accountability test. Your school district wants to use an additional standardized test. Which of the following is a recommendation by the Brookhart and Nitko textbook as the school district deliberates what test to adopt?
Use a test that reflects the concerns of the community
In selecting a particular technique to use in assessing students, to what should the teacher pay attention first?
Use to which the assessment result will be put
Which step in the assessment process is most often problematic?
Using information to improve learning
States who apply for a waiver from NCLB requirements must convince the U.S. Department of Education that they have high expectations for students, have ways of identifying school quality, can support effective instruction and leadership, and reduce unnecessary reporting. What is the most common way states have met the requirement for high expectations for students?
Using the Common Core State Standards
Which of the following is the most important characteristic of a good criterion measure?
Validity of the scores in assessing the tasks to be predicted
A school principal wants to compare her school building's sixth grade students' average standardized test scores in each subject area with those of other school buildings. She accurately calculates the average scores but erroneously looks them up in the individual student norms tables instead of the school averages norms tables. She finds that the averages are near the 75th percentile in all subjects.Suppose the principal had looked up the averages properly in the school averages norm table. What would the results show?
We cannot determine how the results will compare to the 75th percentile because we do not have the school averages norms tables.
Which of the following questions is open-ended?
What are some ways we can clean up our atmosphere?
What can you learn from carefully facilitated class discussions?
What students are thinking?
On which trait should you focus your assessment at the revision stage of the writing process?
Word choice
Which of the following is an example of a narrative writing prompt?
Write about the funniest thing that happened to you last week.
Which sequence of steps is the best for crafting a masterlist set of items?
Write the options, write the items, write the directions for the students.
When crafting tabular items, which of the following should be the last step performed by the teacher?
Writing the directions to students
Sally and Sam both took the same science achievement test. The standard error of measurement (SEM) for the scores is 2.8. Jane obtained a score of 67 and John obtained a 65. Is it reasonable to conclude that Jane and John have the same level of science achievement?
YES
Should you keep records of formative, ungraded assessment information?
Yes
Which of the following comments is an example of descriptive feedback?
You used a lot of details from the story.
What is an open-response assessment task?
a flexible assignment type in which learners answer questions that might not have definite answers
Which of the following scoring options is most appropriate to evaluate students' responses to a task intended to assess students' problem-solving abilities?
a scoring rubric
For instructional purposes, it is best if teachers use
a single set of traits to describe good writing across many content areas and audiences
Who should get a true-false item correct?
a student who knows the content
Which of the following describes a derived score?
all of the above
True-false items are able to assess the
all of the above abilities
Which of the following can be meaningfully used as introductory material for context-dependent item sets? 1. formulas 2. extracts from weekly magazines 3. drawings 4. topographical maps
all of thise
According to the Brookhart and Nitko textbook, a student who does NOT turn in an assignment should be given
an incomplete grade
Which of the diagnostic approaches is implied when a teacher says, "I'm interested in finding out why my students confuse the rotation and revolution of the planets."
analysis of knowledge structure
State-mandated assessment programs that hold schools accountable for test results
are focused on improvement over time.
Which of the following procedures is the best way to assess rule-governed thinking?
ask students to state the consequences of applying the rule
What does a T-score of 50 mean?
average performance
A teacher used a unit test to grade her students. After the test, she reviewed students' performance in terms of their difficulties and strengths. This helped the teacher plan her instruction for the next unit. The unit test served as
both A and B
Distractors for best answer items of an objectively scored format should be obtained from
both A and B.
English Language Learners should receive feedback on
both A and B.
A teacher says, "Ariel is able to add three-digit numbers without carrying and is now ready to learn how to add two-digit numbers with carrying." Which diagnostic approach is implied?
both B and C
Classroom teachers can use derived scores to describe their students' performance when they want to
both a and b
The masterlist set of items is especially useful in assessing students' ability to
classify instances of concepts.
Which of the following assessment methods is most closely associated with formative evaluation of students?
classroom questioning strategies
For a meaningful feedback to students on their essays, the teacher should
comment on only a few errors that are relevant for subsequent work.
How can you get good ideas for effective distractors for multiple choice items?
common mistakes in students class work
When crafting her test items for the end-of-term examination in social studies, Ms. Morrison checked each item to see if it matched the material that she taught the class.
content representativeness
Which of the following diagnostic approaches uses a norm-referencing framework to interpret students' needs?
content strengths and weaknesses
Which of the following approaches is most likely to help a first grader improve his sentencing skills?
correcting missing commas and periods and asking the student to copy his work over
What index quantifies the degree to which two variables are related?
correlation coefficient
A best answer variety multiple choice item is most useful for assessing critical thinking when the
criteria for evaluating the best answer have been explicitly taught.
Which of the following frameworks is most useful for prerequisite knowledge and skills assessment?
criterion-referencing
As a general rule, feedback should ________ students' work.
describe
Generalization of assessment results refers to
drawing conclusions about student achievement in a domain of learning from the test items the student actually took
Multiple true-false items are least appropriate to assess
elementary students.
Which of the following item formats is the best to use to assess "Evaluate" types of learning objectives?
essay items
Which of the following item formats is most appropriate for assessing students' ability to organize personal thoughts?
extended response
A science teacher wants to assess students' ability to select relevant information about global warming and organize their ideas into a comprehensive analysis of the issue. What type of essay task should she use for this purpose?
extended response essay
A middle school teacher gave her low-achieving students a blueprint for their upcoming unit test to help them study. Students who were not low-achieving did not get a blueprint. The teacher called this "differentiation." Which of the responsibilities of teachers has she violated?
fairness to all students
A good way to assess students' knowledge of state capitals would be to construct a matching exercise with the 50 states as premises
false
All the distractors in multiple-choice test items should be grammatically consistent with the stem except for the correct answer.
false
Failure to ensure recency of norms usually results in lower student scores in relation to the norms.
false
Immediate feedback works best for learning all different kinds of knowledge and skills.
false
In organizing a set of masterlist items, the set of items is put before the masterlist of options.
false
In the construction of multiple-choice items, it does not matter whether the options are arranged in some logical order.
false
In writing a masterlist item set, you should construct one item stem for each of the options in the masterlist.
false
It does not make any difference whether the interpretive materials, the masterlist of options, and the set of items appear on the same page or on different pages of the test.
false
It is acceptable to have the correct answer to one multiple-choice item depend on choosing the correct answer to another item.
false
Items written in the greater-same-less or similar format tend to be self-explanatory and do not require directions to students.
false
Options in a multiple-choice test function better when placed in the middle of the stem rather than at the end of the stem.
false
Providing feedback in online learning environments is very different from providing feedback in face-to-face classrooms.
false
Sally does not turn in two homework assignments. Billy turns in the same two assignments, but his work is all wrong. According to Brookhart and Nitko's textbook, the teacher should give both students the same failing mark.
false
Students who receive peer feedback on written work must make suggested changes before they turn in their finished assignment.
false
The correct answer variety of multiple-choice item is appropriate for directly assessing students' patterns of reasoning.
false
The longer the list of premises and responses, the more homogeneous the matching exercise is likely to be.
false
The norm data reported in a test manual cannot have a normal distribution unless students' raw scores naturally have a normal distribution.
false
The probability of randomly guessing the correct answer to a multiple-choice item increases as the number of alternatives increases.
false
When a multiple-choice item is crafted using a complex sentence structure rather than simple language, it is likely that the validity of the results will increase.
false
When writing a matching exercise, it is important to ensure that the number of responses is equal to the number of premises.
false
What is a common problem teachers face when giving feedback to struggling students?
feedback looks like a long list of things to "fix."
Which type of item is best to use when assessing recall of factual information?
fill in the blank items
Which of the following decisions requires the highest reliability coefficient for the assessment results?
granting admission to a college
In administering a standardized achievement test, a teacher should be most familiar with the
guidelines in the manual for the test administration.
Ms. Noble covered up her students' names before she scored their essays. What problem is she most likely trying to avoid?
halo effect
Why is it difficult to craft items to assess understanding of a defined concept by requiring students to classify several examples?
he examples require a number of assumptions to make the true.
The most important use of students' scores on a standardized multilevel achievement is to
identify students' strengths and weaknesses in curriculum areas.
A multilevel achievement battery tests students
in several different curriculum areas.
Other things being equal, adding more test items will ________ the reliability of a test.
increase
Evidence on the internal structure of an assessment instrument concerns how the tasks on the instrument are
interrelated in measuring the same construct.
What is a strength of the prerequisite deficits approach to diagnostic assessment?
it helps teachers plan instruction that follows a learning trajectory.
Any deviation by the teacher from directions for administering a standardized achievement test would
lead to the reduction of the validity of the results of the test.
a social studies teacher wants to assess students' knowledge of countries' capital cities. Which item format is the best for this purpose?
matching items
Which of the following types of knowledge is LEAST meaningfully assessed by using multiple-choice items?
metacognitive knowledge
What is (are) the flaw(s) in the following item? "________ is the highest court in the United States."
misplacement of blank
Which of the following should NOT be reflected in a report card grade?
motivation to learn mathematics
Reliability is a(an) ________ condition for validity of the results of a test.
necessary
Which type of multiple-choice item type ask students to identify non-examples?
negative variety
On a certain Science test, students have the following percentile ranks: Allie, 40; Maura, 45; Robert, 60; Baird, 65; Samantha, 90; Emma, 95. For which pair of students is the difference in raw scores likely to be largest?
none
Two teachers were reprimanded by the principal because of the poor performance of their students on a standardized test. What would you recommend to other teachers who face a situation like these two colleagues?
none of the above
Which of the following best assesses the rule-governed thinking students use when constructing an evidence-based argument from text?
not ask the students to describe how to write an evidence based argument form text
The standard error of measurement refers to the standard deviation of
persons' observed score about their true score.
Report card grades can used for all of the following purposes EXCEPT
punishing students for classroom misbehavior.
Which of the following essay-scoring factors is LEAST likely to lower the validity of social studies test scores?
quality of the ideas
Questions that yield the most useful assessment information for planning next steps in learning are typically
questions that require students to explain their thinking.
In order for a school system to make good decisions about students, they must
regularly update students' records.
Quality information in decision-making refers to information that is
relevant and reliable for the specific use to which it is to be put.
The sequence of the Cognitive Process dimension of the revised Bloom's taxonomy is
remember, understand, apply, analyze, evaluate, and create.
A schema can be described best as the way we
represent information in networks of related concepts.
you are a middle school mathematics teacher. Your students are learning about exterior and interior angles. One student has all the problems wrong on his homework assignment What should you do?
reteach
A principal wanted to award prizes to students who demonstrated the best sense of citizenship. The principal asked the students' teachers and their immediate past teachers to rate the students. With which type of reliability should the principal be most concerned?
scorer reliability
The statement-and-comment format can be used to assess the students' comprehension of
selected lyrics of a poem
To assess the ability to write about one's comprehension of a given statement, which variety of the statement-and-comment items should the teacher use?
short answer version
Which of these procedures would yield the highest reliability coefficient on a test comprised of 50 heterogeneous items?
split halves
Which of the following methods of estimating reliability would be most affected by the speededness of the test?
split-half
For what type of student is example feedback usually most appropriate?
struggling students
Tasks on a summative assessment should yield information about
student achievement of intended learning outcomes.
Your students have studied Shakespeare's sonnets. They have learned about sonnet form, rhythm and meter, Shakespeare's use of imagery, and some of the recurring themes in his sonnets. You are choosing a sonnet to use for an assessment, and you want to make sure the assessment taps higher-order thinking. You should choose one of Shakespeare's sonnets that
students have not read before.
For feedback to students to be most effective, the teacher should ensure that
students review their performance to correct their mistakes.
What is another name for "test blueprint"?
table of specifications
In assessing students' comprehension of words and their meanings using a matching exercise, what should serve as the premise?
the meanings
A strength of teacher-made tests is that they can assess
the specific knowledge and skills taught in a unit of instruction.
What guides students most in adopting study strategies when preparing for essay or objective tests?
thinking skills to be covered
One major purpose of pre-assessment is
to help plan a unit of instruction.
What is the function of the "table" in a tabular matching exercise?
to record more than one set of responses
A good assessment task should help you to distinguish clearly between less knowledgeable and knowledgeable students.
true
A possible way to directly assess the ability to ask and answer challenging questions is by observing a student during his/her presentation and during someone else's presentation.
true
Before your school purchases a computer-based instructional program, it is important to find out how it provides feedback on students' work.
true
Best answer items may lack objectivity if the correct answer varies from teacher to teacher.
true
Distractors in experiment-interpretation items should be based on the typical misconceptions students have about why an experimental result came about.
true
If you want to assess students' critical thinking dispositions, you should construct a rating scale to be used throughout a marking period.
true
In classroom assessment, it is acceptable to include on a test multiple-choice test items with different numbers of alternatives.
true
In classroom assessment, matching exercises are most useful when learning objectives involve classification of some sort.
true
In tabular or matrix items, the students' task is to organize heterogeneous responses into homogeneous groups in a table.
true
In writing best answer items, it is inappropriate to include "none of the above" as one of the options.
true
In writing the multiple-choice version of statement-and-comment items, it is appropriate to use students' past responses on a short-answer version of the test as a basis for distractors.
true
Multiple-choice items are appropriate for indirectly assessing students' ability to make inferences.
true
One advantage of the multiple-choice format over the true-false format is that a student's chances of guessing the correct answer are smaller.
true
Perfect matching in matrix items is not as much of a concern as in traditional matching exercises.
true
The classroom learning environment affects how students use feedback.
true
The difficulty of a multiple-choice item can be increased by using distractors which are very similar to the correct option.
true
When crafting multiple-choice items, it is important to ensure that the distractors are plausible to students who do not know the answer.
true
When the correct answers to multiple-choice items on a test form a pattern, the validity of the test is lowered.
true
You can assess a student's ability to induce and to judge induction by using either the response-choice or the constructed-response item format.
true
With regard to the revised Bloom's taxonomy of learning objectives, the statement, "The student will be able to distinguish between simile and metaphor," is an example of what cognitive process?
understand
Which of the following is likely to be an appropriate student goal?
want to learn my nine-times tables.
Which of the following is a concrete concept?
watermelon
To best participate in a formative assessment cycle, students need to first understand
what it is they are trying to learn
During the time that students are taking a standardized achievement test the teacher should ensure that all students
work on the correct pages and record their answers properly
Refer to the table above. What is the probability that a student with an aptitude test score of 80 or higher would perform below average in college math?
.15
Refer to the table above. What percentage of students with aptitude scores less than 10 were above average in college math performance?
14%
Refer to the table above. Suppose all students who obtained aptitude scores below 10 were refused admission to college level math courses. For what percentage of them would have this have been a poor decision?
37%
The No Child Left Behind Act of 2001 is considered to have high stakes for schools because schools that fail to make adequate yearly progress toward student proficiency on state standards can receive the following sanctions.
All of the above are possible consequences of failure to make adequate yearly progress.
A school district wants to use assessments to evaluate its social studies program. Which of the following would be potentially valid types of assessment instruments?
All of the above would be potentially valid sources of assessment instruments for program evaluation.
With regard to the revised Bloom's taxonomy of learning objectives, the statement, "The student will be use the Pythagorean theorem to solve right triangle problems," is an example of what cognitive process?
Apply
What does the term "reliability" mean in regard to test scores?
Consistency of scores across various factors
Evidence that a social studies test adequately samples a specific area of the social studies curriculum is called
Content-related evidence
A state department of education wants to identify some schools that are doing exemplary work with students whose first language is not English, to serve as models for other schools. Which of the following data about schools might best help the state identify a set of schools for further consideration for this purpose?
Disaggregated state test results
Which type of validity evidence is weighted most heavily in college admissions assessment?
External structure
According to educators, one of the major advantages of the No Child Left Behind Act of 2001 is that it forces schools and teachers to focus only on the important material included in the state test.
FALSE
For planning assessments, state performance standards are more relevant that state content standards.
FALSE
Spelling test results that are valid for ranking students in spelling contests are likely to be equally valid for evaluating the types of word patterns students can spell. Question 28 options:
FALSE
Statements of mastery learning objectives are broader than statements of developmental learning objectives.
FALSE
Test scores that are highly reliable are necessarily valid.
FALSE
The hierarchical nature of taxonomies of learning objectives implies that each teacher should teach lower level skills before moving on to teach higher level skills.
FALSE
To evaluate students' performance in mathematics, a teacher must measure the students in mathematics.
FALSE
Your primary concern in selecting techniques to assess a learning objective or objectives should be classroom practicality and efficiency.
FALSE
For the following, select the type of reliability estimate described in the assessment data collection statements below. A group of students was administered a different but similar set of assessment tasks, one set in the morning and one later in the day. The assessor wanted to find out whether the relative ordering of the students was similar on both sets of assessment tasks.
NOT Alternate form (different occasions) reliability estimation
For the following, select the type of reliability estimate described in the assessment data collection statements below. Students took an essay test comprised of 10 short essay questions. Each student's scores were recorded for each of the 10 essays. The assessor wanted to know whether students responded in a similar way from one question to another.
NOT Alternate form (same occasion) reliability estimation
What should be included as evidence to support the claim that the results of an assessment procedure validly reflect important mathematics thinking skills and processes?
NOT Expert judgments on the thinking processes and skills involved in students' solutions of the tasks BOTH A AND B
A science teacher correlates her students' lab grades and science fair project grades. Which of the following is she investigating?
NOT Internal consistency evidence Concurrent validity evidence
Which of the following is LEAST important for a criterion measure in a validation study to have?
NOT Relevance to long-term performance NOT Freedom from bias against individuals or groups High reliability
A teacher wants to know if students' scores on similar assessment tasks are consistent over time. Which of the following procedures of estimating reliability would be appropriate for this purpose?
NOT Test-retest reliability (different occasions)
A teacher administered a multiple-choice math readiness test to two of her classes. All the students in one class have nearly the same level of mathematical background and current F achievement. The students in the other class differ widely in their mathematics background and current achievement. If the readiness test scores are correlated with the students' math course grades, for which group would a higher correlation be obtained?
NOT the class in which students are very similar in mathematics background and achievement
Marie had several failing grades in second grade, and her principal and teachers decided she was not ready for third grade work. The school suggested that she repeat second grade. What type of decision was the school making?
Placement decision
Which statement best shows the relationship between placement and selection?
Placement does not involve rejection, but selection does.
Two different schools in the same district gave their students the same mathematics test two times. The students were of similar ability and achievement levels. In School A, students took the tests one week apart. In School B, students took the tests at the beginning and the end of a report period. Which groups' test-retest scores will have a lower correlation?
School B
According to the legislation, the major purpose of the No Child Left Behind Act of 2001 is to help all students achieve high standards.
TRUE
Any time a teacher measures the performance of students, the teacher tests the students.
TRUE
In a certain high school, the correlation between students' scores on a reading test and a writing test is 0.89. This means that high performance in reading causes high performance in writing.
TRUE
One of the factors that positively affects the correlation coefficient between two tests is the similarity of the constructs being measured by the assessments.
TRUE
One reason why teachers have to formulate learning objectives before teaching is to ensure that each student is taught and achieves the most important points of the lesson.
TRUE
Test scores that are highly valid are necessarily reliable.
TRUE
The correlation between scores on a certain problem-solving test and scores on a certain spatial reasoning test is -0.75. This means that a student's problem-solving score is likely to be high if his spatial reasoning score is low.
TRUE
When you measure students' performance, you are assessing them.
TRUE
You know that your classroom assessment matches the corresponding mastery learning objective when the assessment requires students to do everything stated in the learning objective.
TRUE
Which criterion for stating specific learning objectives is missing in the following statement of a learning objective? "The student should be able to solve problems."
The content
Which of the following can be used as evidence to judge the external structure of test results?
The correlation coefficient between the test and a criterion
Which of the following is the best explanation of why the results of a grade three standardized achievement test in mathematics cannot be validly used by a grade three teacher for assigning grades at the end of the semester?
The items are not likely to emphasize what the teacher taught the students.
Which of the following is LEAST desirable for a list of learning objectives for a unit you plan to teach?
The list includes all desirable outcomes for the course, including minor ones.
Which of the following best explains the importance of using statements of learning objectives in the assessment of students?
The teacher knows the specific outcomes students should attain and develops appropriate assessment procedures to assess them.
In order to judge the validity of assessment results, one should look primarily for
a combination of sound research and logical explanation demonstrating the appropriateness of the interpretations.
One of the important teaching uses for taxonomies of thinking skills is to
check whether what is to be taught and assessed covers the most important cognitive skills.
An educational taxonomy is a tool used to ________ learning goals and assessment tasks.
classify
For the following, select the type of validity evidence illustrated by the question below.An English language instructor gave a verbal aptitude test to a class of 50 students. Later, the instructor compared these scores to their grades at the end of the first term.
external structure of the test results
For the following, select the type of validity evidence illustrated by the question below.After administering a test of 100 multiple-choice items to a class of 50 students, the teacher correlated the students' item scores with their total scores on the test.
internal structure of the task scores
A table of specifications is a table showing the
the content areas and skills covered in a test.
A major reason for stating learning objectives at the planning stage of teaching is to make sure
the level of achievement students should attain by the end of the instructional period is clear.
Which of the following statements is an example of a developmental learning objective?
to write a persuasive essay
Which of the following is most likely to help a teacher score classroom performance assessments reliably?
use scoring rubrics for scoring