KIN 3502 FINAL -- ch 11, kin hhhh, KIN 3502 - Sport Skills & Motor Abilities (Chap. 11), KIN 3502 - Final - Chap. 12 Quiz (Psychological Measures), KIN 3502 - Exam 2 - Chap. 9 Quiz, KIN 3502 final -- ch 12, Kin 3502 Final- Psychological Measurements

अब Quizwiz के साथ अपने होमवर्क और परीक्षाओं को एस करें!

Title: Regression 89. If a regression analysis uses age, sum of skinfolds (SS), SS2, and gender to better understand body density, the analysis is called

*b. multiple regression

Title: Regression 102. A regression analysis yields the equation Y = 125.0 − 50(X). What is the value of the correlation coefficient?

*b. negative

Title: Correlation 18. The percentage of variance in one variable accounted for or shared by another variable is denoted by what statistic?

*b. r^2

Title: Correlation 39. Which of the following statistics is a measure of the amount of variance accounted for between two variables?

*b. r^2

Title: Correlation 42. What does a negative value of a Pearson correlation coefficient mean?

*b. that high scores on one variable are associated with low scores on a second variable

AAHPERD tests are designed primarily for _______________ performers

*beginning-level* performers

Title: Prediction 12. Which of these correlation coefficients has the least predictive value?

*c. .17

Title: Regression 95. A regression analysis yields the equation Y = 3.0 − 20(X) − 30(Z). Considering an X value of 5 and a Z value of 10, what is the Y intercept?

*a. 3

Title: Correlation 14. Which of the following statements is true?

*a. A correlation of +.80 is just as strong as a correlation of -.80.

Title: Correlation 15. Which of the following is true if a correlation of -1.00 exists between two variables?

*a. A high score on one variable is associated with a low score on the other variable.

Title: Variables 52. What can you do if two variables are correlated?

*a. Estimate one from the other.

Title: Correlation 20. If two variables have a correlation coefficient of .85, it can be inferred that

*b. a relationship exists

Title: Correlation 26. The relationship between the coordinate points (1,2), (3,3), (4,5), and (5,5) is

*b. direct

Title: Correlation 107. A correlation coefficient of .96 depicts a strong indirect relationship.

*b. false

Title: Correlation 111. Multiple correlation is when you analyze more than 1 correlation coefficient.

*b. false

Title: Regression 110. An adjusted R square is useful when interpreting a simple linear regression analysis.

*b. false

Title: Prediction 30. In simple linear prediction, the residual scores are

*b. found by subtracting the predicted Y from the measured Y

Title: Correlation 69. If values of X tend to increase as their paired values of Y tend to decrease, the relationship between X and Y will be

*b. indirect

Title: Regression 105. A regression analysis yields the equation Y = 125.0 − 50(X). What is the relationship between X and Y?

*b. indirect

Title: Prediction 33. Which of the following correlation coefficients provides the most predictive power?

*e. -.94

What score represents P16 if the mean of the distribution is 300 and the standard deviation is 10? a. 295 b. 290 c. 275 d. 265

290

In the Self-Motivation Inventory, if you're doing physical activity infrequently, you are in Stage...

The z score of -.50 represents what percentile? a. 30.85 b. 19.15 c. 38.30 d. 50.00

30.85

A pupil attains a raw score of 82 on a test with a mean of 100 and a standard deviation of 12. What is the corresponding T score?

Assume that a teacher wants to grade on the curve. The average grade will be a C. She will give 10% of the class an A and 10% an F. What is the T-score cutoff to pass the test (i.e., earn a grade of D or better)? Round to the nearest whole number. (Hint: Find the z score from the table of normal-curve areas.)

In the Self-Motivation Inventory, if you're active consistently for less than 6 months, you are in Stage...

What is the mean of a T score?

30. Which of the following test items is not common to all youth fitness batteries? a. body composition b. aerobic capacity c. flexibility d. strength

33. A high school volleyball coach determines that each player must be able to serve 8 out of 10 serves into the court in overhand fashion. This is an example of a(n) a. judgmental standard b. normative standard c. empirical standard d. combination standard

85. When developing a psychomotor test, you should analyze the sport to determine the skills or abilities to be measured. a. true b. false

total body movement tests

(*speed tests*) *assess the speed* at which a performer *completes a task* that involves *movement* of the *whole body* in a *restricted area*

Examples of *Product*-oriented assessments

- MPH - # of homeruns - distance/heigh jumped - time elapsed in a 40 yd. dash - # of levels completed on PACER

Primary *Subdomains* of *Human Performance*

- muscular strength - speed - agility - anaerobic power - flexibility - balance - kinesthetic perception (ability to perceive a position in space)

Purposes of Human Performance Analysis

- selection - classification - diagnosis - prediction

what should be considered when using "other tests" for a skills test?

- time - facilities - # of testers - # of tests being used - type of tests being used (*only 1 trait should be measured at a time*)

13. What name is given to specifically written goals with directions for attaining them? a. cutoff scores b. criterion-referenced tests c. behavioral objectives d. instructional aims

...

14. What name is given to a test score indicating that a person actually met the criterion although the field test indicated he did not? a. false positive b. false negative c. master d. nonmaster

...

16. An epidemiologist tests whether meeting national physical activity guidelines relates to health outcomes. What are the predictor and outcome variables in this study? a. Both the predictor and outcome are continuous. b. The predictor is continuous and the outcome is categorical. c. The predictor is categorical and the outcome is continuous. d. Both the predictor and outcome are categorical.

...

17. A mastery test is a type of norm-referenced measure. a. true b. false

...

18. A mastery test is a type of _______ measure. a. norm-referenced b. criterion-referenced c. standardized d. empirical

...

19. Which of the following contains compatible words? a. mastery test, norms, cutoff score b. mastery test, criterion, cutoff score c. achievement test, norms, cutoff score d. achievement test, criterion, cutoff score

...

2. Which of the following best demonstrates the concept of a criterion-referenced standard? a. Allison's percent body fat is nearer the mean than Melba's. b. Scott was able to achieve a VO2max of 65 ml · kg-1 · min-1 on the treadmill. c. Christi scored a 95% on the final examination. d. Elaine could not swim well enough to enter intermediate swimming. e. Jessica's score was 3 standard deviations above the mean.

...

27. What approach to setting a cutoff score would most likely compare the test results of individuals known to have a certain attribute to individuals known not to have the attribute? a. judgmental b. normative c. empirical d. combination

...

3. What type of exam is the Red Cross lifesaving test? a. norm-referenced test b. criterion-referenced test c. contains elements of norm-referenced and criterion-referenced measurement d. neither norm- nor criterion-referenced

...

4. Which analysis is generally not used with criterion-referenced testing? a. chi square b. phi c. percent agreement d. t test e. kappa

...

If everyone in a classroom scores the same on an exam, the variance is a. 0 b.1 c. −1 d. unable to determine

What is the mean of a z score distribution? a. 0 b. 1 c. 10 d. 50

What area of the normal curve lies between a z score of .48 and a T score of 42? a. 0.00 b. 10.00 c. 18.44 d. 31.56 e. 50.00

0.00

Assume that resting heart rates are normally distributed with a mean of 80 beats per minute and a standard deviation of 8 beats per minute. What is the probability of randomly selecting a person with a heart rate of 95 beats per minute or greater? a. 0.000 b. 0.030 c. 0.500 d. 0.970

0.030

The z score at the 70th percentile is

0.52

Fat mass is

0.90 g/cm cubed

In a normal distribution, the mean divided by the median equals a. 0.00 b. 100 c. 1 d. a figure that cannot be calculated

In the Self-Motivation Inventory, if you are not even thinking about changing, you are in Stage...

What are the three major components of health-related physical fitness?

1) aerobic capacity 2) body composition 3) musculockeletal fitness

5 suggestions for *Improving Rating Scales*

1) develop well-constructed scales 2) thoroughly train raters to use scales 3) explain common rating errors to the raters 4) allow raters ample time to observe behaviors 5) use multiple raters whenever possible (improves reliability & validity)

What is the T score for the 43rd percentile? a. 48.20 b. 48.52 c. 49.30 d. 49.82

48.20

Consider the following scores: 2, 3, 3, 4. What is the standard deviation? Round to the nearest tenth.

What is the T score associated with the 97.5th percentile

If a normal distribution of scores has a mean of 70 and a standard deviation of 10, what score does it take to be 2 standard deviations above the mean? a. 50 b. 60 c. 80 d. 90

What % of athletes use imagery?

103. A score on an inventory designed to assess a person's beliefs about diet and nutrition is associated with which domain? a. affective b. cognitive c. effective d. psychosocial e. psychomotor

104. True score variance can be found by a. subtracting error score variance from observed score variance b. subtracting error score variance from true score variance c. subtracting total score variance from observed score variance d. grading the exam

107. AAHPERD's basketball skills test battery includes a. speed spot shooting b. blocking out c. layups d. offensive movement

118. Subjective rating of performances on a fixed scale is a(n) a. absolute scale b. discrete scale c. relative scale d. percentile scale

152. When a correlation coefficient is calculated between two different trials, it is a form of a. reliability b. objectivity c. equivalence d. criterion validity

156. The SEM is used for reliability, whereas the SEE is used for validity. a. true b. false

157. If a correlation coefficient is calculated for a test-retest reliability study, a paired t test should also be calculated. a. true b. false

158. An individual's true score is never known. a. true b. false

160. The reliability between two different people scoring a test is called interrater reliability. a. true b. false

161. Validity of a test refers to its relevance. a. true b. false

162. What happens to the standard error of measurement as the standard deviation decreases? a. It goes down. b. It goes up. c. It depends on the type of test. d. It depends on the validity.

26. The FITNESSGRAM includes a. two approaches to the assessment of physical activity b. a variety of approaches to the assessment of physical activity c. no techniques to assess physical activity d. only a written test of physical activity

4. A student pushes as hard as possible against a fixed-lever arm on a dynamometer. What type of measurement is involved? a. isometric b. isokinetic c. isotonic d. isogenic

48. Which of the following would be considered a test of power? a. 50-yard dash b. curl-ups to exhaustion c. curl-ups in 1 minute d. pull-ups

50. The principal youth physical fitness program in the United States is the a. FITNESSGRAM b. YRBS c. Eurofit d. Kid's FIT

61. A lower limit of the "Healthy Fitness Zone" of the FITNESSGRAM test battery a. reflects a minimal accepted level for health b. reflects a maximal accepted level for health c. reflects a minimal accepted level for performance d. reflects a maximal accepted level for performance

63. PACER is a test that estimates a. aerobic capacity b. body composition c. lower-body strength d. power

64. Curl-ups measure a. abdominal strength b. flexibility c. upper-body strength d. lower-body strength

64. What is the standard error of measurement for a test having a reliability of .64 and a standard deviation of 5? a. 3.0 b. 3.2 c. 5.0 d. 6.4

70. The ACTIVITYGRAM uses a previous-day physical activity recall approach to assessing physical activity levels in youth. a. true b. false

71. A skill is a learned trait based on the ____________ that a person has. a. abilities b. attitudes c. knowledge d. behaviors

74. Why is test reliability so important? a. It is an indication of the reproducibility of a student's ability in an area. b. It is necessary for comparison of students' results to criterion-referenced norms. c. It is necessary for comparison of students' results to norm-referenced standards. d. It is an indication of the test's ability to measure what it reports to measure. e. If a test is reliable, then it is valid.

76. Statistical validity coefficients range from −1.00 to +1.00. a. true b. false

94. What is the relationship between error variance and true score variance? a. They are inversely related. b. They are positively related. c. They add up to less than observed variance. d. They add up to more than observed variance.

95. It is important to reevaluate an instrument from time to time, because students may change over time. a. true b. false

96. The measurement issues of reliability and validity are important in skills testing. a. true b. false

Highly correlated with maximum strength

Absolute endurance

blood pressure

An example of continuous quantitative data

number of days spent in ICU

An example of discrete quantitative data

1. The Educational Testing Service establishes the reliability of parallel forms of a test to measure students' knowledge in physical education. What is the appropriate coefficient for this type of reliability? a. standard error of measurement b. equivalence reliability c. alpha coefficient reliability d. Spearman-Brown formula reliability

105. The common notation for reliability is a. rxy b. rxx′ c. r2xx′ d. r2xy

11. Ms. Goodtester wants to study the variance among three 30 s trials of a tennis wall volley test. What procedure will she probably use? a. content validity b. intraclass reliability coefficient c. interclass reliability coefficient d. uncertain from the information given

111. The potential problem of repetitive-performance tests is that they a. might not be suitable for large groups of subjects b. might not transfer into actual game performance c. might not be appropriate when time is an issue d. are not easy tests

112. Total-body movement tests are often called a. max-out tests b. speed tests c. torso tests d. power tests

120. Checklists are appropriate if both ____________ and _____________ are being evaluated. a. student; instructor b. process; outcome c. knowledge; skills d. quality; quantity

120. The following coefficients are classified as intraclass reliability coefficients. a. alpha and omega b. alpha and KR20 c. correlation and omega d. correlation and KR20 e. alpha and correlation

121. Guidelines for the development of skills tests state that skills tests should a. be of simple difficulty b. be interesting and meaningful to the performer c. not be concerned with proper form d. include extraneous variables as much as possible

122. The tendency for a rater to give high ratings to a subject because of bias is called the a. Hawthorne effect b. halo effect c. size effect d. Avis effect

123. The tendency of raters not to use the extremes of a rating scale is called the a. absolute tendency error b. central tendency error c. relative tendency error d. variance tendency error

130. The SEM is calculated by multiplying the square root of 1 minus the reliability coefficient by the a. average of the test b. standard deviation of the test c. variance of the test d. validity coefficient

132. What is the 95% interval that you expect the true scores to fall in, given a score of 80 and SEM of 6.32? a. 73.68 to 86.32 b. 67.36 to 92.64 c. 61.04 to 98.96 d. 6.32 to 80

136. Validity can be subdivided into three types: content, criterion, and a. consistency b. construct c. reliability d. truth

14. In comparison to a basic ability, a specific skill is a. dynamic, common, and innate b. dynamic, specific, and developed c. static, common, and innate d. static, specific, and developed

14. The principal weakness of fitness tests for special children is a. low reliability and validity b. limited information on reliability and validity c. lack of feasibility d. lack of relevance

142. Correlating heart rate and blood pressure as measures of cardiovascular fitness is a form of which type of validity? a. concurrent b. construct c. content d. predictive

144. One type of criterion validity is a. construct b. concurrent c. formative d. summative

166. If there is no observed score variance, reliability will be high. a. true b. false

167. A reliability of .98 is larger than a reliability of −.98. a. true b. false

17. Which of the following tests measures components of health-related physical fitness? a. dips and 440-yard (402.3 m) dash b. flexed arm hang and sit-ups c. 1-mile run and standing broad jump d. 12-minute run and 50-yard (45.7 m) dash

46. The National Center for Health Statistics has shown __________ in the percentage of overweight children and adolescents from the 1960s to 2000s. a. a decline b. an increase c. stability d. inconsistency

47. What index does the correlation of test 1, form A and test 1, form B represent? a. test-retest b. equivalence c. internal consistency d. split-halves reliability

48. The AAHPER Youth Fitness Test included a. bench press b. shuttle run c. curl-ups d. leg squats

58. An exam covering a single subject that grades as either pass or fail is considered a. a norm-referenced exam b. a criterion-referenced exam c. a standardized exam d. an achievement exam

58. What is the most important aspect of a test administered in the psychomotor domain? a. reliability b. validity c. objectivity d. length

59. The four basic approaches used to develop criterion-referenced standards for human performance tests are judgmental, normative, empirical, and a. achievement b. combination c. formative d. summative

71. A test can have high criterion-related validity if it is both relevant and reliable, even if the criterion lacks reliability. a. true b. false

71. The primary tool for the statistical analysis of CRTs is a. ANOVA b. a contingency table c. regression d. t tests

72. Abilities are more _________ and _________ than skills. a. diverse; holistic b. innate; general c. learned; specific d. learned; general

72. If data from a CRT are continuous, they must first be __________ before they are analyzed. a. averaged b. categorized c. ranked d. standardized

73. Which of the following procedures would be least likely to increase the reliability of a test? a. Increase the number of items on the test. b. Give the test to a more homogeneous group of students. c. Increase the discriminating power of the items. d. Reword the items to reduce ambiguity.

75. A skills test should involve many performers if possible. a. true b. false

75. To establish reliability of a CRT, you must first decide if you're concerned with stability or a. averaging b. equivalence c. dependability d. validity

78. What is the general relationship between the variance error of measurement and test reliability? a. They are positively related. b. They are negatively related. c. They are unrelated. d. It depends on the reliability. e. It depends on the validity of the test in the situation.

78. _____________ tests usually provide higher reliability and validity coefficients than _____________ tests. a. Sport skills; motor skills b. Sport skills; written c. Written; motor skills d. Written; sport skills

88. Constructing test items that are practically important is not a major concern when developing a psychomotor test. a. true b. false

88. The phi coefficient is a special case of a a. Pearson chi square b. Pearson correlation c. Fisher chi square d. McNemar chi square

93. What happens to the standard error of estimate as the correlation between X and Y goes up? a. It goes up. b. It goes down. c. It depends on the reliability. d. It depends on what is being measured.

94. A Kappa value of .6 is considered to be moderate. a. true b. false

95. Which form of validity evidence would be most likely to rely on both logical and statistical procedures? a. concurrent b. construct c. content d. predictive

what can built environments contain?

Buildings and parks or green space for neighborhoods

101. Of the following scores, the one that represents the difference between the observed measurement and the correct value is the a. average score b. observed score c. error score d. true score

106. Reliability can be defined as the proportion of total variance that is a. error score variance b. observed score variance c. true score variance d. true score and error score variance

125. When calculating Cronbach's alpha, the denominator of the right-side term represents the a. number of trials b. sum of the variance of each trial c. variance for the sum across all trials d. grand mean

126. The square root of the reliability coefficient is called a. Cronbach's alpha b. KR20 c. the index of reliability d. SEM

127. If the reliability coefficient is .75, then the index of reliability is a. .56 b. .75 c. .87 d. .89

13. Which of the following types of reliability least belongs with the other two? a. KR20 b. alpha c. split halves with Spearman-Brown correction

131. If a test has a standard deviation of 20 and a reliability of .90, what is the standard error of measurement? a. .32 b. .95 c. 6.32 d. 18.97

139. Correlating a criterion measure with another field measure is an attempt to show a. content validity b. construct validity c. criterion validity d. internal consistency

140. Statistical validity is another term for a. content validity b. construct validity c. criterion validity d. internal consistency

141. Correlational validity is another term for a. content validity b. construct validity c. criterion validity d. internal consistency

18. What would be the best measure of bowling achievement? a. Have the student bowl one line. b. Record the number of strikes and spares made in five games. c. Compute a five-game bowling average. d. Obtain experts' ratings of bowling ability.

19. Health-related physical fitness is based on which of the following constructs? a. aerobic capacity b. motor fitness c. functional health d. body composition

30. What conditions would result in the largest value for the standard error of measurement? a. high reliability and a heterogeneous group b. high reliability and a homogeneous group c. low reliability and a heterogeneous group d. low reliability and a homogeneous group

46. Which test is most likely to be high in objectivity? a. one that requires training sessions for the tester b. one involving expensive equipment c. one that discriminates well among examinees d. one with simple directions

62. For what type of validity does the following experiment offer evidence? When a test designed to measure fitness knowledge was given to a group of students rated high in fitness knowledge and another group rated low in fitness knowledge, the group rated high in fitness knowledge scored better on the examination. a. predictive validity b. concurrent validity c. construct validity d. content validity

62. The approach used to develop criterion-referenced standards that is based on the experience or beliefs of experts is a. combination b. empirical c. judgmental d. normative e. standardization

63. Which of the following is a concern when you have several raters? a. Pearson product-moment correlation b. reliability coefficient c. objectivity coefficient d. validity coefficient

74. An analysis of students scoring above and below a standard on two different testing periods would best be analyzed using a(n) a. t test b. ANOVA c. chi square d. Pearson product moment e. intraclass reliability

Which is NOT a factor on a semantic differential instrument? A) activity B) evaluation C) feeling D) potency

C) feeling is NOT a factor (*evaluation, potency, & activity* are all factors of a semantic differential instrument)

reliable

Changing the number of free throws from 5 to 15 has to do with making the test more

79. A chi-square test of association can be used in a _________ study. a. reliability b. mean change c. validity d. both a and c

79. Testing of sport skills and motor abilities comes from the ________________ domain. a. affective b. cognitive c. effective d. psychomotor

8. The determination of a child's fitness depends on a. the fitness test b. the standard used in evaluation c. the test administrator d. a and b

80. The contingency table of a chi-square test assessing the association between passing and failing a field test performed on two different days would have what dimensions? a. 1 × 1 b. 1 × 2 c. 2 × 1 d. 2 × 2 e. 3 × 3

84. For a measure to be valid, what must be true? a. It is consistent. b. It contains a large amount of true score. c. It contains little measurement error. d. It measures what it is supposed to measure.

85. When calculating the proportion of agreement, the number of agreements can be found by adding the a. row totals b. column totals c. row and column totals d. diagonal

90. What is the alpha coefficient also known as? a. Pearson product-moment correlation b. a reliability estimate c. KR20 d. two of these

97. The degree to which repeated measurements of the same trait are reproducible under the same conditions refers to a. affective domain b. cognitive domain c. formative evaluation d. reliability e. validity

98. The degree to which a test pertains to its objectives relates to a. affective domain b. cognitive domain c. formative evaluation d. relevance e. research

What is the appropriate procedure for weighting the score on the final examination to count twice as much as the midterm examination score?

Double the T score for the final and add this to the T score for the midterm.

14. Which of the following factors will ensure that a test reflects objectivity? a. Include a large number of trials on the test. b. Have many instructors score the test. c. Administer the test on more than one occasion. d. Use the alpha coefficient when determining the intertester reliability. e. Create a well-defined scoring system for the test.

84. The proportion of agreement can be computed by dividing the number of agreements by the a. first-row total b. second-row total c. first-column total d. second-column total e. grand total

99. AAHPERD's tennis skills test battery includes all except a. ground stroke b. forehand and backhand c. serve d. volley e. scoring

two parallel or equivalent forms of an exam are given to participants

Equivalence

basketball free throws

Example of accuracy test?

Increased interest due to percent of population that is obese/overweight

Exercise motivation

___ motivation is on a continuum from intrinsic to extrinsic

Extrinsic

Which of the following is an inferential statistic? a. mode b. F ratio c. standard deviation d. range

F ratio

T/F -- current affective states do not influence scores on trait responses

FALSE -- do influence

Assessment allows all students, regardless of gender, ethnicity, or background, equal opportunity to do well

Fairness

occurs throughout the learning process and is useful for understanding current progress

Formative assessment

What is fat around the hips called

Gynoid obesity

In what tests are patients are given a grade based on their ability to move through their range of motion against gravity

HHD

Most often used in clinical situations to assess muscle strength

HHD testing

repetitive-performance tests usually have *HIGH* _________, but may *not replicate game situations*, causing __________ ___________

HIGH *reliability*; DECREASED *validity*

2 things you need to measure BMI

Height and weight

directional

Hypothesis: On average males 1 RM are higher than females.

unlikely that the difference is due to chance

If a p-value is closer to 0, what does that mean?

Psychological Skills Training usually includes ___, ___, ___, ___, etc.

Imagery Self Talk Concentration Goal Setting

What type of hypothesis does competitive anxiety have?

Inverted-U idk wtf that is but

Muscle generates force at a constant speed through a range of motion

Isokinetic contraction

Golden Rule #8

Keep *records* (no records = no point in testing = no valid information)

Who has developed a taxonomy for the psychomotor domain

Krathwohl

Gold standard of flexibility

Laboratory assessment of the ROM of a specific joint

Field methods for muscular strength and endurance assessed by

Lifting external weights or the repetitive movement of the body

Scale used to assess the degree of agreement or disagreement with statements

Likert scale

What is a built environment

Manmade surroundings that provide the setting for human activity

Precursor to HHD testing

Manual muscle test

1RM is used for

Measuring maximum strength

Waist/hip ratio// men should be ___ women should be___

Men should be less than 102 cm, Women less than 88 cm

Immediate setting within which individuals interact

Microsystems

The force that can be generated by the musculature that is contracting

Muscular strength

Is there a test that provides values for total body flexibility?

No Because flexibility is joint specific, determining ROM of a few joints does not necessarily provide an indicator of flexibility in other joints

BMI=kg/m^2

Normal BMI is 18.5 to 24.9, Overweight is 25-29.9, Obese class I is 30-34

Uses imagery, self-talk, concentration, goal setting, etc.

PST

Intended to improve performance and enhance athletes' enjoyment and satisfaction

PST (psychological testing)

Two main areas of sport and exercise psychology

Performance enhancement, Mental health

validity

Performance should have high correlation with activity and test performance

Six general purposes of measurement and evaluation

Placement, Diagnosis Prediction Motivation Achievement Program evaluation

What are the 5 stages in The Stages of Change for Exercise and Physical Activity thing?

Precontemplation Contemplation Preparation Action Maintenance

Measurement of repetitive performance related to maximum strength

Relative endurance

Mesosystems can include

School, neighborhood

Most widely used flexibility test in physical programs

Sit and reach test

these types of measures provide more accurate and reliable predictor of behavior

Situation specific

flexibility will vary depending on which muscle and joint are being evaluated

Specificity

___ is a rapidly developing field that encompasses a wide scope

Sport and Exercise Psychology

occurs at the end of the learning process and is used to assess student achievement

Summative assessment

. Which of the following measure represents the highest degree of relative performance? a. percentile of 90 b. T score of 72 c. raw score of 60 with mean of 50 and standard deviation of 10 d. z score of 1.5

T score of 72

T/F -- relationship btwn Trait vs State measures may not always be concrete

TRUE

describe and summerize data; outliers

The first step in any analysis is to ______, which is an oppurtunity to look for _____.

Intrinsic motivation to know, to move toward accomplishments, and to experience stimulation

Tridimensional

Items to consider with muscular testing

Unlike body comp and CR endurance, there is no single measurement that provides assessment of muscular strength or endurance

Four things for selecting the best test

Validity Reliability Objectivity Feasibility

Who is related to the Exercise Motivation Scale?

Vallerand

What are the 6 subscales of the Profiles of Mood States?

Vigor Confusion Anxiety Tension Anger Fatigue

essential prior to assessing flexibility, should begin with some whole body aerobic exercise such as walking or cycling

Warmup

feasibility

What a disadvantage of power and distance test?

golf, bowling, archery

What are good sports that measure performance?

dependent (paired) t-test (pre test to post test?)

What test would you use to see if two groups of related score are different?

independent t-test (did males and females differ in 1 RM bench press)

What test would you use to see if two samples differ?

probability

What uses two socres to predict % of scores above/below score based on mean and SD?

testing one person

When would it be best to use an essay test?

objectivity

Which is not a criterai for skill test?

ordinal

Which is not a level of data?

dependent t test

Which test would you use for group of individs improving form pre test to post test

different scales & measurements in different units?

Why can't you add up raw scores to assess overall fitness?

The Self-Motivation Inventory was designed to measure ___

a person's self-motivation to persist

What is summation notation? a. a series of Greek symbols b. an extension of scales of measurement c. the software used within PASW to generate results d. a shorthand method of describing mathematical steps

a shorthand method of describing mathematical steps

Test-retest—

a single test is administered twice to participants

what is used in qualitative measurement interviews?

a tape recorder or notes

The Stages of Change for Exercise and Physical Activity thing was developed as framework to describe phases involved in...?

acquisition and maintenance of a behavior

involved in behavior change

action

Guidelines for builders, designers, city planners to improve healthy behaviors in a community

active design guideline

adjectives that describe action

activity

non linear regression

arousal - performance relationhips would be ?

distance or power performance tests

assess participant's ability to project an object for *maximum displacement* or *force*

performance-based testing

assessment of the actual performance; concrete criterion to compare performance against

realistic goals will help students realize what they are able to achieve and build self-confidence and self-efficacy

attainable

Title: Prediction 29. In a prediction or regression equation, the standard error of estimate is a function of

b. the standard deviation of the dependent variable (Y) c. the correlation between X and Y *e. b and c

Behavioral regulation in exercise questionnaire

based on SDT

Which of the following methods is used to test the association between two nominal variables? a. chi square b. t test for two dependent groups c. one-way ANOVA d. t test for two independent groups

chi square

Which of the following is not a PASW command that you can use to summarize your data? a. descriptives b. frequencies c. histograms d. cluster

cluster

ANOVA

compare three mean values

analysis

comparing and contrasting ideas

What does the Sport Competitive Anxiety Test measure?

competitive trait anxiety

Early research of built communities focused on ___ with supervised exercise programs in relation to proximity to facilities

compliance

PA surveys have become more _____, allowing assessment of specific behaviors such as walking or cycling for both recreational and transportation purposes

comprehensive

*issue in skills testing*: may need to *compromise* btw ______________ & ________________

compromise btw *choosing an extremely objective & reliable test* & one that is *more game-like* or real

noted as the most impt psychological factor in sport. has significant influence on performance. shown to manifest differently in sport than in other environments

confidence

Learning and practicing effective self talk can enhance ___, ___, ___, etc.

confidence focus performance

Reliability—

consistency of measurement

intention to change behavior

contemplation

The variable weight (in kg) would be considered a. continuous b. interval c. ordinal d. nominal

continuous

The variables age (in years), sport, and college major would be considered _______, _______, and _______, respectively. *a. continuous; nominal; nominal b. ordinal; nominal; ordinal c. continuous; nominal; continuous d. continuous; continuous; ordinal

continuous: nominal ; nominal

The variables height (in inches), gender, and race would be considered _______, _______, and _______, respectively. a. nominal; nominal; nominal b. ordinal; nominal; ordinal *c. continuous; nominal; nominal d. continuous; nominal; ordinal

continuous; nominal ; nominal

summary values

convienent way to summerize large amounts of data

reliability

criteria in tests adhere to standardized procedures

validity

criteria in which performance on test should have high correlation to performance in the activity

feasibility

criteria in which test can be administered practically

qualitative measurement provides

depth and detail

3 types of performance based assessments

diagnostic, formative, summative

Each strength testing device produces ___ results

different

qualitative measurement observation involves _, _, and _

direct observation coding videotaping

Which pair of scores represents the same level of measurement? a. football jersey numbers and height b. eye color and distance c. height and eye color d. distance and height

distance and height

multimodal

distribution with multiple peaks

bimodal

distribution with two peaks

Who did best relative to his or her classmates? That is, who had the highest percentile? a. Timmy, whose percentile was 97% on a fitness test b. Jessica, who scored 45 correct on a test with 50 questions c. Shawn, whose z score was 0 on a written health test d. Allison, whose T score was 75 on a soccer test e. unable to determine from data given

e. unable to determine from data given

PA surveys have become more comprehensive, allowing assessment of specific behaviors such as walking or cycling for both ____ purposes

ecreational and transportation

procedure for psychomotor testing are the same as those for written tests; 3 phases

effective testing procedures

what does mental health focus on?

enhancing mental health and well-being through participation in physical activity, sport, and exercise

The Likert scale assume ___ intervals between responses

equal

What type of fat is in the heart, lungs, etc

essential

Which type of fat is normal for bodily function

essential

prediction

estimate a person's socre on one measure

the degree of goodness you attribute to concept (SD scales)

evaluation

semantic differential scales measure 3 factors --

evaluation potency activity

variability

extent to which the individual data values deviate from location

The ability to move body parts through a wide range of motion without undue strain to the articulations and muscle attachments

flexibility

Most accurate tests of flexibility are those in which a ____ is used to measure the actual degrees of rotation of various joints

goniometer

What are people doing in the Contemplation stage?

have an intention to change behavior

what 2 things does the interviewer do in qualitative measurement interviews?

have good rapport with the participant remain non-judgmental

ratio (length and weight)

highes level of classificiation, common unit of measurement and true zero point

Which of the following graphs is the best choice in determining the shape of a distribution? a. bar chart b. histogram c. pie chart d. frequency table

histogram

The orientation of goals in Goal Setting is...?

how success is defined

total body movement tests focus on what 2 factors?

how well participant *controlled movement* & how *fast* participant *completed movement*

One of the precautions for using psychological testing in sport and exercise is that it should not be used to determine...

if an athlete should be drafted onto a team or chosen for a certain position

Which psychological skill is visualization and mental rehearsal/practice?

imagery

visualization, mental rehearsal/practice. 90%+ of athletes use this. helps prepare the mind for successful execution of skills.

imagery

Psychological Skills Training is intended to improve ___ and enhance ___

improve performance enhance athletes' enjoyment and satisfaction

quantitative measurement includes what type of variables?

independent and dependent

Which of the following terms least belongs with the others? a. criterion variable b. independent variable c. dependent variable d. response variable

independent variables

Sports analytics provides the __________; it does NOT make the ___________

information; decisions (massive amounts of data available -- but what's useful?)

Start to a screening process:

informed consent

what type of approach is the trait vs. state measurement?

interactionalist

can be open-ended, semi-structured or structures. good rapport with participant. interviewer must remain non-judgmental. tape recorder or notes

interviews

the Behavioral Regulation in Exercise Questionnaire is on a continuum from ___ to ___

intrinsic to extrinsic

What are people doing in the Action stage?

involved in behavior change

Who did best relative to his or her classmates? That is, who had the highest percentile? a. John, whose percentile was 97% on a fitness test b. Karen, who scored 45 correct on a test with 50 questions c. Mike, whose z score was 3.0 on a written health test d. Allison, whose T score was 75 on a soccer test e. It is impossible to determine from what is given.

it is impossible to determine from what is given

Why is the range not often used for statistical analysis? a. It is too large. b. It is the easiest measure to calculate. c. It is not very reliable. d. It takes too long to figure out.

it is not very reliable

is a state stable or unstable?

it is transitory and can potentially change rapidly

Which of the following tests will carry the greatest weight if you simply add the total number of correct items to obtain a total score for the course grade? a. items = 200; mean = 140; standard deviation = 10 b. items = 200; mean = 130; standard deviation = 20 c. items = 200; mean = 150; standard deviation = 30 d. items = 200; mean = 120; standard deviation = 40

items = 200; mean = 120; standard deviation = 40

There is something in S/E psych called trait vs. state measures

just so you know

Transportation and city planning researchers were studying the relationship of

land-use patterns to walking and cycling for transportation

positively skewed distribution

larger tail on right

skills

learned traits based on the abilities that a person has (more sport specific)

negatively skewed distribution

longer tail on left

Sit and reach test popularity associated with the belief that limitations are associated with ____ and may predispose one to injury

low back pain

Flexibility decreases the chance of ___

lower back pain

nominal (gender)

lowest level of classification, mutually excusive

sustained behavior change

maintenance

Surgeon general's report of PA concluded that

many Americans do not get enough PA to promote health and lower the risk of a variety of chronic diseases

Descriptive statistics provide a. mathematical summaries of data b. inferences about population parameters c. generalizations about distributions d. predictions regarding future trends

mathematical summaries of data

The most stable measure of central tendency with interval data is the

mean

Three of the following four things have something in common. Which does not belong with the other three? a. mean b. range c. standard deviation d. variance

mean

When you sum all of the X values of interest and divide by the number of values that you summed, you have calculated a a. mean b. median c. mode d. variance

mean

a goal needs to have the ability to be measured so that you can determine if you are achieving what you set out to do

measurable

standard deviation

measure of degree that individual observations deviate from the mean

standard deviation

measure of variability used with mean

mean

measures of central tendancy that includes all the scores?

Which measure of central tendency is most appropriate for ordinal scale measurements? a. mode b. median c. mean d. either the mode or the mean

median

Which of the following is not affected by the value of every score in the set on which it is calculated? a. mean b. standard deviation c. median d. variance

median

ranking scores on a test from highest to lowest facilitates finding which of the following measures?

median and mode

enhancing the psychological effects of participation in PA, sport, exercise. our minds have an impt effect on our bodies and vice versa.

mental health

Location where interactions among microsystems take place

mesosystem

Which statistic can be used with either continuous, nominal, or ordinal data? a. mean b. median c. average d. mode

mode

if this is for personal use, use what kind of test?

modify standardized procedures consistently

abilities

more innate than skills (more general)

How can a standard z score be changed to a T score?

multiply by 10 and add 50

is performance enhancement restricted to elite athletes?

na fam

What is the relationship btw physical fitness level and poor health outcomes or risk?

negative (-)

A distribution that has many more high scores than low scores is

negatively skewed

How does one describe a distribution that has many more high scores than low scores? a. positively skewed b. negatively skewed c. normal d. rectangular

negatively skewed

With a mean of 12 and a median of 14, we can infer that the distribution is a. positively skewed b. negatively skewed c. normal d. rectangular

negatively skewed

Political party affiliation (Republican, Democrat) is an example of which measurement scale? a. nominal scale b. ordinal scale c. interval scale d. ratio scale

nominal

The numbers on the shirts of football referees represent what level of measurement? a. nominal b. ordinal c. interval d. ratio

nominal

What is another name for the z score table? a. percentile table b. T-score table *c. normal curve table d. symmetrical table

normal curve table

quantitative measurement is ___ in nature

numerical

standard error

one rater uses a rating standard different from that of other raters (errors due to rating on a different standard than other raters; not as common)

qualitative measurement interviews can be _, _, or _

open-ended semi-structured structured

Lining students up according to height without actually measuring height represents what level of measurement? a. nominal b. ordinal c. interval d. ratio

ordinal

The level of measurement that, at best, can rank subjects is

ordinal

the variable BMI group (low, normal, high, very high) would be considered a. continuous b. interval c. ordinal d. nominal e. ratio

ordinal

Which of the following is an alternative name for the term dependent variable? a. outcome variable b. manipulated variable c. X variable d. treatment variable

outcome variable

trials-to-criterion

participant performs a skill until he or she reaches a certain criterion performance; good method for reducing test administration time (esp. useful in a teaching situation bc allows teacher more time to help unskilled people)

In the previous question, what is or are the dependent variable(s)? a. percent body fat b. gender and age c. age and percent body fat d. age

percent body fat

Which of the following does not belong with the other two? a.Tscore=70 b.zscore=2.0 c. percentile = 95

percentile 95

strength of concept

potency

the strength of concept (SD scales)

potency

the mixed methods approach provides what kind of measurement?

precise

no intention to change behavior

precontemplation

stages of change --

precontemplation contemplation preparation action maintenance

preparing for action

preparation

What are people doing in the Preparation stage?

preparing for action

qualitative measurement is focused on

process not product

major measure of mood, providing a link btwn PA and mental health. can be used as state or trait measure. 6 subscales

profile of mood states.

Psychological Skills Training Imagery Self Talk and Goal Setting are all....

psychological skills

intended to improve performance and enhance athletes' enjoyment and satisfaction. usually includes imagery, self-talk, concentration, goal setting and confidence.

psychological skills training

numerical in nature. usually experimental and correlational designs. objectively measured independent and dependent variables. psychological states and traits associated by reliable and valid inventories

quantitative

how does the mixed methods approach provide precise measurement?

quantitative investigation and qualitative data

likert scales are used to assess the degree of agreement or disagreement with statements. this is an example of

quantitative methods

What is the simplest measure of variability to calculate

range

ordinal (olympics)

ranked (higher, lower - no commone measurement between scores)

Reaction-time tests are what level of measurement?

ratio

To make statements such as 8 units is twice as much as 4 units, what level of measurement is required? a. nominal b. ordinal c. interval d. ratio

ratio

Which of the following allows for fractional comparison? a. nominal b. ordinal c. interval d. ratio

ratio

Which of the following is the highest (most sophisticated) scale of measurement? a. nominal b. ordinal c. interval d. ratio

ratio

You want to make the norms as easy as possible for students, parents, administrators, and teachers to understand. How would you report your results?

raw scores and percentiles

benefits of PA on mental health --

reduces stress alleviates anxiety better sleep improve self-esteem decrease depression

A distribution around a given mean and standard deviation that is mesokurtic with no skewness is said to be

regular, bell shaped , normal

chi quare test (ex: gender and drink preference; treatment and death)

relationship between two categorical variables

Golden Rule #9

remember the *Max - Min - Con Principle* (maximizing effort of testing & minimizing error in variance; let participants know specific requirements for testing (do they need to eat before test, etc.))

application

requires using the knowledge you have

Performance tests results should be meaningful to __________ & _____________

researcher & participant

What is work

result of physical effort, W = Fd

general vs. sport-specific is another type of measurement in S/E psych

sah dude

relative scale

scale scored by comparing performance to that of others in the same group (*norm-ref.* approach)

absolute scale

scores are evaluated on a fixed scale; performance is compared with a predetermined standard (*criterion-ref.* approach)

confidence includes --

self-belief self-efficacy

Psychological tests are ___ and ___ specific

situation and sport

If a distribution of test scores has an unusually high score in comparison to the other scores, what shape does it have? a. normal b. skewed left c. skewed right d. uniform

skewed right

Which words might be used to describe the shape of a distribution? a. skewness b. ICC c. variance d. range

skewness

the statistical term for the shape (or symmetry) of a distribution is __________ and the peakedness of a curve is referred to as _______. a. kurtosis; skewness *b. skewness; kurtosis c. kurtosis; normal d. normal; skewness 141

skewness; kurtosis

a learned trait based on the abilities that a person has.

skill

sport-specific

skill

multiple prediction

skinfolds for % body fat would be ?

Microsystems include

social and physical characteristics, Individual, classroom, home

Social environment: more crime, more active or less? Research shows that residents of neighborhoods with more ___ are ___ active

social disorder, less active

What does the Sport Anxiety Scale measure?

somatic and cognitive anxiety

uses a single test to determine reliability by splitting the test into parts

split-halves

does general measurement or sport-specific provide a more accurate/reliable predictor of behavior?

sport-specific

Which of the following represents the nominal scale of measurement? a. achievement on the SA T test b. student enrollment (ID) number c. percentage of body fat from skinfold calipers d. age of the students

student enrollment (ID) number

Which of the following represents the nominal scale of measurement? a. body fat b. test score c. student number d. vertical jump

student number

Imagery helps prepare the mind for...?

successful execution of skills

What are people doing in the Maintenance stage?

sustaining the behavior change

What is the appropriate inferential statistical test for examining the difference between two means? (Assume the data are interval or ratio.) a. t test b. multiple regression c. one-way ANOVA d. Pearson correlation coefficient

t test

synthesis

taking apart info and putting it back together in a way

Golden Rule #2

test bc you believe it will *make a difference* (must have a purpose to test)

Golden Rule #1

test for *things that make sense*

Golden Rule #3

test with a *performance-focused goal* (short-term goals = during training/practice long-term goals = at the end of training program)

qualitative measurement is ___ in nature

textual

4. The most popular youth fitness batteries include health-related fitness tests. Which test battery includes a test of motor fitness? the President's Challenge

the President's Challenge

p > .05 indicates that a. the null hypothesis is rejected b. the alternative hypothesis H1 is accepted c. a type I error has occurred *d. the null hypothesis is retained

the null hypothesis is retained

subjective ratings

the value a rater places on a skill or performance *based on personal observation*

The standard deviation is an indication of

the variability of a set of test scores around the mean

Which of the following is a characteristic of ratio scale measures? a. They indicate rank-only order. b. They have absolute zero values. c. They have arbitrary zero values. d. They indicate only group membership.

they have absolute zero values

Which of the following is a characteristic of ordinal scale measures? a. They indicate rank order. b. They can express ratios. c. They have arbitrary zero values. d. They indicate only group membership.

they indicate rank order

quantitative vs. qualitative measurement is another one in S/E psych

this sucks bye

how is a trait acquired?

through learning or genetics

what is "the fundamental aspect of personality"?

trait

stages of change for exercise and PA is also called

transtheoretical model

Which of the following is an alternative term for independent variable? a. outcome variable b. response variable c. Y variable d. treatment variable

treatment variables

hypothesis testing

used when some comparison is to be made

accuracy-based tests

usually involve the skill of throwing or hitting an object (ex. free throws, goals in soccer, archery, golf putting)

Well-designed sampling methods are most important for obtaining what type of results? a. precise b. interesting c. valid d. reliable

valid

A test cannot be ___ if it is not ___

valid, reliable

truthfulness of a measurement

validity

two continuous variables

value on one variable vary with value on another

With muscular testing performance can ___ with technique

vary

A researcher investigated the relationship between vitamin C (none, 500 mg, 1000 mg) and workers (office, outdoors) on the frequency of colds. What is or are the independent variable(s)? a. colds b. vitamin C c. colds and workers d. vitamin C and workers

vitamin c and workers

Toby thinks that VO2max is affected by age, gender, weight, and physical activity. What is or are Toby's dependent variable(s)? *a. VO2max b. age and gender c. age, gender, and weight d. VO2max and physical activity

vo2 max

location

where on average the data lies

how are psychological states and traits assessed in quantitative measurement?

with reliable and valid inventories

Essential fat is higher for ___ than ___

women than men

One of the precautions for using psychological testing in sport and exercise is that you have to have self-awareness of...

your own qualifications and limitations

Title: Correlation 71. If 30 students take a pretest on the first day of class and 20 of those students take the posttest on the last day of class, what is the correlation sample size (n)?

*a. 20

50. Noting cases at a particular time or place is known as a a. cohort b. case series c. case control d. randomized clinical trial e. community trial

52. Rank-order rating scales are an example of a(n) a. absolute rating scale b. relative rating scale c. numerical rating scale d. checklist

52. The AAHPERD Health-Related Physical Fitness Test was published with a. average norms b. percentile norms c. set cutoff scores d. p values

62. A higher passing standard of the FITNESSGRAM test battery a. serves to reward and acknowledge students b. serves to motivate and challenge students c. emphasizes fitness knowledge d. emphasizes well-being

56. A criterion-referenced test is usually concerned with a. average scores b. cutoff scores c. standardized scores d. percentiles

59. What do accuracy tests typically involve? a. speed b. an object c. power d. total-body movement

59. Which type of validity is most similar to face validity? a. prediction b. content c. construct d. concurrent

64. The approach used to develop criterion-referenced standards that relies on the availability of an external measure of the criterion attribute is a. combination b. empirical c. judgmental d. normative e. standardization

153. When a correlation coefficient is calculated between two different raters, it is a form of a. reliability b. objectivity c. equivalence d. criterion validity

155. The reliability of a test can generally be improved if the number of items is decreased. a. true b. false

159. Objectivity is a special kind of validity. a. true b. false

163. The mean of the error score is always positive. a. true b. false

165. For a measure to be reliable, it must be valid. a. true b. false

66. CRTs are __________ the proportion of people in the population that meet the standard. a. dependent on b. independent of c. based on d. related to

67. The standing long jump measures a. agility b. power c. lower-body strength d. upper-body strength

5. In the process of constructing a sport skill test, why might some of the tentative test items be administered twice? a. to check administration procedures b. to assess reliability c. to check validity d. to provide intercorrelational data

5. What effect does fatigue have on reliability? a. Fatigue increases reliability. b. Fatigue decreases reliability. c. It depends on the length of the test. d. Fatigue has no major effect on test reliability. e. It depends on whether the test is a written test or a motor test.

116. A maximal-effort softball throw is an example of a(n) a. accuracy test b. distance or power test c. repetitive-performance test d. total-body movement test

34. How does the ACTIVITYGRAM differ from the FITNESSGRAM? a. It is easier to use. b. It can be used for all ages. c. It measures typical daily activity. d. The ACTIVITYGRAM is part of the FITNESSGRAM.

36. For which of the following sports would a battery of skill tests be most appropriate? a. archery b. bowling c. golf d. rock climbing

37. If you are going to use a subjective rating scale, which of the following will improve the reliability of your scale? a. Use several judges. b. Rate the student several times. c. Both of the above d. Neither of the above

38. In examining passing rates for the FITNESSGRAM 1-mile (1.6 km) run test, which of the following is true? a. Girls' passing rates improve as they get older. b. Girls' passing rates tend to be higher than boys'. c. Boys' passing rates are above 50% across the age range. d. Girls' and boys' passing rates are similar during the high school years.

39. A physical education teacher correlates the results of an in-class tournament with selected skill tests. This is an example of what type of validity evidence? a. content b. predictive c. concurrent d. construct

39. The reason criterion-referenced standards tend to vary across tests purportedly measuring the same trait is that a. setting a cutoff is not possible b. tests are not the same c. there are number of ways to set a cutoff and experts disagree on the exact level to establish a cutoff d. the real reason is not stated

52. On what principle is the Spearman-Brown prophecy formula based? a. A valid test is generally a reliable test. b. the interclass reliability model c. Longer tests are generally more reliable. d. Objectivity is a special case of reliability.

21. What factors are important in defining physical fitness? a. exercise level and heredity b. physical activity history c. types of fitness testing d. the population to be evaluated

22. Which method would be used to estimate the reliability of four trials of a throwing test? a. interclass b. Pearson product moment c. repeated measures d. alpha

23. Increasing the number of trained raters when measuring a sport skill can most positively affect which measurement concern? a. validity b. feasibility c. relevance d. objectivity

28. Which of the following concerns is all inclusive regarding the use of a particular skill test? a. discrimination b. appropriateness to student group c. measuring important abilities d. validity

3. Physical inactivity in elementary school is said to lead to cardiovascular disease in adulthood. Which type of validity is described? a. content validity b. concurrent validity c. construct validity d. predictive validity

30. Allowing students to practice a skill test will improve a. feasibility b. objectivity c. relevance d. reliability

32. On an agility test, the correlation between the first and second administrations of the test given on two separate days was .60. What does this correlation assess? a. objectivity b. equivalence c. internal consistency d. stability

33. What key concept should be remembered when attempting to test balance, flexibility, and strength? a. sex differences b. general motor ability c. kinesthesis d. specificity

34. For which of the following sports would a battery of skill tests be least appropriate? a. volleyball b. basketball c. tennis d. bowling

35. Wall volley tests correlate well with tournament rankings. To what conclusion does this fact lead? a. Tournament rankings are good tests. b. Wall volley tests are reliable. c. Wall volley tests are objective. d. Wall volley tests are valid.

35. Why is the segmented day used with the ACTIVITYGRAM? a. It improves fitness assessment. b. It is similar to results from most other fitness tests; thus comparisons can be easily made. c. It helps to determine if the student is in the healthy fitness zone on the FITNESSGRAM. d. It helps students recall what they did.

6. What aspect of measurement will be most affected by using three raters instead of one? a. difficulty b. discrimination c. criterion d. validity

154. When a correlation coefficient is calculated between a test and a criterion test, it is a form of a. reliability b. objectivity c. equivalence d. validity

16. Motor educability tests would be most highly related to a. basketball skills b. dancing skills c. football skills d. tumbling skills

61. Rating scales used in the psychomotor domain are most like a. reliability estimates b. norm-referenced evaluation c. criterion-referenced evaluation d. authentic assessment

63. The approach used to develop criterion-referenced standards that uses norm-referenced data to set a standard is a. combination b. empirical c. judgmental d. normative e. standardization

66. An analytic rubric for a performance a. negates the halo effect b. prevents a standard error c. minimizes the effect of central tendency d. reduces the subjectivity of the evaluator

104. The grading process should begin with __________ and end with _________. a. test selection; results compared to standards b. test selection; final grades c. objectives of instruction; results compared to standards d. objectives of instruction; final grades

106. Which grading component makes for a valid instructional outcome? a. attitude b. effort c. leadership d. skills tests

107. Generally, it is desirable for reliability to be at or above a. .05 b. .25 c. .65 d. .80 e. 1.00

109. Objective skills testing includes all but a. accuracy tests b. total-body movement tests c. distance or power performance tests d. knowledge tests

35. If you could obtain only one piece of information about a test, what would give you the most information? a. Pearson product-moment correlation coefficient b. reliability coefficient c. objectivity coefficient d. criterion coefficient e. validity coefficient

36. What is a major difference between norm-referenced reliability and validity and criterion-referenced reliability and validity? a. the size of the values obtained b. the number of students involved c. the interpretation of reliability d. the interpretation of validity e. the types of variables (X and Y) used

categorical independent variable; continuous dependent variable

Differences between means

49. Randomly assigning whole groups to treatments or exposures is known as a a. cohort b. case series c. case control d. randomized clinical trial e. community trial

performance based assessment that occurs before the start of the learning process and is useful for understanding prior knowledge

Diagnostic

8. Which of the following would best describe the criterion measure when examining validity? a. optimal b. consistent c. logical d. concurrent e. truthful

Summative evaluation

Occurs at the end

Air left in the lungs—what is it?

Residual volume

2 tailed (different - less than or greater than)

What's the nondirectional hypothesis of an independent t-test?

1. measure progress 2. provide feedbac 3. assign grade

What's the purposes of a knowledge test?

accuracy, wall volley, power and distance, total bodily movement

What are the four ways to test a sports skill?

skill test, rating scales, performance

What are the three ways of measuring a sports skill test?

evaluating athletes, pre-employment tests

What are the two applications for the Theory of Basic Abilities?

One of the precautions for using psychological testing in sport and exercise is that you have to be able to evaluate...

a test's validity for the sample and situation

sport-specific multidimensional inventories measure...?

a variety of psychological skills for performance success

The term *obesity* is used loosely in everyday conversation; what is an accurate definition of obesity? a) overweight b) overfat c) high body density d) low BMI

b) *overfat*

more innate than skills (general)

abilities

overarm pattern is considered an --

ability

measurements in mental health --

anxiety depression self-esteem self-concept mood anger

What is the Exercise Motivation Scale based on?

the self-determination theory continuum

What is the ultimate goal of evaluation?

to make intelligent decisions

One of the precautions for using psychological testing in sport and exercise is that coaches should not...

be the person giving or interpreting the psychological test (unless they have specific training)

Ecological models are most powerful when they are—

behavior specific

knowledge

being able to answer a question and recall something you ahve memorized is what part of Bloom's taxonomy?

Semantic differential scales use

bipolar adjectives

in semantic differential scales, participants respond to...

bipolar adjectives

Trait / State is stable

trait

Forced categories should not be created because of the potential loss of information. a. true b. false

true

The area to the left of 1 standard deviation from the mean in a normal distribution represents 84% of the total area of the curve. a. true b. false

true

Total score variance minus error score variance leaves you with a. absolute score variance b. no variance c. true score variance d. observed variance

true score variance

The total area to the left of -1 standard deviation from the mean in a normal distribution represents approximately 84% of the total area of the curve. a. true b. false

false

What are people doing in the Precontemplation stage?

have no intention to change behavior

11. A test battery contains the following three items: distance runs, skinfolds, and trunk extension. What does the test battery measure?

health-related physical fitness

Profile of mood link btw:

link between PA and mental health

measurements in performance enhancement --

attentional focus confidence pre competitive anxiety self-motivation imagery

What major advantage do field tests have over laboratory tests? a) greater reliability b) greater feasibility c) greater objectivity d) greater validity

b) *greater feasibility* (field test are more feasible than laboratory test bc they generally do NOT require expensive equipment like lab tests; but this makes lab tests more reliable & valid!; also field test are more subjective bc they are normally more norm-referenced)

Which of the following methods is used when one group of subjects is measured on only two separate occasions? a. chi square b. t test for two dependent groups c. one-way ANOVA d. t test for two independent groups

t test for two dependent groups

Exercise Motivation Scale is based on the ____ theory continuum

self-determination

designed to measure a person's self-motivation to persist. developed to be used with exercise adherence

self-motivation inventory

can be positive or negative. learning and practicing effective ___________ can enhance confidence, focus, performance, etc

self-talk

Measures evaluation, potency, and activity (adjectives)

semantic differential scales

Participants respond to bipolar adjectives what type of scale test

semantic differential scales

clearly defines the goal you are hoping to achieve, and providing ample detail on each component of your goal

specific

SMART =

specific measurable attainable relevant timely

Two things for assessing flexibility requirements

specificity and warm up

Which interclass reliability uses spearman-brown prophecy formula?

split-halves

the key to using __________ ___________ efficiently is to develop a system that works for a given team

sports analytics

Which measure of variability is normally reported with the mean? a. range b. variance c. standard deviation d. correlation

standard deviation

you are discussing the heterogeneity of your tennis class results with one of your fellow teachers. Which of the following terms will you probably use in the discussion? a. standard deviation b. intervally scaled c. percentile d. grouped frequency distribution

standard deviation

if you want to make comparisons with other groups, use what kind of test?

standardized

Trait / State is always changing

state

situational. function of situation or environment in which a person is placed. transitory, potentially rapidly changing

state

what is "the function of the situation or environment in which a person is placed"?

state

The Profiles of Mood States can be used as a ___ measure

state vs. trait

regression

statistical model used to predict performance on one variable from another

1. One problem with many agility tests is that the students' scores are not scattered vary widely. Why is this a problem? a. This indicates low validity for these tests. b. This makes obtaining scores difficult. c. This makes discriminating between good and poor performance difficult. d. This makes constructing norms for the agility tests difficult.

10. All other things being equal, which of the following represents the most valid test? a. R = .90 b. r = .80 c. r2 = .85 d. R2 = .82

100. An evaluation conducted during a training program is called a. affective b. cognitive c. formative d. psychomotor e. summative

108. Which of the following makes for greater reliability? a. zero observed score variance b. large error score variance c. large true score variance d. zero total score variance

4 primary classifications of *objective skills tests*

(1) *accuracy*-based skills tests (2) *repetitive*-performance tests (3) *total body movement* tests (4) *distance* or *power* performance tests (some tests may be combinations of 2 of these)

2 problems with *subjective ratings*

(1) *defining criteria* (2) ensuring *consistency* among *raters* (single raters may be biased! vs. several raters increase reliability, but decrease feasibility (have to hire more raters = more costly test))

consider the following data: 8, 10, 12, 14, 7, 5, 9, 10. What is the mode? a. 9.0 b. 9.4 c. 9.5 d. 10.0

10.0

Title: Correlation 22. What is the general relationship between skinfold measures and hydrostatically determined percent body fat?

*a. high positive

Title: SEE 55. What does the standard error of estimate tell you?

*a. how much error there is in prediction

Title: Regression 101. Four different variables were separately examined to investigate their predictive relationship to Y. Which of the following models shows a better fit to the data?

*a. model 1: 1 − R^2 = .09

Title: Regression 98. Four different variables were separately examined to investigate their predictive relationship to Y. Which of the following models shows a better fit to the data?

*a. model 1: SEE = .009

Title: Regression 100. Four different variables were separately examined to investigate their predictive relationship to Y. Which of the following models shows a better fit to the data?

*a. model 1: sum of squared residuals = 200

Title: Regression 31. Multiple regression indicates

*a. multiple predictor variables

Title: Regression 82. Simple linear regression refers to ______ X variable(s) and ______ Y variable(s).

*a. one; one

Title: Correlation 41. Which symbol is commonly used to indicate a Pearson correlation coefficient?

*a. r

Title: Correlation 24. A positive correlation between two variables indicates that

*a. students scoring low on one variable score low on the other variable

Title: Correlation 58. The Dallas Cowboys computed correlations between players' ratings (determined by the coaches) and four specific performance tests. The correlations with the ratings were as follows: test A = -.87; test B = .75; test C = .62; test D = .57. In selecting potential players, the Cowboys would do best to use which test?

*a. test A

Title: Correlation 67. The sign (plus or minus) of a correlation coefficient indicates

*a. the direction of the relationship

Title: Correlation 106. If a correlation coefficient is negative, the relationship is indirect.

*a. true

Title: Correlation 113. A scatterplot is useful for identifying unusual points in the data.

*a. true

Title: Regression 108. A small SEE would indicate a relatively good fit.

*a. true

Title: Regression 109. Subtracting the coefficient of determination from 1 is helpful if you want to better understand how much lack of fit is in your model.

*a. true

Title: Regression 112. The sum of the residuals is always equal to zero.

*a. true

Title: Correlation 25. An indirect relationship would indicate

*d. a negative correlation

Title: SEE 36. In general, which of the following types of relationship is best to obtain?

*d. a relationship with a low standard error of estimate

Title: Regression 87. The standard error of estimate for the regression line of X and Y can be calculated with the standard deviation of Y and the

*d. correlation coefficient of X and Y

Title: Variables 54. What is the difference between the actual and the estimated variable called?

*d. error

Title: Correlation 63. The PPM correlation coefficient is an index of the

*d. linear relationship between two variables

Title: Regression 99. Four different variables were separately examined to investigate their predictive relationship to Y. Which of the following models shows a better fit to the data?

*d. model 4: R^2 = .98

Title: Correlation 11. What statistic is computed when correlating more than two variables with the criterion?

*d. multiple R

Title: Correlation 64. An indirect relationship is the same as a

*d. negative relationship

Title: Prediction 47. How well does a test of athletic ability with a .12 correlation with batting average in baseball players predict batting success?

*d. not very well

Title: Regression 77. Estimating one variable from another variable is called

*d. prediction

Title: Correlation 65. The most appropriate graph for displaying an X,Y relationship is a

*d. scatterplot

Title: Regression 81. The amount of change in Y for a unit change in X is evaluated with the ___________ of a regression line.

*d. slope

Title: Correlation 44. What does a correlation coefficient of 1.0 mean?

*d. that those who did the best on the first test also did the best on the second test

Title: Correlation 7. What is the major distinction between simple correlation and multiple correlation?

*d. the number of predictors used for the correlation

Title: Variables 51. What does the coefficient of determination tell you?

*d. the shared variance between two variables

Title: Correlation 38. Which of the following is the primary purpose of correlational research?

*d. to study the relationships among variables

Title: Regression 96. A regression analysis yields the equation Y = 3.0 + 20(X) − 30(Z). Considering an X value of 5 and a Z value of 10, what is the predicted value of Y?

*d. −197

Golden Rule #7

*don't* necessarily *rely on previously developed tests*; develop your own (previous test may have been for a specific sport other than the one you are testing for now)

Title: Regression 90. If a regression analysis uses age, sum of skinfolds (SS), SS2, and gender to better understand body density (BD), the dependent variable(s) is or are

*e. BD

Title: Correlation 28. In detecting a strong curvilinear relationship,

*e. a correlation near 0 would be expected

Title: Correlation 79. With correlation,

*e. it does not matter which variable is the dependent variable

Title: Regression 1. What is the major difference between simple regression and multiple regression?

*e. number of predictors used

Title: Correlation 10. If there is a negative relationship between two physical measures,

*e. the value of one measure increases as the value of the other measure decreases

Title: Correlation 66. If values of X are all positive and values of Y are all negative, the relationship between X and Y will be

*e. unapparent until a correlation is computed

Title: Regression 85. The correlation between regression residual and Y should be

*e. zero

Title: Correlation 62. Which of the following correlations displays the strongest relationship?

*e. −.99

Golden Rule #10

*educate* athletes about *testing* (let participants know why they are being tested & if maximal effort is needed for testing)

2 important factors to consider in skills testing (in addition to reliability & validity) is ________ & ________

*feasibility* & *best way to evaluate the skill*

*one problem with distance or power performance tests* is ensuring these tests are performed in a ________-_______ ____________; this will determine whether the test requires or accounts for any correction for ____________

*game-like* manner; *accuracy*

Analysis depends on the purpose of testing: - if individual performance is tested = use ________ - if comparing performance to standard is tested = use __________

*individual* performance = *norm-referenced* analysis performance compared to a *standard* = *criterion-referenced* analysis

Kenyon's Attitudes Toward Physical Activity (ATPA) illustrates the _____________ of attitudes.

*multidimensionality*

What are the two main classifications of techniques for assessing physical activity?

*objective* & *self-report*

___________ measures are *ALWAYS BETTER* than ____________ measures

*objective* > subjective

___________ & ____________ are qualitative questions in nature.

*observation* & *interview* = qualitative

for total body movement tests, _____________ __________ can be used to *reduce or eliminate the speed problem*

*performance ratio*

*subjective ratings* can be developed for individual skills, specifically _______-________ _________; most of the *objective tests* use _________-_________ _____________

*process-oriented components* = subjective ratings *product-oriented components* = objective tests

Likert questions are what type of questions? qualitative or quantitative?

*quantitative*

Badminton serve & volleyball spike both involve similar overarm patterns & are specific _________; but overarm pattern is considered a(n) __________.

*skills*; *ability*

When administering an instrument in the affective domain, you should remember that instruments are often _________ _________ rather than _____________ to all activities.

*sport specific* rather than *generalizable* to all activities

*pretest* duties include...

- *planning* (tester should be familiar with test, items administered, facilities, & equipment/markings) - *expose examinee to test & allow practice* - *order of testing administration* (don't make 2 long or strenuous tests on the same day)

*testing* duties include...

- *prepare test location* as early as possible (reserve location) - address *safety* concerns (bring towels if it rains) - allow *warm-ups* for physical performance tests (avoids injury) - *"hints"* should be given to EVERYONE, if available (avoids bias/standardization of results) - perform actual test

*posttest* duties include...

- *transcribing* the test results & *analyzing* the scores - each time results are transcribed, another person should *proofread* them - analysis depends on the *purpose of testing*

total body movement tests have *HIGH* _____________ bc of large amount of _____-___________ _____________

HIGH *reliability* bc of large amount of *inter-participant variability*

median will always be smaller than the mean

What is always true in a positively skewed distribution

testing

What is not a part of Bloom's taxonomy?

biomechanical

What is not for athletes, but for pre-employment?

Knowledge Comprehension Application Analysis Synthesis Evaluation

What is the acronym for Blooms taxonomy?

it includes every score

What is the advantage of the mean?

not affected by outliers

What is the advantage of the median?

mode

What is the best measurement for categorical (nominal) data?

Psychophsyical (rate frequency) Biomechanical (stress on spine) Physiological (cardio response) Validation (criterion, content, construct)

What is the pre-employment test methodology?

confidence interval

What is the predicted score +/- SE?

to find the probability of seeing the observed difference just from chance

What is the purpose of a p-value?

can never prove the truth of hypothese; provides evidence to support or refute it

What is true of statistical analysis?

no difference in comparison

What is true of the null hypothesis?

coefficient of determination (r^2)

What measures the amount of vairability in one measure that is explained by the other measure?

Assent from minors Consent from parent

What must you get when including minors?

one group t-test (ACT scores of KIN majors vs. ACT scores of LSU pop'l)

What test would you use to find if a sample differs from population or from zero?

reliability

What's an advantage of power and distance test?

throwing, kicking, striking

What's an example of power and distance test?

dribbling

What's an example of total bodily movement tests?

1 tailed (less or more)

What's the directional hypothesis of an independent t-test?

application

What's true about deciding which felxibiilty test to use?

do not reject null (there is no difference/relationship)

If test statistic is greater than your given significance (.05), you....?

reject null and accept alternative (there is a difference)

If test statistic is less than your given significance (.05), you...?

68% CI

mean +/- 1 SE

95% CI

mean +/- 2 SE

99% CI

mean +/- 3 SE

The mean on a test is 200 and the variance was 400 points. What percentage of people scored less than 170? Round to the nearest whole number. a. 7 b. 13 c. 43 d. 87

The *application of statistical analyses to sports* does what 2 things?

(1) *saves decision makers time* by providing all relevant info needed for player evaluation in an efficient manner (2) *provides insight to novel ways* (new methods) when determining the best player suited for their system

3 essential components of *Sports Analytics*

(1) data mangement (2) predictive models (3) information systems

*performance tests* should have what 3 factors?

(1) minimally acceptable *validity & reliability* (2) *low cost* (3) relatively *easy to administer*

3 phases of *Effective Testing Procedures*

(1) pretest duties (2) testing duties (3) posttest duties

2 types of rating scales

(1) relative scale (2) absolute scale

total body movement tests can be administered *quickly*, but have 2 inherent problems which are?

(1) test must *approximate game performance* (max speed is not always required in the game; avoid going out of bounds) (2) *different valid criterions* for *evaluation* & for *performance efficiency*

Halo effect

(most common error) rater elevates or reduces a person's score because of bias (judge will have a predetermined conception of how the performer will perform (usually done more when the judge knows the performer))

Which of the following is used in Likert-type responses? A) multiple-choice responses B) responses that range from strongly agree to strongly disagree C) short-answer responses D) responses that can be perceived as appropriate in sport & nonsport settings

*B) responses that range from strongly agree to strongly disagree*

10 Golden Rules for Testing ____________ _______________:

*Competitive Athletes*

What is the relationship found btw *VO2max* & *time* to complete a *1.5-mile run*?

*HIGH negative (-)*

In your testing and exercise population course, what kind of relationship would you expect to see btw skinfold measures & hydrostatically determined percent fat?

*HIGH positive (+)*

__________ of ________ is critical to keep in mind when examining *domains of human performance*.

*Specificity* of *task*

_________ vs. ___________ responses are what differentiate *state* & *trait* measures.

*Task-specific* vs. *general* responses

__________ of the data is the biggest concern when interpreting survey results in the affective domain.

*Usefulness* of the data

Which distribution is the most homogeneous given the following: (1) mean = 100 and variance = 25; (2) mean = 120 and variance = 36; (3) mean = 90 and variance = 49.

*a. (1)

Title: Correlation 72. If there are 10 values of X and 10 paired values of Y, what is the sample size (n) of the correlation coefficient?

*a. 10

Title: Prediction 5. Scotty is predicted to have a score of 75 on a test. The correlation between the predictor variable (X) and the predicted variable (Y) is .80. Both variables have standard deviations of 10. How likely is it that Scotty's actual score on the Y variable is above 81?

*a. 16%

Title: Prediction 35. Mary is predicted to score 150 on an achievement test in which the mean is 130 and the standard deviation is 20. Approximately how likely is it that her actual score will be above 158 if the coefficient of determination between the predictor and the achievement test is .84?

*a. 2.5%

Title: Regression 97. A regression analysis yields the equation Y = 3.0 + 20(X) − 30(Z). What would the predicted value of Y be if X and Z were both zero?

*a. 3

Title: PASW 74. In PASW, to calculate a correlation coefficient, go to Analyze, Correlate, then

*a. Bivariate

Title: Correlation 21. If a correlation of -.70 were found between body weight and the number of push-ups one can do, which of the following statements would be correct?

*a. Generally, heavy people can do few push-ups.

Title: Regression 83. In PASW, to run a straight-line regression analysis, go toAnalyze, Regression, then

*a. Linear

Title: Prediction 32. Which of the following describes the general relationship between the SEE and the coefficient of determination?

*a. They are inversely (negatively) related.

Title: Correlation 6. The correlation between X and Y is .90. What can be said with certainty about Y in relation to X?

*a. They are related.

Title: Regression 78. If X is found to be highly and positively correlated to Y, all of the following are true except

*a. X causes Y

Title: Regression 92. A regression analysis yields the equation Y = 3.0 + 20(X) − 30(Z). What is or are the dependent variable(s)?

*a. Y

Title: Correlation 27. A correlation of .8 is considered

*a. a high correlation

Title: Correlation 23. The correlation between percent body fat by skinfolds and underwater weighing is .90. A scatterplot of this relationship would look like

*a. a pickle

Title: Correlation 68. The square of the correlation coefficient is called the

*a. coefficient of determination

Title: Prediction 60. A test serves a prediction function well if it

*a. has a strong statistical relationship with a criterion

Title: Correlation 46. Which of the following relationships would be most precisely described with a correlation statistic?

*a. height and weight

Title: Correlation 3. Which of the following best describes the correlation between the12-minute run and VO2max?

*a. high and positive

Title: Prediction 4. Maureen scored 140 on an achievement test in which the mean was 80 and the standard deviation was 20. What might you predict she would score on another test if the correlation between achievement and the other test were .91?

*a. well above the mean

Title: Regression 80. The Y-intercept of a regression line is the value of Y when X is equal to

*a. zero

Title: PPM 13. Between what limits can the Pearson product-moment correlation coefficient vary?

*b. -1.0 and +1.0

Title: Correlation 2. Which of the following correlation coefficients reflects the greatest linear relationship?

*b. .98

Title: Prediction 34. You want to predict success on your final examination based on your midterm score. The prediction equation is Y' = 5(X) + 20. What is your predicted score on the final if you scored 10 on the midterm? The mean on the final is 70, and the standard deviation of the final is 10.

*b. 70

Title: Regression 88. The following can be used for determining regression fit except

*b. SEM

Title: Correlation 19. If a correlation coefficient of 1.15 is found, this would indicate

*b. a computational error

Title: Correlation 56. A negative correlation coefficient indicates low relationship.

*b. false

Title: Correlation 40. What does a correlation coefficient describe about the relationship between two variables?

*b. the direction and strength

Title: Scatterplot 50. What does a scatterplot tell you?

*b. the general nature of the relationship between variables

Title: PPM 53. What does the Pearson product-moment correlation coefficient tell you?

*b. the linear relationship between two variables

Title: Regression 91. If a regression analysis uses age, sum of skinfolds (SS), SS2, and gender to better understand body density, how many predictors are there?

*c. 3

Title: Prediction 37. If predicting Y from X and r = .95 which of the following is most likely to be true?

*c. If you scored high on one variable, you probably scored high on the other variable.

Title: Regression 8. Skinfolds are often used in multiple regression equations to estimate body fatness. What does multiple regression equation refer to?

*c. More than one predictor variable is used.

Title: Correlation 57. The correlation between test A and test B is .34. The correlation between test A and test C is .68. Which of the following statements is correct?

*c. The relationship between tests A and C is higher than that between tests A and B.

Title: Scatterplot 16. The shape of a scatterplot indicating the relationship between shoe size and performance in a graduate statistics class would resemble

*c. an orange

Title: Correlation 76. The PPM correlation coefficient cannot

*c. be interpreted for curvilinear relationships

Title: Correlation 61. If no relationship exists between two variables, a plot of matched points would look like a

*c. circle

Title: Correlation 73. A correlation coefficient describes the ___________ and the ___________ of a linear relationship.

*c. magnitude; direction

Title: Correlation 70. If values of X tend to be random as their paired values of Y tend to decrease, the relationship between X and Y will be

*c. nearly zero

Title: Variance 59. The residual variance best represents

*c. nonpredicted variance

Title: Regression 75. The coefficient of determination can be interpreted as a(n)

*c. percent

Title: Correlation 17. r^2xy indicates the

*c. percentage of variance common to X and Y

Title: Regression 94. A regression analysis yields the equation Y = 3.0 + 20(X) − 30(Z). What is the correlation between the predictor and predicted variable?

*c. positive

Title: Regression 86. The standard error of estimate is also called the

*c. standard error of prediction

Title: Correlation 9. The square of the correlation coefficient indicates

*c. the percentage of variance in common with the two variables

Title: Variables 49. What does bivariate mean?

*c. two variables

in repetitive-performance tests, it is extremely important to make sure test participants are using _________ _________ bc this *helps reinforce game-like situations*

*correct form*

Title: Correlation 45. Which correlation coefficient best describes the relationship between height and weight in healthy individuals across ages 12 to 80?

*d. .80

Title: Regression 103. A regression analysis yields the equation Y = 125.0 − 50(X). Considering an X value of −10, what is the Y-intercept?

*d. 125

Title: Regression 104. A regression analysis yields the equation Y = 125.0 − 50(X). Considering an X value of −10, what is the predicted value of Y?

*d. 625

Title: Correlation 43. Which conclusion should be drawn if the correlation between a physical fitness test and a rating of basketball playing ability is 0.90?

*d. Basketball playing ability and physical fitness are strongly associated.

Title: Correlation 48. What can we conclude if the correlation between length of marriage and marital happiness is .82?

*d. Long marriages tend to be happy ones.

Title: Regression 93. A regression analysis yields the equation Y = 3.0 + 20(X) − 30(Z). What is or are the independent variable(s)?

*d. X and Z

Title: Regression 84. The amount of inaccuracy of regression formula is not represented by

*d. Y-intercept

process-oriented assessments

- focuses on *HOW* movement is performed - development occurs at different times to different body parts - *segmental* or *component* approach

product-oriented assessments

- focuses on performance *outcomes* (completion of movement as a whole) - *norm-referenced* (results are compared to others' results)

3 common errors in rating scales

- halo effect - standard error - central-tendency error

Examples of *Process*-oriented assessments

- hip rotation - arm/leg action - angle of takeoff in a jump - golf swing form - foot placement when kicking a ball

The mean on a test was 200 and the standard deviation was 20 points. What percent of people scored less than 170? Round to the nearest whole number. a. 7 b. 13 c. 43 d. 87

*Beneficial Functions of Performance Testing* (MacDougall & Wenger 1991)

- indicates athletes' *strengths & weaknesses* within sport & provides baseline data for individualized training - provides *feedback on effectiveness of training* - provides information about *current performance status* - educational process to help *better monitor performance*

Skills Test Classifications

- objective tests (ALWAYS BEST!) - subjective ratings (diving, gymnastics) - other tests (measure one trait at a time)

AAHPERD test selection guide indicates what 4 factors?

- skills tests should include *primary skills of the sport* (*no more than 4* skills should be tested) - acceptable reliability & validity - NO *high inter-correlations* (lots of overlap in skills = measuring essentially the same thing) - be able to validly discriminate among performance levels (*construct validity*)

1. Which of the following is preferred when cutoffs are important? a. norm-referenced tests b. criterion-referenced tests c. Both are acceptable. d. It depends on the variability of the score.

...

10. Which of the following are evaluated in a categorical manner? a. norm-referenced tests b. criterion-referenced tests c. standardized tests d. achievement tests

...

11. According to Cureton and Warren, which is not a limitation of CRT? a. Cutoff scores always involve some subjectivity. b. They are evaluated in a categorical manner. c. Misclassifications can be severe. d. Students attaining cutoff may be motivated to improve.

...

12. What is the order of the basic steps in mastery learning? a. pretesting, establishing behavioral objectives, instruction, and evaluation b. establishing behavioral objectives, instruction, pretesting, and posttesting c. establishing behavioral objectives, pretesting, instruction, and posttesting d. pretesting, instruction, establishing behavioral objectives, and posttesting

...

15. The reliability of CRT is best established with indices of a. association b. variability c. dependability d. consistency

...

20. ________ comes to mind when you think of a criterion-referenced evaluation, and ________ comes to mind when you think of a norm-referenced evaluation. a. ratio; ordinal b. percentile; passing c. percentile; people d. minimum; percentile e. minimum; ordinal

...

21. Before allowing students to participate in a gymnastics unit, a teacher administered a safety rules test with the requirement that everyone score above 80%. This is an example of what type of test? a. achievement b. criterion-referenced c. norm-referenced d. standardized

...

22. CRT validity can be established by examining the overlap of two _________ groups. a. approximately equal b. divergent c. somewhat similar d. both a and c

...

23. False positives result when subjects truly meet the criterion, but the field test indicates they did not. a. true b. false

...

24. What statistic is often calculated with criterion-referenced reliability and validity? a. t test b. ANOVA c. chi square d. Pearson product moment e. intraclass reliability

...

25. A criterion-referenced test is used to detect individual differences in ability. a. true b. false

...

26. What level of measurement is most often associated with criterion-referenced measurement? a. nominal b. ordinal c. interval d. ratio

...

5. In its construction, which type of test uses experts' opinions, empirical data, and norms? a. norm-referenced b. criterion-referenced c. achievement test d. standardized test

...

6. An odds ratio can be calculated for a _________ table. a. 1 × 1 b. 1 × 2 c. 2 × 1 d. 2 × 2 e. 3 × 2

...

7. Which of the following is not a basic approach for developing criterion-referenced standards? a. behavioral b. empirical c. judgmental d. normative

...

8. Which of the following statements about CRT is false? a. It is used to classify subjects as master or nonmaster. b. It is used when specific, well-defined goals are identifiable. c. It is limited to nominal measurement. d. It is well suited for programmed instruction centered on behavioral objectives.

...

9. How should criterion-referenced standards be specified? a. by setting an arbitrary percentage for the cutoff score b. by defining a class or domain of tasks that should be performed by the individual c. by developing standard scores based on the mean and standard deviation d. by rank-ordering the scores and establishing cutoffs based on rankings

...

Kelly's T score was 60. What is her z score?

1.0

Lean tissue average density

1.10 g/cm cubed

Karen's T score was 65. What is her z score? a. 0.5 b. 1.5 c. 2.0 d. need more information

1.5

Assume that resting heart rates are normally distributed with a mean of 80 beats per minute and a standard deviation of 8 beats per minute. What is the z score of a heart rate of 95 beats per minute? a. −1.88 b. 1.88 c. −2.00 d. 2.00

1.88

After administering a test, your instructor decided to develop norms for it. Year1:N=60;ΣX=560;ΣX2 =7880;SD=1;M=14. Year 2: N = 40; ΣX = 450; ΣX2 = 4905; SD = 3; M = 10. What is the mean of the combined classes? Round to the nearest tenth. a. 10.1 b. 11.1 c. 12.0 d. 13.0

10.1

The best estimate of the median of the measurements 5, 10, 7, 11,8,18,13,and12is a. 11 b. 11.5 c. 10 d. 10.5

10.5

if a normal distribution of scores has a mean of 47 and a standard deviation of 11, what range of values would contain 99.74% of the scores? a. 14 to 80 b. 25 to 69 c. 30 to 64 d. 36 to 58

14-80

The average number of points scored by NBA teams last year was 100 per game. The standard deviation was 30 points. Ninety-five percent of the games had NBA teams scoring fewer than how many points? Round to the closest whole number.

150

PA guidelines for Americans (moderate activity, big activity, muscle)

150 min/week moderate activity, 75 min/week vigorous activity, 2 days of muscle strengthening per week

Approximately what percentile is associated with a raw score located 1 standard deviation below the mean in a normal distribution? a. 40 b. 35 c. 22 d. 16

If a normal distribution of scores has a mean of 70 and a standard deviation of 10, approximately what percentage of students scored below 60? a. 2.5 b. 16.0 c. 50.0 d. 84.0 e. 97.5

16.0

What is the variance of a distribution having the following statistics: N = 25; ΣX = 125; ΣX2 = 1025? a. 13 b. 15 c. 17 d. 19

In the Self-Motivation Inventory, if you're giving change a thought now and then but not doing activity, you are in Stage...

In a fairly large, normal distribution with a mean of 15.0, two- thirds of the cases fall between the score points 12.5 and 17.5. The standard deviation can be estimated to be close to a. 1.0 b. 2.5 c. 4.0 d. 6.25

2.5

The mean on a test was 50 and the standard deviation was 20. What percent of people scored between a z score of .80 and a raw score of 68? a. 2.78 b. 18.41 c. 21.19 d. 28.81

2.78

Consider the following data: 8, 10, 12, 14, 7, 5, 9, 10. What is the standard deviation? a. 2.8 b. 8.0 c. 9.0 d. 10.0

2.8

How many students out of 100 would be between the z scores of .33 and .94?

Consider the following: mean = 196, standard deviation = 14, and N = 17. What is the raw score for a z score of 0.35? a. 207 b. 205 c. 203 *d. 201

201

Using the scores 1, 3, 2, 1, 3, find (ΣX2)/2 + (ΣX)2 × 2

212

What score from a distribution having a mean of 30 and a standard deviation of 5 corresponds to a z score of -1.5? a. 22.50 b. 25.00 c. 30.00 d. 37.50

22.50

Scholastic Aptitude Test (SAT) scores are normally distributed with a mean of 500 and a standard deviation of 100. A student's SAT score is 430. What is the student's percentile? Round to the nearest whole number.

T score = 43. What is the percentile? a. 24.20 b. 25.80 c. 34.13 d. 43.06

24.20

What percentage of the cases would you find in a distribution between the median and Q3? a. 25% b. .34% c. .50% d. unable to determine from given information

25%

what percent of the cases would you find in a distribution between the median and P75?

25%

What is the standard deviation of a distribution having the following statistics: N = 25; ΣX = 125; ΣX2 = 1025? a. 2 b. 3 c. 4 d.5

What T score value is associated with a score of 15 from a distribution having a mean and standard deviation of 25 and 10, respectively? a. 35 b. 40 c.45 d. 60

On a knowledge test, a score of 10 is at the 40th percentile. What does this indicate

40 percent of the test scores were at or below 10

. SAT scores are normally distributed with a mean of 500 and a standard deviation of 100. A student's SAT score was at the 30th percentile. What was her raw score? Round to the nearest whole number. a. 416 b. 426 c. 448 d. 466 e. 486

448

Put score limits on 95 percent of the scores when the 84th percentile is a score of 60 and the 16th percentile is a score of 50. a. 50 to 60 b. 45 to 65 c. 40 to 70 d. 35 to 75

45 to 65

With a mean of 27.1 and a standard deviation of 3.9, the T scores for the raw scores 26 and 32 are a. 36 and 65 b. 42 and 59 c. 46 and 61 d. 47 and 63 e. 49 and 60

47 and 63

In the Self-Motivation Inventory, if you have maintained the new habit for more than 6 months, you are in Stage...

What is the standard deviation of the combined classes? Year 1: N=60;ΣX=560;ΣX2 =7880;SD=1;M=14.Year2:N=40;ΣX= 450; ΣX2 = 4905; SD = 3; M = 10. Round to the nearest whole number. a. 2.0 b. 3.0 c. 4.0 d. 5.0

5.0

PMC starts to develop around age

For a 100-point test, the cutoff for an F is 60 points. With a mean of 74 and standard deviation of 10, how many students out of 100 would receive an F? a. less than 1 b. 8 c. 15 d. more than 20

Consider the following data: 8, 10, 12, 14, 7, 5, 9, 10. What is the variance? a. 2.8 b. 8.0 c. 9.0 d. 10.0

8.0

If IQ scores are normally distributed with a mean of 100 and a standard deviation of 15, what percentage of the population has IQs between 85 and 130? a. 34.13 b. 68.26 c. 81.85 d. 95.44

81.85

What percentage of the normal curve lies between -1.53 and +1.37 standard deviation units from the mean?

85.17

Consider the following scores: X = 11, 10, 9, 8, 7, 6, 5; and f = 4, 5, 5, 4, 3, 3, 1. What is P50? a. 7 b. 8 c. 8.5 d. 9 e. 9.5

Consider the following data: 8, 10, 12, 14, 7, 5, 9, 10. What is the range? a. 2.8 b. 8.0 c. 9.0 d. 10.0

9.0

Consider the following data: 8, 10, 12, 14, 7, 5, 9, 10. What is the mean? a. 9.0 b. 9.4 c. 9.5 d. 10.0

9.4

Consider the following data: 8, 10, 12, 14, 7, 5, 9, 10. What is the median? a. 9.0 b. 9.4 c. 9.5 d. 10.0

9.5

The z score of 1.56 is what percentile? a. 44.06 b. 94.06 c. 45.71 d. 92.71

94.06

What percent of observations lies between -1.96 and +1.96 standard deviations?

95%

What percentage of the normal curve lies between -2.0 and +2.0 standard deviation units from the mean? a. 47.72 b. 68.26 c. 86.64 d. 95.44

95.44

The average athlete is able to begin activity 90 days after having a knee operation. The standard deviation is 20 days. Sixty-eight percent of athletes are able to participate within how many days? Round to the nearest day.

99 days

Given that variance = 100 and P16 = 985, which of the following is probably the mean? a. 965 b. 975 c. 995 d. need more information

995

performance ratio = ________________ / ________________

= *performance* time / *movement* time (ex. = dribbling time / time to finish course)

1. What are criterion-referenced standards for health-related youth fitness tests based on? a. analysis of health-related empirical data b. tester's opinion of age-related health status c. minimum skill levels d. comparison among peers

108. Accuracy is a valid objective for skills testing. a. true b. false

112. The three types of interclass reliability are a. test-retest, equivalence, and split-halves b. test-retest, equivalence, and standard error of measurement c. Spearman-Brown, equivalence, and standard error of measurement d. Spearman-Brown, standard error of measurement, and split-halves

113. Giving subjects two trials of the same test is an example of a. test-retest b. equivalence c. split-halves d. internal consistency

117. The resulting reliability coefficient from a split-halves procedure should be a. adjusted using the Spearman-Brown formula b. adjusted using the intraclass coefficient c. adjusted using the standardized formula d. interpreted as is

12. Ms. Goodtester relates tennis tournament standings to the three 30 s trials of the tennis wall volley. For what is she collecting evidence? a. concurrent validity b. content validity c. interclass reliability d. uncertain from the information given

12. What is the best reason for examining pertinent literature before constructing a performance test? a. to determine if a suitable test already exists b. to determine the abilities to be measured c. to locate tentative test items d. to locate norms for comparison

126. Which of the following is not a component of health-related fitness? a. speed b. aerobic capacity c. musculoskeletal function d. all of the above are components of health-related fitness

127. The primary purposes of human performance analysis are selection, classification, and diagnosis. a. true b. false

128. A skill test involves passing a volleyball over a rope 8 feet above the floor and into a square 4 by 4 feet. A student is allowed 10 passes that are scored 0 or 1. The biggest problem of this test is a. feasibility b. objectivity c. relevance d. reliability

138. The logical decision to include certain items in a test is an application of a. content validity b. construct validity c. criterion validity d. internal consistency

148. A theoretical construct a. can be indirectly measured b. cannot be measured c. should be measured twice and then averaged d. can be directly measured

149. Sport leadership is a construct. a. true b. false

15. Which of the following types of validity evidence is based almost entirely on rational analysis? a. content b. predictive c. concurrent d. construct

16. A test battery contains the following three items: distance runs, skinfolds, and trunk extension. What does that test battery measure? a. health-related physical fitness b. cardiovascular efficiency c. muscular endurance d. motor fitness

164. Error scores contribute relatively little to the observed variation. a. true b. false

168. Interclass reliability measures consistency between scores. a. true b. false

169. Intraclass reliability measures consistency both between and within scores. a. true b. false

170. Test-retest reliability is an interclass method. a. true b. false

172. The Kuder-Richardson Formula 20 (KR20) is used to measure internal consistency of a test when items are scored right or wrong (0,1). a. true b. false

174. The standard error of measurement is used for estimating a score range for a. an individual b. a sample c. a population d. a specific group

18. What major advantage(s) do intraclass reliability models have over interclass models? a. unlimited trials b. can use more people c. both a and b d. neither a nor b

18. Which of the following is the major factor preventing the measurement of maximal oxygen uptake in a high school physical education class? a. feasibility b. objectivity c. reliability d. validity

19. Which method of determining the reliability should be used with a six-item softball throw for distance administered on one day? a. alpha coefficient b. equivalence reliability c. test-retest d. stability

20. What is the general relationship between the standard error of measurement and test reliability? a. They are inversely related. b. They are directly related. c. They are unrelated.

20. Which sport skill would be most efficiently assessed through having students participate in the sport, score the performance in the usual manner, and obtain an average score? a. archery b. tennis c. volleyball d. soccer

23. What is the relationship between the true score variance and the reliability of a test? a. They are directly related. b. They are inversely related. c. They are unrelated. d. unable to determine

23. Which type of test is least likely to appear in a test battery for health-related fitness? a. agility b. endurance c. flexibility d. strength

27. What assumption underlies the use of general motor ability tests? a. Certain basic motor skills are involved in a variety of physical tasks. b. Motor skills are specific to different physical tasks. c. Students can be classified on the basis of specified skills. d. The potential for learning motor skills can be measured.

27. What is the term that refers to using SAT scores in making decisions for academic eligibility for college athletes? a. predictive validity b. generalizability (from high school to college grades) c. relevance d. construct validity

29. Subjects were weighed underwater and then skinfold measures were taken to determine the regression equation for calculating percent body fat. The Pearson product-moment correlation between actual body fat (as determined from underwater weight) and the skinfold estimate of percent body fat represents what type of validity evidence? a. concurrent b. construct c. predictive d. logical

29. What does kappa consider that proportion of agreements (P) does not? a. chance agreement b. level of measurement c. uneven column marginals d. uneven row marginals

3. Adequate practice trials by children taking a fitness test will improve a. reliability b. relevance c. validity d. feasibility

33. The Spearman-Brown prophecy formula is used in conjunction with which method of calculating reliability coefficients? a. split-halves b. test-retest c. equivalent forms d. Kuder-Richardson formula 20

39. Insufficient planning and organization before and during the administration of a test will have what effect? a. lower test validity and reliability b. lower test reliability and objectivity c. lower test validity and objectivity d. impossible to determine from what is given

39. What was the difference between the National Children and Youth Fitness Studies I and II? a. The studies produced normative data on different ages of children. b. The studies produced criterion-referenced standards for different ages of children. c. NCYFS I examined only males. d. NCYFS II examined only males.

43. What is a major problem with skill tests found in the back of skill textbooks? a. They often have not been well validated. b. They do not have norms associated with them. c. They are too time consuming. d. They are not objective in nature.

43. Which of the following is the most important to remember when selecting a test? a. A valid test will have a degree of reliability. b. A reliable test will have a degree of validity. c. Interdependency of test items increases test validity. d. A subjective test tends to be reliable.

44. AAHPER stands for a. American Alliance of Health, Physical Education and Recreation b. American Association of Health, Physical Education and Recreation c. Associated Alliance of Health, Physical Education and Recreation d. Associated American of Health, Physical Education and Recreation

44. What aspect of a test would increase its predictive ability? a. It has a strong statistical relationship with a criterion. b. It contains items that appeal to common sense. c. It is stable over time and situations. d. It contains items that correlate highly with one another

45. The National Children and Youth Fitness Study was conducted by the a. Office of Disease Prevention and Health Promotion b. National Center for Health Statistics c. President's Council on Physical Fitness, Sports & Nutrition d. Cooper Institute for Aerobics Research

45. What best describes construct validation? a. relating a measure with numerous theoretically related measures b. relating a measure with a particular criterion c. examining the content representativeness of the items d. accurately estimating the percentage of students who complete a unit of instruction

47. The Youth Risk Behavior Survey is conducted by the a. Centers for Disease Control and Prevention b. National Center for Health Statistics c. President's Council on Physical Fitness, Sports & Nutrition d. Cooper Institute for Aerobics Research

48. A highly valid written test would most likely display which of the following characteristics? a. accurate measurement of the ability being appraised b. consistent measurement of the ability being appraised c. internal consistency of the test items d. freedom from subjective factors

50. The most appropriate order for constructing a flowchart for motor performance tests is a. review criteria of good tests, review literature, select test items, establish procedures b. analyze sport to be tested, review criteria of good tests, establish procedures, pilot study c. review literature, analyze sport to be tested, select test items, establish procedures d. review criteria of good tests, select test items, analyze sport to be tested, establish procedures

51. Longitudinal, generally long-term tracking of populations is known as a a. cohort b. case series c. case control d. randomized clinical trial e. community trial

53. What are norm-referenced standards most appropriate for? a. comparing students b. identifying students' weaknesses c. motivating students d. predicting students' performance

53. Which of the following terms refers to the ratio of risks between the exposed or unexposed populations? (This statistic is calculated with incidence measures.) a. relative risk b. odds ratio c. absolute risk d. attributable risk

55. President's Challenge and FITNESSGRAM use health-related criterion-referenced fitness standards. a. true b. false

55. The use of a criterion measure is involved in the assessment of which two types of validity? a. predictive and concurrent b. construct and content c. concurrent and content d. predictive and construct

56. Statistical is to criterion-related validity as logical is to ______-related validity. a. content b. predictive c. concurrent d. construct

60. How are wall volley tests typically validated? a. using concurrent validity b. using construct validity c. using predictive validity d. using content validity

60. The President's Challenge test battery includes a. 1-mile run b. PACER c. trunk lift d. bench press

60. Two different instructors administer the same test to the same group of students (at different times), and the results of the two test administrations are compared. What is being measured? a. objectivity b. regression c. correlation d. validity

61. A criterion-referenced test is constructed to yield measurements that are directly interpretable in terms of specific performance standards. a. true b. false

66. The Presidential Active Lifestyle Award is based on a daily activity goal of 60 minutes per day, at least 5 days of week, for 6 weeks. a. true b. false

66. Which of the following is the basis for developing the other three coefficients? a. Pearson product-moment correlation coefficient b. reliability coefficient c. objectivity coefficient d. validity coefficient

68. CRT validity is usually established with some type of a. concurrent criterion b. confounding variable c. moderating variable d. stability

68. Describe the general relationship between the SEE and the coefficient of determination. a. They are inversely (negatively) related. b. They are directly (positively) related. c. There is no relationship between them. d. It can be high or it can be low, depending on the situation.

68. The rationale for excluding the 20-free-throws test in a basketball skill battery is based on which of the following criteria? a. feasibility b. objectivity c. reliability d. content validity e. construct validity

69. Cutoff scores involve some subjective judgment. a. true b. false

69. The Brockport Physical Fitness Test is a health-related physical fitness test for youth with various disabilities. a. true b. false

71. The ACTIVITYGRAM uses questionnaire items from the Youth Risk Factor Behavior Survey. a. true b. false

72. If the variability of the scores for males exceeds that of females, but both have identical reliabilities, what is true about the standard errors of measurement for males and females? a. The SEM for males will be greater. b. The SEM for females will be greater. c. The SEMs will be identical. d. One needs to know means to determine the SEMs.

74. A skills test should require a sufficient number of trials to obtain a reasonable measure of performance. a. true b. false

76. When developing a skills test to use for comparisons, a standardized test should be considered. a. true b. false

76. With CRTs, the equivalence is whether the two tests result in equivalent classifications for the individuals being assessed. a. true b. false

77. Skills tests should yield scores that provide for diagnostic interpretation whenever possible. a. true b. false

77. Which type of reliability comes to mind when you think of the Pearson product-moment correlation? a. interclass b. alpha coefficient c. split-halves d. equivalent forms

78. The phi coefficient is the same as Pearson product-moment correlation between two dichotomous variables. a. true b. false

8. The appropriate development of a rating scale that assesses students' sport skills helps reduce what potential measurement error? a. subjectivity b. unequal environmental factors c. motivation d. learning

80. What happens to the standard error of measurement as the test reliability goes up? a. It goes down. b. It goes up. c. It depends on the type of test. d. It depends on the validity.

81. A contingency table of a chi-square test assessing the association between passing and failing a field test performed on two different days would determine a. reliability b. mean change c. validity d. objectivity

83. If the purpose of the skills test is to compare an individual's performance with a standard, use ____________-referenced analysis. a. criterion b. norm c. standardized d. achievement

83. PASW can produce chi-square, phi, and Kappa values by running the ____________ procedure. a. crosstabs b. correlation c. descriptives d. regression

85. Which type of validity evidence is normally assessed qualitatively? a. content b. concurrent c. predictive d. construct

87. Ensuring that test items are representative of the performances to be analyzed is important when developing a psychomotor test. a. true b. false

89. Having other experts review your test battery can be very beneficial when developing a psychomotor test. a. true b. false

89. The phi coefficient ranges from a. −1 to +1 b. 0 to 1 c. 0 to 100 d. 0 to 1000 e. It does not have a specific range.

89. What happens to test reliability as the true score variance increases? a. It goes up. b. It goes down. c. It depends on the variables being analyzed. d. Observed score increases. e. Error score increases.

9. The most popular youth fitness batteries include health-related fitness tests. Which test battery includes a test of motor fitness? a. Eurofit b. FITNESSGRAM c. Healthy People 2010 d. Canadian Standardized Test of Fitness

9. When constructing a basketball skill test, the teacher develops 10 items and wants to reduce this number to 4. Which items should be retained for the final test? a. those that correlate highest with the criterion and lowest among themselves b. those that correlate lowest with the criterion and highest among themselves c. those that correlate highest with the criterion and highest among themselves d. those that correlate lowest with the criterion and lowest among themselves

91. The Kappa coefficient is similar to phi but corrects for a. chance agreement b. sample size c. number of items d. correlation

92. The Kappa coefficient ranges from a. −1 to +1 b. 0 to 1 c. 0 to 100 d. 0 to 1000 e. It does not have a specific range.

92. What happens to the standard error of measurement as error variance goes up? a. It goes up. b. It goes down. c. It depends on the reliability. d. It depends on what is being measured.

94. After developing a criterion-referenced test, it is useful to establish a. cutoffs b. items c. norms d. percentiles

Which is NOT a level in the transtheoretical stages of the change model? A) attitude B) contemplation C) action D) precontemplation

A) attitude is NOT a level (*precontemplation* --> *contemplation* --> *preparation* --> *action* --> *maintenance*)

severity of disease (mild, moderate, severe)

An example of ordered qualitative data

ventilary suppport (noninvasive, oscillatory)

An example of unordered qualitative data

100. Of the following scores, the one that is actually known is the a. average score b. observed score c. error score d. true score

102. Grading on knowledge of diseases in a health class is associated with which domain? a. affective b. cognitive c. effective d. psychosocial e. psychomotor

105. An instructional objective a. is discriminating b. allows students equal chance to demonstrate their ability c. does not allow mastery d. is very difficult

113. Total-body movement tests usually have _______ reliability because a large amount of interindividual variability is associated with timed performances. a. low b. high c. undetermined d. inconsistent

114. If the period between a test and a retest is long and the reliability is good, the test is said to be a. equivalent b. stable c. valid d. not valid

115. Giving subjects two different trials of two different but parallel tests is an example of a. test-retest b. equivalence c. split-halves d. KR20

115. If movement time is measured by the performance ratios, slower players are penalized. a. true b. false

124. When calculating Cronbach's alpha, the numerator of the right-side term represents the a. number of trials b. sum of the variance of each trial c. variance for the sum across all trials d. total variance

125. Motor educability indicates the ability to learn a specific motor skill. a. true b. false

128. The best estimate of a person's true score is the a. error score b. obtained score c. highest score d. lowest score

146. Correlating past scores from an alcohol consumption questionnaire with future scores from a graded exercise test is an application of a. content validity b. predictive validity c. concurrent validity d. construct validity

15. The FITNESSGRAM uses a. one standard of achievement for each test b. two standards of achievement for each test c. three standards of achievement for each test d. percentiles e. standard scores

150. Upper-body strength is a construct. a. true b. false

17. Three different predictive validity tests were administered to a group of prospective students. Scores on each of the tests were subsequently correlated with a measure of students' performance. The results were as follows: test 1 with performance r = .13; test 2 with performance r = .48; test 3 with performance r = .68. How do the tests rank in predictive validity? a. Test 1 is best; test 2 is next; test 3 is poorest. b. Test 3 is best; test 2 is next; test 1 is poorest. c. Test 3 is best; test 1 is next; test 2 is poorest. d. Test 1 is best; test 3 is next; test 2 is poorest.

171. Cronbach's alpha and KR20 will always yield the same validity coefficient when items are scored right or wrong (0,1). a. true b. false

173. Both the standard error of measurement and the standard error of estimate are reliability statistics. a. true b. false

175. The validity of .90 is better than the validity of −.90. a. true b. false

2. A test contains 60 items and has a reliability of .50. What would the reliability of the test be if there were 180 items? a. .66 b. .75 c. .80 d. .90

2. Tests such as sit-ups and push-ups a. are positively related to body weight b. are negatively related to body weight c. are unrelated to body weight d. are positively related to BMI e. are unrelated to BMI

2. What aspect of a test designed to measure softball ability could be most seriously questioned if another student pitches the ball to the examinee? a. feasibility b. reliability c. norm values d. validity

21. Which of the following is not a prescribed use of tests of basic abilities? a. diagnosis b. grading c. prediction d. program evaluation

22. A comparison of health educators' and high school students' scores on a sex education test revealed that the mean score for the educators was higher than that of the students. What type of validity evidence does this support? a. concurrent validity b. construct validity c. content validity d. face validity

24. Motor fitness and physical fitness are essentially the same construct. a. true b. false

25. The domain of balance is _____ and _____. a. multidimensional; task general b. multidimensional; task specific c. unidimensional; task general d. unidimensional; task specific

26. What test item is most often found in sport skill tests for sports that use some type of ball? a. serving speed b. wall volley test c. dash test d. passing test

27. What is the ACTIVITYGRAM based on? a. a survey instrument provided by the ACSM b. a segmented-day approach c. participation in physical education classes d. accelerometer monitoring

3. What is the primary reason for grouping students of similar abilities together? a. It increases positive attitudes. b. It increases teaching efficiency. c. It decreases the number of teachers needed. d. It decreases equipment costs.

31. Which of the following test items would most likely be included on a motor educability test? a. flexibility test b. novel stunt c. written question d. balance task

32. Which of the following activities is more frequently evaluated with rating scales? a. badminton b. dance and rhythms c. softball and baseball d. volleyball e. golf

33. Which source is the best for items related to testing fitness levels in special children? a. FITNESSGRAM b. Brockport Physical Fitness Test c. Rikli and Jones' fitness test

34. Which of the following terms is not normally associated with criterion-referenced measurement? a. categorical b. percentile c. index of dependability d. absolute

35. Which of the following terms is not normally associated with criterion-referenced measurement? a. absolute b. performance comparison c. standards d. misclassification

36. The FITNESSGRAM Physical Activity Questionnaire is based on the following questions except a. strength activity b. anaerobic activity c. aerobic activity d. flexibility

37. In grades 9 through 12 the majority of children are enrolled in physical education courses. a. true b. false

38. The relationship between scoring average in high school and college is calculated for a group of basketball players. This is an example of what type of validity evidence? a. content b. predictive c. concurrent d. construct

38. Which of the following is the primary weakness of judges' evaluating skill performance? a. feasibility b. objectivity c. reliability d. criterion

4. What type of validity evidence is probably described for skinfold assessment? a. content b. concurrent c. construct d. predictive

40. You are interested in developing an "easier" way of testing one's intelligence (without having to take a 1-hour intelligence examination). Which type of validation procedure should you use? a. content validity b. concurrent validity c. construct validity d. predictive validity

42. The name of the president who established the President's Council on Physical Fitness, Sports & Nutrition is a. Washington b. Eisenhower c. Clinton d. Bush

42. Which of the following types of reliability cannot be calculated with criterion-referenced tests? a. stability b. internal consistency c. equivalence d. they can all be calculated

44. Disease information in an epidemiological study is referred to as a. mortality b. morbidity c. incidence d. prevalence

44. Of the following, which is least likely to appear in a large number of tests that assess sport skills? a. assessment of accuracy b. measure of balance c. total-body movement test d. measure of power

5. Award structures for physical fitness tests were traditionally based on ______ but are now often based on ______ a. performance; circumstances b. percentiles; predetermined criterions c. people; correlations d. performance; conduct

50. What problem is caused by tests that produce scores having very little variability? a. The scores lack objectivity. b. The results do not discriminate between good and poor performance. c. Constructing norms for the test is difficult. d. Determining poor items on the test is difficult.

54. Judge C consistently rates the performers higher than judges A and B do. This is an example of which common error in rating scales? a. halo effect b. standard error c. central tendency error d. numerical error

54. Which of the following terms refers to the estimate of relative risk used in prevalence studies? a. relative risk b. odds ratio c. absolute risk d. attributable risk

54. _________________ was the first nationally recognized fitness test battery that used health-related criterion-referenced fitness standards. a. AAHPERD Health-Related Physical Fitness Test b. FITNESSGRAM c. President's Challenge d. Kid's FIT

56. Female track athletes are placed into a specific track event based on their performance in motor performance tests. This is an example of a. selection b. classification c. diagnosis d. prediction

56. Normative data on youth fitness scores are useful for all but a. evaluating a program b. identifying excellence in achievement c. identifying the current status of individuals d. establishing health risk behavior

57. Criterion-referenced tests are used to make __________ decisions. a. averaged b. categorical c. comparative d. standardized e. long-term

60. The FITNESSGRAM Healthy Fitness Zone standards were established based on norm-referenced standards. a. true b. false

64. Which of the following is not a prescribed use of tests of basic abilities? a. diagnosis b. grading c. classification d. selection

65. Which of the following physical fitness awards from the President's Challenge program recognizes children who reach a high standard of performance (85%) on all five test items? a. National b. Presidential c. Participant d. Champion

68. To increase the reliability of fitness testing of youth, you should not allow students to practice the skills involved with the test. a. true b. false

7. A skill test involves passing a volleyball over a rope 8 feet above the floor and into a square 4 by 4 feet. A student is allowed 10 passes that are scored 0 or 1. This test has a high degree of a. feasibility b. objectivity c. relevance d. reliability

79. Consider the following: Mike's score = 100; mean score = 150; Sy = 100; rxx' = .91. What range of scores would contain Mike's true score 68% of the time? a. 0 to 200 b. 70 to 130 c. 40 to 160 d. 150 to 250

82. Posttest duties include a. recording the data b. analyzing the data c. developing standardized instructions d. consulting with experts

86. The proportion of agreement ranges from a. −1 to +1 b. 0 to 1 c. 0 to 100 d. 0 to 1000

86. When developing a psychomotor test, you should not waste time reviewing the literature. a. true b. false

87. The phi coefficient can be computed for a. averaged data b. dichotomous data c. continuous data d. any type of data

87. Which reliability coefficient would indicate the greatest measurement reliability? a. 0.0 b. 1.0 c. 10.0 d. 100

88. What value will a reliability coefficient have when no measurement error is present? a. 0.0 b. 1.0 c. 10.0 d. 100

90. Pilot testing, after a test is finalized, is an important step in developing a psychomotor test. a. true b. false

90. The Kappa coefficient can be computed for a. averaged data b. dichotomous data c. continuous data d. any type of data

91. Validity estimates are appropriate only for the _________ being tested. a. abilities b. groups c. skills d. knowledge

91. What is true about the index of reliability? a. It will be close to zero with high reliability. b. It indicates the relationship between observed and true scores. c. It is the same as the standard error of measurement. d. It can be used only with the interclass reliability.

93. Kappa can be used for interobserver agreement, test-retest, and a. average agreement b. criterion agreement c. objectivity d. none of these

11. Which of the following statements is most correct regarding general motor ability tests? a. A general motor ability test is a good predictor of athletic success. b. General motor ability test scores correlate moderately with specific sport skill test scores. c. General motor ability test items measure specific sport skills. d. The composite score criterion is an excellent way to validate general motor ability tests.

110. The challenge of accuracy tests is reliability and _______________. a. feasibility b. stability c. validity d. measurement

110. The two types of reliability coefficients are a. intravert and extravert b. interclass and extraclass c. interclass and intraclass d. intraclass and extraclass

111. The two types of reliability coefficients are based on a. correlation coefficients and regression analysis b. t tests and analysis of variance c. correlation coefficients and analysis of variance d. t tests and regression analysis

114. A performance ratio is a method used in adjusting performance times for subject _____________. a. age b. height c. speed d. weight

116. Correlating the results of the odd items with the even items is an example of a. test-retest b. equivalence c. split-halves d. validity

117. __________ is scored by comparing performance to that of others in the same group. a. Absolute scale b. Discrete scale c. Relative scale d. Percentile scale

119. If you want to determine the reliability of a test administered three different times, you should calculate a(n) a. correlation coefficient b. interclass coefficient c. intraclass coefficient d. determination coefficient

12. What type of test was the AAHPER Youth Fitness Test? a. health-related fitness b. motor reducibility c. motor fitness d. a and c

121. Cronbach alpha is used as a measure of a. test-retest reliability b. construct validity c. internal consistency d. validity

122. When scores are dichotomous (0,1), Cronbach alpha is equivalent to a. Pearson correlation coefficient b. intraclass correlation coefficient c. Kuder-Richardson formula 20 d. total variance

123. When calculating Cronbach's alpha, k represents the a. number of subjects b. number of independent variables c. number of trials d. number of dependent variables

13. Selecting tests that can be given to many people at the same time by using partners is helpful for what reason? a. It reduces learning time. b. It increases objectivity. c. It increases feasibility. d. It reduces discipline problems

13. What is the main reason that many different test batteries have been constructed for measuring physical fitness? a. No national group exists to coordinate the many efforts to construct a measure of fitness. b. Physical fitness levels vary for different groups. c. A lack of agreement exists concerning a definition of physical fitness. d. The facilities and equipment available for taking measurements vary widely.

143. With concurrent validity, the criterion is measured _______ the alternative measure. a. after b. before c. at the same time as d. two times more than

145. Correlating scores on a push-up test with scores on a maximal weight bench press to measure upper-body strength is an application of a. content validity b. predictive validity c. concurrent validity d. construct validity

15. Of the following, which is the best test of whole-body agility? a. stork stand b. soccer kick c. squat thrusts d. repeated throws in softball

16. Which of the following statements could not possibly be true for a test? a. Though it has little face validity, it shows substantial predictive validity. b. Though it is judged to have high content validity, it has very low reliability. c. Though it has little reliability, it has substantial criterion-related validity. d. Though it has zero predictive validity, its reliability is quite low. e. Though it has high concurrent validity, its predictive validity is low.

17. Using the vertical jump in a volleyball skill test battery is based on the criterion of a. feasibility b. objectivity c. relevance d. reliability

21. Consider the following: Mike's score = 100; mean score = 150; Sy = 100; rxx' = .91; SEM = 30. You can be 95% confident that Mike's true score is between what two values? a. 0 and 200 b. 70 and 130 c. 40 and 160 d. 150 and 250

22. Which statement is correct concerning the fitness levels of U.S. children? a. The majority of children are unfit. b. The majority of children are obese. c. There is no consensus on children's fitness levels. d. Most children are fit, but the percentage of unfit children is increasing.

25. Which type of reliability comes to mind when you think of KR20? a. Pearson product moment b. interclass c. alpha coefficient d. test-retest e. equivalent forms

28. Physical activity behaviors during childhood are a. unrelated to adult physical activity behaviors b. useful in the prediction of academic performance c. related to adult physical activity behaviors d. useful in the prediction of adult athletic success

28. What statistical test used with CRTs is actually the number of agreements divided by the total number of classifications made? a. chi square b. kappa c. proportion of agreement d. phi coefficient

28. With which group of students would you probably obtain the highest reliability on a softball throw for distance? a. 10th-grade boys b. 10th-grade girls c. 10th-grade boys and girls d. impossible to determine

29. Edwin Fleishman was best known for his work in the area of a. specificity b. motor educability c. theory of basic abilities d. physical fitness

31. A teacher administers six trials of a wall volley test in tennis. What is the proper coefficient to use to estimate the reliability of this test? a. Pearson product-moment correlation b. interclass correlation c. alpha coefficient d. Spearman-Brown formula

31. In what way does the FITNESSGRAM differ from most other youth fitness batteries? a. Lower reliabilities are reported. b. The computer program is different. c. It includes the Healthy Fitness Zone Concept. d. The types of items used to test fitness are different

31. What test statistic would be evaluated by determining how many students met the cutoff score and how many did not meet the cutoff score on two different tests of cardiovascular fitness? a. content validity b. concurrent validity c. equivalence reliability d. stability reliability

32. The data in a contingency (2 × 2) table were used to obtain a P (proportion of agreement), and the value obtained was .625. Which of the following could be the K (kappa) value for the same data? a. .875 b. .750 c. .525 d. .650

32. What does the fact that different numbers of students are reportedly physically fit based on different tests indicate? a. different reliabilities b. different validities c. different standards d. different subjects

34. Assume that you have a test with 25 items and you want to increase it to 100 items. What is the predicted reliability for the new test if the current reliability is .50? a. .67 b. .75 c. .80 d. .85

40. A criterion-referenced test for firefighting is being able to scale a 5-foot fence. This is an example of a(n) ___________approach to establishing a cutoff. a. judgmental b. normative c. empirical d. combination

41. On the FITNESSGRAM, if a student scores above the healthy fitness zone, a. the student should reduce his or her level of activity b. the student should increase his or her level of physical activity c. the student has a fitness level higher than is necessary for good health d. the student should be retested

43. The National School Population Fitness Survey is conducted by the a. Centers for Disease Control and Prevention b. National Center for Health Statistics c. President's Council on Physical Fitness, Sports & Nutrition d. Cooper Institute for Aerobics Research

43. Which of the following statements is not true about epidemiology? a. Epidemiologic research is a tool that is becoming increasingly popular in human performance measurement. b. Epidemiology is the study of the distribution and determinants of health-related states and events in populations and the applications of this study to the control of health problems. c. Norm-referenced measurement is fundamental to the type of research questions studied by epidemiologists. d. Descriptive epidemiology seeks to describe the frequency and distribution of mortality and morbidity according to time, place, and person.

46. The number, proportion, rate, or percentage of new cases of mortality and morbidity is referred to as a. marginals b. cohort c. incidence d. prevalence

48. A comparison of known cases of mortality or morbidity to matched noncases is known as a a. cohort b. case series c. case control d. randomized clinical trial e. community trial

51. In youth fitness, an important change has been from _________-referenced fitness standards to _________-referenced fitness standards. a. criterion; norm b. norm; standardized c. norm; criterion d. standardized; norm

51. Subjective rating scales are most appropriate for skills that are a. movement oriented b. distance or power oriented c. process oriented d. skill oriented

51. What type of validity evidence is provided by examining the difference among groups that you predict should differ on the test? a. content validity b. concurrent validity c. construct validity d. predictive validity

52. Which of the following terms refers to the risk (proportion, percentage, rate) of mortality or morbidity in a population that is exposed or not exposed to a risk factor? a. relative risk b. odds ratio c. absolute risk d. attributable risk

55. Not using extremely high or extremely low ratings results in which type of error? a. halo effect b. standard error c. central tendency error d. numerical error

57. An individualized conditioning program is developed for all the members of the women's basketball team based on the results of motor performance tests. This is an example of a. selection b. classification c. diagnosis d. prediction

57. The NCYFS I and II resulted in high-quality __________ data on youth fitness. a. incidence b. convenience c. normative d. prevalence

6. The National Children and Youth Fitness Studies I and II used a. criterion-referenced standards b. a large convenience sample of children c. a national probability sample of children d. tests of motor fitness e. tests of athletic fitness

61. Which of the following statements could be true about the relationships among objectivity, reliability, and validity of a test? a. It can be valid but not reliable and objective. b. It can be valid and objective but not reliable. c. It can be reliable and objective but not valid. d. Neither can exist without the other.

62. According to Fleishman, performance on a specific skill may be explained in terms of a. general motor capacity b. motor educability c. basic abilities d. specific skills

63. Which of the following statements accurately describes the measurement of flexibility? a. The objectivity of flexibility tests tends to be low. b. One test validly measures flexibility. c. There are several specific types of flexibility. d. Tests of flexibility are objective but tend to lack reliability.

65. A coach designs a 9-point rating scale for rating players. This is done to a. overcome the halo effect b. prevent a standard error c. minimize the effects of central tendency d. control the subjectivity of the grading

65. The consistency with which performances are categorized across trials most specifically refers to a. objectivity b. rigidity c. proportion of agreement d. validity

65. What is or are the difference(s) between interclass and intraclass correlation? a. Interclass is more appropriate when measuring two different variables. b. Intraclass takes more sources of error into account. c. Intraclass can accommodate more than two trials per subject. d. Intraclass is easier to interpret.

67. For elementary and junior high school boys and girls, the correlation between age and tests that involve running, jumping, and throwing is a. negative b. not known c. positive d. zero

67. Specific diagnostic evaluation based on CRT can be made to improve ______________ if standards are not met. a. associations b. comparability c. performance d. rankings

70. The most serious weakness of accuracy-type skill tests is a. feasibility b. objectivity c. reliability d. content validity e. construct validity

70. What does residual variance best represent? a. true score variance b. experimental variance c. nonpredicted variance d. common variance

73. Guidelines for development of skills tests state that skills tests should a. not require testing of reliability and validity b. be complex to administer and to take c. have instructions that are easy to understand d. require either expensive or extensive equipment

73. The sum of observations in a specific column are called a. averages b. group means c. marginals d. total sample size

77. Statistical techniques used for assessment of reliability and validity of CRT include chi square, proportion of agreement, Kappa, and a. ANOVA b. delta c. phi coefficient d. regression

81. The three phases of effective testing procedures are a. attitudes, abilities, and skills b. cognitive, motor, and psychosocial c. pretest, test, and posttest d. planning, testing, and reflection

82. A contingency table of a chi-square test between a field test and a criterion lab test would determine a. reliability b. mean change c. validity d. objectivity

9. Which of the following statement(s) is or are true? a. A low relationship between a test and its criterion indicates poor reliability. b. Close agreement between two judges scoring an event indicates good validity. c. Consistency is the chief quality of reliability. d. A reliable test is a valid test. e. More than one of these is true.

93. After developing a norm-referenced test, it is useful to establish a. cutoffs b. items c. norms d. averages

95. Which reliability measure is not associated with criterion-referenced measurement? a. Kappa b. phi c. Theta d. chi square

97. A simple but highly reliable skills test may be a more appropriate choice than a more gamelike test if a. the tester needs to test a small group of athletes b. the tester has enough time for testing c. the tester is testing beginners d. the tester is testing advanced athletes

98. A gamelike skills test may be a more appropriate choice than a simple but highly reliable test if a. the tester needs to test a large group of athletes b. the tester has limited time for testing c. the tester is testing advanced athletes d. the tester is testing beginners

99. An observed score is the sum of the true score and the a. average score b. estimated score c. error score d. total score

41. Which of the following statistics is not used in evaluating criterion-referenced tests? a. chi square b. Cohen's kappa c. proportion of agreement (P) d. intraclass R

41. The rationale for including the 20-free-throws test in a basketball skill test battery is based on which of the following criteria? a. difficulty b. feasibility c. objectivity d. relevance

Muscle generates force as it shortens

Concentric contraction

Test that is constructed to yield measurements that are directly interpretable in terms of specific performance standards

Criterion-Referenced test

10. Of the following, what is the best way to reliably assess playing ability in a sport? a. Use a rating scale. b. Use a standardized skill test. c. Have many items on the test on ability. d. Assess playing ability on a number of occasions.

109. Which is not true concerning reliability? a. small error score variance yields greater reliability b. large true score variance yields greater reliability c. longer tests yield greater reliability d. quicker tests yield greater reliability

11. To evaluate the fitness of special children, it is necessary to understand their disabilities. The necessary knowledge for that understanding a. will be developed from this course b. will be developed from studies in motor behavior c. will be developed from studies in biomechanics d. will be developed from studies in adapted physical education

118. The Spearman-Brown formula can be used to estimate the reliability of a test that a. has doubled in size b. has not changed in size c. has been reduced in size d. both a and c

124. Tests for evaluation of skill performance include all of the following except a. subjective ratings b. trials-to-criterion testings c. performance-based testings d. criterion-related testing

129. The degree to which a person's observed score fluctuates as a result of errors of measurement is represented by a. standard deviation of X b. standard deviation of Y c. standard error of estimate d. standard error of measurement

133. Reliability of a test can change depending on all but a. how the test is administered b. whom the test is administered to c. when the test is administered d. the software used

134. All of the following factors can affect a test's reliability except a. fatigue b. practice c. subject variability d. test validity

135. The extent to which a test measures what it's supposed to measure is its a. internal consistency b. test-test stability c. reliability d. validity

137. Which one of the following is based on having a true criterion measure? a. internal consistency b. construct validity c. objectivity d. concurrent validity

147. Criterion measures for validity can be obtained by all of the following except a. actual participation b. known valid criterion c. expert judges d. any correlated variable

151. Determining differences in measurement between groups with known differences is a form of a. content validity b. predictive validity c. concurrent validity d. construct validity

19. Which of the following statements most accurately describes the interrelationship of skill and ability? a. Ability and skill are negatively related. b. Ability and skill are not related. c. Ability is limited by skill. d. Skill is limited by ability.

20. What type of test item is least likely to appear in a test battery measuring health-related physical fitness? a. muscular endurance b. strength c. cardiovascular endurance d. balance

24. A test of which of the following would least likely be included in a general motor ability test battery? a. eye-hand coordination b. power c. reaction time d. strength

25. In assessment of physical activity in children, Sallis and colleagues found a. self-reports to be very unreliable b. self-reports to have consistent reliability and validity across all ages of children c. that the reliability but not the validity of self-reports increased as the age of the children increased d. that the reliability and validity of self-reports increased as the age of the children increased

26. A newscaster reported that the local football team "played an easy schedule" of opponents this year. The coach stated that he did not believe that to be truthful. What aspect of the statement was the coach questioning? a. reliability b. criterion c. construct d. validity

29. What is a test battery? a. a youth fitness test b. an adult fitness test c. a valid test d. a series of test items

30. What test statistic would be examined by determining how many students met the cutoff score and how many did not meet the cutoff score for a test on day 1 and again on day 2? a. construct validity b. concurrent validity c. equivalence reliability d. stability reliability

36. Assume you have the following data from a recent test: trial 1 mean = 150 and s2 = 10; trial 2 mean = 155 and s2 = 10; trial 3 mean = 147 and s2 = 20; total mean = 452 and s2 = 100. What is the alpha reliability of the test? a. .60 b. .70 c. .80 d. .90

37. Proportion of agreement is used for what type of reliability estimation? a. stability b. test-retest c. norm-referenced d. criterion-referenced

37. The reliability of a test is .91. Place 68% confidence limits around a person's observed score of 60 on the test if the standard deviation of the test is 20 points. a. 51 to 69 b. 40 to 80 c. 52 to 68 d. 54 to 66

4. What aspect of a test designed to measure softball hitting ability could be most seriously questioned if the test involves batting the ball from a batting tee? a. objectivity b. reliability c. norm values d. validity

40. A teacher decides to use a rating scale to evaluate achievement in modern dance skills. What is she trying to increase if she explicitly defines the skill to be rated and the point values to be assigned? a. the correlation of the measurement b. the mean of the measurement c. the standard error of the measurement d. the objectivity of the measurement

40. The PACER test of the FITNESSGRAM is a measure of a. running agility b. abdominal endurance c. dynamic balance d. aerobic capacity e. anaerobic capacity

41. Why is test validity so important? a. It is an indication of the reproducibility of a student's ability in an area. b. It is necessary for comparison of students' results to criterion-referenced norms. c. It is necessary for comparison of students' results to norm-referenced standards. d. It is an indication of the test's ability to measure what it reports to measure. e. If a test is reliable, then it is valid.

42. What are the two major aspects measured in golf tests? a. accuracy and speed b. distance and loft c. loft and speed d. accuracy and distance

42. Your instructor is interested in improving the reliability of the first class test. He decides to do so by increasing the number of items. What is the appropriate procedure to use to determine an estimate of the new reliability? a. Pearson product-moment correlation b. intraclass correlation c. alpha coefficient d. Spearman-Brown formula

45. The number, proportion, rate, or percentage of total cases of mortality and morbidity is referred to as a. marginals b. cohort c. incidence d. prevalence

45. Which volleyball skill is least likely to be measured in a battery of volleyball tests? a. volleying b. passing c. serving d. spiking

46. Objectives are to content validity as criterion is to a. internal consistency b. construct validity c. objectivity d. concurrent validity

47. Randomly assigning subjects to treatments or exposures is known as a a. cohort b. case series c. case control d. randomized clinical trial e. community trial

47. What procedure represents the most serious misuse of norms accompanying a performance test? a. using the norms for general reference b. using the norms as a basis for determining students' achievement levels c. using the norms after administering the test to students whose ability levels closely match those for whom the norms were established d. using the norms after administering the test under different conditions than those for which the norms were established

49. The AAHPERD Health-Related Physical Fitness Test included all but a. distance runs b. skinfolds c. 1-minute sit-up test d. leg squats

49. What method is used to determine the reliability of a two-trial test administered on the same day? a. equivalence b. relevance c. parallel forms d. test-retest

53. FITNESSGRAM was created by the a. Centers for Disease Control and Prevention b. National Center for Health Statistics c. President's Council on Physical Fitness, Sports & Nutrition d. Cooper Institute for Aerobics Research

53. Which of the following is not an example of an absolute rating scale? a. numerical scale b. descriptive scale c. paired comparison scale d. checklist

54. What is the minimum reliability and validity for a test to be useful? a. reliability = .80; validity = .90 b. reliability = .70; validity = .80 c. reliability = .60; validity = .70 d. It depends on the situation.

55. Which of the following terms refers to the risk of mortality and morbidity directly related to a risk factor? a. relative risk b. odds ratio c. absolute risk d. attributable risk

57. If there is little variability in a measure from time to time, it is said to be a. valid b. homogeneous c. modal d. reliable e. typical

58. The NCYFS I and II results were considered high quality because a. the children represented the population of U.S. children b. a probability sample was used c. a large convenience sample was used d. a and b

58. The reliability of a test is reported to be .80. What is the standard error of measurement for this test in z score form (i.e., SD = 1.00)? a. .15 b. .25 c. .35 d. .45 e. unable to determine

59. The FITNESSGRAM test battery includes all but a. PACER b. skinfolds c. curl-ups d. leg squats

6. A researcher administered six different tests of balance to a group of subjects and found low intercorrelations among all six tests. This experiment offers evidence for what hypothesis? a. generality of motor tasks b. positive transfer among motor tasks c. reminiscence of motor tasks d. specificity of motor tasks

67. A test constructed to measure badminton knowledge is administered to a group of students along with another test known to be a valid measure of this same knowledge. A correlation between the resulting scores is evidence of a. reliability b. content validity c. face validity d. concurrent validity

69. Using the medicine ball throw in a volleyball skill test battery may be questioned based on a. feasibility b. objectivity c. reliability d. content validity e. construct validity

69. Which prediction equation should be used? Each was obtained on a different sample. a. r = .71; SEE = 101 b. r = .73; SEE = 103 c. r = .75; SEE = 105 d. r = .79; SEE = 107

7. Fitness tests for children can be divided into which two types based on the objectives of the test? a. motor and athletic b. skill and athletic c. health-related and age-related d. health-related and motor

7. Which of the following tests is probably the most reliable measure of running speed? a. 20-yard dash b. 30-yard dash c. 40-yard dash d. 50-yard dash

70. The statistical analyses used for CRTs are _______ those used for NRTs. a. different from b. the same as c. based on the same principles as d. both a and c

75. Subjects were weighed underwater and then skinfold measures were taken to determine the regression equation for calculating percent body fat. The Pearson product-moment correlation between body fat as determined from underwater weight and from the skinfolds was .95. What type of validity coefficient is this? a. content b. construct c. predictive d. criterion

80. The order of test administration is important when _____________ is a possible factor. a. time b. fatigue c. test place d. both a and b

81. Assume that each of the following tests has the same variance. Which test has the smallest SEM? a. rxx' = .00 b. rxx' = .50 c. rxx' = .80 d. rxx' = .90 e. unable to determine from what is given

82. All things considered, what is the most important characteristic of a test? a. objectivity b. reliability c. relevance d. validity e. variance

83. Assume that you see a correlation coefficient presented in a published report. What determines whether it is a correlation coefficient (rxy), a reliability coefficient (rxx'), or a validity coefficient (rxy)? a. the number of variables being correlated b. the size of the reported correlation coefficients c. the number of coefficients reported d. how the variables are used in the analysis

84. Assume you develop a new skills test for your students or clients. What is the most important characteristic of this test? a. its length; if it's too long then people will not use it b. the number of trials; there must be enough trials to be reliable c. its objective nature; it must be objective d. its validity

86. When measuring reliability by dividing a test in half, how is the reliability of the whole test estimated? a. by calculating the standard error of the mean b. by calculating the standard error of the estimate c. by calculating the standard error of the measurement d. by calculating the Spearman-Brown value

92. The stability of motor performance tests is established with __________ coefficients that involve __________ measures. a. validity; fixed b. reliability; fixed c. reliability; validity d. reliability; repeated

96. The consistency of an observation specifically relates to a. affective domain b. cognitive domain c. formative evaluation d. reliability

categorical independent variable; categorical dependent variable

Differences in categorization

10. Which of the following youth fitness test items is supported primarily by content validation procedures? a. 12-minute run b. 5-mile (8.0 km) run c. skinfolds d. 1-mile run (1.6 km) e. sit-and-reach

101. Grading on skill level in a beginning tennis class is associated with which domain? a. affective b. cognitive c. effective d. psychosocial e. psychomotor

102. Error scores cannot be a. negative b. positive c. small d. large e. numerical

103. The mean of the error score is always a. large b. small c. negative d. positive e. zero

119. An absolute scale can be all of the following except a a. numerical scale b. descriptive scale c. graphic d. checklist e. rank scale

24. What is the difference between reliability and objectivity? a. There is no real difference between reliability and objectivity. b. Reliability relates to time, whereas objectivity relates to relevance. c. Reliability relates to relevance, whereas objectivity relates to scorers. d. Reliability relates to scorers, whereas objectivity relates to time. e. Reliability relates to time, whereas objectivity relates to scorers

38. Proportion of agreement is used for what type of validity estimation? a. content b. concurrent c. predictive d. norm-referenced e. criterion-referenced

49. Which of the following is not a guideline for skills test development established by AAHPERD? a. must have at least minimal acceptable reliability and validity b. require neither expensive nor extensive equipment c. are reasonable in terms of preparation and administration time d. exclude extraneous variables as much as possible e. must use multivariable statistical techniques

Used to evaluate the entire performance with a single score

Holistic rubric

no difference between the groups other than that due to random variation

If a p-value is closer to 1, what does that mean?

Physical form of communities include

Land use patents—how the land is used, Large and small scale built and natural features—architectural details, quality of landscaping, Transportation system

Who developed a physical fitness test for older adults?

Rikli & Jones

variance

SD^2

Goal Setting is another psychological skill. What kind of goal does it use?

SMARTS goals

Multidimensional inventories measuring a variety of psychological skills for performance success

Sport specific

strength of evidence against null hypothesis

What does the p-value measure?

8. How does the ACTIVITYGRAM differ from the FITNESSGRAM?

The ACTIVITYGRAM measures typical daily activity.

The Stages of Change for Exercise and Physical Activity thing is also called...?

The Trans-Theoretical Model

standard deviation

The amount ot which all scores differ/deviate from the mean

what happens to the mean and the standard deviation of a set of scores if each score is increased by 10?

The mean changes and the standard deviation stays the same.

objective, valid, reliable (NOT SUBJECTIVE)

What are the aspects of subjective evaluation?

relevance, reliability, validity, feasibility

What are the criteria for skill tests?

not reliable, not valid

What are the disadvantages of accuracy tests?

not valid, not relevant

What are the disadvantages of wall volley tests?

location and variability

What are the two most important elements of a data set?

y axis

What axis is the dependent variable?

x axis

What axis is the independent variable?

bar and pie charts

What charts are used for qualititative data?

histograms or box and whisker plots

What charts are used for quantitative data?

ratio and interval

What classification are physical tests?

estimated variablitily of the mean

What does standard error of the mean show?

Though field tests of physical fitness may be more feasible than criterion measures, what is one major disadvantage of field tests? a) reduced validity b) increased standard error of measurement c) decreased convenience d) submaximal estimations of performance

a) *reduced validity*

adjectives that describe action (SD scales)

activity

The best reason for using T scores with three skill items on a basketball test is to

add scores together

What type of test is used to make categorical decisions

criterion-referenced

Golden Rule #6

be aware of *possible effects* on other aspects of *performance* (make sure enough time is given to rest btw tests & don't test too close to competition)

comprehension

being able to explain in own words

In the previous question, what is or are the dependent variable(s)? a. time of year b. body weight and basal metabolic rate c. body weight d. basal metabolic rate and time of year

body weight and basal metabolic rate

the mixed methods approach includes

both quantitative and qualitative measurements

should trait or state be focused on more when understanding and predicting behavior?

both!!

Physical form of communities

built environment

Performance tests should be of reasonable _________ & have a set ___________ or guidelines

difficulty; criteria (ex. for a free throw test -- the hoop should be lower for an 8 yr. old compared to an 18 yr. old)

john scored at the 45th percentile on the course midterm. Which interpretation of his score is the best? a. 45% of the people tested exceeded his score b. John got 45% of the items correct. c. John had a better score than 45% of the people tested. d. More than one of these statements is an appropriate interpretation.

c. John had a better score than 45% of the people tested.

What dimension did the Competitive State Anxiety Inventory-2 add?

cOnFiDeNcE

A researcher investigated the relationship between vitamin C (none, 500 mg, 1000 mg) and workers (office, outdoors) on the frequency of colds. Which is or are the dependent variable(s)? a. colds b. vitamin C c. colds and workers d. vitamin C and workers

colds

interval

common unit of measurement, but no true zero point

t-test

compare two mean values

The variables percent body fat (%), oxygen consumption (ml/kg/min), and weight (in pounds) would be considered _______, _______, and _______, respectively. a. nominal; nominal; nominal b. ordinal; nominal; ordinal c. continuous; nominal; continuous d. continuous; nominal; ordinal *e. continuous; continuous; continuou

continuous; continuous ; continuous

The variables time (in seconds), temperature (in degrees F), and triathlon event (swim, bike, run) would be considered _______, _______, and _______, respectively. a. nominal; nominal; nominal b. ordinal; nominal; ordinal c. continuous; continuous; nominal d. continuous; nominal; ordinal

continuous; continuous; nominal

relevance

criteria in which testing environment is similar to real world environment

In what population might balance be considered a component of health-related physical fitness?

elderly adults at risk for falling

standard error of mean

estimated variability of mean

degree of goodness you attribute to concept

evaluation

semantic differential scales measure what 3 factors?

evaluation potency activity

increased interest due to percent of population obese/overweight

exercise motivation

quantitative measurement usually includes what two things?

experimental and correlational designs

Measurement on the nominal scale is accurately described as quantitative measurement. a. true b. false

false

The first quartile corresponds to the 75th percentile value. a. true b. false

false

The larger the standard deviation, the more homogeneous the distribution. a. true b. false

false

Flexibility includes __ and ___

flexion and extension

A distribution of scores is best obtained with what PASW command? a. report b. means c. frequencies d. descriptives

frequencies

To compare two sets of data, the most appropriate technique to use is the a. frequency distribution b. frequency polygon c. histogram d. cumulative frequency polygon

frequency polygon

A researcher investigated the relationship between age (10 years, 12 years, 14 years) and gender on percent body fat. What is or are the independent variable(s)? a. percent body fat and gender b. gender and age c. age and percent body fat d. age

gender and age

how success is defined

goal setting

Which of the following does not represent a nominal scale of measurement? a. gender b. race c. religion d. height

height

central-tendency error

hesitancy of raters to assign extreme ratings to performers (scale rating of 1-5, judges are not likely to give 1s & 5s (extreme scores) bc of comparison to other performers (rater may think performance was good, but leaves the extreme rating open bc someone else's performance may be slightly better) -- creates *LESS variability*, reducing reliability

The graphing technique that is most appropriate for depicting the shape of a single set of data is a a. frequency distribution b. frequency polygon c. histogram d. cumulative frequency polygon

histogram

Inferential statistics are used to decide if differences among treatment groups are the result of the ______ or (the) ______. a. independent variable; dependent variable b. dependent variable; chance error c. confounding variable; dependent variable d. independent variable; chance error

independent variable; chance error

A goniometer is a protractor type of instrument used to measure the ___ at both extremes in the total range of movement

joint angle

A distribution that has a large number of scores that cluster close to the mean with relatively few scores falling in either tail is said to be a. platykurtic b. mesokurtic c. leptokurtic d. heterogeneous

leptokurtic

what kind of scale is this an example of? 1=strongly agree 2=agree 3=neutral 4=disagree 5=strongly disagree

likert scale

what are the two scales used in quantitative measurement?

likert scales semantic differential scales

Which measure of central tendency is most appropriate for continuous scale measurements? a. mode b. median c. mean d. either the mode or the mean

mean

Which of the following is the arithmetic average? a. mean b. median c. mode d. variance

mean

Which of the following statistics are most commonly reported in sport and exercise science research reports? a. mean: standard deviation b. median: standard deviation c. mean: range d. mode: range

mean : standard deviation

Which statistic requires the data to be ranked before its calculation? a. mean b. median c. average d. mod

median

For a set of 5,000 scores, the mean is 98, the median is 110, and the mode is 120. Which of the following describes the distribution? a. homogeneous b. heterogeneous c. positively skewed d. negatively skewed

negatively skewed

Which of the following measurement scales is related to naming or classifying?

nominal

lassifying subjects as athletes or nonathletes, male or female, from New York or Hawaii, is an example of which measurement scale? a. nominal scale b. ordinal scale c. interval scale d. ratio scale

nominal

direct observation with coding. videotaping. should be unobtrusive to the participants

observation

qualitative measurement usually includes what three things?

observations ethnography interviews

Obesity is ___, not ___

overfatness, not overweight

Given that the mean is 80 and the standard deviation is 10, which of the following does not belong? a. z score = -1.21 b. T score = 37.9 c. raw score = 67.9 d. P = 38.69

p=38.69

Characteristics of a population are called ________, whereas those of a sample are termed _________. a. statistics; measures b. statistics; parameters c. parameters; statistics d. statistics; variables

parameters ; statistics

for *performance efficiency* of total body movement test, the *valid criterion* is ___________ _________

performance *SPEED* (speed is process-related)

for *evaluation* of total body movement test, the *valid criterion* is __________ _________

performance *TIME* (time is an outcome)

effects of psychological factors on sport performance. mind affects body. not restricted to elite athletes

performance enhancement

2 main areas of sport and exercise physiology --

performance enhancement mental health

What two things does Sport and Exercise Psychology focus on?

performance enhancement mental health

first element of preparing to administer tests

planning

Systematic, purposeful and meaningful collections of student work used to document learning over time

portfolios

Self Talk is another psychological skill. It can either be ___ or ___

positive negative

increase our feelings of psychological well-being results in

positive influence on mental health

With a median of 12 and a mean of 14, we can infer that the distribution is a. positively skewed b. negatively skewed c. normal d. rectangular

positively skewed

planning to administer tests, examinee being exposed to the test and allowed to practice, and order of testing administration needs to be considered

pretest duties

3 phases of effective testing procedures --

pretest duties testing duties posttest duties

textural in nature. field observations, ethnography and interviews. provide depth and detail. focused on process not product. serve as foundation for development of quant. instruments

qualitative

Which of the following scales contain an absolute zero point?

ratio

Oxygen uptake capacity measured in liters per minute is an example of which type of measurement? a. nominal scale measurement b. ordinal scale measurement c. interval scale measurement d. ratio scale measurement

ratio scale measurement

correlation

relationship between two continuous variables

correlation

relationship between two variables

is a trait stable or unstable?

relatively stable

a goal should be pertinent and related to what you set out to accomplish

relevant

consistency of a measurement

reliability

Given the following statistics, which test would contribute most to a composite score obtained by adding each student's four scores together? a. test A: M = 80; SD = 2 b.testB:M=70;SD=4 c.testC:M=60;SD=6 d.testD:M=50;SD=8

test D:M=50;SD8

3 types of interclass reliability

test-retest, equivalence, split-halves

One of the precautions for using psychological testing in sport and exercise is that you must have an understanding of.....

testing principles and concept measurement error

repetitive-performance tests

tests that involve *continuous performance* of an activity for a *specified period of time*

primary issue associated with *accuracy tests* is what?

the *development of a scoring system* that provides a *reliable, yet valid result* (also must consider the # of repetitions necessary to produce reliable results)

In the affective domain, dimensionality refers to what?

the *number of factors* interpreted

what does performance enhancement focus on?

the effects of psychological factors on sport performance

the mixed methods approach allows the investigator to consider

the idiosyncrasies of the responses

What is exercise motivation?

the increased interest due to percent of population that is obese/overweight

What is the most important piece of information reported as the result of a chi square, t test, or ANOVA? a. the dependent variable(s) b. the independent variable(s) c. the test statistic (i.e., X2, t, F, etc.) *d. the probability level

the probability level

One of the precautions for using psychological testing in sport and exercise is that athletes should be told...

the purpose of the test what it measures how it is used

what do statistically significant findings imply? a. The results are very important. b. The results are not very important. c. The results are likely due to chance differences among groups. *d. The results are likely due to real differences among groups.

the results are likely due to real differences among groups

What is the Behavioral Regulation in Exercise Questionnaire based on?

the self-determination theory continuum

what does the phrase "mind affects body" mean in performance enhancement?

the way we think and feel has a strong impact on how we physically perform

Which of the following statements about dependent variables is true? a. They are set at one level. b. They are set at more than one level. c. They are measured. d. They are manipulated.

they are measured

Why are standard scores so important? a. They are related to the normal curve. b. Grades can be based on them. *c. They permit scores to be compared more appropriately. d. They are easier to interpret than percentiles. e. They reduce the variability in a distribution to 1 (for z scores) and 10 (for T scores).

they permit scores to be compared more appropriately

n investigation was conducted on how the time of year (winter, spring, summer, fall) affects body weight and basal metabolic rate. What is or are the independent variable(s)? a. time of year b. body weight and basal metabolic rate c. body weight d. basal metabolic rate

time of year

creating a well-defined timeline will help to track progress towards your goal and allow for feedback on how your actions towards reaching your goal

timely

what are likert scales used for?

to assess the degree of agreement or disagreement with statements

fundamental aspects of personality, relatively stable. predispositions. learned or genetic

trait

The sum of the deviation scores from the mean will always be zero. a. true b. false

true

Validity—

truthfulness of measurement

Golden Rule #5

try to provide *test results as quickly as possible*

When a person concludes that there is a difference between population means when she has actually observed an improbable difference between two sample means that come from identical populations, she has made a. a type I error b. a type II error c. no error d. either a type I or a type II error; must see the data to decide

type I error

qualitative measurement observations should be...?

unobtrusive to the participant

Golden Rule #4

use a *battery of tests* & report the results in a *profile* (helps make human performance analysis decisions & shows *strengths & weaknesses*)

application

used for VO2 Max in a group of Olympic athletes

6 sub scales of profile of mood states --

vigor confusion anxiety tension anger fatigue

6 sub scales with profile of mood:

vigor, confusion, anxiety, tension, anger, fatigue

When are the mean, median, and mode identical? a. when the sample size is very large b. when all equal zero c. when the distribution is mesokurtic *d. when the distribution is symmetrical

KIN 3502 FINAL -- ch 11, kin hhhh, KIN 3502 - Sport Skills & Motor Abilities (Chap. 11), KIN 3502 - Final - Chap. 12 Quiz (Psychological Measures), KIN 3502 - Exam 2 - Chap. 9 Quiz, KIN 3502 final -- ch 12, Kin 3502 Final- Psychological Measurements

संबंधित स्टडी सेट्स

ACCT CHP 4 Job-Order Costing

African Lions

World History: SOCIAL CHANGES: POSITIVE CONTRIBUTIONS

Conditioning and Learning Quiz

BL-Linux Chapter 12 - Shell Scripting

Chapter 4: Food Safety

MUS 110 CH 15 Listening Study Guide: LG 4 Farmer: Fair Phyllis

Econ 202 Final

Bio Study Guide Ch 8

Y2 LCRS Head, Neck and Spine

ACCOUNTING 101- Chapter 2

Deutsch 1 - Semester 1 Fragen

cell respiration (lab)

Homework 10: Ch. 31 Fungi

Ch 12 Study Questions

Ancient Greece chapter 8&9 test

Chapter 20

Interactive and Multichannel Marketing

chapter 9 brain & behavior study questions

Chap 7 Mastering Biology