Question 1

When a test developer gives the same test to the same group of test takers on two different occasions, the developer is gathering evidence of ______.&#10;A) internal consistency&#10;B) internal reliability&#10;C) test-retest reliability&#10;D) scorer reliability

Accepted Answer

When a test developer gives the same test to the same group of test takers on two different occasions, the developer is testing for test-retest reliability, which examines the consistency of scores over time. This is not assessing internal consistency or scorer reliability, and while internal reliability is related to test-retest reliability, it refers to consistency across items within a single administration of the test.

Question 2

What do we call changes in test scores resulting from the sequence in which the tests were taken?&#10;A) practice effects&#10;B) order effects&#10;C) fatigue effects&#10;D) alternate effects

Accepted Answer

Order effects refer to changes in test scores resulting from the sequence in which the tests were taken. Practice effects refer to changes in test scores resulting from repeated testing. Fatigue effects refer to changes in test scores resulting from participant exhaustion or boredom. Alternate effects do not refer to any specific phenomenon related to test scores.

Question 3

What is Dr. Jonah using when she gives students a math achievement test today and a different version of the same test in 2 months?&#10;A) supplemental forms&#10;B) complementary forms&#10;C) alternate forms&#10;D) nonparallel forms

Accepted Answer

Dr. Jonah is using alternate forms, which are different versions of the same test that are designed to measure the same construct or skill. This allows her to compare student performance over time without introducing the possibility of practice effects or familiarity with the specific items on the test.

Question 4

When using the split halves method of estimating reliability/precision, which one of the following requires adjustment to compensate for splitting the test into halves?&#10;A) assignment of the test questions&#10;B) correlation method&#10;C) the reliability coefficient&#10;D) the test scores

Accepted Answer

The reliability coefficient derived from the split halves method requires adjustment, typically using the Spearman-Brown formula, to account for the fact that splitting the test into halves results in a decrease in reliability. The other options do not require adjustment specifically for the split halves method.

Question 5

What is the term used to describe the consistency of test scores?&#10;A) validity&#10;B) reliability/precision&#10;C) distribution&#10;D) standard deviation

Accepted Answer

Reliability/precision refers to the consistency of test scores over time and across different test versions or forms. It is a crucial aspect of any assessment tool, as it ensures that the scores obtained are accurate and trustworthy. Validity, on the other hand, refers to the extent to which a test measures what it is supposed to measure. The distribution and standard deviation are related to the spread of scores within a group, and not specifically to the consistency of scores.

Question 6

Which one of the following pairs of math questions would be most likely to produce an internally consistent result?&#10;A) 8 &#8722; 10 = ? and 500 + 224 = ?&#10;B) 22 &#215; 48 = ? and 48 + 22 =?&#10;C) (&#8722;8) &#8722; (+10) = ? and 8 &#215; 10 = ?&#10;D) 8 &#215; 10 = ? and 10 &#215; 8 = ?

Accepted Answer

The pairs of questions in choice D involve multiplication, which is commutative, meaning that the order of the factors does not affect the result. Therefore, the answers to 8 × 10 and 10 × 8 will be the same, creating an internally consistent result.

Question 7

As the interval between administrations lengthens, test-retest reliability will most likely ______.&#10;A) increase&#10;B) decrease&#10;C) vary unpredictably&#10;D) remain unchanged

Accepted Answer

As the interval between test administrations lengthens, the likelihood of changes in the test-taker's knowledge, mood, or other relevant characteristics increases, which can lead to a decrease in test-retest reliability.

Question 8

What do we mean when we say that a test is internally consistent?&#10;A) a test taker only takes the test once&#10;B) a test taker's scores will remain similar over time&#10;C) the test questions are measuring a similar concept&#10;D) the scores of a group of test takers will be very similar

Accepted Answer

Internal consistency refers to the degree to which the items on a test are measuring the same construct or concept. This is typically assessed using measures like Cronbach's alpha or split-half reliability, which calculate the correlation between individual items or subtests within a test. If a test is internally consistent, it suggests that the items are all measuring the same thing, rather than tapping into unrelated factors or sources of error.

Question 9

What is the formula that Cronbach proposed for calculating internal consistency for questions that have more than two possible responses called?&#10;A) coefficient alpha&#10;B) KR-20&#10;C) product moment correlation&#10;D) Spearman Brown

Accepted Answer

The formula that Cronbach proposed for calculating internal consistency for questions that have more than two possible responses is called coefficient alpha.

Question 10

Which one of the following is used for calculating internal consistency for tests whose questions can be scored as either right or wrong?&#10;A) split halves method&#10;B) test-retest method&#10;C) coefficient alpha&#10;D) KR-20 formula

Accepted Answer

The KR-20 formula is specifically designed for calculating internal consistency reliability for tests with dichotomous outcomes (i.e., questions that can only be scored as right or wrong). It is a special case of the coefficient alpha for dichotomous items.

Question 11

Which one of the following methods of estimating reliability/precision requires dividing the test into halves and then correlating the set of individual test scores on the first half with the set of individual test scores on the second half?

A) test-retest method
B) coefficient alpha
C) split-half method
D) correlation method

Accepted Answer

The answer of Which one of the following methods of...

Question 12

What do we call two forms of a test that are comparable in every way?&#10;A) identical forms&#10;B) similar forms&#10;C) parallel forms&#10;D) supplementary forms

Accepted Answer

The answer of What do we call two forms of...

Question 13

Evidence of reliability/precision indicates that ______.&#10;A) test scores will likely be consistent across repeated measurements&#10;B) the test taker is properly administering and using the test&#10;C) the test is measuring what it is designed to measure&#10;D) a test possesses an important but not essential property

Accepted Answer

The answer of Evidence of reliability/precision indicates that ______.&#10;A) test...

Question 14

Since the PAI requires test takers to provide ratings on a response scale that has four options--False, Not at all true, Slightly true, Mainly true, and Very true--the appropriate formula for estimating internal consistency is ______.

A) Spearman Brown
B) KR-20
C) product moment correlation
D) coefficient alpha

Accepted Answer

The answer of Since the PAI requires test takers to...

Question 15

Estimating reliability/precision using methods of internal consistency is appropriate only for tests that are ______.&#10;A) heterogeneous&#10;B) homogeneous&#10;C) unstandardized&#10;D) standardized

Accepted Answer

The answer of Estimating reliability/precision using methods of internal consistency...

Question 16

When researchers want to measure test-retest reliability/precision, they must assume test takers' ability will ______.&#10;A) not change between the first administration and the second administration&#10;B) increase between the first administration and the second administration&#10;C) decrease between the first administration and the second administration&#10;D) be affected by the order they took the tests

Accepted Answer

The answer of When researchers want to measure test-retest reliability/precision,...

Question 17

When tests are heterogeneous, estimates of internal consistency are likely to be ______.&#10;A) low&#10;B) high&#10;C) comparable&#10;D) homogeneous

Accepted Answer

The answer of When tests are heterogeneous, estimates of internal...

Question 18

An employment test for the job of sales manager that measures knowledge of sales theory, interpersonal skills, and ability to use text messaging is ______.&#10;A) homogeneous&#10;B) heterogeneous&#10;C) reliable&#10;D) generalizable

Accepted Answer

The answer of An employment test for the job of...

Question 19

What is the best way to divide the test when using the split-half method?&#10;A) put the first questions in one half and the last questions in the other half&#10;B) put all multiple choice questions in one half and all essay questions in the other half&#10;C) randomly assign each question to the first half or the second half&#10;D) assign odd-numbered questions to one half and even-numbered questions to the other

Accepted Answer

The answer of What is the best way to divide...

Question 20

Which one of the following is required to yield an accurate estimate of reliability/precision using the split halves method?&#10;A) the two halves must be equivalent in length and content&#10;B) the scores on each half must be equivalent&#10;C) the first half must contain more questions than the second half&#10;D) the estimate must be calculated using the coefficient alpha

Accepted Answer

The answer of Which one of the following is required...

Question 21

Which one of the following formulas is used when two halves of one test is used to adjust the reliability coefficient?&#10;A) Spearman Brown&#10;B) coefficient alpha&#10;C) Pearson product moment correlation&#10;D) KR-20

Accepted Answer

The answer of Which one of the following formulas is...

Question 22

Which one of the following is used to provide an index of the strength and direction of the linear relationship between the two sets of scores?&#10;A) coefficient alpha&#10;B) correlation&#10;C) KR-20&#10;D) Spearman Brown

Accepted Answer

The answer of Which one of the following is used...

Question 23

What is the unexplained difference between the true score (T) and the obtained score (X) called?&#10;A) systematic error&#10;B) random error&#10;C) internal consistency&#10;D) coefficient alpha

Accepted Answer

The answer of What is the unexplained difference between the...

Question 24

The Spearman Brown formula is used when calculating ______.&#10;A) only internal reliability&#10;B) only split halves reliability&#10;C) only scorer reliability&#10;D) all forms of reliability

Accepted Answer

The answer of The Spearman Brown formula is used when...

Question 25

Jonathan wanted to estimate the internal consistency of a multiple choice test. Which one of the following would be most appropriate for him to use?&#10;A) Spearman Brown&#10;B) coefficient alpha&#10;C) Cohen's kappa&#10;D) KR-20

Accepted Answer

The answer of Jonathan wanted to estimate the internal consistency...

Question 26

Judy wants to estimate the internal consistency of a survey on which the respondent marks (1) Not at all, (2) Sometimes, (3) Most of the time, or (4) Always. Which one of the following is most appropriate for her to use?

A) Spearman Brown
B) coefficient alpha
C) Cohen's kappa
D) KR-20

Accepted Answer

The answer of Judy wants to estimate the internal consistency...

Question 27

Which one of the following statements is TRUE about error?&#10;A) Systematic error increases the reliability of a test.&#10;B) Systematic error lowers the reliability of a test.&#10;C) Random error lowers the reliability of a test.&#10;D) Both random and systematic error lower the reliability of a test.

Accepted Answer

The answer of Which one of the following statements is...

Question 28

Theoretically, if a test taker took a test, an infinite number of times and an average score was calculated from those administrations, the average test score would ______.&#10;A) equal the true test score&#10;B) contain the true score and error&#10;C) contain only error&#10;D) have no meaning

Accepted Answer

The answer of Theoretically, if a test taker took a...

Question 29

Cohen's kappa is a statistical method for ______.&#10;A) estimating test-retest reliability&#10;B) estimating interrater agreement&#10;C) estimating internal consistency&#10;D) correlating ratings by two judges

Accepted Answer

The answer of Cohen's kappa is a statistical method for...

Question 30

Which one of the following statements regarding test reliability is TRUE?&#10;A) Researchers must choose one method of estimating the reliability/precision of test scores.&#10;B) Researchers can identify the causes of random error.&#10;C) No measurement instrument is perfectly reliable or consistent.&#10;D) A measurement instrument should be perfectly reliable or consistent.

Accepted Answer

The answer of Which one of the following statements regarding...

Question 31

What do we call the statistic that reflects the amount of inconsistency or error expected in an individual's test score?&#10;A) reliability coefficient&#10;B) standard deviation&#10;C) standard error of measurement&#10;D) correlation coefficient

Accepted Answer

The answer of What do we call the statistic that...

Question 32

Julio wants to calculate the interrater agreement for an essay exam that is scored by giving each essay a pass (2) or fail (1) mark. Which one of the following would be most appropriate for Julio to use?

A) Cohen's kappa
B) coefficient alpha
C) KR-20
D) Correlation

Accepted Answer

The answer of Julio wants to calculate the interrater agreement...

Question 33

Scorer reliability and agreement concerns the consistency of what?&#10;A) test scores&#10;B) scorer judgments&#10;C) different test forms&#10;D) test takers' performance

Accepted Answer

The answer of Scorer reliability and agreement concerns the consistency...

Question 34

What do we call a single source of error that always increases or decreases the true score by the same amount?&#10;A) the true score&#10;B) the average score&#10;C) random error&#10;D) systematic error

Accepted Answer

The answer of What do we call a single source...

Question 35

Chris wants to calculate the interscorer reliability for an essay test that had two scorers. Which one of the following would be most appropriate for Chris to use?&#10;A) Cohen's kappa&#10;B) coefficient alpha&#10;C) KR-20&#10;D) Correlation

Accepted Answer

The answer of Chris wants to calculate the interscorer reliability...

Question 36

As the reliability/precision of a test decreases, which one of the following items increases?&#10;A) standard deviation&#10;B) standard error of measurement&#10;C) internal consistency&#10;D) difficulty

Accepted Answer

The answer of As the reliability/precision of a test decreases,...

Question 37

The range of scores that we feel comfortable and includes the true score is called a ______.&#10;A) standard deviation&#10;B) standard error of measurement&#10;C) confidence interval&#10;D) normal curve

Accepted Answer

The answer of The range of scores that we feel...

Question 38

Which one of the following is most helpful to test developers who wish to increase the reliability/precision of a test's scores?&#10;A) Spearman Brown formula&#10;B) coefficient alpha&#10;C) correlation&#10;D) Cohen's kappa

Accepted Answer

The answer of Which one of the following is most...

Question 39

If a meteorologist uses a thermometer that always reads 1 degree higher than the actual temperature, then the error that results is ______. If a meteorologist is nearsighted and he reads the thermometer with a different amount and direction of inaccuracy each time, the error that results will be ______.

A) reliable; unreliable
B) unreliable; reliable
C) random; systematic
D) systematic; random

Accepted Answer

The answer of If a meteorologist uses a thermometer that...

Question 40

What is the amount of consistency among scorers' judgments called?&#10;A) test-retest agreement&#10;B) interscorer agreement&#10;C) intrascorer agreement&#10;D) internal agreement

Accepted Answer

The answer of What is the amount of consistency among...

Question 41

According to classical test theory, what would the reliability coefficient be when the variance of observed scores is equal to the variance of true scores?&#10;A) 0&#10;B) 0.5&#10;C) 0.75&#10;D) 1.0

Accepted Answer

The answer of According to classical test theory, what would...

Question 42

When a test score is used for selection or classification of individuals, it is advisable to calculate the standard error of measurement at the ______.&#10;A) score used to make the classification decision&#10;B) average score on the test&#10;C) highest score on the test&#10;D) lowest score on the test

Accepted Answer

The answer of When a test score is used for...

Question 43

Discuss Cohen's kappa. How is it used by test developers or researchers? Give an example.

Accepted Answer

The answer of Discuss Cohen's kappa. How is it used...

Question 44

Effective test administration is likely to ______.&#10;A) increase error and lower test reliability&#10;B) increase error and raise test reliability&#10;C) decrease error and lower test reliability&#10;D) decrease error and raise test reliability

Accepted Answer

The answer of Effective test administration is likely to ______.&#10;A)...

Question 45

Rita scored 96 on an employment test, and Naomi scored 98 on the same test. Naomi believes that she has the highest score, but Rita disagrees. Which of the following would be most helpful in determining whether Naomi's score is statistically higher than Rita's score?

A) the standard deviation and mean of the test scores
B) the standard error of measurement of the test scores
C) the reliability coefficient of the test scores
D) the coefficient of determination of the test scores

Accepted Answer

The answer of Rita scored 96 on an employment test,...

Question 46

Define reliability and describe three methods for estimating the reliability of a psychological test and its scores.

Accepted Answer

The answer of Define reliability and describe three methods for...

Question 47

According to classical test theory, what will the reliability coefficient be when the observed score variance is greater than true score variance?&#10;A) 0&#10;B) less than 1.0&#10;C) exactly 1.0&#10;D) over 1.0

Accepted Answer

The answer of According to classical test theory, what will...

Question 48

How do researchers and test developers identify systematic error in test scores?&#10;A) using analysis of variance&#10;B) using correlation&#10;C) constructing a confidence interval&#10;D) calculating the standard error of measurement

Accepted Answer

The answer of How do researchers and test developers identify...

Question 49

Which one of the following is most likely to decrease the reliability of a test?&#10;A) poorly written questions&#10;B) unprepared test takers&#10;C) to many test questions&#10;D) overlapping confidence intervals

Accepted Answer

The answer of Which one of the following is most...

Question 50

A nonparametric index for scorer agreement when the scores are nominal or ordinal is provided by ______.&#10;A) coefficient alpha&#10;B) KR-20&#10;C) Cohen's kappa&#10;D) Spearman Brown

Accepted Answer

The answer of A nonparametric index for scorer agreement when...

Question 51

Which one of the following proposes separating sources of systematic error from random error in order to eliminate systematic error?&#10;A) classical test theory&#10;B) generalizability theory&#10;C) reliability theory&#10;D) theory of the normal curve

Accepted Answer

The answer of Which one of the following proposes separating...

Question 52

Ivan conducted a reliability study for a test of computer skills. His results were a coefficient alpha of .95 and a test-retest r of .85 (4 weeks). Interpret these results including explaining why the reliability coefficients might have been different.

Accepted Answer

The answer of Ivan conducted a reliability study for a...

Question 53

What could a test developer do to increase internal consistency of a test?&#10;A) accurately measure the test-retest reliability of the test&#10;B) add well-written questions to each test form&#10;C) add questions that measure the same concept&#10;D) adjust the reliability coefficient using the KR-20 formula

Accepted Answer

The answer of What could a test developer do to...

Question 54

Describe how to gather evidence of reliability using the split halves method. Start your discussion at the time the test is administered and continue until the final numerical value is calculated and interpreted.

Accepted Answer

The answer of Describe how to gather evidence of reliability...

Question 55

Explain what the standard error of measurement is and how to use it to construct a confidence interval around an observed score.

Accepted Answer

The answer of Explain what the standard error of measurement...

Question 56

Identify and discuss the theory that explains why an observed test score is made up of the &#34;true score&#34; and &#34;random error.&#34; Give examples.

Accepted Answer

The answer of Identify and discuss the theory that explains...

Deck 5: What Is Test Reliabilityprecision