Question 1

Classical Test Theory assumes&#10;A)the length of a test has no bearing on its reliability.&#10;B)measurement errors occur systematically.&#10;C)it is not possible to estimate true scores.&#10;D)the distribution of random errors is the same for every respondent.

Accepted Answer

Classical Test Theory (CTT) assumes that any observed score is the sum of the true score and a random error component. This implies that the distribution of random errors is considered to be the same for every respondent, which is a fundamental assumption of CTT. It does not assume that measurement errors occur systematically (which would contradict the notion of errors being random), nor does it claim that it's impossible to estimate true scores (estimating true scores is a central goal of CTT). Additionally, CTT does recognize that the length of a test can impact its reliability, typically with longer tests being more reliable due to the reduction in the impact of random errors.

Question 2

Who developed methods for evaluating sources of error in behavioral research?&#10;A)Edward Thorndike&#10;B)Kuder and Richardson&#10;C)Charles Spearman&#10;D)Cronbach

Accepted Answer

Cronbach developed methods for evaluating sources of error in behavioral research, such as the Cronbach's alpha coefficient.

Question 3

If we repeatedly administered the same test to the same individual, the standard deviation of the person's score would be the&#10;A)standard error of the mean.&#10;B)variance.&#10;C)reliability of the test.&#10;D)standard error of measurement.

Accepted Answer

The standard deviation of an individual's scores over repeated administrations of the same test is referred to as the standard error of measurement. It quantifies the variability in scores due to measurement error, providing an estimate of the precision of an individual's observed test score.

Question 4

Theoretically, if Susie repeatedly took the 6^th grade achievement test, you would be able to find her true score by finding the ____ of the distribution of her scores. A)mean B)standard deviation C)variance D)standard error of measurement

Accepted Answer

The mean of the distribution of Susie's scores would represent her true score if she took the test an infinite number of times, as it averages out all the variations in her performance across the tests.

Question 5

Theoretically, reliability is&#10;A)the correlation of the observed test score with the true score.&#10;B)the square root of the ratio of true to the observed score.&#10;C)the ratio of true to the observed score squared.&#10;D)not possible to define.

Accepted Answer

Reliability in the context of testing and measurement refers to the consistency of a test score or measurement. Theoretically, it is conceptualized as the correlation between the observed test scores and the true scores, where the true score represents the actual attribute or ability being measured, free from any measurement errors.

Question 6

Assuming the &#34;rubber yardstick&#34; shrinks and expands at random, what can be said about the distribution of scores from the rubber yardstick?&#10;A)It will have a mean of zero (0).&#10;B)It will be normal.&#10;C)It will have a standard error of zero (0).&#10;D)It will be skewed.

Accepted Answer

If the rubber yardstick shrinks and expands at random, the distribution of scores will tend to be normal due to the central limit theorem, assuming a large enough sample size.

Question 7

The work of Charles Spearman combined what two measurement concepts?&#10;A)mean and variance&#10;B)sample statistics and population parameters&#10;C)sampling error and correlation&#10;D)reliability and validity

Accepted Answer

Spearman's work notably combined the concepts of sampling error and correlation, especially in the context of his development of factor analysis and the theory of general intelligence (g factor), where he explored the relationships between different cognitive abilities and how they could be explained by a single underlying factor.

Question 8

What is Spearman known for?&#10;A)Working out the basics of reliability theory&#10;B)Developing the notion of sampling error&#10;C)Creating methods for measuring error&#10;D)Developing multivariate analysis

Accepted Answer

Spearman is best known for his work in developing the basics of reliability theory, which is fundamental in the field of psychometrics and psychological testing.

Question 9

What is Cronbach known for?&#10;A)Developing measures to evaluate sources of error&#10;B)Creating the basics of multivariate analysis&#10;C)Developed the basics of contemporary measurement theory&#10;D)Distinguished between objective and subjective measures

Accepted Answer

Cronbach is known for developing measures to evaluate sources of error.

Question 10

We can get an idea of how much measurement error is present in a score through the&#10;A)true score.&#10;B)observed score.&#10;C)standard error of the mean.&#10;D)standard error of measurement.

Accepted Answer

The standard error of measurement gives an estimate of the amount of error present in a score, indicating how much scores might vary due to measurement error.

Question 11

Because classic test theory assumes a person's true score is the same over time, repeating the same test over and over gives a distribution of scores that reflect what?&#10;A)systematic error&#10;B)random error&#10;C)reliability&#10;D)internal consistency

Accepted Answer

The answer of Because classic test theory assumes a person's...

Question 12

The basic theory of reliability was first worked out by&#10;A)Karl Pearson.&#10;B)Charles Spearman.&#10;C)Julian Stanley.&#10;D)Lee Cronbach.

Accepted Answer

The answer of The basic theory of reliability was first...

Question 13

According to classical test theory, errors of measurement are&#10;A)always overestimates of true score.&#10;B)always underestimates of true score.&#10;C)random.&#10;D)constant.

Accepted Answer

The answer of According to classical test theory, errors of...

Question 14

When creating a test, one generally uses a subset of items to represent a larger construct. This is known as&#10;A)a population parameter.&#10;B)a domain sampling.&#10;C)a sampling error.&#10;D)descriptive statistics.

Accepted Answer

The answer of When creating a test, one generally uses...

Question 15

When talking about errors in terms of psychological testing, we are referring to the fact that:&#10;A)someone got an answer incorrect.&#10;B)there is always some inaccuracy in the measurement.&#10;C)the test was inappropriate for that particular group.&#10;D)the score is too subjective to be accurate.

Accepted Answer

The answer of When talking about errors in terms of...

Question 16

Repeated use of the same test typically results in different scores. How does classical test theory account for this?&#10;A)poor test validity&#10;B)systematic variability&#10;C)random error&#10;D)inattention

Accepted Answer

The answer of Repeated use of the same test typically...

Question 17

An observed score is composed of&#10;A)the residual and the true score.&#10;B)the criterion and the predictor.&#10;C)the measurement error and the predictor.&#10;D)the true score and the measurement error.

Accepted Answer

The answer of An observed score is composed of&#10;A)the residual...

Question 18

Which of the following is an important distinction between systematic errors and random errors?&#10;A)Random errors are more likely than systematic errors to cause errors in conclusions.&#10;B)Systematic errors occur only in objective measures and random errors occur only in subjective measures.&#10;C)Random errors can be eliminated by careful wording of test items.&#10;D)Systematic errors are extremely rare among psychological tests.

Accepted Answer

The answer of Which of the following is an important...

Question 19

If you have three clocks in your house, and every clock is 10 minutes fast, this is an example of&#10;A)systematic error.&#10;B)random error.&#10;C)measurement error.&#10;D)a rubber yardstick.

Accepted Answer

The answer of If you have three clocks in your...

Question 20

Classical Test Theory assumes that&#10;A)errors are systematic.&#10;B)errors are random.&#10;C)true scores cannot be estimated.&#10;D)the length of a test has no bearing on its reliability.

Accepted Answer

The answer of Classical Test Theory assumes that&#10;A)errors are systematic.&#10;B)errors...

Question 21

Sources of error associated with time sampling are measured using&#10;A)the test-retest method.&#10;B)the split half method.&#10;C)KR 20.&#10;D)the alpha method.

Accepted Answer

The answer of Sources of error associated with time sampling...

Question 22

How does the domain sampling model conceptualize reliability?&#10;A)The absolute value of the difference between the standard error of measurement and the variance&#10;B)The ratio of variance of the observed scores on the short version of a test and the variance of the long-run true scores&#10;C)The sum of squares of the difference between the observed and true scores&#10;D)The ratio of the number of sample items to the number of domain items, multiplied by the mean of the sample distribution

Accepted Answer

The answer of How does the domain sampling model conceptualize...

Question 23

Professor Pine constructed five different short history tests by randomly drawing questions from the huge pool of all possible questions about the current material. He has created&#10;A)randomly parallel tests.&#10;B)a large sample size.&#10;C)systematic errors.&#10;D)attenuation effects.

Accepted Answer

The answer of Professor Pine constructed five different short history...

Question 24

In the domain sampling model, the error that is being considered is the error caused by&#10;A)choosing the wrong domain.&#10;B)systematic error.&#10;C)using a limited sample of items.&#10;D)random error.

Accepted Answer

The answer of In the domain sampling model, the error...

Question 25

Tests designed according to item response theory&#10;A)are no longer considered useful.&#10;B)can only be used with non-objective material&#10;C)yield more reliable results with fewer items&#10;D)provide low-tech methods for field use.

Accepted Answer

The answer of Tests designed according to item response theory&#10;A)are...

Question 26

The difference between David's two typing tests, one at the beginning of the semester and one at the end, reflects the fact that he typed quite a few term papers during the semester. This reflects&#10;A)attenuation.&#10;B)random error.&#10;C)practice effects.&#10;D)domain sampling.

Accepted Answer

The answer of The difference between David's two typing tests,...

Question 27

A split-half correlation, KR 20, and coefficient alpha are all used to evaluate&#10;A)standard errors of measurement.&#10;B)internal consistency.&#10;C)variance.&#10;D)validity.

Accepted Answer

The answer of A split-half correlation, KR 20, and coefficient...

Question 28

Why might different random samples of domain items yield different estimates of the true score?&#10;A)sampling error&#10;B)poor reliability&#10;C)respondent error&#10;D)item bias

Accepted Answer

The answer of Why might different random samples of domain...

Question 29

Which of the following would tend to provide the most conservative estimate of split-half reliability?&#10;A)the Phillips method&#10;B)the Spearman-Brown formula&#10;C)coefficient alpha&#10;D)the odd-even reliability coefficient

Accepted Answer

The answer of Which of the following would tend to...

Question 30

Suppose you were trying to estimate the reliability of a whole test on the basis of the correlation between scores on the two halves of the test. In order to correct for using scores based on the halves, you might use the

A)KR 20.
B)alpha method.
C)Spearman-Brown formula.
D)split half method.

Accepted Answer

The answer of Suppose you were trying to estimate the...

Question 31

A reliability coefficient of .60 suggests that&#10;A)64% of the variance on the test is error.&#10;B)40% of the variance on the test is error.&#10;C)78% of the variance on the test is error.&#10;D)the test can be used for clinical purposes but not for research.

Accepted Answer

The answer of A reliability coefficient of .60 suggests that&#10;A)64%...

Question 32

If a researcher is attempting to assess the reliability of a measure of depression, the method of choice would be&#10;A)internal consistency.&#10;B)time sampling.&#10;C)the test-retest method.&#10;D)more than one of these.

Accepted Answer

The answer of If a researcher is attempting to assess...

Question 33

Federal government guidelines require that a test be&#10;A)standardized for use among all U.S. sub-populations.&#10;B)factor analyzed before it can be used to make employment decisions.&#10;C)reliable before it can be used to make employment decisions.&#10;D)reliable above the .90 level.

Accepted Answer

The answer of Federal government guidelines require that a test...

Question 34

The Spearman Brown formula corrects for deflated reliability due to&#10;A)half-length tests.&#10;B)small sample size.&#10;C)systematic error.&#10;D)poor test item construction.

Accepted Answer

The answer of The Spearman Brown formula corrects for deflated...

Question 35

Dr. Janine developed two equivalent forms of a test and administered them both, in counter-balanced order, to a group of people on the same day in order to assess reliability. What is this called?&#10;A)test- retest&#10;B)parallel forms&#10;C)split-half&#10;D)KR 20

Accepted Answer

The answer of Dr. Janine developed two equivalent forms of...

Question 36

Dr. Smith is trying to determine the reliability of a new personality test. Two randomly parallel tests, A and B, have a correlation of .81. What is the estimated reliability of the new personality test?

A).81
B)-.9
C).9
D).81/ t

Accepted Answer

The answer of Dr. Smith is trying to determine the...

Question 37

The problems created by using a limited number of items to represent a larger and more complicated construct are explicitly considered in the ____ model.&#10;A)multivariate&#10;B)random sampling&#10;C)domain sampling&#10;D)standard error of measurement

Accepted Answer

The answer of The problems created by using a limited...

Question 38

The method for estimating the internal consistency of a test that simultaneously considers all possible ways of splitting the items is the&#10;A)Spearman Brown formula.&#10;B)Kuder-Richardson formula.&#10;C)Cronbach's alpha.&#10;D)the odd-even method.

Accepted Answer

The answer of The method for estimating the internal consistency...

Question 39

Upon repeated applications of the same test, performance on the second application may be affected by previous experience on the test. This is known as&#10;A)attenuation.&#10;B)a carryover effect.&#10;C)shrinkage.&#10;D)selected recall.

Accepted Answer

The answer of Upon repeated applications of the same test,...

Question 40

As opposed to reliability based on the classical test theory, ____ focuses on the range of item difficulty that is useful in assessing an individual's ability.&#10;A)domain sampling&#10;B)internal consistency&#10;C)coefficient alpha&#10;D)item response theory

Accepted Answer

The answer of As opposed to reliability based on the...

Question 41

Which of the following is a problem in evaluating the agreement between observers in behavioral studies?&#10;A)The observers are usually not trained.&#10;B)The behaviors being studied are usually not directly observable.&#10;C)There will always be some agreement by chance.&#10;D)There is no method for evaluating the agreement between observers.

Accepted Answer

The answer of Which of the following is a problem...

Question 42

In order to determine the unidimensionality of a test, you can use&#10;A)factor analysis.&#10;B)split half reliability.&#10;C)parallel forms assessment.&#10;D)the Spearman-Brown prophecy formula.

Accepted Answer

The answer of In order to determine the unidimensionality of...

Question 43

Test constructors can improve test reliability by&#10;A)increasing the number of items.&#10;B)decreasing the number of items.&#10;C)retaining items that have the most face validity.&#10;D)reducing the item to total correlation.

Accepted Answer

The answer of Test constructors can improve test reliability by&#10;A)increasing...

Question 44

Correction for attenuation is used&#10;A)to estimate the validity of a test.&#10;B)to correct for tests that are short.&#10;C)to correct for tests that are long.&#10;D)to estimate the true correlation between variables that have been measured with error.

Accepted Answer

The answer of Correction for attenuation is used&#10;A)to estimate the...

Question 45

Measures of test-retest reliability are sometimes considered inappropriate for the evaluation of health status because&#10;A) health status tests should not given at multiple points in time.&#10;B)variations in health status may be related to true changes over time rather than measurement error.&#10;C)there is no domain of health status.&#10;D)health status is too complicated to measure.

Accepted Answer

The answer of Measures of test-retest reliability are sometimes considered...

Question 46

The difference between KR 20 and coefficient alpha is&#10;A)KR 20 can be used to evaluate time sampling problems while alpha cannot.&#10;B)Alpha can be used to evaluate time sampling problems while KR 20 cannot.&#10;C)KR 20 can only be used for items scored right or wrong but Alpha can be used for items in any format.&#10;D)Alpha can only be used for items scored right or wrong but KR 20 can be used for items in any format.

Accepted Answer

The answer of The difference between KR 20 and coefficient...

Question 47

The kappa statistic is used to&#10;A)assess the level of agreement among several observers.&#10;B)estimate the correlation between a continuous variable and an artificially dichotomous variable.&#10;C)estimate the percentage of disagreement between observers.&#10;D)estimate the validity of behavioral observation.

Accepted Answer

The answer of The kappa statistic is used to&#10;A)assess the...

Question 48

If the same test, given at different points in time to the same test takers, yields different scores, then the method typically used to assess this source of error is&#10;A)test-retest.&#10;B)alternate forms/parallel forms.&#10;C)split-half.&#10;D)KR 20.

Accepted Answer

The answer of If the same test, given at different...

Question 49

Jennifer read a report in which the agreement between raters of children's aggressive behavior was .50, indicating&#10;A)the raters agreed at chance levels.&#10;B)agreement was poor.&#10;C)agreement was excellent.&#10;D)agreement was moderate.

Accepted Answer

The answer of Jennifer read a report in which the...

Question 50

Which of the following is true of the parallel forms method?&#10;A)It is the most often used method for estimating reliability.&#10;B)It provides one of the most rigorous methods for estimating reliability.&#10;C)It is largely ineffective with psychological tests.&#10;D)Sophisticated computer programs have made it unnecessary.

Accepted Answer

The answer of Which of the following is true of...

Question 51

Standard errors of measurement are used to&#10;A)determine whether an observed score is the &#34;true&#34; score.&#10;B)determine the standard deviation of the scores.&#10;C)calculate the exact true score.&#10;D)create confidence intervals around specific observed test scores.

Accepted Answer

The answer of Standard errors of measurement are used to&#10;A)determine...

Question 52

The preferred method for assessing the level of agreement between observers is the&#10;A)kappa statistic&#10;B)Spearman coefficient&#10;C)coefficient alpha&#10;D)rank-order statistic

Accepted Answer

The answer of The preferred method for assessing the level...

Question 53

Approximately what value must a reliability coefficient have for most purposes in basic research?&#10;A).90&#10;B).50&#10;C).70&#10;D).30

Accepted Answer

The answer of Approximately what value must a reliability coefficient...

Question 54

What is the impact of carryover effects on test-retest reliability?&#10;A)Test-retest reliability is not influenced by carryover effects.&#10;B)Carryover effects result in an overestimation of reliability.&#10;C)Carryover effects result in an underestimation of reliability.&#10;D)Test-retest reliability increases carryover effects.

Accepted Answer

The answer of What is the impact of carryover effects...

Question 55

The reliability of a difference score is&#10;A)equal to the reliability of the most reliable of the two measures.&#10;B)equal to the reliability of the least reliable of the two measures.&#10;C)the average reliability of the two measures.&#10;D)expected to be lower than the reliability of either of the two measures.

Accepted Answer

The answer of The reliability of a difference score is&#10;A)equal...

Question 56

Difference scores are created by&#10;A)subtracting one test score from another.&#10;B)subtracting the true score from a predicted score.&#10;C)eliminating error from true scores.&#10;D)giving a test to two different individuals.

Accepted Answer

The answer of Difference scores are created by&#10;A)subtracting one test...

Question 57

The standard error of measurement allows us to&#10;A)estimate the degree to which a test provides inaccurate readings.&#10;B)have an acceptable margin of error.&#10;C)determine the source of error.&#10;D)avoid any measurement error.

Accepted Answer

The answer of The standard error of measurement allows us...

Question 58

Which of the following is used to estimate the number of items that should be added to a test to achieve a specified reliability?&#10;A)KR 20&#10;B)coefficient alpha&#10;C)Spearman-Brown prophecy formula&#10;D)split-half technique

Accepted Answer

The answer of Which of the following is used to...

Question 59

Which of the following is a source of measurement error?&#10;A)respondent sampling&#10;B)scorer sampling&#10;C)internal consistency&#10;D)external consistency

Accepted Answer

The answer of Which of the following is a source...

Question 60

Items are probably measuring the same thing when the correlation between an item and the total score&#10;A)is high.&#10;B)is low.&#10;C)approaches 0.&#10;D)is negative.

Accepted Answer

The answer of Items are probably measuring the same thing...

Question 61

What is the most useful indicator of reliability for the interpretation of individual scores?&#10;A)split-half variance&#10;B)item sampling&#10;C)test-retest&#10;D)standard error of measurement

Accepted Answer

The answer of What is the most useful indicator of...

Question 62

Explain how someone might decide how reliable is &#34;reliable enough&#34; for a measure. What settings might warrant more stringent criteria for reliability, and why?

Accepted Answer

The answer of Explain how someone might decide how reliable...

Question 63

Describe some of the advantages and disadvantages associated with behavioral observation techniques. Provide examples.

Accepted Answer

The answer of Describe some of the advantages and disadvantages...

Question 64

Briefly discuss each of the APA's standards for reliability.

Accepted Answer

The answer of Briefly discuss each of the APA's standards...

Deck 4: Reliability