Question 1

Which one of the following provides important information for increasing the test's internal consistency?&#10;A) discrimination index&#10;B) difficulty level&#10;C) interitem correlation matrix&#10;D) coefficient of multiple correlation

Accepted Answer

Interitem correlation matrix provides information about the relationships between different items in the test. A high interitem correlation indicates that the items are measuring the same construct and increases the internal consistency of the test. Therefore, analyzing the interitem correlation matrix is important for increasing a test's internal consistency.

Question 2

Which one of the following outcomes indicates that a test item should be retained in a test?&#10;A) High performers answered the item correctly and low performers answered the item incorrectly.&#10;B) High performers answered the item correctly and low performers answered the item correctly.&#10;C) High performers answered the item incorrectly and low performers answered the item correctly.&#10;D) High performers answered the item incorrectly and low performers answered the item incorrectly.

Accepted Answer

When high performers answered the item correctly and low performers answered the item incorrectly, it indicates that the item is discriminating well between high and low performers, and thus should be retained in the test.

Question 3

The percentage of test takers who respond correctly to a test item is a measure of the item's ______.&#10;A) difficulty&#10;B) bias&#10;C) ability to discriminate&#10;D) item-total correlation

Accepted Answer

The percentage of test takers who respond correctly to a test item is a measure of the item's difficulty. Difficulty is a measure of how hard or easy a test taker finds an item, and is calculated by dividing the number of test takers who answered correctly by the total number of test takers who attempted the item. It is an important aspect of item analysis as it helps test developers to decide which items to keep or discard and whether the overall test difficulty is appropriate for the intended population.

Question 4

One advantage of empirically based tests is that ______.&#10;A) they have strong validity coefficients&#10;B) their internal reliability is high&#10;C) test takers prefer them to other types of tests&#10;D) it is more difficult for test takers to fake responses

Accepted Answer

Empirically based tests are designed to measure traits or behaviors based on empirical data, making it harder for test takers to manipulate or fake their responses.

Question 5

Which one of the following statistics can be used to make decisions about retaining or discarding an item based on how well the item discriminates between high- and low-scoring test takers?&#10;A) item-total correlation&#10;B) inter-item correlation&#10;C) difficulty level&#10;D) discrimination index

Accepted Answer

The answer of Which one of the following statistics can...

Question 6

When test developers examine the discrimination indexes of each item, which one of the following outcomes do they consider being most desirable?&#10;A) low positive numbers&#10;B) high positive numbers&#10;C) average positive numbers&#10;D) low negative numbers

Accepted Answer

When test developers examine the discrimination index of each item, they consider a high positive number to be most desirable because it implies that the item effectively differentiates between high and low performers on the test. A low or negative discrimination index suggests that the item may not be working as intended and may need to be revised or removed.

Question 7

What is the discrimination index?&#10;A) a comparison of the scores of respondents by sex, race, or other personal characteristics&#10;B) an index of how difficult each test item is&#10;C) a comparison of high performer scores with low performer scores on each item&#10;D) cumulative results from an item analysis yielding an overall score for the test

Accepted Answer

The discrimination index is a measure used in educational testing to determine how well a particular question differentiates between high-performing and low-performing students. It compares the scores of the top scorers with those of the bottom scorers on each test item.

Question 8

Which one of the following tests is an example of an empirically based test?&#10;A) Mathematics Self-Efficacy Test&#10;B) Myers-Briggs type indicator&#10;C) Minnesota Multiphasic Personality Inventory&#10;D) Graduate Record Exam

Accepted Answer

The Minnesota Multiphasic Personality Inventory is an example of an empirically based test because it was developed using a systematic, scientific approach to test development and has been extensively researched and validated through the use of statistical analysis and examination of empirical data. The other options may have elements of empiricism in their development, but do not meet the full criteria of an empirically based test.

Question 9

Dividing the number of persons who answered correctly by the total number of persons who responded to the question is a measure of an item's ______.&#10;A) discrimination index&#10;B) phi coefficient&#10;C) difficulty&#10;D) bias

Accepted Answer

Dividing the number of persons who answered correctly by the total number of persons who responded to the question gives us the difficulty index of the item, which indicates how challenging the item is for the respondents. The discrimination index would require comparing the performance of high and low scorers, the phi coefficient is a measure of association between two dichotomous variables, and bias refers to systematic errors in measurement.

Question 10

How do researchers calculate the discrimination index?&#10;A) They calculate the difference between the percentage of upper performers and the percentage of lower performers who responded correctly.&#10;B) They calculate the percentage of test takers who answered the item correctly.&#10;C) They compare the percentage of test takers who answered the item correctly by demographic group.&#10;D) They develop a matrix that contains the results of the item analyses for each item and calculate a score by adding the matrix columns.

Accepted Answer

The discrimination index is calculated by finding the difference between the percentage of upper performers (those who score well on the test overall) and the percentage of lower performers (those who score poorly on the test overall) who responded correctly to a specific test item. This indicates whether the item is easier for higher performing test-takers and more difficult for lower performing test-takers, which provides information about the quality of the item.

Question 11

Tests that are designed to classify individuals into two or more categories based on their scores on a criterion measure are called ______.&#10;A) empirically based tests&#10;B) pilot tests&#10;C) diagnostic tests&#10;D) screening tests

Accepted Answer

The answer of Tests that are designed to classify individuals...

Question 12

How are phi coefficients interpreted?&#10;A) the same as discrimination coefficients&#10;B) the same as reliability coefficients&#10;C) the same as validity coefficients&#10;D) the same as Pearson product moment correlations

Accepted Answer

The answer of How are phi coefficients interpreted?&#10;A) the same...

Question 13

When items have a very low or very high p value, test developers ______.&#10;A) accept them as good items&#10;B) know that they contribute to the variability of the test scores&#10;C) rewrite or discard the items&#10;D) have evidence that the item is valid

Accepted Answer

The answer of When items have a very low or...

Question 14

The formula D = U &#8722; L is used to calculate which one of the following statistics? A. discrimination index&#10;B) difficulty level&#10;C) item-total correlation&#10;D) item-total index

Accepted Answer

The answer of The formula D = U &#8722; L...

Question 15

To increase internal consistency, items that correlate well with other items measuring the same construct should be ______.&#10;A) dropped&#10;B) retained&#10;C) rewritten&#10;D) grouped together

Accepted Answer

The answer of To increase internal consistency, items that correlate...

Question 16

Which one of the following ranges of item p values yield distribution of test scores with the most variation?&#10;A) 0-0.3&#10;B) 0.4-0.6&#10;C) 0.7-1&#10;D) 1-3

Accepted Answer

The answer of Which one of the following ranges of...

Question 17

Test items that everyone gets &#34;right&#34; or everyone gets &#34;wrong&#34; provide ______.&#10;A) evidence the test yields a wide range of scores&#10;B) proof the test is not biased against minorities&#10;C) support for the validity of the test questions&#10;D) no basis for a comparison of test takers' abilities

Accepted Answer

The answer of Test items that everyone gets &#34;right&#34; or...

Question 18

What are phi coefficients?&#10;A) the correlation between two dichotomous variables&#10;B) the correlation between two sets of test scores&#10;C) the correlation between a test item and the total test score&#10;D) they correlation of item difficulty and item bias

Accepted Answer

The answer of What are phi coefficients?&#10;A) the correlation between...

Question 19

An interitem correlation matrix displays the ______.&#10;A) reliability and validity coefficients for each item&#10;B) difficulty of each item on the test&#10;C) correlation of each item with every other item on the test&#10;D) correlation of each item with the total test score for all test takers

Accepted Answer

The answer of An interitem correlation matrix displays the ______.&#10;A)...

Question 20

What is a quantitative item analysis?&#10;A) numerical data from respondent questionnaires about the test&#10;B) analysis of data from respondent questionnaires about the test&#10;C) statistical analyses of the responses test takers gave to individual items&#10;D) statistical analyses of the test's validity

Accepted Answer

The answer of What is a quantitative item analysis?&#10;A) numerical...

Question 21

When a test yields significantly different validity coefficients for different subgroups, we say it has ______.&#10;A) single-group validity&#10;B) differential validity&#10;C) discriminant validity&#10;D) nongroup validity

Accepted Answer

The answer of When a test yields significantly different validity...

Question 22

The item characteristic curve can provide a picture of an item's ______.&#10;A) distribution of responses&#10;B) interitem and item-total correlation&#10;C) level of difficulty and discrimination&#10;D) reliability and validity

Accepted Answer

The answer of The item characteristic curve can provide a...

Question 23

Which one of the following is a characteristic of a good test item--one that should be retained in the final version of the test?&#10;A) low discrimination index&#10;B) item characteristic curves that have very little slope&#10;C) difficulty level of 0.5&#10;D) interitem correlation coefficient near 0

Accepted Answer

The answer of Which one of the following is a...

Question 24

What are cut scores?&#10;A) scores that would have been higher had it not been for test bias&#10;B) mean, median, and mode of the norm distribution&#10;C) decision points for dividing test scores into pass/fail groupings&#10;D) transformed scores, such as z and T scores

Accepted Answer

The answer of What are cut scores?&#10;A) scores that would...

Question 25

How does computerized adaptive testing (CAT) choose items for individuals taking the test?&#10;A) Questions are predetermined before the individual takes the test.&#10;B) Software chooses items based on level of ability determined from previous responses.&#10;C) Software chooses items based on level of ability determined from average of previous responses.&#10;D) Software chooses items based on predicted level of individual's ability.

Accepted Answer

The answer of How does computerized adaptive testing (CAT) choose...

Question 26

The main purpose of the validation study is to ______.&#10;A) get the reactions of test takers and test users&#10;B) gather data on the construct(s) that the test measures&#10;C) confirm the test's ability to yield meaningful and accurate results&#10;D) comply with legal requirements that tests must have evidence of validity

Accepted Answer

The answer of The main purpose of the validation study...

Question 27

Which one of the following is a part of quantitative item analysis?&#10;A) constructing an agenda for a test takers' group discussion&#10;B) constructing one-on-one interviews for test takers&#10;C) constructing item characteristic curves&#10;D) constructing numerical rating scales for test-taker questionnaires

Accepted Answer

The answer of Which one of the following is a...

Question 28

What is an item characteristic curve?&#10;A) a line that describes the probability of answering an item correctly plotted against the level of ability on the trait being measured&#10;B) a line that describes the distribution of responses to a single item on the trait being measured&#10;C) a histogram constructed for the responses to a single item on the trait being measured&#10;D) a line similar to the normal curve that results from graphing the discrimination index against difficulty level for the trait being measured ______.

Accepted Answer

The answer of What is an item characteristic curve?&#10;A) a...

Question 29

To determine the maximum likelihood estimation, computerized adaptive testing (CAT) software weights all of the following EXCEPT ______.&#10;A) pseudo-guessing parameter&#10;B) difficulty&#10;C) discrimination&#10;D) ability

Accepted Answer

The answer of To determine the maximum likelihood estimation, computerized...

Question 30

What is an advantage of computerized adaptive testing?&#10;A) It provides less data about the test taker.&#10;B) It takes less time to complete for the test taker.&#10;C) It is cheaper to administer.&#10;D) It is easier to create tests.

Accepted Answer

The answer of What is an advantage of computerized adaptive...

Question 31

Test developers can easily find the difficulty of an item by ______.&#10;A) looking at how steep the item characteristic curve is&#10;B) locating the point at which the item characteristic curve indicates a probability of .5 of answering correctly.&#10;C) subtracting the percentage of low performers who responded correctly from the percentage of high performers who responded correctly&#10;D) asking test takers to complete a qualitative item analysis survey on item difficulty

Accepted Answer

The answer of Test developers can easily find the difficulty...

Question 32

What is the purpose of test norms?&#10;A) to provide a structure that makes it easy to identify test bias&#10;B) to provide a reference point or structure for understanding one test taker's score&#10;C) to show that many people have taken the test and their scores were normally distributed&#10;D) to provide evidence of reliability and validity for the test

Accepted Answer

The answer of What is the purpose of test norms?&#10;A)...

Question 33

A test question has item bias when it ______.&#10;A) is easier for one group than for another group&#10;B) has a high discrimination index&#10;C) does not correlate with other test items&#10;D) does not correlate with the test's raw score

Accepted Answer

The answer of A test question has item bias when...

Question 34

A major difficulty in setting cut scores is ______.&#10;A) setting the score low enough that most test takers pass&#10;B) setting the score high enough that only the best test takers pass&#10;C) allowing for test error that may allow some to pass who should not pass&#10;D) calculating the standard error measurement for the test scores

Accepted Answer

The answer of A major difficulty in setting cut scores...

Question 35

What is item response theory?&#10;A) a part of classical test theory that specifies that item difficulty and discrimination are related&#10;B) a theory that relates the performance of each item to a statistical estimate of the test taker's ability on the construct being measured&#10;C) a theory that describes the cognitive steps a respondent takes before answering an item&#10;D) a theory that uses probability to estimate the test taker's honestly or motivation when answering an item

Accepted Answer

The answer of What is item response theory?&#10;A) a part...

Question 36

What is single-group validity?&#10;A) when validity coefficients for different subgroups differ&#10;B) when a test is valid for one group but not for another group&#10;C) when the target audience comprises only one type of test takers&#10;D) when the test is valid for use only one time

Accepted Answer

The answer of What is single-group validity?&#10;A) when validity coefficients...

Question 37

What is cross-validation?&#10;A) a repeat of the validation study sometimes using another sample of test takers&#10;B) a validation study whose participants represent all minority groups&#10;C) carrying out the validation in locations throughout a state or country&#10;D) an alternative form of validation that can be accomplished using a statistical formula

Accepted Answer

The answer of What is cross-validation?&#10;A) a repeat of the...

Question 38

Which one of the following is NOT a method for conducting a qualitative item analysis?&#10;A) asking test takers to fill out a questionnaire about the test&#10;B) asking test takers to attend group discussions&#10;C) using item characteristic curves to assess item bias&#10;D) using one-on-one interviews with test takers to find out how they interpreted the test questions

Accepted Answer

The answer of Which one of the following is NOT...

Question 39

The validity coefficients that result when a test is cross-validated are usually expected to be ______.&#10;A) the same as the validity coefficients found in the original validation study&#10;B) lower than the validity coefficients found in the original validation study&#10;C) higher than the validity coefficients found in the original validation study&#10;D) unrelated to the validity coefficients found in the original validation study

Accepted Answer

The answer of The validity coefficients that result when a...

Question 40

Pierre conducted a validation study, and he found that the test was only valid for French-speaking males. What kind of validity did he identify?&#10;A) between-group validity&#10;B) single-group validity&#10;C) differential validity&#10;D) among-groups validity

Accepted Answer

The answer of Pierre conducted a validation study, and he...

Question 41

When a common regression line for two groups is used to predict performance, but the individual regression lines for the groups differ where they cross the y-axis, which one of the following types of predictive bias is present?

A) method bias
B) construct bias
C) slope bias
D) intercept bias

Accepted Answer

The answer of When a common regression line for two...

Question 42

What was the problem that testing experts pointed out in the Golden Rule case settlement with the Educational Testing Service (ETS)?&#10;A) There are no laws that address test bias or discrimination among subgroups of test takers.&#10;B) The exam developed by ETS had single group validity for Whites only.&#10;C) Some items on the exam developed by ETS were easier for Whites than for Blacks.&#10;D) Comparing item difficulty levels in the form of p values failed to take into consideration the test takers' level of ability.

Accepted Answer

The answer of What was the problem that testing experts...

Question 43

Explain the purpose of a cut score and describe two methods for identifying a cut score. Give examples for each method.

Accepted Answer

The answer of Explain the purpose of a cut score...

Question 44

Explain the concepts of predictive bias, differential validity, and single-group validity. Give an example of each.

Accepted Answer

The answer of Explain the concepts of predictive bias, differential...

Question 45

Recent research on the tests of cognitive validity have found that such tests are ______.&#10;A) equally valid for minority and majority test takers&#10;B) more valid for minority test takers than they are for majority test takers&#10;C) more valid for majority test takers than they are for minority test takers&#10;D) not valid for some groups of test takers

Accepted Answer

The answer of Recent research on the tests of cognitive...

Question 46

Two first-year college students, Carlos and Carl, are used to taking different types of academic tests. Carlos has mostly taken essay tests, and Carl has mostly taken multiple choice tests. What problem may arise when they take the same tests in college?

A) Depending on the type of tests given, there may be construct bias present in the test scores.
B) Depending on the type of tests given, there may be method bias present in the test scores.
C) Depending on the type of tests given, there may be reliability bias present in the test scores.

Accepted Answer

The answer of Two first-year college students, Carlos and Carl,...

Question 47

Describe the processes and expected outcomes of validation and cross-validation studies. Explain why each is important.

Accepted Answer

The answer of Describe the processes and expected outcomes of...

Question 48

Items for which the p value falls in the range of 0.90 to 1.00 are usually considered ______.&#10;A) too difficult&#10;B) somewhat difficult&#10;C) somewhat easy&#10;D) too easy

Accepted Answer

The answer of Items for which the p value falls...

Question 49

Describe how we collect and interpret data for a qualitative item analysis. Discuss the types of information the test developer should seek and give examples of questions.

Accepted Answer

The answer of Describe how we collect and interpret data...

Question 50

What are item-total correlations? What is their purpose in an item analysis?

Accepted Answer

The answer of What are item-total correlations? What is their...

Question 51

Identify and explain the criteria for retaining and dropping items to revise a test. Make a matrix with faked data for 5 items. Explain why each item should or should not be retained.

Accepted Answer

The answer of Identify and explain the criteria for retaining...

Question 52

Describe how we collect, analyze, and interpret data for a quantitative item analysis. Give examples.

Accepted Answer

The answer of Describe how we collect, analyze, and interpret...

Question 53

What is the purpose of an item discrimination index and how is it calculated?

Accepted Answer

The answer of What is the purpose of an item...

Question 54

While test validity is a statistical concept, test fairness is a ______.&#10;A) social concept&#10;B) mathematical concept&#10;C) scientific concept&#10;D) meaningless concept

Accepted Answer

The answer of While test validity is a statistical concept,...

Question 55

Which one of the following explanations was thought to be the most likely reason why recent research on tests of cognitive ability showed that there was differential validity on the tests when test takers from majority and minority groups were compared?

A) range restriction in the minority group
B) range restriction in the majority group
C) culture bias for the majority group
D) culture bias for the minority group

Accepted Answer

The answer of Which one of the following explanations was...

Question 56

What is the importance of item difficulty and how is it calculated?

Accepted Answer

The answer of What is the importance of item difficulty...

Question 57

What is the line in this graph called?  &#10;A) normal curve&#10;B) regression line&#10;C) item characteristic curve&#10;D) probability curve

Accepted Answer

The answer of What is the line in this graph...

Question 58

What are interitem correlations? What is their purpose in an item analysis?

Accepted Answer

The answer of What are interitem correlations? What is their...

Question 59

When the regression lines that predict performance for two groups have different slopes, which one of the following types of measurement bias is likely to occur?&#10;A) single-group validity&#10;B) differential validity&#10;C) culture bias&#10;D) method bias

Accepted Answer

The answer of When the regression lines that predict performance...

Question 60

When a common regression line for two groups is used to predict performance, but the individual regressions lines for the groups differ where they cross the Y axis, which one of the following problems may occur?&#10;A) The performance of the group whose regression line crosses the y-axis at a higher point will be overpredicted.&#10;B) The performance of the group whose regression line crosses the y-axis at a lower point will be overpredicted.&#10;C) The performance of both groups will be overpredicted.&#10;D) The performance of neither group will be overpredicted.

Accepted Answer

The answer of When a common regression line for two...

Question 61

What is item bias? How do test developers and researchers identify item bias?

Accepted Answer

The answer of What is item bias? How do test...

Question 62

Define and describe types of bias in psychological testing. Discuss different types of predictive bias. Give examples of each type.

Accepted Answer

The answer of Define and describe types of bias in...

Question 63

What is cross-validation and what is its purpose and importance? Describe two methods for finding the criterion-related validity coefficient for cross-validation.

Accepted Answer

The answer of What is cross-validation and what is its...

Question 64

What are item-criterion correlations? Describe one way in which developers use them?

Accepted Answer

The answer of What are item-criterion correlations? Describe one way...

Deck 11: How Do We Assess the Psychometric Quality of a Test