Question 1

Under what circumstance is it NOT to your advantage to guess on a multiple-choice exam?&#10;A) when you are making a &#34;wild guess&#34; and a correction formula is being used&#10;B) in any test situation where you are making a &#34;wild guess&#34;&#10;C) when you can rule out one or more of the alternatives as being incorrect&#10;D) when the guessing threshold is low

Accepted Answer

When a correction formula is applied to the scoring of a multiple-choice exam, guessing blindly (making a "wild guess") can reduce your score. This is because the correction formula typically penalizes incorrect answers to discourage random guessing, thus making it disadvantageous to guess without any basis.

Question 2

The following is an item from an attitude scale: Physical punishment is essential in order to control children.
Strongly disagree
Disagree
Neither agree or disagree
Agree
Strongly agree
This item is in the

A) category format.
B) Likert format.
C) dichotomous format.
D) polytomous format.

Accepted Answer

This item is in the Likert format because it presents a statement and asks for a level of agreement or disagreement on a five-point scale.

Question 3

Which item format can best be factor analyzed to find which ones group together?&#10;A) multiple-choice&#10;B) Likert&#10;C) dichotomous&#10;D) forced-choice

Accepted Answer

Likert items are best suited for factor analysis because they measure a construct on a continuum and have more variance than dichotomous or forced-choice items. Multiple-choice items can work for factor analysis, but they may not be as effective since the response options are often limited.

Question 4

Suppose you got 75 items correct on a 100-item,six alternative,multiple-choice exam.What would your score be after we corrected for guessing?&#10;A) 50&#10;B) 57&#10;C) 63&#10;D) 70

Accepted Answer

The answer of Suppose you got 75 items correct on...

Question 5

When distractors are likely to be selected as alternative responses on multiple-choice tests,&#10;A) validity is increased.&#10;B) item reliability is increased.&#10;C) item reliability is decreased.&#10;D) guessing is reduced.

Accepted Answer

The answer of When distractors are likely to be selected...

Question 6

One problem with the use of category rating scales is that&#10;A) many respondents are confused by dichotomous formats.&#10;B) responses are sometimes influenced by the context in which objects are rated.&#10;C) rating scales must be at least 100 points in order to be meaningfully interpreted.&#10;D) category rating scale data do not have ordinal scale property.

Accepted Answer

Category rating scales are susceptible to context effects because the categories are not defined by precise numerical values. This means that people's responses can be influenced by factors outside of what is being rated, such as the order of items or the wording of questions. While dichotomous formats (where there are only two options) can be confusing, this is not a specific problem with category rating scales. The number of points on a rating scale may impact interpretability, but there is no set requirement for a minimum number of points. Category rating scales can have ordinal properties, as the categories can be ranked in order from least to most desirable.

Question 7

In order to correct for guessing&#10;A) a correction formula can be used.&#10;B) distractors should be eliminated.&#10;C) the number of items should be increased.&#10;D) distractors should be increased.

Accepted Answer

A correction formula, such as the formula for guessing on multiple-choice questions (subtracting 1/4 point for each wrong answer), can be used to adjust scores and account for guessing. Eliminating distractors or increasing the number of items may improve the validity of the test, but it will not directly address the issue of guessing. Increasing the number of distractors may actually increase the likelihood of guessing, as it provides more options for the test-taker to choose from.

Question 8

In multiple choice examinations,incorrect alternatives are called&#10;A) flags.&#10;B) non-categories.&#10;C) distractors.&#10;D) miss rates.

Accepted Answer

Incorrect alternatives in multiple choice exams are referred to as distractors because their purpose is to distract or confuse students who do not know the correct answer. A) Flags are not a term used in connection with multiple choice exams. B) Non-categories is not a commonly used term in this context. D) Miss rates are the percentage of test-takers who do not answer a particular question correctly, which is not the same as incorrect alternatives.

Question 9

The expected level of chance performance,for a 200-item multiple-choice exam with four choice alternatives,is&#10;A) 25 correct.&#10;B) 50 correct.&#10;C) 75 correct.&#10;D) 100 correct.

Accepted Answer

The expected level of chance performance for a multiple-choice exam with four choice alternatives is 25% (1 out of 4 options is correct). Therefore, the expected number of correct answers by guessing on a 200-item exam is 200 x 0.25 = 50.

Question 10

True-false examinations use&#10;A) a dichotomous format.&#10;B) a polytomous format.&#10;C) a Likert format.&#10;D) a category format.

Accepted Answer

True-false examinations use a dichotomous format, meaning there are only two possible answer choices for each question. The other formats listed (polytomous, Likert, and category) involve more than two possible answer choices.

Question 11

A test format that is typically used for attitude measurement is the&#10;A) checklist format.&#10;B) dichotomous format.&#10;C) category format.&#10;D) Likert format.

Accepted Answer

The answer of A test format that is typically used...

Question 12

What describes the chances that a low-ability test taker will obtain each score?&#10;A) acquiescence response set&#10;B) the miss rate&#10;C) guessing threshold&#10;D) the moments method

Accepted Answer

The answer of What describes the chances that a low-ability...

Question 13

Describing the chances that low-ability test takers will obtain each score is called the&#10;A) dichotomous format.&#10;B) polytomous format.&#10;C) guessing threshold.&#10;D) 50% threshold.

Accepted Answer

The answer of Describing the chances that low-ability test takers...

Question 14

Suppose that you are taking a multiple choice test where there is no correction for guessing.If you aren't sure of the answer,&#10;A) only guess if you have some confidence you are correct.&#10;B) you should always guess on a speed test.&#10;C) you should always guess.&#10;D) you should never guess.

Accepted Answer

The answer of Suppose that you are taking a multiple...

Question 15

What format do some personality tests use because it requires an absolute judgment?&#10;A) multiple-choice&#10;B) Likert&#10;C) dichotomous&#10;D) category

Accepted Answer

The answer of What format do some personality tests use...

Question 16

The tendency for test takers to agree on most of the items is called a(n)&#10;A) guessing threshold.&#10;B) acquiescence response set.&#10;C) item difficulty.&#10;D) the miss rate.

Accepted Answer

The answer of The tendency for test takers to agree...

Question 17

This test item is an example of a&#10;A) polytomous format.&#10;B) dichotomous format.&#10;C) Likert format.&#10;D) category format.

Accepted Answer

The answer of This test item is an example of...

Question 18

Distractors that are obviously incorrect&#10;A) lower the reliability of the test.&#10;B) increase the reliability of the test.&#10;C) have no impact on the reliability of the test.&#10;D) reduce the likelihood of correct guessing.

Accepted Answer

The answer of Distractors that are obviously incorrect&#10;A) lower the...

Question 19

The difference between Likert scales and category formats is that&#10;A) category formats are used only in health settings.&#10;B) category formats tends to be dichotomous while Likert scales tends to be polytomous.&#10;C) category formats tend to have a smaller number of choices.&#10;D) Likert scales tend to have a smaller number of choices.

Accepted Answer

The answer of The difference between Likert scales and category...

Question 20

One method for measuring chronic pain asks the respondent to group statements according to how accurately they describe his/her discomfort.This would be an example of the&#10;A) Q-sort format.&#10;B) checklist format.&#10;C) Likert format.&#10;D) category format.

Accepted Answer

The answer of One method for measuring chronic pain asks...

Question 21

A multiple-choice test with five options has a chance performance level of&#10;A) .50.&#10;B) .25.&#10;C) .20.&#10;D) .10.

Accepted Answer

The answer of A multiple-choice test with five options has...

Question 22

Which of the following item writing recommendations has research support?&#10;A) All answer options should be plausible.&#10;B) Items should cover important concepts and objectives.&#10;C) All parts of an item or exercise should appear on the same page.&#10;D) There should be an equal number of true and false statements.

Accepted Answer

The answer of Which of the following item writing recommendations...

Question 23

Which method involves scoring that is very time consuming?&#10;A) dichotomous format.&#10;B) visual analogue scale.&#10;C) Likert scale.&#10;D) multiple-choice format.

Accepted Answer

The answer of Which method involves scoring that is very...

Question 24

The optimum level of item difficulty for a five-alternative multiple choice item is&#10;A) .50.&#10;B) .60.&#10;C) .70.&#10;D) .80.

Accepted Answer

The answer of The optimum level of item difficulty for...

Question 25

For most tests,the maximum amount of information about differences between individuals can be obtained from items in the difficulty range of&#10;A) .30 to .70.&#10;B) .40 to .80.&#10;C) between .55 and .85.&#10;D) above .90.

Accepted Answer

The answer of For most tests,the maximum amount of information...

Question 26

In the extreme group method of item analysis,&#10;A) point-biserial correlations are used.&#10;B) data from some test-takers are not used in the analysis.&#10;C) only the performance of those who scored extremely well is studied.&#10;D) distractors are eliminated.

Accepted Answer

The answer of In the extreme group method of item...

Question 27

Which type of item tends to lose reliability and become obsolete over time?&#10;A) factual items&#10;B) skill-based items&#10;C) items based on abstract concepts&#10;D) simple items

Accepted Answer

The answer of Which type of item tends to lose...

Question 28

When Lupe argued that one of the questions on the five-alternative test was unfairly difficult,the teacher simply replied by saying that the item difficulty was optimal at&#10;A) .50.&#10;B) .60.&#10;C) .625.&#10;D) .70.

Accepted Answer

The answer of When Lupe argued that one of the...

Question 29

The optimal item difficulty of a six-alternative test is&#10;A) .50.&#10;B) .585.&#10;C) .60.&#10;D) .625.

Accepted Answer

The answer of The optimal item difficulty of a six-alternative...

Question 30

If 50% of the individuals taking a particular test get a certain item correct,the difficulty (or easiness)level of that item would be&#10;A) .05.&#10;B) .25.&#10;C) .50.&#10;D) .10.

Accepted Answer

The answer of If 50% of the individuals taking a...

Question 31

How do Likert format tests differ from tests made of dichotomous and polytomous items?&#10;A) Likert format tests require far fewer items to achieve reliability and validity.&#10;B) Likert format items quantify characteristics rather than classifying responses as correct or incorrect.&#10;C) Likert format tests cannot be validated whereas dichotomous and polytomous item tests can be validated.&#10;D) Likert format items require higher literacy levels than do dichotomous and polytomous items.

Accepted Answer

The answer of How do Likert format tests differ from...

Question 32

If the five applicants for the chief financial officer position of ABC Company are highly qualified,the company should use a test that&#10;A) has easier items.&#10;B) discriminates 20% of the time.&#10;C) contains mostly difficult items.&#10;D) contains items ranging in difficulty from .30 to .70.

Accepted Answer

The answer of If the five applicants for the chief...

Question 33

Which testing method is popular for measuring self-rated health?&#10;A) q-sort technique&#10;B) visual analogue scale&#10;C) checklists&#10;D) category formats

Accepted Answer

The answer of Which testing method is popular for measuring...

Question 34

What is the impact of adding distractors on polytomous item reliability?&#10;A) The number of distractors is inversely related to item reliability.&#10;B) Large numbers of distractors can greatly increase reliability.&#10;C) Adding distractors may not increase reliability if the distractors are implausible.&#10;D) Reliability is optimized when there are 8 to 10 distractors.

Accepted Answer

The answer of What is the impact of adding distractors...

Question 35

Which of the following increases the likelihood that students will be likely to guess when they are not sure of the correct response on a multiple choice item?&#10;A) when they expect a low grade&#10;B) when the items are easy&#10;C) when the course is a required course&#10;D) when they dislike the subject

Accepted Answer

The answer of Which of the following increases the likelihood...

Question 36

Which of the following is a disadvantage of true-false tests?&#10;A) They are typically only useful with simple information.&#10;B) They encourage memorization without understanding.&#10;C) They are difficult to administer.&#10;D) They encourage rapid responding.

Accepted Answer

The answer of Which of the following is a disadvantage...

Question 37

The method of item analysis which looks at the correlation between performance on an item (correct or incorrect)and total test score is&#10;A) the extreme group method.&#10;B) the tetrachoric method.&#10;C) the point-biserial method.&#10;D) the item characteristic curve method.

Accepted Answer

The answer of The method of item analysis which looks...

Question 38

As the proportion of people who get an item on a test correct increases,the measure of item difficulty&#10;A) decreases.&#10;B) remains the same.&#10;C) increases.&#10;D) approaches chance.

Accepted Answer

The answer of As the proportion of people who get...

Question 39

When teachers are initially told that the students they will be teaching are either not very imaginative or are very imaginative,ratings using an adjective checklist will tend to reflect this original assessment.This is an example of

A) the effect of context.
B) visual analogue.
C) low sample size.
D) forced choice effect.

Accepted Answer

The answer of When teachers are initially told that the...

Question 40

Why have checklists fallen out of favor?&#10;A) They are simplistic.&#10;B) They are prone to error.&#10;C) They are difficult to write well.&#10;D) They cannot be validated.

Accepted Answer

The answer of Why have checklists fallen out of favor?&#10;A)...

Question 41

In experimental psychology,the proportion of the top third of the class that correctly answered the last question of the final was .93 while .89 of the bottom third of the class answered correctly.The professor should decide not to include this question in the next final because the discrimination index indicates

A) negative discrimination.
B) chance level performance.
C) that students were incorrectly prepared.
D) that the item does not discriminate well.

Accepted Answer

The answer of In experimental psychology,the proportion of the top...

Question 42

Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which item discriminates well at low levels of performance but not at high levels?&#10;A) item a&#10;B) item b&#10;C) item c&#10;D) item d&#10;E) item e

Accepted Answer

The answer of Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which...

Question 43

In item analysis,the internal criteria against which items are evaluated refers to the&#10;A) discrimination index.&#10;B) total test score.&#10;C) criterion.&#10;D) predictor.

Accepted Answer

The answer of In item analysis,the internal criteria against which...

Question 44

The least frequent score in a frequency polygon is the&#10;A) negative discriminator.&#10;B) discrimination point.&#10;C) antimode.&#10;D) criterion.

Accepted Answer

The answer of The least frequent score in a frequency...

Question 45

When test items are evaluated against total test score,we use a(n)&#10;A) internal criterion.&#10;B) external criterion.&#10;C) multivariate analysis.&#10;D) criterion referenced test.

Accepted Answer

The answer of When test items are evaluated against total...

Question 46

The approach to test construction in which the item characteristic curve for each individual item is analyzed is called&#10;A) prophecy theory.&#10;B) classical test theory.&#10;C) item response theory.&#10;D) item analysis theory.

Accepted Answer

The answer of The approach to test construction in which...

Question 47

Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which item is unrelated to total test score performance?&#10;A) item a&#10;B) item b&#10;C) item c&#10;D) item d&#10;E) item e

Accepted Answer

The answer of Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which...

Question 48

Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which item discriminates at various levels of performance?&#10;A) item a&#10;B) item b&#10;C) item c&#10;D) item d&#10;E) item e

Accepted Answer

The answer of Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which...

Question 49

Dr.H likes to start off his tests with a few easier items in order to boost the confidence of the test takers.This is an example of&#10;A) human factors.&#10;B) the psychometric properties of the test.&#10;C) optimum item difficulty.&#10;D) item difficulty.

Accepted Answer

The answer of Dr.H likes to start off his tests...

Question 50

The extreme group method and the point biserial method are both used to estimate&#10;A) reliability.&#10;B) validity.&#10;C) discriminability.&#10;D) difficulty.

Accepted Answer

The answer of The extreme group method and the point...

Question 51

Professor Plum created class intervals from the test scores for his class.He made a line graph using these intervals on the X-axis and the proportion of students who answered a particular question correctly on the Y-axis.The result is&#10;A) a discrimination index.&#10;B) a correlation index.&#10;C) an item characteristic curve.&#10;D) a histogram.

Accepted Answer

The answer of Professor Plum created class intervals from the...

Question 52

One of the major advantages of tests developed using item response theory is that they&#10;A) can be easily adapted for computer administration.&#10;B) are longer.&#10;C) are easier to administer.&#10;D) can be developed with little effort.

Accepted Answer

The answer of One of the major advantages of tests...

Question 53

In order to evaluate a criterion referenced test,the test was administered to a group of students who had studied a learning unit and to another group who had not studied the learning unit.For each item on the test,the criterion for mastery would be&#10;A) the point-biserial correlation.&#10;B) below the antimode.&#10;C) above the antimode.&#10;D) the validity coefficient.

Accepted Answer

The answer of In order to evaluate a criterion referenced...

Question 54

Proponents of criterion-referenced tests have criticized item analysis procedures because they&#10;A) cannot be used for criterion-referenced tests.&#10;B) have statistical flaws.&#10;C) do not provide information about the type of errors that students make.&#10;D) have no relevance for educational tests.

Accepted Answer

The answer of Proponents of criterion-referenced tests have criticized item...

Question 55

The proportion of test takers that get a &#34;good&#34; item correct increases as a function of the&#10;A) item characteristic curve.&#10;B) total test score.&#10;C) validity of the test.&#10;D) item difficulty.

Accepted Answer

The answer of The proportion of test takers that get...

Question 56

When 100% of the test-takers get an item correct,the item will have a&#10;A) low difficulty index (0%).&#10;B) high discriminability index.&#10;C) discriminability index of approximately .5.&#10;D) very low discriminability index.

Accepted Answer

The answer of When 100% of the test-takers get an...

Question 57

Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which item is inversely related to performance on the test?&#10;A) item a&#10;B) item b&#10;C) item c&#10;D) item d&#10;E) item e

Accepted Answer

The answer of Exhibit 6-1&#10;  &#10;Refer to Exhibit 6-1.Which...

Question 58

An employment test attempted to find out if individuals who scored high on specific items that assessed an individual's 'ability to work well in a team' related strongly to the test as a whole.The purpose of the study was to evaluate&#10;A) human factors.&#10;B) optimum item difficulty.&#10;C) item discriminability.&#10;D) categories.

Accepted Answer

The answer of An employment test attempted to find out...

Question 59

An item characteristic curve that rises gradually and then turns down for people at the highest levels of performance&#10;A) is likely to occur when students are making a wild guess.&#10;B) can happen when 'none of the above' is one of the multiple choice options.&#10;C) turns down at a point referred to as the antimode.&#10;D) indicates an item with a high level of difficulty.

Accepted Answer

The answer of An item characteristic curve that rises gradually...

Question 60

The average of a series of item characteristic curves is known as&#10;A) the average characteristic curve.&#10;B) the standard error of the item characteristic.&#10;C) a test characteristic curve.&#10;D) the variance ratio curve.

Accepted Answer

The answer of The average of a series of item...

Question 61

In order to choose questions for a final version of a test,the examiners created a graph with difficulty on one axis and discriminability on the other.The examiners should use the questions that&#10;A) fall above the .50-point on the discriminability axis.&#10;B) fall below the .50 point on the discriminability axis.&#10;C) fall between .30 and .70 on difficulty and above .30 on discriminability.&#10;D) fall above the .50 point on discriminability and difficulty.

Accepted Answer

The answer of In order to choose questions for a...

Question 62

In most situations,a good test should contain items&#10;A) from a wide range of difficulty levels.&#10;B) at optimum difficulty levels.&#10;C) at levels appropriate for the test taker.&#10;D) mostly at or near average difficulty levels.

Accepted Answer

The answer of In most situations,a good test should contain...

Question 63

Which of the following is true of the characteristic curve for a &#34;good&#34; test item?&#10;A) It is normally distributed.&#10;B) It is bimodal and positively skewed.&#10;C) It has a gradual,positive slope.&#10;D) It is negatively accelerated.

Accepted Answer

The answer of Which of the following is true of...

Question 64

Which of the following methods is used in the analysis of item discriminability?&#10;A) test-retest&#10;B) extreme group&#10;C) characteristic curves&#10;D) factor analysis

Accepted Answer

The answer of Which of the following methods is used...

Deck 6: Writing and Evaluating Test Items