Question 1

The search for patterns in your data is called:&#10;A) inferential statistical analysis.&#10;B) descriptive data analysis.&#10;C) exploratory data analysis.&#10;D) exploratory pattern making.

Accepted Answer

Exploratory data analysis (EDA) is the process of analyzing data sets to summarize their main characteristics, often with visual methods, and to discover patterns.

Question 2

Why is it a good idea to explore your data using EDA techniques before you conduct any statistical tests?&#10;A) They help spot serious defects in your data that may warrant taking corrective action before you proceed to the inferential analysis.&#10;B) They can help you determine which summary statistics would be appropriate for a given set of data.&#10;C) They may reveal unsuspected influences.&#10;D) All of the answers are correct.

Accepted Answer

Exploratory Data Analysis (EDA) is crucial as it helps identify data issues, guides the choice of appropriate summary statistics, and uncovers hidden patterns or influences, making all the options correct.

Question 3

If categories are represented by numbers (e.g., 1 = Democrat, 2 = Republican), the categories are said to be:&#10;A) dummy-coded.&#10;B) nominalized.&#10;C) pseudonumbered.&#10;D) transformed.

Accepted Answer

Dummy coding is a method of representing categorical variables as binary variables. In this case, each category is represented by a single binary variable, with a value of 1 indicating that the observation belongs to that category and a value of 0 indicating that it does not.

Question 4

Identify a true statement about an unstacked format, which is a way of organizing data from experimental or quasi-experimental designs.&#10;A) This format is appropriate when data include multiple independent or dependent variables.&#10;B) This format fails to display treatment summary statistics in a simple way.&#10;C) In this format, multiple columns of subject numbers are needed.&#10;D) In this format, a separate column is created for the scores from each treatment.

Accepted Answer

In an unstacked format, each treatment or condition has its own column for scores, allowing for easy comparison and analysis of treatment effects.

Question 5

Which of the following is a disadvantage of a stacked format?&#10;A) It makes it difficult to include additional columns to indicate the observed values of additional variables.&#10;B) It fails to accommodate complex designs involving several quasi-independent variables.&#10;C) It does not provide a simple way to display treatment summary statistics.&#10;D) It is not an acceptable format for many computer statistical analysis packages.

Accepted Answer

The answer of Which of the following is a disadvantage...

Question 6

Dummy coding involves:&#10;A) assignment of numbers to the levels of a qualitative independent variable.&#10;B) creating new variables based on the values of the old ones.&#10;C) categorizing dependent variables according to their magnitudes.&#10;D) None of the answers is correct.

Accepted Answer

Dummy coding involves assigning numbers to the levels of a qualitative independent variable to include it in a regression model.

Question 7

For quantitative data (e.g., the number of milligrams of a drug), coding your data involves:&#10;A) dummy coding.&#10;B) transferring each subject's score to a computer coding sheet.&#10;C) assigning numeric values to categorical data.&#10;D) None of the answers is correct.

Accepted Answer

Quantitative data, like the number of milligrams of a drug, is typically entered directly into a computer coding sheet without the need for dummy coding or assigning numeric values to categorical data.

Question 8

When coding dependent variables, you should not worry about creating new variables or with making special categories because:&#10;A) computers cannot read such composite variables.&#10;B) they are rarely used in data analysis.&#10;C) most statistical analysis software have commands that lets one manipulate data in a variety of ways.&#10;D) All of the answers are correct.

Accepted Answer

Most statistical analysis software includes commands and functions that allow users to manipulate and transform data, including creating new variables and categories as needed for analysis.

Question 9

After entering data, _____.&#10;A) you need not check for data errors because the computer can do it for you&#10;B) the computer recognizes errors and corrects the data automatically&#10;C) you need not check for data errors because they are rare&#10;D) you should carefully check your data file for errors because the computer cannot detect incorrectly entered data

Accepted Answer

Computers cannot automatically detect all types of data entry errors, especially if the data is syntactically correct but semantically incorrect. Therefore, it is important to manually check for errors.

Question 10

A limitation of using grouped data is that:&#10;A) the average score may not represent the performance of individual subjects in a group.&#10;B) a curve resulting from plotting averaged data may not reflect the true nature of the psychological phenomenon being studied.&#10;C) grouping scores adds error variance to the data.&#10;D) the average score may not represent the performance of individual subjects in a group, and a curve resulting from plotting averaged data may not reflect the true nature of the psychological phenomenon being studied.

Accepted Answer

Grouped data can obscure individual variations and may not accurately reflect the underlying psychological phenomenon, as both the average score and the plotted curve can misrepresent individual performances and true data patterns.

Question 11

Examining individual data shows how each subject performed in a study when:&#10;A) you have repeated measures of the same behavior.&#10;B) several subjects per treatment group provide data measured on an interval scale.&#10;C) more than five subjects are included in each treatment group.&#10;D) data are measured on a ratio scale.

Accepted Answer

The answer of Examining individual data shows how each subject...

Question 12

When organizing data, a good strategy to adopt is:&#10;A) restrict yourself to looking at grouped data.&#10;B) restrict yourself to looking at individual data.&#10;C) to look at both grouped and individual data.&#10;D) to look at neither grouped nor individual data.

Accepted Answer

The answer of When organizing data, a good strategy to...

Question 13

The horizontal axis of a graph is called the _____.&#10;A) abscissa or x-axis&#10;B) abscissa or y-axis&#10;C) ordinate or x-axis&#10;D) ordinate or y-axis

Accepted Answer

The answer of The horizontal axis of a graph is...

Question 14

When graphing data from an experiment, levels of an independent variable are normally represented along the:&#10;A) y-axis.&#10;B) x-axis.&#10;C) z-axis.&#10;D) ordinate.

Accepted Answer

The answer of When graphing data from an experiment, levels...

Question 15

In a bar graph, the length of each bar reflects the value of the:&#10;A) independent variable.&#10;B) abscissa.&#10;C) dependent variable.&#10;D) quasi-independent variable.

Accepted Answer

The answer of In a bar graph, the length of...

Question 16

A bar graph is the best method of graphing data when your independent variable is:&#10;A) scaled on an interval scale.&#10;B) scaled on a ratio scale.&#10;C) continuous.&#10;D) categorical.

Accepted Answer

The answer of A bar graph is the best method...

Question 17

Which of the following statements is true about a bar graph?&#10;A) In this graph, each pair of scores is represented as a point on the graph.&#10;B) It is most appropriate when the independent variable is continuous and quantitative.&#10;C) In this graph, the width of each bar reflects the value of the dependent variable.&#10;D) It presents data as bars extending away from the axis representing the independent variable.

Accepted Answer

The answer of Which of the following statements is true...

Question 18

_____ are more appropriate than bar graphs when your independent variable is continuous and quantitative.&#10;A) Line graphs&#10;B) Scatter plots&#10;C) Pie graphs&#10;D) All of the answers are correct.

Accepted Answer

The answer of _____ are more appropriate than bar graphs...

Question 19

Line graphs are appropriate when one wants to illustrate:&#10;A) the main effects of variables.&#10;B) categorical relationships among variables.&#10;C) dichotomized data splits.&#10;D) functional relationships between variables.

Accepted Answer

The answer of Line graphs are appropriate when one wants...

Question 20

A(n) _____ curve is relatively flat at first and becomes progressively steep as it moves along the x-axis.&#10;A) negatively accelerated&#10;B) non-accelerated&#10;C) positively accelerated&#10;D) increasing

Accepted Answer

The answer of A(n) _____ curve is relatively flat at...

Question 21

A curve that is steep at first but becomes progressively flatter as it moves along the x-axis is:&#10;A) negatively accelerated.&#10;B) positively accelerated.&#10;C) nonmonotonic.&#10;D) non-accelerated.

Accepted Answer

The answer of A curve that is steep at first...

Question 22

When a curve levels off at some maximum or minimum value, the function is said to be _____ at that value.&#10;A) nonlinear&#10;B) nonmonotonic&#10;C) positively accelerated&#10;D) asymptotic

Accepted Answer

The answer of When a curve levels off at some...

Question 23

A curve that represents a uniformly increasing or decreasing function is said to be:&#10;A) monotonic.&#10;B) nonmonotonic.&#10;C) cubic.&#10;D) asymptotic.

Accepted Answer

The answer of A curve that represents a uniformly increasing...

Question 24

Pairs of scores from a correlational study are usually represented as points on a:&#10;A) histogram.&#10;B) scatter plot.&#10;C) line graph.&#10;D) pie graph.

Accepted Answer

The answer of Pairs of scores from a correlational study...

Question 25

If your data are in the form of proportions or percentages, then a good type of graph to represent the value of each category in an analysis would be a:&#10;A) line graph.&#10;B) histogram.&#10;C) pie graph.&#10;D) bar graph.

Accepted Answer

The answer of If your data are in the form...

Question 26

Graphing data, rather than presenting them in a table, is important when you:&#10;A) want to show relationships clearly.&#10;B) are choosing appropriate statistics.&#10;C) are contemplating not using descriptive statistics.&#10;D) want to show relationships clearly and when choosing appropriate statistics.

Accepted Answer

The answer of Graphing data, rather than presenting them in...

Question 27

A set of mutually exclusive categories (classes) together with a count of the number of data values falling into each category is termed a:&#10;A) sorted list.&#10;B) cumulative distribution.&#10;C) frequency distribution.&#10;D) scatter plot.

Accepted Answer

The answer of A set of mutually exclusive categories (classes)...

Question 28

_____ resemble bar graphs, with each bar representing a class, and a given bar's length indicating the frequency of scores falling within its range.&#10;A) Histograms&#10;B) Scatter plots&#10;C) Exploded pie graphs&#10;D) Stemplots

Accepted Answer

The answer of _____ resemble bar graphs, with each bar...

Question 29

In a stemplot of scores ranging from 11 to 83, a score of 42 would be located at a stem value of _____.&#10;A) 4&#10;B) 6&#10;C) 40&#10;D) 2

Accepted Answer

The answer of In a stemplot of scores ranging from...

Question 30

Stemplots have the advantage over histograms of:&#10;A) plotting the shape of a distribution.&#10;B) preserving all the actual values present in the data.&#10;C) determining the center of a distribution.&#10;D) determining the spread of a distribution.

Accepted Answer

The answer of Stemplots have the advantage over histograms of:&#10;A)...

Question 31

When examining a histogram or stemplot of your data, you should:&#10;A) locate the center of the distribution along the scale of measurement.&#10;B) note the spread of the scores.&#10;C) note the overall shape of the distribution and look for any gaps or outliers.&#10;D) All of the answers are correct.

Accepted Answer

The answer of When examining a histogram or stemplot of...

Question 32

Extreme scores that lie far from the others in a distribution are called:&#10;A) deviants.&#10;B) distant scores.&#10;C) outlaws.&#10;D) outliers.

Accepted Answer

The answer of Extreme scores that lie far from the...

Question 33

A distribution is _____ if a long tail goes off to the right, upscale.&#10;A) normal&#10;B) positively skewed&#10;C) negatively skewed&#10;D) bimodal

Accepted Answer

The answer of A distribution is _____ if a long...

Question 34

A distribution contains the following scores: 1, 1, 2, 2, 2, 3, 3, 4, 4, 5. Its mode is:&#10;A) 5.&#10;B) 3.&#10;C) 2.&#10;D) 4.

Accepted Answer

The answer of A distribution contains the following scores: 1,...

Question 35

Identify a true statement about mode.&#10;A) It is the most frequent score in a distribution.&#10;B) It is the middle score in a distribution.&#10;C) It is the most widely used measure of center.&#10;D) It is the simplest and least informative measure of spread.

Accepted Answer

The answer of Identify a true statement about mode.&#10;A) It...

Question 36

Although mode is simple to calculate, it is limited because it:&#10;A) is insensitive to extreme scores.&#10;B) is difficult to compute.&#10;C) is inappropriate for use with interval data.&#10;D) does not take into account the values of scores outside of the most frequent score.

Accepted Answer

The answer of Although mode is simple to calculate, it...

Question 37

A distribution contains the following scores: 1, 1, 2, 2, 2, 3, 3, 4, 4, 5. Its mean is:&#10;A) 2.5.&#10;B) 2.7.&#10;C) 2.0.&#10;D) 4.0.

Accepted Answer

The answer of A distribution contains the following scores: 1,...

Question 38

In a distribution with an even number of scores, the median is determined by:&#10;A) finding the most frequent score in the top half of the distribution and averaging it with the most frequent score in the bottom half of the distribution.&#10;B) finding the arithmetic average of the entire distribution and dividing it by two.&#10;C) averaging the middle pair of scores.&#10;D) finding the most frequent score.

Accepted Answer

The answer of In a distribution with an even number...

Question 39

The median is a rather insensitive measure of center because it:&#10;A) is difficult to calculate.&#10;B) does not take into account the magnitudes of the scores above and below it.&#10;C) cannot be used with interval data.&#10;D) All of the answers are correct.

Accepted Answer

The answer of The median is a rather insensitive measure...

Question 40

The _____ is the most sensitive measure of center because it takes into account all scores in a distribution when it is calculated.&#10;A) median&#10;B) mode&#10;C) interquartile range&#10;D) arithmetic average

Accepted Answer

The answer of The _____ is the most sensitive measure...

Question 41

Which of the following is the major advantage of the mean?&#10;A) Its value is directly affected by the magnitude of each score in a distribution.&#10;B) Its insusceptibility to the influence of outliers makes it highly reliable.&#10;C) It is an appropriate measure of center when data are measured on an ordinal scale.&#10;D) It is the most preferred measure whenever a distribution is strongly skewed.

Accepted Answer

The answer of Which of the following is the major...

Question 42

The mean is derived by:&#10;A) finding the most frequent score in a distribution.&#10;B) finding the middle score in an ordered distribution.&#10;C) summing the scores in a distribution and dividing the sum by the total number of scores.&#10;D) averaging the middle pair of scores in an ordered distribution.

Accepted Answer

The answer of The mean is derived by:&#10;A) finding the...

Question 43

For data measured on a nominal scale, you are limited to using the _____ as your measure of center.&#10;A) mean&#10;B) mode&#10;C) median&#10;D) All of the answers are correct.

Accepted Answer

The answer of For data measured on a nominal scale,...

Question 44

If data are scaled on an interval or ratio scale, the mean becomes a less representative measure of center when:&#10;A) there are more than 10 scores in a distribution.&#10;B) there are less than 5 scores in a distribution.&#10;C) the mean and median are equal.&#10;D) the distribution of scores is strongly skewed.

Accepted Answer

The answer of If data are scaled on an interval...

Question 45

In a positively skewed distribution, the mean:&#10;A) underestimates the center.&#10;B) overestimates the center.&#10;C) is as accurate a measure of the center as is the median.&#10;D) accurately represents central tendency.

Accepted Answer

The answer of In a positively skewed distribution, the mean:&#10;A)...

Question 46

The simplest and least informative measure of spread is the:&#10;A) standard deviation.&#10;B) variance.&#10;C) range.&#10;D) interquartile range.

Accepted Answer

The answer of The simplest and least informative measure of...

Question 47

In the context of the measures of spread, the _____ is the average squared deviation from the mean.&#10;A) standard deviation&#10;B) variance&#10;C) range&#10;D) interquartile range

Accepted Answer

The answer of In the context of the measures of...

Question 48

The most popular measure of spread is the:&#10;A) standard deviation.&#10;B) variance.&#10;C) interquartile range.&#10;D) range.

Accepted Answer

The answer of The most popular measure of spread is...

Question 49

Which of the following measures of spread is easy to calculate and is resistant to the effects of skew and outliers?&#10;A) The standard deviation&#10;B) The variance&#10;C) The interquartile range&#10;D) The range

Accepted Answer

The answer of Which of the following measures of spread...

Question 50

Included in the five-number summary are the:&#10;A) minimum, the first quartile, the median, the third quartile, and the maximum.&#10;B) mean, the median, the mode, the standard deviation, and the interquartile range.&#10;C) minimum, the interquartile range, the standard deviation, the range, and the maximum.&#10;D) mean, the median, the interquartile range, the standard deviation, and the range.

Accepted Answer

The answer of Included in the five-number summary are the:&#10;A)...

Question 51

You can display the five-number summary graphically as a:&#10;A) histogram.&#10;B) bar graph.&#10;C) boxplot.&#10;D) scatter plot.

Accepted Answer

The answer of You can display the five-number summary graphically...

Question 52

The _____ of a correlation coefficient tells you the direction of a relationship, whereas the _____ tells you the degree of linear relationship between two variables.&#10;A) magnitude; sign&#10;B) value; magnitude&#10;C) sign; magnitude&#10;D) None of the answers is correct.

Accepted Answer

The answer of The _____ of a correlation coefficient tells...

Question 53

The presence of outliers can affect the _____ of the Pearson r.&#10;A) sign&#10;B) magnitude&#10;C) magnitude, sign, or both&#10;D) None of the answers is correct.

Accepted Answer

The answer of The presence of outliers can affect the...

Question 54

Which of the following is the most widely used measure of association and is appropriate when the dependent measures are scaled on an interval or a ratio scale?&#10;A) The point-biserial correlation&#10;B) The phi coefficient&#10;C) The Spearman rank-order correlation&#10;D) The Pearson r

Accepted Answer

The answer of Which of the following is the most...

Question 55

The measure of correlation to use when one variable is measured on an interval scale and the other is measured on a nominal scale is the:&#10;A) Spearman rank-order correlation.&#10;B) point-biserial correlation.&#10;C) phi coefficient.&#10;D) part correlation.

Accepted Answer

The answer of The measure of correlation to use when...

Question 56

The Spearman rank-order correlation is used when:&#10;A) dependent variables are scaled on a ratio scale.&#10;B) one variable being measured is on an interval scale and the other being measured is on a nominal scale.&#10;C) both of the variables being correlated are measured on a dichotomous scale.&#10;D) one wants to determine whether the relationship between variables is monotonic.

Accepted Answer

The answer of The Spearman rank-order correlation is used when:&#10;A)...

Question 57

With _____, you can estimate values of a variable based on knowledge of the values of others.&#10;A) linear regression&#10;B) the Pearson r&#10;C) the phi coefficient&#10;D) the coefficient of determination

Accepted Answer

The answer of With _____, you can estimate values of...

Question 58

The best-fitting straight line on a scatter plot that minimizes the sum of the squared deviations of each data point from the line is called the:&#10;A) standard error line.&#10;B) optimal line.&#10;C) least-squares regression line.&#10;D) None of the answers is correct.

Accepted Answer

The answer of The best-fitting straight line on a scatter...

Question 59

In the formula that describes the regression line mathematically, b is:&#10;A) a constant.&#10;B) the regression weight.&#10;C) a predicted score.&#10;D) the same as the Pearson r.

Accepted Answer

The answer of In the formula that describes the regression...

Question 60

The difference between Y and $\hat{Y}$ is a:&#10;A) residual.&#10;B) remainder.&#10;C) regression deviation.&#10;D) difference.

Accepted Answer

The answer of The difference between Y and $\hat{Y}$ is...

Question 61

The _____ provides an estimate of the amount of error in prediction.&#10;A) standard deviation&#10;B) variance&#10;C) regression weight&#10;D) standard error of estimate

Accepted Answer

The answer of The _____ provides an estimate of the...

Question 62

The standard error of estimate increases as:&#10;A) beta decreases.&#10;B) the strength of the relationship between X and Y increases.&#10;C) the strength of the relationship between X and Y decreases.&#10;D) the constant in the regression equation increases.

Accepted Answer

The answer of The standard error of estimate increases as:&#10;A)...

Question 63

By squaring the correlation coefficient, you can:&#10;A) determine how many deviant scores are there in your distributions.&#10;B) obtain the strength of the causal relationship between variables.&#10;C) obtain an index of the amount of variation in one variable that can be accounted for by variation in the other.&#10;D) determine how much unit change in X you can expect with a unit change in Y.

Accepted Answer

The answer of By squaring the correlation coefficient, you can:&#10;A)...

Question 64

A high value of the coefficient of nondetermination means that:&#10;A) there is a causal relationship between variables.&#10;B) performing linear regression is unnecessary.&#10;C) you can expect a large change in X as a function of Y.&#10;D) None of the answers is correct.

Accepted Answer

The answer of A high value of the coefficient of...

Deck 13: Describing Data