Deck 10: Regression Analysis: Estimating Relationships
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/92
Play
Full screen (f)
Deck 10: Regression Analysis: Estimating Relationships
1
The correlation value ranges from:
A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -¥ to+ ¥
A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -¥ to+ ¥
-1 to +1
2
The weakness of scatterplots is that they:
A) do not help identify linear relationships
B) can be misleading about the types of relationships they indicate
C) only help identify outliers
D) do not actually quantify the relationships between variables
A) do not help identify linear relationships
B) can be misleading about the types of relationships they indicate
C) only help identify outliers
D) do not actually quantify the relationships between variables
do not actually quantify the relationships between variables
3
Correlation is a summary measure that indicates:
A) a curved relationship among the variables
B) the rate of change in Y for a one unit change in X
C) the strength of the linear relationship between pairs of variables
D) the magnitude of difference between two variables
A) a curved relationship among the variables
B) the rate of change in Y for a one unit change in X
C) the strength of the linear relationship between pairs of variables
D) the magnitude of difference between two variables
the strength of the linear relationship between pairs of variables
4
A correlation value of zero indicates.
A) a strong linear relationship
B) a weak linear relationship
C) no linear relationship
D) a perfect linear relationship
A) a strong linear relationship
B) a weak linear relationship
C) no linear relationship
D) a perfect linear relationship
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
5
The covariance is not used as much as the correlation because:
A) it is not always a valid predictor of linear relationships
B) it is difficult to calculate
C) it is difficult to interpret
D) of all of these options
A) it is not always a valid predictor of linear relationships
B) it is difficult to calculate
C) it is difficult to interpret
D) of all of these options
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
6
In linear regression, we fit the least squares line to a set of values (or points on a scatterplot). The distance from the line to a point is called the:
A) fitted value
B) residual
C) correlation
D) covariance
E) estimated value
A) fitted value
B) residual
C) correlation
D) covariance
E) estimated value
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
7
A scatterplot that appears as a shapeless mass of data points indicates:
A) a curved relationship among the variables
B) a linear relationship among the variables
C) a nonlinear relationship among the variables
D) no relationship among the variables
A) a curved relationship among the variables
B) a linear relationship among the variables
C) a nonlinear relationship among the variables
D) no relationship among the variables
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
8
In regression analysis, the variable we are trying to explain or predict is called the:
A) independent variable
B) dependent variable
C) regression variable
D) statistical variable
E) residual variable
A) independent variable
B) dependent variable
C) regression variable
D) statistical variable
E) residual variable
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
9
A "fan" shape in a scatterplot indicates:
A) unequal variance
B) a nonlinear relationship
C) the absence of outliers
D) sampling error
A) unequal variance
B) a nonlinear relationship
C) the absence of outliers
D) sampling error
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
10
In linear regression, the fitted value is:
A) the predicted value of the dependent variable
B) the predicted value of the independent value
C) the predicted value of the slope
D) the predicted value of the intercept
E) none of these choices
A) the predicted value of the dependent variable
B) the predicted value of the independent value
C) the predicted value of the slope
D) the predicted value of the intercept
E) none of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
11
Regression analysis asks:
A) if there are differences between distinct populations
B) if the sample is representative of the population
C) how a single variable depends on other relevant variables
D) how several variables depend on each other
A) if there are differences between distinct populations
B) if the sample is representative of the population
C) how a single variable depends on other relevant variables
D) how several variables depend on each other
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
12
In regression analysis, if there are several explanatory variables, it is called:
A) simple regression
B) multiple regression
C) compound regression
D) composite regression
A) simple regression
B) multiple regression
C) compound regression
D) composite regression
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
13
Data collected from approximately the same period of time from a cross-section of a population are called:
A) time series data
B) linear data
C) cross-sectional data
D) historical data
A) time series data
B) linear data
C) cross-sectional data
D) historical data
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
14
A single variable X can explain a large percentage of the variation in some other variable Y when the two variables are:
A) mutually exclusive
B) inversely related
C) directly related
D) highly correlated
E) none of these choices
A) mutually exclusive
B) inversely related
C) directly related
D) highly correlated
E) none of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
15
_____ is/are especially helpful in identifying outliers.
A) Linear regression
B) Regression analysis
C) Normal curves
D) Scatterplots
E) Multiple regression
A) Linear regression
B) Regression analysis
C) Normal curves
D) Scatterplots
E) Multiple regression
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
16
In choosing the "best-fitting" line through a set of points in linear regression, we choose the one with the:
A) smallest sum of squared residuals
B) largest sum of squared residuals
C) smallest number of outliers
D) largest number of points on the line
A) smallest sum of squared residuals
B) largest sum of squared residuals
C) smallest number of outliers
D) largest number of points on the line
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
17
Outliers are observations that:
A) lie outside the sample
B) render the study useless
C) lie outside the typical pattern of points on a scatterplot
D) disrupt the entire linear trend
A) lie outside the sample
B) render the study useless
C) lie outside the typical pattern of points on a scatterplot
D) disrupt the entire linear trend
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
18
In regression analysis, the variables used to help explain or predict the response variable are called the:
A) independent variables
B) dependent variables
C) regression variables
D) statistical variables
A) independent variables
B) dependent variables
C) regression variables
D) statistical variables
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
19
The term autocorrelation refers to:
A) the analyzed data refers to itself
B) the sample is related too closely to the population
C) the data are in a loop (values repeat themselves)
D) time series variables are usually related to their own past values
A) the analyzed data refers to itself
B) the sample is related too closely to the population
C) the data are in a loop (values repeat themselves)
D) time series variables are usually related to their own past values
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
20
In regression analysis, which of the following causal relationships are possible?
A) X causes Y to vary.
B) Y causes X to vary.
C) Other variables cause both X and Y to vary.
D) All of these options are possible.
A) X causes Y to vary.
B) Y causes X to vary.
C) Other variables cause both X and Y to vary.
D) All of these options are possible.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
21
Cross-sectional data are usually data gathered from approximately the same period of time from a cross-sectional of a population.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
22
In multiple regression, the coefficients reflect the expected change in:
A) Y when the associated X value increases by one unit
B) X when the associated Y value increases by one unit
C) Y when the associated X value decreases by one unit
D) X when the associated Y value decreases by one unit
A) Y when the associated X value increases by one unit
B) X when the associated Y value increases by one unit
C) Y when the associated X value decreases by one unit
D) X when the associated Y value decreases by one unit
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
23
Regression analysis can be applied equally well to cross-sectional and time series data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
24
To help explain or predict the response variable in every regression study, we use one or more explanatory variables. These variables are also called response variables or independent variables.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
25
In linear regression, a dummy variable is used:
A) to represent residual variables
B) to represent missing data in each sample
C) to include hypothetical data in the regression equation
D) to include categorical variables in the regression equation
E) when "dumb" responses are included in the data
A) to represent residual variables
B) to represent missing data in each sample
C) to include hypothetical data in the regression equation
D) to include categorical variables in the regression equation
E) when "dumb" responses are included in the data
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
26
An important condition when interpreting the coefficient for a particular independent variable X in a multiple regression equation is that:
A) the dependent variable will remain constant
B) the dependent variable will be allowed to vary
C) all of the other independent variables remain constant
D) all of the other independent variables be allowed to vary
A) the dependent variable will remain constant
B) the dependent variable will be allowed to vary
C) all of the other independent variables remain constant
D) all of the other independent variables be allowed to vary
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
27
Which of the following is an example of a nonlinear regression model?
A) a quadratic regression equation
B) a logarithmic regression equation
C) constant elasticity equation
D) the learning curve model
E) all of these choices
A) a quadratic regression equation
B) a logarithmic regression equation
C) constant elasticity equation
D) the learning curve model
E) all of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
28
The percentage of variation (
) can be interpreted as the fraction (or percent) of variation of the
A) explanatory variable explained by the independent variable
B) explanatory variable explained by the regression line
C) response variable explained by the regression line
D) error explained by the regression line

A) explanatory variable explained by the independent variable
B) explanatory variable explained by the regression line
C) response variable explained by the regression line
D) error explained by the regression line
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
29
The standard error of the estimate (
) is essentially the
A) mean of the residuals
B) standard deviation of the residuals
C) mean of the explanatory variable
D) standard deviation of the explanatory variable

A) mean of the residuals
B) standard deviation of the residuals
C) mean of the explanatory variable
D) standard deviation of the explanatory variable
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
30
In every regression study there is a single variable that we are trying to explain or predict. This is called the response variable or dependent variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
31
In linear regression, we can have an interaction variable. Algebraically, the interaction variable is the other variables in the regression equation.
A) sum
B) ratio
C) product
D) mean
A) sum
B) ratio
C) product
D) mean
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
32
The two primary objectives of regression analysis are to study relationships between variables and to use those relationships to make predictions.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
33
The percentage of variation (R2) ranges from:
A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -1 to 0
A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -1 to 0
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
34
The adjusted R2 adjusts R2 for:
A) non-linearity
B) outliers
C) low correlation
D) the number of explanatory variables in a multiple regression model
A) non-linearity
B) outliers
C) low correlation
D) the number of explanatory variables in a multiple regression model
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
35
Approximately what percentage of the observed Y values are within one standard error of the estimate (
) of the corresponding fitted Y values?
A) 67%
B) 95%
C) 99%
D) It is not possible to determine this.

A) 67%
B) 95%
C) 99%
D) It is not possible to determine this.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
36
The multiple standard error of estimate will be:
A) 0.901
B) 0.888
C) 0.800
D) 0.953
E) 0.894
A) 0.901
B) 0.888
C) 0.800
D) 0.953
E) 0.894
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
37
In a simple linear regression analysis, the following sums of squares are produced:
The proportion of the variation in Y that is explained by the variation in X is:
A) 20%
B) 80%
C) 25%
D) 50%
E) none of these choices

A) 20%
B) 80%
C) 25%
D) 50%
E) none of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
38
Given the least squares regression line,
, which statement is true?
A) The relationship between X and Y is positive.
B) The relationship between X and Y is negative.
C) As X increases, so does Y.
D) As X decreases, so does Y.
E) There is no relationship between X and Y.

A) The relationship between X and Y is positive.
B) The relationship between X and Y is negative.
C) As X increases, so does Y.
D) As X decreases, so does Y.
E) There is no relationship between X and Y.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
39
The regression line
has been fitted to the data points (28, 60), (20, 50), (10, 18), and (25, 55). The sum of the squared residuals will be:
A) 20.25
B) 16.00
C) 49.00
D) 94.25

A) 20.25
B) 16.00
C) 49.00
D) 94.25
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
40
In multiple regression, the constant
:
A) is the expected value of the dependent variable Y when all of the independent variables have the value zero
B) is necessary to fit the multiple regression line to set of points
C) must be adjusted for the number of independent variables
D) are all of these options

A) is the expected value of the dependent variable Y when all of the independent variables have the value zero
B) is necessary to fit the multiple regression line to set of points
C) must be adjusted for the number of independent variables
D) are all of these options
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
41
A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line:
= 84 +7X. This implies that if there is no advertising, then the predicted amount of sales (in dollars) is $84,000.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
42
In regression analysis, we can often use the standard error of estimate
to judge which of several potential regression equations is the most useful.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
43
When the scatterplot appears as a shapeless swarm of points, this can indicate that there is no relationship between the response variable Y and the explanatory variable X, or at least none worth pursuing.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
44
Correlation is measured on a scale from 0 to 1, where 0 indicates no linear relationship between two variables, and 1 indicates a perfect linear relationship.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
45
The least squares line is the line that minimizes the sum of the residuals.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
46
In a simple linear regression problem, if the percentage of variation explained
is 0.95, this means that 95% of the variation in the explanatory variable X can be explained by regression.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
47
The residual is defined as the difference between the actual and predicted, or fitted values of the response variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
48
Correlation is used to determine the strength of the linear relationship between an explanatory variable X and response variable Y.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
49
An outlier is an observation that falls outside of the general pattern of the rest of the observations on a scatterplot.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
50
A negative relationship between an explanatory variable X and a response variable Y means that as X increases, Y decreases, and vice versa.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
51
Scatterplots are used for identifying outliers and quantifying relationships between variables.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
52
In simple linear regression, the divisor of the standard error of estimate
is n - 1, simply because there is only one explanatory variable of interest.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
53
A regression analysis between sales (in $1000) and advertising (in $) resulted in the following least squares line:
= 32 + 8X. This implies that an increase of $1 in advertising is expected to result in an increase of $40 in sales.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
54
In a simple regression analysis, if the standard error of estimate
= 15 and the number of observations n = 10, then the sum of the residuals squared must be 120.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
55
The multiple R for a regression is the correlation between the observed Y values and the fitted Y values.
.
.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
56
A regression analysis between weight (Y in pounds) and height (X in inches) resulted in the following least squares line:
= 140 + 5X. This implies that if the height is increased by 1 inch, the weight is expected to increase on average by 5 pounds.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
57
A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line:
= 84 +7X. This implies that if advertising is $800, then the predicted amount of sales (in dollars) is $140,000.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
58
A useful graph in almost any regression analysis is a scatterplot of residuals (on the vertical axis) versus fitted values (on the horizontal axis), where a "good" fit not only has small residuals, but it has residuals scattered randomly around zero with no apparent pattern.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
59
In reference to the equation,
, the value 0.10 is the expected change in Y per unit change in
.


Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
60
The regression line
= 3 + 2X has been fitted to the data points (4, 14), (2, 7), and (1, 4). The sum of the residuals squared will be 8.0.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
61
The effect of a logarithmic transformation on a variable that is skewed to the right by a few large values is to "squeeze" the values together and make the distribution more symmetric.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
62
The R2 can only increase when extra explanatory variables are added to a multiple regression model.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
63
We should include an interaction variable in a regression model if we believe that the effect of one explanatory variable
on the response variable Y depends on the value of another explanatory variable
.


Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
64
The primary purpose of a nonlinear transformation is to "straighten out" the data on a scatterplot.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
65
In a multiple regression analysis with three explanatory variables, suppose that there are 60 observations and the sum of the residuals squared is 28. The standard error of estimate must be 0.7071.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
66
The adjusted R2 is adjusted for the number of explanatory variables in a regression equation, and it has he same interpretation as the standard R2.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
67
The adjusted R2 is used primarily to monitor whether extra explanatory variables really belong in a multiple regression model.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
68
If a categorical variable is to be included in a multiple regression, a dummy variable for each category of the variable should be used, but the original categorical variables should not be sued.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
69
An interaction variable is the product of an explanatory variable and the dependent variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
70
If the regression equation includes anything other than a constant plus the sum of products of constants and variables, the model will not be linear.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
71
In a multiple regression problem with two explanatory variables if, the fitted regression equation is
, then the estimated value of Y when
and
is 49.4.



Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
72
If a scatterplot of residuals shows a parabola shape, then a logarithmic transformation may be useful in obtaining a better fit.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
73
In a simple linear regression problem, suppose that
= 12.48 and
= 124.8. Then the percentage of variation explained
must be 0.90.



Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
74
In a nonlinear transformation of data, the Y variable or the X variables may be transformed, but not both.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
75
For the multiple regression model
, if
were to increase by 5 units, holding
and
constant, the value of Y would be expected to decrease by 50 units.




Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
76
The percentage of variation explained,
, is the square of the correlation between the observed Y values and the fitted Y values.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
77
The coefficients for logarithmically transformed explanatory variables should be interpreted as the percent change in the dependent variable for a 1% percent change in the explanatory variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
78
In the multiple regression model
we interpret X1 as follows: holding X2 constant, if X1 increases by 1 unit, then the expected value of Y will increase by 9 units.

Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
79
A logarithmic transformation of the response variable Y is often useful when the distribution of Y is symmetric.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
80
In a simple regression with a single explanatory variable, the multiple R is the same as the standard correlation between the Y variable and the explanatory X variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck