Deck 10: Regression Analysis: Estimating Relationships

Full screen (f)
exit full mode
Question
The correlation value ranges from:

A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -¥ to+ ¥
Use Space or
up arrow
down arrow
to flip the card.
Question
The weakness of scatterplots is that they:

A) do not help identify linear relationships
B) can be misleading about the types of relationships they indicate
C) only help identify outliers
D) do not actually quantify the relationships between variables
Question
Correlation is a summary measure that indicates:

A) a curved relationship among the variables
B) the rate of change in Y for a one unit change in X
C) the strength of the linear relationship between pairs of variables
D) the magnitude of difference between two variables
Question
A correlation value of zero indicates.

A) a strong linear relationship
B) a weak linear relationship
C) no linear relationship
D) a perfect linear relationship
Question
The covariance is not used as much as the correlation because:

A) it is not always a valid predictor of linear relationships
B) it is difficult to calculate
C) it is difficult to interpret
D) of all of these options
Question
In linear regression, we fit the least squares line to a set of values (or points on a scatterplot). The distance from the line to a point is called the:

A) fitted value
B) residual
C) correlation
D) covariance
E) estimated value
Question
A scatterplot that appears as a shapeless mass of data points indicates:

A) a curved relationship among the variables
B) a linear relationship among the variables
C) a nonlinear relationship among the variables
D) no relationship among the variables
Question
In regression analysis, the variable we are trying to explain or predict is called the:

A) independent variable
B) dependent variable
C) regression variable
D) statistical variable
E) residual variable
Question
A "fan" shape in a scatterplot indicates:

A) unequal variance
B) a nonlinear relationship
C) the absence of outliers
D) sampling error
Question
In linear regression, the fitted value is:

A) the predicted value of the dependent variable
B) the predicted value of the independent value
C) the predicted value of the slope
D) the predicted value of the intercept
E) none of these choices
Question
Regression analysis asks:

A) if there are differences between distinct populations
B) if the sample is representative of the population
C) how a single variable depends on other relevant variables
D) how several variables depend on each other
Question
In regression analysis, if there are several explanatory variables, it is called:

A) simple regression
B) multiple regression
C) compound regression
D) composite regression
Question
Data collected from approximately the same period of time from a cross-section of a population are called:

A) time series data
B) linear data
C) cross-sectional data
D) historical data
Question
A single variable X can explain a large percentage of the variation in some other variable Y when the two variables are:

A) mutually exclusive
B) inversely related
C) directly related
D) highly correlated
E) none of these choices
Question
_____ is/are especially helpful in identifying outliers.

A) Linear regression
B) Regression analysis
C) Normal curves
D) Scatterplots
E) Multiple regression
Question
In choosing the "best-fitting" line through a set of points in linear regression, we choose the one with the:

A) smallest sum of squared residuals
B) largest sum of squared residuals
C) smallest number of outliers
D) largest number of points on the line
Question
Outliers are observations that:

A) lie outside the sample
B) render the study useless
C) lie outside the typical pattern of points on a scatterplot
D) disrupt the entire linear trend
Question
In regression analysis, the variables used to help explain or predict the response variable are called the:

A) independent variables
B) dependent variables
C) regression variables
D) statistical variables
Question
The term autocorrelation refers to:

A) the analyzed data refers to itself
B) the sample is related too closely to the population
C) the data are in a loop (values repeat themselves)
D) time series variables are usually related to their own past values
Question
In regression analysis, which of the following causal relationships are possible?

A) X causes Y to vary.
B) Y causes X to vary.
C) Other variables cause both X and Y to vary.
D) All of these options are possible.
Question
Cross-sectional data are usually data gathered from approximately the same period of time from a cross-sectional of a population.
Question
In multiple regression, the coefficients reflect the expected change in:

A) Y when the associated X value increases by one unit
B) X when the associated Y value increases by one unit
C) Y when the associated X value decreases by one unit
D) X when the associated Y value decreases by one unit
Question
Regression analysis can be applied equally well to cross-sectional and time series data.
Question
To help explain or predict the response variable in every regression study, we use one or more explanatory variables. These variables are also called response variables or independent variables.
Question
In linear regression, a dummy variable is used:

A) to represent residual variables
B) to represent missing data in each sample
C) to include hypothetical data in the regression equation
D) to include categorical variables in the regression equation
E) when "dumb" responses are included in the data
Question
An important condition when interpreting the coefficient for a particular independent variable X in a multiple regression equation is that:

A) the dependent variable will remain constant
B) the dependent variable will be allowed to vary
C) all of the other independent variables remain constant
D) all of the other independent variables be allowed to vary
Question
Which of the following is an example of a nonlinear regression model?

A) a quadratic regression equation
B) a logarithmic regression equation
C) constant elasticity equation
D) the learning curve model
E) all of these choices
Question
The percentage of variation ( <strong>The percentage of variation (   ) can be interpreted as the fraction (or percent) of variation of the</strong> A) explanatory variable explained by the independent variable B) explanatory variable explained by the regression line C) response variable explained by the regression line D) error explained by the regression line <div style=padding-top: 35px> ) can be interpreted as the fraction (or percent) of variation of the

A) explanatory variable explained by the independent variable
B) explanatory variable explained by the regression line
C) response variable explained by the regression line
D) error explained by the regression line
Question
The standard error of the estimate ( <strong>The standard error of the estimate (   ) is essentially the</strong> A) mean of the residuals B) standard deviation of the residuals C) mean of the explanatory variable D) standard deviation of the explanatory variable <div style=padding-top: 35px> ) is essentially the

A) mean of the residuals
B) standard deviation of the residuals
C) mean of the explanatory variable
D) standard deviation of the explanatory variable
Question
In every regression study there is a single variable that we are trying to explain or predict. This is called the response variable or dependent variable.
Question
In linear regression, we can have an interaction variable. Algebraically, the interaction variable is the other variables in the regression equation.

A) sum
B) ratio
C) product
D) mean
Question
The two primary objectives of regression analysis are to study relationships between variables and to use those relationships to make predictions.
Question
The percentage of variation (R2) ranges from:

A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -1 to 0
Question
The adjusted R2 adjusts R2 for:

A) non-linearity
B) outliers
C) low correlation
D) the number of explanatory variables in a multiple regression model
Question
Approximately what percentage of the observed Y values are within one standard error of the estimate ( <strong>Approximately what percentage of the observed Y values are within one standard error of the estimate (   ) of the corresponding fitted Y values?</strong> A) 67% B) 95% C) 99% D) It is not possible to determine this. <div style=padding-top: 35px> ) of the corresponding fitted Y values?

A) 67%
B) 95%
C) 99%
D) It is not possible to determine this.
Question
The multiple standard error of estimate will be:

A) 0.901
B) 0.888
C) 0.800
D) 0.953
E) 0.894
Question
In a simple linear regression analysis, the following sums of squares are produced: <strong>In a simple linear regression analysis, the following sums of squares are produced:   The proportion of the variation in Y that is explained by the variation in X is:</strong> A) 20% B) 80% C) 25% D) 50% E) none of these choices <div style=padding-top: 35px> The proportion of the variation in Y that is explained by the variation in X is:

A) 20%
B) 80%
C) 25%
D) 50%
E) none of these choices
Question
Given the least squares regression line, <strong>Given the least squares regression line,   , which statement is true?</strong> A) The relationship between X and Y is positive. B) The relationship between X and Y is negative. C) As X increases, so does Y. D) As X decreases, so does Y. E) There is no relationship between X and Y. <div style=padding-top: 35px> , which statement is true?

A) The relationship between X and Y is positive.
B) The relationship between X and Y is negative.
C) As X increases, so does Y.
D) As X decreases, so does Y.
E) There is no relationship between X and Y.
Question
The regression line <strong>The regression line   has been fitted to the data points (28, 60), (20, 50), (10, 18), and (25, 55). The sum of the squared residuals will be:</strong> A) 20.25 B) 16.00 C) 49.00 D) 94.25 <div style=padding-top: 35px> has been fitted to the data points (28, 60), (20, 50), (10, 18), and (25, 55). The sum of the squared residuals will be:

A) 20.25
B) 16.00
C) 49.00
D) 94.25
Question
In multiple regression, the constant <strong>In multiple regression, the constant   :</strong> A) is the expected value of the dependent variable Y when all of the independent variables have the value zero B) is necessary to fit the multiple regression line to set of points C) must be adjusted for the number of independent variables D) are all of these options <div style=padding-top: 35px> :

A) is the expected value of the dependent variable Y when all of the independent variables have the value zero
B) is necessary to fit the multiple regression line to set of points
C) must be adjusted for the number of independent variables
D) are all of these options
Question
A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line: A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line:   = 84 +7X. This implies that if there is no advertising, then the predicted amount of sales (in dollars) is $84,000.<div style=padding-top: 35px> = 84 +7X. This implies that if there is no advertising, then the predicted amount of sales (in dollars) is $84,000.
Question
In regression analysis, we can often use the standard error of estimate In regression analysis, we can often use the standard error of estimate   to judge which of several potential regression equations is the most useful.<div style=padding-top: 35px> to judge which of several potential regression equations is the most useful.
Question
When the scatterplot appears as a shapeless swarm of points, this can indicate that there is no relationship between the response variable Y and the explanatory variable X, or at least none worth pursuing.
Question
Correlation is measured on a scale from 0 to 1, where 0 indicates no linear relationship between two variables, and 1 indicates a perfect linear relationship.
Question
The least squares line is the line that minimizes the sum of the residuals.
Question
In a simple linear regression problem, if the percentage of variation explained In a simple linear regression problem, if the percentage of variation explained   is 0.95, this means that 95% of the variation in the explanatory variable X can be explained by regression.<div style=padding-top: 35px> is 0.95, this means that 95% of the variation in the explanatory variable X can be explained by regression.
Question
The residual is defined as the difference between the actual and predicted, or fitted values of the response variable.
Question
Correlation is used to determine the strength of the linear relationship between an explanatory variable X and response variable Y.
Question
An outlier is an observation that falls outside of the general pattern of the rest of the observations on a scatterplot.
Question
A negative relationship between an explanatory variable X and a response variable Y means that as X increases, Y decreases, and vice versa.
Question
Scatterplots are used for identifying outliers and quantifying relationships between variables.
Question
In simple linear regression, the divisor of the standard error of estimate In simple linear regression, the divisor of the standard error of estimate   is n - 1, simply because there is only one explanatory variable of interest.<div style=padding-top: 35px> is n - 1, simply because there is only one explanatory variable of interest.
Question
A regression analysis between sales (in $1000) and advertising (in $) resulted in the following least squares line: A regression analysis between sales (in $1000) and advertising (in $) resulted in the following least squares line:   = 32 + 8X. This implies that an increase of $1 in advertising is expected to result in an increase of $40 in sales.<div style=padding-top: 35px> = 32 + 8X. This implies that an increase of $1 in advertising is expected to result in an increase of $40 in sales.
Question
In a simple regression analysis, if the standard error of estimate In a simple regression analysis, if the standard error of estimate   = 15 and the number of observations n = 10, then the sum of the residuals squared must be 120.<div style=padding-top: 35px> = 15 and the number of observations n = 10, then the sum of the residuals squared must be 120.
Question
The multiple R for a regression is the correlation between the observed Y values and the fitted Y values.
.
Question
A regression analysis between weight (Y in pounds) and height (X in inches) resulted in the following least squares line: A regression analysis between weight (Y in pounds) and height (X in inches) resulted in the following least squares line:   = 140 + 5X. This implies that if the height is increased by 1 inch, the weight is expected to increase on average by 5 pounds.<div style=padding-top: 35px> = 140 + 5X. This implies that if the height is increased by 1 inch, the weight is expected to increase on average by 5 pounds.
Question
A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line: A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line:   = 84 +7X. This implies that if advertising is $800, then the predicted amount of sales (in dollars) is $140,000.<div style=padding-top: 35px> = 84 +7X. This implies that if advertising is $800, then the predicted amount of sales (in dollars) is $140,000.
Question
A useful graph in almost any regression analysis is a scatterplot of residuals (on the vertical axis) versus fitted values (on the horizontal axis), where a "good" fit not only has small residuals, but it has residuals scattered randomly around zero with no apparent pattern.
Question
In reference to the equation, In reference to the equation,   , the value 0.10 is the expected change in Y per unit change in   .<div style=padding-top: 35px> , the value 0.10 is the expected change in Y per unit change in In reference to the equation,   , the value 0.10 is the expected change in Y per unit change in   .<div style=padding-top: 35px> .
Question
The regression line The regression line   = 3 + 2X has been fitted to the data points (4, 14), (2, 7), and (1, 4). The sum of the residuals squared will be 8.0.<div style=padding-top: 35px> = 3 + 2X has been fitted to the data points (4, 14), (2, 7), and (1, 4). The sum of the residuals squared will be 8.0.
Question
The effect of a logarithmic transformation on a variable that is skewed to the right by a few large values is to "squeeze" the values together and make the distribution more symmetric.
Question
The R2 can only increase when extra explanatory variables are added to a multiple regression model.
Question
We should include an interaction variable in a regression model if we believe that the effect of one explanatory variable We should include an interaction variable in a regression model if we believe that the effect of one explanatory variable   on the response variable Y depends on the value of another explanatory variable   .<div style=padding-top: 35px> on the response variable Y depends on the value of another explanatory variable We should include an interaction variable in a regression model if we believe that the effect of one explanatory variable   on the response variable Y depends on the value of another explanatory variable   .<div style=padding-top: 35px> .
Question
The primary purpose of a nonlinear transformation is to "straighten out" the data on a scatterplot.
Question
In a multiple regression analysis with three explanatory variables, suppose that there are 60 observations and the sum of the residuals squared is 28. The standard error of estimate must be 0.7071.
Question
The adjusted R2 is adjusted for the number of explanatory variables in a regression equation, and it has he same interpretation as the standard R2.
Question
The adjusted R2 is used primarily to monitor whether extra explanatory variables really belong in a multiple regression model.
Question
If a categorical variable is to be included in a multiple regression, a dummy variable for each category of the variable should be used, but the original categorical variables should not be sued.
Question
An interaction variable is the product of an explanatory variable and the dependent variable.
Question
If the regression equation includes anything other than a constant plus the sum of products of constants and variables, the model will not be linear.
Question
In a multiple regression problem with two explanatory variables if, the fitted regression equation is In a multiple regression problem with two explanatory variables if, the fitted regression equation is   , then the estimated value of Y when   and   is 49.4.<div style=padding-top: 35px> , then the estimated value of Y when In a multiple regression problem with two explanatory variables if, the fitted regression equation is   , then the estimated value of Y when   and   is 49.4.<div style=padding-top: 35px> and In a multiple regression problem with two explanatory variables if, the fitted regression equation is   , then the estimated value of Y when   and   is 49.4.<div style=padding-top: 35px> is 49.4.
Question
If a scatterplot of residuals shows a parabola shape, then a logarithmic transformation may be useful in obtaining a better fit.
Question
In a simple linear regression problem, suppose that In a simple linear regression problem, suppose that   = 12.48 and   = 124.8. Then the percentage of variation explained   must be 0.90.<div style=padding-top: 35px> = 12.48 and In a simple linear regression problem, suppose that   = 12.48 and   = 124.8. Then the percentage of variation explained   must be 0.90.<div style=padding-top: 35px> = 124.8. Then the percentage of variation explained In a simple linear regression problem, suppose that   = 12.48 and   = 124.8. Then the percentage of variation explained   must be 0.90.<div style=padding-top: 35px> must be 0.90.
Question
In a nonlinear transformation of data, the Y variable or the X variables may be transformed, but not both.
Question
For the multiple regression model For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units.<div style=padding-top: 35px> , if For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units.<div style=padding-top: 35px> were to increase by 5 units, holding For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units.<div style=padding-top: 35px> and For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units.<div style=padding-top: 35px> constant, the value of Y would be expected to decrease by 50 units.
Question
The percentage of variation explained, The percentage of variation explained,   , is the square of the correlation between the observed Y values and the fitted Y values.<div style=padding-top: 35px> , is the square of the correlation between the observed Y values and the fitted Y values.
Question
The coefficients for logarithmically transformed explanatory variables should be interpreted as the percent change in the dependent variable for a 1% percent change in the explanatory variable.
Question
In the multiple regression model In the multiple regression model   we interpret X<sub>1</sub> as follows: holding X<sub>2</sub> constant, if X<sub>1</sub> increases by 1 unit, then the expected value of Y will increase by 9 units.<div style=padding-top: 35px> we interpret X1 as follows: holding X2 constant, if X1 increases by 1 unit, then the expected value of Y will increase by 9 units.
Question
A logarithmic transformation of the response variable Y is often useful when the distribution of Y is symmetric.
Question
In a simple regression with a single explanatory variable, the multiple R is the same as the standard correlation between the Y variable and the explanatory X variable.
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/92
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 10: Regression Analysis: Estimating Relationships
1
The correlation value ranges from:

A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -¥ to+ ¥
-1 to +1
2
The weakness of scatterplots is that they:

A) do not help identify linear relationships
B) can be misleading about the types of relationships they indicate
C) only help identify outliers
D) do not actually quantify the relationships between variables
do not actually quantify the relationships between variables
3
Correlation is a summary measure that indicates:

A) a curved relationship among the variables
B) the rate of change in Y for a one unit change in X
C) the strength of the linear relationship between pairs of variables
D) the magnitude of difference between two variables
the strength of the linear relationship between pairs of variables
4
A correlation value of zero indicates.

A) a strong linear relationship
B) a weak linear relationship
C) no linear relationship
D) a perfect linear relationship
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
5
The covariance is not used as much as the correlation because:

A) it is not always a valid predictor of linear relationships
B) it is difficult to calculate
C) it is difficult to interpret
D) of all of these options
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
6
In linear regression, we fit the least squares line to a set of values (or points on a scatterplot). The distance from the line to a point is called the:

A) fitted value
B) residual
C) correlation
D) covariance
E) estimated value
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
7
A scatterplot that appears as a shapeless mass of data points indicates:

A) a curved relationship among the variables
B) a linear relationship among the variables
C) a nonlinear relationship among the variables
D) no relationship among the variables
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
8
In regression analysis, the variable we are trying to explain or predict is called the:

A) independent variable
B) dependent variable
C) regression variable
D) statistical variable
E) residual variable
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
9
A "fan" shape in a scatterplot indicates:

A) unequal variance
B) a nonlinear relationship
C) the absence of outliers
D) sampling error
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
10
In linear regression, the fitted value is:

A) the predicted value of the dependent variable
B) the predicted value of the independent value
C) the predicted value of the slope
D) the predicted value of the intercept
E) none of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
11
Regression analysis asks:

A) if there are differences between distinct populations
B) if the sample is representative of the population
C) how a single variable depends on other relevant variables
D) how several variables depend on each other
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
12
In regression analysis, if there are several explanatory variables, it is called:

A) simple regression
B) multiple regression
C) compound regression
D) composite regression
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
13
Data collected from approximately the same period of time from a cross-section of a population are called:

A) time series data
B) linear data
C) cross-sectional data
D) historical data
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
14
A single variable X can explain a large percentage of the variation in some other variable Y when the two variables are:

A) mutually exclusive
B) inversely related
C) directly related
D) highly correlated
E) none of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
15
_____ is/are especially helpful in identifying outliers.

A) Linear regression
B) Regression analysis
C) Normal curves
D) Scatterplots
E) Multiple regression
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
16
In choosing the "best-fitting" line through a set of points in linear regression, we choose the one with the:

A) smallest sum of squared residuals
B) largest sum of squared residuals
C) smallest number of outliers
D) largest number of points on the line
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
17
Outliers are observations that:

A) lie outside the sample
B) render the study useless
C) lie outside the typical pattern of points on a scatterplot
D) disrupt the entire linear trend
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
18
In regression analysis, the variables used to help explain or predict the response variable are called the:

A) independent variables
B) dependent variables
C) regression variables
D) statistical variables
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
19
The term autocorrelation refers to:

A) the analyzed data refers to itself
B) the sample is related too closely to the population
C) the data are in a loop (values repeat themselves)
D) time series variables are usually related to their own past values
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
20
In regression analysis, which of the following causal relationships are possible?

A) X causes Y to vary.
B) Y causes X to vary.
C) Other variables cause both X and Y to vary.
D) All of these options are possible.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
21
Cross-sectional data are usually data gathered from approximately the same period of time from a cross-sectional of a population.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
22
In multiple regression, the coefficients reflect the expected change in:

A) Y when the associated X value increases by one unit
B) X when the associated Y value increases by one unit
C) Y when the associated X value decreases by one unit
D) X when the associated Y value decreases by one unit
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
23
Regression analysis can be applied equally well to cross-sectional and time series data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
24
To help explain or predict the response variable in every regression study, we use one or more explanatory variables. These variables are also called response variables or independent variables.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
25
In linear regression, a dummy variable is used:

A) to represent residual variables
B) to represent missing data in each sample
C) to include hypothetical data in the regression equation
D) to include categorical variables in the regression equation
E) when "dumb" responses are included in the data
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
26
An important condition when interpreting the coefficient for a particular independent variable X in a multiple regression equation is that:

A) the dependent variable will remain constant
B) the dependent variable will be allowed to vary
C) all of the other independent variables remain constant
D) all of the other independent variables be allowed to vary
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
27
Which of the following is an example of a nonlinear regression model?

A) a quadratic regression equation
B) a logarithmic regression equation
C) constant elasticity equation
D) the learning curve model
E) all of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
28
The percentage of variation ( <strong>The percentage of variation (   ) can be interpreted as the fraction (or percent) of variation of the</strong> A) explanatory variable explained by the independent variable B) explanatory variable explained by the regression line C) response variable explained by the regression line D) error explained by the regression line ) can be interpreted as the fraction (or percent) of variation of the

A) explanatory variable explained by the independent variable
B) explanatory variable explained by the regression line
C) response variable explained by the regression line
D) error explained by the regression line
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
29
The standard error of the estimate ( <strong>The standard error of the estimate (   ) is essentially the</strong> A) mean of the residuals B) standard deviation of the residuals C) mean of the explanatory variable D) standard deviation of the explanatory variable ) is essentially the

A) mean of the residuals
B) standard deviation of the residuals
C) mean of the explanatory variable
D) standard deviation of the explanatory variable
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
30
In every regression study there is a single variable that we are trying to explain or predict. This is called the response variable or dependent variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
31
In linear regression, we can have an interaction variable. Algebraically, the interaction variable is the other variables in the regression equation.

A) sum
B) ratio
C) product
D) mean
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
32
The two primary objectives of regression analysis are to study relationships between variables and to use those relationships to make predictions.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
33
The percentage of variation (R2) ranges from:

A) 0 to +1
B) -1 to +1
C) -2 to +2
D) -1 to 0
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
34
The adjusted R2 adjusts R2 for:

A) non-linearity
B) outliers
C) low correlation
D) the number of explanatory variables in a multiple regression model
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
35
Approximately what percentage of the observed Y values are within one standard error of the estimate ( <strong>Approximately what percentage of the observed Y values are within one standard error of the estimate (   ) of the corresponding fitted Y values?</strong> A) 67% B) 95% C) 99% D) It is not possible to determine this. ) of the corresponding fitted Y values?

A) 67%
B) 95%
C) 99%
D) It is not possible to determine this.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
36
The multiple standard error of estimate will be:

A) 0.901
B) 0.888
C) 0.800
D) 0.953
E) 0.894
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
37
In a simple linear regression analysis, the following sums of squares are produced: <strong>In a simple linear regression analysis, the following sums of squares are produced:   The proportion of the variation in Y that is explained by the variation in X is:</strong> A) 20% B) 80% C) 25% D) 50% E) none of these choices The proportion of the variation in Y that is explained by the variation in X is:

A) 20%
B) 80%
C) 25%
D) 50%
E) none of these choices
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
38
Given the least squares regression line, <strong>Given the least squares regression line,   , which statement is true?</strong> A) The relationship between X and Y is positive. B) The relationship between X and Y is negative. C) As X increases, so does Y. D) As X decreases, so does Y. E) There is no relationship between X and Y. , which statement is true?

A) The relationship between X and Y is positive.
B) The relationship between X and Y is negative.
C) As X increases, so does Y.
D) As X decreases, so does Y.
E) There is no relationship between X and Y.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
39
The regression line <strong>The regression line   has been fitted to the data points (28, 60), (20, 50), (10, 18), and (25, 55). The sum of the squared residuals will be:</strong> A) 20.25 B) 16.00 C) 49.00 D) 94.25 has been fitted to the data points (28, 60), (20, 50), (10, 18), and (25, 55). The sum of the squared residuals will be:

A) 20.25
B) 16.00
C) 49.00
D) 94.25
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
40
In multiple regression, the constant <strong>In multiple regression, the constant   :</strong> A) is the expected value of the dependent variable Y when all of the independent variables have the value zero B) is necessary to fit the multiple regression line to set of points C) must be adjusted for the number of independent variables D) are all of these options :

A) is the expected value of the dependent variable Y when all of the independent variables have the value zero
B) is necessary to fit the multiple regression line to set of points
C) must be adjusted for the number of independent variables
D) are all of these options
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
41
A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line: A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line:   = 84 +7X. This implies that if there is no advertising, then the predicted amount of sales (in dollars) is $84,000. = 84 +7X. This implies that if there is no advertising, then the predicted amount of sales (in dollars) is $84,000.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
42
In regression analysis, we can often use the standard error of estimate In regression analysis, we can often use the standard error of estimate   to judge which of several potential regression equations is the most useful. to judge which of several potential regression equations is the most useful.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
43
When the scatterplot appears as a shapeless swarm of points, this can indicate that there is no relationship between the response variable Y and the explanatory variable X, or at least none worth pursuing.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
44
Correlation is measured on a scale from 0 to 1, where 0 indicates no linear relationship between two variables, and 1 indicates a perfect linear relationship.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
45
The least squares line is the line that minimizes the sum of the residuals.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
46
In a simple linear regression problem, if the percentage of variation explained In a simple linear regression problem, if the percentage of variation explained   is 0.95, this means that 95% of the variation in the explanatory variable X can be explained by regression. is 0.95, this means that 95% of the variation in the explanatory variable X can be explained by regression.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
47
The residual is defined as the difference between the actual and predicted, or fitted values of the response variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
48
Correlation is used to determine the strength of the linear relationship between an explanatory variable X and response variable Y.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
49
An outlier is an observation that falls outside of the general pattern of the rest of the observations on a scatterplot.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
50
A negative relationship between an explanatory variable X and a response variable Y means that as X increases, Y decreases, and vice versa.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
51
Scatterplots are used for identifying outliers and quantifying relationships between variables.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
52
In simple linear regression, the divisor of the standard error of estimate In simple linear regression, the divisor of the standard error of estimate   is n - 1, simply because there is only one explanatory variable of interest. is n - 1, simply because there is only one explanatory variable of interest.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
53
A regression analysis between sales (in $1000) and advertising (in $) resulted in the following least squares line: A regression analysis between sales (in $1000) and advertising (in $) resulted in the following least squares line:   = 32 + 8X. This implies that an increase of $1 in advertising is expected to result in an increase of $40 in sales. = 32 + 8X. This implies that an increase of $1 in advertising is expected to result in an increase of $40 in sales.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
54
In a simple regression analysis, if the standard error of estimate In a simple regression analysis, if the standard error of estimate   = 15 and the number of observations n = 10, then the sum of the residuals squared must be 120. = 15 and the number of observations n = 10, then the sum of the residuals squared must be 120.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
55
The multiple R for a regression is the correlation between the observed Y values and the fitted Y values.
.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
56
A regression analysis between weight (Y in pounds) and height (X in inches) resulted in the following least squares line: A regression analysis between weight (Y in pounds) and height (X in inches) resulted in the following least squares line:   = 140 + 5X. This implies that if the height is increased by 1 inch, the weight is expected to increase on average by 5 pounds. = 140 + 5X. This implies that if the height is increased by 1 inch, the weight is expected to increase on average by 5 pounds.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
57
A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line: A regression analysis between sales (in $1000) and advertising (in $100) resulted in the following least squares line:   = 84 +7X. This implies that if advertising is $800, then the predicted amount of sales (in dollars) is $140,000. = 84 +7X. This implies that if advertising is $800, then the predicted amount of sales (in dollars) is $140,000.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
58
A useful graph in almost any regression analysis is a scatterplot of residuals (on the vertical axis) versus fitted values (on the horizontal axis), where a "good" fit not only has small residuals, but it has residuals scattered randomly around zero with no apparent pattern.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
59
In reference to the equation, In reference to the equation,   , the value 0.10 is the expected change in Y per unit change in   . , the value 0.10 is the expected change in Y per unit change in In reference to the equation,   , the value 0.10 is the expected change in Y per unit change in   . .
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
60
The regression line The regression line   = 3 + 2X has been fitted to the data points (4, 14), (2, 7), and (1, 4). The sum of the residuals squared will be 8.0. = 3 + 2X has been fitted to the data points (4, 14), (2, 7), and (1, 4). The sum of the residuals squared will be 8.0.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
61
The effect of a logarithmic transformation on a variable that is skewed to the right by a few large values is to "squeeze" the values together and make the distribution more symmetric.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
62
The R2 can only increase when extra explanatory variables are added to a multiple regression model.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
63
We should include an interaction variable in a regression model if we believe that the effect of one explanatory variable We should include an interaction variable in a regression model if we believe that the effect of one explanatory variable   on the response variable Y depends on the value of another explanatory variable   . on the response variable Y depends on the value of another explanatory variable We should include an interaction variable in a regression model if we believe that the effect of one explanatory variable   on the response variable Y depends on the value of another explanatory variable   . .
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
64
The primary purpose of a nonlinear transformation is to "straighten out" the data on a scatterplot.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
65
In a multiple regression analysis with three explanatory variables, suppose that there are 60 observations and the sum of the residuals squared is 28. The standard error of estimate must be 0.7071.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
66
The adjusted R2 is adjusted for the number of explanatory variables in a regression equation, and it has he same interpretation as the standard R2.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
67
The adjusted R2 is used primarily to monitor whether extra explanatory variables really belong in a multiple regression model.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
68
If a categorical variable is to be included in a multiple regression, a dummy variable for each category of the variable should be used, but the original categorical variables should not be sued.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
69
An interaction variable is the product of an explanatory variable and the dependent variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
70
If the regression equation includes anything other than a constant plus the sum of products of constants and variables, the model will not be linear.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
71
In a multiple regression problem with two explanatory variables if, the fitted regression equation is In a multiple regression problem with two explanatory variables if, the fitted regression equation is   , then the estimated value of Y when   and   is 49.4. , then the estimated value of Y when In a multiple regression problem with two explanatory variables if, the fitted regression equation is   , then the estimated value of Y when   and   is 49.4. and In a multiple regression problem with two explanatory variables if, the fitted regression equation is   , then the estimated value of Y when   and   is 49.4. is 49.4.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
72
If a scatterplot of residuals shows a parabola shape, then a logarithmic transformation may be useful in obtaining a better fit.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
73
In a simple linear regression problem, suppose that In a simple linear regression problem, suppose that   = 12.48 and   = 124.8. Then the percentage of variation explained   must be 0.90. = 12.48 and In a simple linear regression problem, suppose that   = 12.48 and   = 124.8. Then the percentage of variation explained   must be 0.90. = 124.8. Then the percentage of variation explained In a simple linear regression problem, suppose that   = 12.48 and   = 124.8. Then the percentage of variation explained   must be 0.90. must be 0.90.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
74
In a nonlinear transformation of data, the Y variable or the X variables may be transformed, but not both.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
75
For the multiple regression model For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units. , if For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units. were to increase by 5 units, holding For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units. and For the multiple regression model   , if   were to increase by 5 units, holding   and   constant, the value of Y would be expected to decrease by 50 units. constant, the value of Y would be expected to decrease by 50 units.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
76
The percentage of variation explained, The percentage of variation explained,   , is the square of the correlation between the observed Y values and the fitted Y values. , is the square of the correlation between the observed Y values and the fitted Y values.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
77
The coefficients for logarithmically transformed explanatory variables should be interpreted as the percent change in the dependent variable for a 1% percent change in the explanatory variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
78
In the multiple regression model In the multiple regression model   we interpret X<sub>1</sub> as follows: holding X<sub>2</sub> constant, if X<sub>1</sub> increases by 1 unit, then the expected value of Y will increase by 9 units. we interpret X1 as follows: holding X2 constant, if X1 increases by 1 unit, then the expected value of Y will increase by 9 units.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
79
A logarithmic transformation of the response variable Y is often useful when the distribution of Y is symmetric.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
80
In a simple regression with a single explanatory variable, the multiple R is the same as the standard correlation between the Y variable and the explanatory X variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 92 flashcards in this deck.