Question 1

Which of the following assumptions appears violated based on this plot?&#10;A) The variance of the errors is constant&#10;B) The errors are independent&#10;C) The mean of the errors is zero&#10;D) The errors are normally distributed

Accepted Answer

C

Question 2

11eb4064_6151_a61f_ad6e_8516466f814d_TB2969_00

Accepted Answer

C

Question 3

Consider the second-order model

Accepted Answer

B

Question 4

What relationship between x and y is suggested by the scattergram?  &#10;A) a quadratic relationship with downward concavity&#10;B) a linear relationship with negative slope&#10;C) a linear relationship with positive slope&#10;D) a quadratic relationship with upward concavity

Accepted Answer

The answer of What relationship between x and y is...

Question 5

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and the average GMAT score of the program's students. The results of a regression analysis based on a sample of 75 MBA programs is shown below:

Least Squares Linear Regression of Salary

$\begin{array} { l c c c l }\text {Predictor}\\ \text { Variables } & \text { Coefficient } & \text { Std Error } & \text { T } & \text { P } \\ \text { Constant } & 169.910 & 26.5350 & 6.40 & 0.0000 \\ \text { Tuition } & - 3.37373 & 0.81171 & - 4.16 & 0.0001 \\ \text { TxT } & 0.03563 & 0.00590 & 6.03 & 0.0000 \end{array}$

$\begin{array} { l c c r } \text { R-Squared } & 0.7361 & \text { Resid. Mean Square (MSE) } & 358.887 \\ \text { Adjusted R-Squared } & 0.7288 & \text { Standard Deviation } & 18.9443 \end{array}$

$\begin{array} { l l c c } \text { Source } & \text { DF } & \text { SS } \\ \text { Regression } & 2 & & 72081.8 \\ \text { Residual } & & 72 & 25839.8 \\ \text { Total } & & 74 & 97921.7 \\ & & & \\ \text { Cases Included } 75 & \text { Missing Cases 0 } \end{array}$
One of the t-test test statistics is shown on the printout to be the value $t = 6.03$ . Interpret this value.

A) There is sufficient evidence, at

\alpha = 0.05

, to indicate that at least one of the variables proposed in the interaction model is useful at predicting the average starting salary of graduates of MBA programs.
B) There is sufficient evidence, at

\alpha = 0.05

, to indicate that there is a linear relationship between average starting salary of graduates of MBA programs and the tuition of the MBA program.
C) There is insufficient evidence, at

\alpha = 0.05

, to indicate that at least one of the variables proposed in the interaction model is useful at predicting the average starting salary of graduates of MBA programs.
D) There is sufficient evidence, at

\alpha = 0.05

, to indicate that there is a curvilinear relationship between average starting salary of graduates of MBA programs and the tuition of the MBA program.

Accepted Answer

The global F-test statistic is used to test the overall significance of the regression model. The null hypothesis is that all of the regression coefficients are equal to zero, which means that the model does not explain any of the variation in the dependent variable. The alternative hypothesis is that at least one of the regression coefficients is not equal to zero, which means that the model does explain some of the variation in the dependent variable.The F-test statistic is calculated by dividing the mean square for the regression by the mean square for the residual. The mean square for the regression is a measure of the variation in the dependent variable that is explained by the model, while the mean square for the residual is a measure of the variation in the dependent variable that is not explained by the model.In this case, the F-test statistic is 100.42, which is significant at the 0.05 level. This means that we can reject the null hypothesis and conclude that at least one of the regression coefficients is not equal to zero. In other words, the model does explain some of the variation in the dependent variable.

Question 6

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and The average GMAT score of the program's students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary

The model was then used to create 95% confidence and prediction intervals for y and for E(Y) when The tuition charged by the MBA program was $75,000 and the GMAT score was 675. The results are Shown here:

95% confidence interval for E(Y): ($126,610, $136,640)
95% prediction interval for Y: ($90,113, $173,160)

Which of the following interpretations is correct if you want to use the model to estimate E(Y) for All MBA programs?

A) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$90,113 and $173,16,30.
B) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$126,610 and $136,640.
C) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $126,610 and $136,640.
D) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $90,113 and $173,16,30.

Accepted Answer

The p-value for the global f-test is 0.0000, which is less than 0.05. This means that there is sufficient evidence to indicate that something in the regression model is useful for predicting the average starting salary of the graduates of an MBA program.

Question 7

11eb4064_6150_e2c1_ad6e_b3e741998b04_TB2969_00

Accepted Answer

A quadratic model includes a squared term of the independent variable, which is present in option B with $\beta _ { 2 } \mathrm { x } _ { 1 } ^ { 2 }$.

Question 8

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and The average GMAT score of the program's students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary

The model was then used to create 95% confidence and prediction intervals for y and for E(Y) when The tuition charged by the MBA program was $75,000 and the GMAT score was 675. The results are Shown here:

95% confidence interval for E(Y): ($126,610, $136,640)
95% prediction interval for Y: ($90,113, $173,160)

Which of the following interpretations is correct if you want to use the model to estimate E(Y) for All MBA programs?

A) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$90,113 and $173,16,30.
B) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$126,610 and $136,640.
C) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $126,610 and $136,640.
D) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $90,113 and $173,16,30.

Accepted Answer

The answer of A study of the top MBA programs...

Question 9

A public health researcher wants to use regression to predict the sun safety knowledge of pre-school children. The researcher randomly sampled 35 preschoolers, assigned them to one of Two groups, and then measured the following three variables:
SUNSCORE: $\quad \mathrm { y } =$ Score on sun-safety comprehension test
READING: $\quad \mathrm { x } _ { 1 } =$ Reading comprehension score
GROUP: $\quad\quad x _ { 2 } = 1$ if child received a Be Sun Safe demonstration, 0 if not

The following two models were hypothesized:
Model 1: $E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 1 } ^ { 2 } + \beta _ { 3 } x _ { 2 } + \beta _ { 4 } x _ { 1 } x _ { 2 } + \beta _ { 5 } x _ { 1 } ^ { 2 } x _ { 2 }$
Model 2: $E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 3 } x _ { 2 } + \beta _ { 4 } x _ { 1 } x _ { 2 }$

A partial f-test was conducted to compare the two models and the resulting p-value was found to be 0.0023. Fill in the blank. The results lead us to conclude that there is _____ $\text { (at } \alpha = 0.05 )$

A) insufficient evidence of quadratic relationship between sun-safety score to reading score.
B) sufficient evidence of a statistically useful model for sun-safety score.
C) sufficient evidence of interaction between sun-safety score and reading score.
D) sufficient evidence of a quadratic relationship between sun-safety score to reading score.

Accepted Answer

The p-value of 0.0023 is less than the significance level of 0.05, indicating that the quadratic terms in Model 1 significantly improve the model fit compared to Model 2, providing evidence of a quadratic relationship.

Question 10

We decide to conduct a multiple regression analysis to predict the attendance at a major league baseball game. We use the size of the stadium as a quantitative independent variable and the type Of game as a qualitative variable (with two levels - day game or night game). We hypothesize the
Following model: $\mathrm { E } ( \mathrm { y } ) = \beta _ { 0 } + \beta _ { 1 ^ { \mathrm { x } } 1 } + \beta _ { 2 \mathrm { x } _ { 2 } } + \beta _ { 3 } \mathrm { x } _ { 3 }$
Where $\quad$ $x _ { 1 } =$ size of the stadium
$\quad$ $\quad$ $\quad$ $x _ { 2 } = 1$ if a day game, 0 if a night game

A plot of the $y - x _ { 1 }$ relationship would show:

A) Two non-parallel curves
B) Two parallel lines
C) Two parallel curves
D) Two non-parallel lines

Accepted Answer

The answer of We decide to conduct a multiple regression...

Question 11

Which equation represents a complete second-order model for two quantitative independent variables? &#10;A) $E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } ^ { 2 } + \beta _ { 2 } x _ { 2 } ^ { 2 } + \beta _ { 3 } x _ { 1 } ^ { 2 } x _ { 2 } + \beta _ { 4 } x _ { 1 } x _ { 2 } ^ { 2 } + \beta _ { 5 } x _ { 1 } ^ { 2 } x _ { 2 } ^ { 2 }$&#10;B) $E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 1 } x _ { 2 } + \beta _ { 4 } x _ { 1 } ^ { 2 } + \beta _ { 5 } x _ { 2 } ^ { 2 }$&#10;C) $E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 1 } ^ { 2 } + \beta _ { 4 } x _ { 2 } ^ { 2 }$&#10;D) $E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } x _ { 2 } + \beta _ { 2 } x _ { 1 } ^ { 2 } + \beta _ { 3 } x _ { 2 } ^ { 2 }$

Accepted Answer

The answer of Which equation represents a complete second-order model...

Question 12

11eb4064_6150_e2c2_ad6e_b9298241101b_TB2969_00

Accepted Answer

The answer of 11eb4064_6150_e2c2_ad6e_b9298241101b_TB2969_00...

Question 13

&#10;A) 11&#10;B) .9286&#10;C) 5.5&#10;D) .9405

Accepted Answer

The answer of  &#10;A) 11&#10;B) .9286&#10;C) 5.5&#10;D) .9405...

Question 14

Which of the following is not a possible indicator of multicollinearity?&#10;A) significant correlations between pairs of independent variables&#10;B) non-significant t-tests for individual &#946; parameters when the F-test for overall model adequacy is significant&#10;C) signs opposite from what is expected in the estimated &#946; parameters&#10;D) non-random patterns in the plot of the residuals versus the fitted values

Accepted Answer

The answer of Which of the following is not a...

Question 15

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and The average GMAT score of the program's students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary

The model was then used to create 95% confidence and prediction intervals for y and for E(Y) when The tuition charged by the MBA program was $75,000 and the GMAT score was 675. The results are Shown here:

95% confidence interval for E(Y): ($126,610, $136,640)
95% prediction interval for Y: ($90,113, $173,160)

Which of the following interpretations is correct if you want to use the model to estimate E(Y) for All MBA programs?

A) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$90,113 and $173,16,30.
B) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$126,610 and $136,640.
C) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $126,610 and $136,640.
D) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $90,113 and $173,16,30.

Accepted Answer

The answer of A study of the top MBA programs...

Question 16

11eb4064_6151_f347_ad6e_31b4163e1a3a_TB2969_00

Accepted Answer

The answer of 11eb4064_6151_f347_ad6e_31b4163e1a3a_TB2969_00...

Question 17

11eb4064_6151_a621_ad6e_69de16b93d30_TB2969_00

Accepted Answer

The answer of 11eb4064_6151_a621_ad6e_69de16b93d30_TB2969_00...

Question 18

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and The average GMAT score of the program's students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary

The model was then used to create 95% confidence and prediction intervals for y and for E(Y) when The tuition charged by the MBA program was $75,000 and the GMAT score was 675. The results are Shown here:

95% confidence interval for E(Y): ($126,610, $136,640)
95% prediction interval for Y: ($90,113, $173,160)

Which of the following interpretations is correct if you want to use the model to estimate E(Y) for All MBA programs?

A) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$90,113 and $173,16,30.
B) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$126,610 and $136,640.
C) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $126,610 and $136,640.
D) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $90,113 and $173,16,30.

Accepted Answer

The answer of A study of the top MBA programs...

Question 19

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and The average GMAT score of the program's students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary

The model was then used to create 95% confidence and prediction intervals for y and for E(Y) when The tuition charged by the MBA program was $75,000 and the GMAT score was 675. The results are Shown here:

95% confidence interval for E(Y): ($126,610, $136,640)
95% prediction interval for Y: ($90,113, $173,160)

Which of the following interpretations is correct if you want to use the model to estimate E(Y) for All MBA programs?

A) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$90,113 and $173,16,30.
B) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$126,610 and $136,640.
C) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $126,610 and $136,640.
D) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $90,113 and $173,16,30.

Accepted Answer

The answer of A study of the top MBA programs...

Question 20

11eb4064_6151_30e9_ad6e_25a932f0ff19_TB2969_00

Accepted Answer

The answer of 11eb4064_6151_30e9_ad6e_25a932f0ff19_TB2969_00...

Question 21

Consider the partial printout below. $$\begin{array}{l}&#10;\begin{array} { l c l l l l l } &#10;\hline & \text { Coefficients } & \text { Standard Error } & t \text { Stat } & \text { P-value } & \text { Lower 95\% } & \text { Upper 95\% } \&#10;\hline \text { Intercept } &- 63.14873931 & 25.09115112 & - 2.516773304 & 0.045484943 & - 124.5446192 & - 1.752859365 \&#10;\text { X1 } _ { 1 } & 14.72507864 & 8.113581741 & 1.814867849 & 0.119466699 & - 5.128155197 & 34.57831248 \&#10;\text { X2 } & 12.48784546 & 4.686063743 & 2.664890224 & 0.037279879 & 1.021452165 & 23.95423875 \&#10;\text { X1X2 } & - 1.886935135 & 1.344999834 & - 1.402925924 & 0.210210141 & - 5.178033575 & 1.404163305 \&#10;\hline&#10;\end{array}\&#10;\text { Is there evidence (at } \alpha = .05 \text { ) that } x _ { 1 } \text { and } x _ { 2 } \text { interact? Explain. }&#10;\end{array}$$

Accepted Answer

The answer of Consider the partial printout below. \[\begin{array}{l}&#10;\begin{array} {...

Question 22

&#10;A) 4.2&#10;B) 10.8&#10;C) 11.4&#10;D) 1.8

Accepted Answer

The answer of  &#10;A) 4.2&#10;B) 10.8&#10;C) 11.4&#10;D) 1.8...

Question 23

It is dangerous to predict outside the range of the data collected in a regression analysis. For instance, we shouldn't predict the price of a 5000 square foot home if all our sample homes were smaller than 4500 square feet. Which of the following multiple regression pitfalls does this example describe?

A) Estimability
B) Multicollinearity
C) Stepwise Regression
D) Extrapolation

Accepted Answer

The answer of It is dangerous to predict outside the...

Question 24

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and The average GMAT score of the program's students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary

The model was then used to create 95% confidence and prediction intervals for y and for E(Y) when The tuition charged by the MBA program was $75,000 and the GMAT score was 675. The results are Shown here:

95% confidence interval for E(Y): ($126,610, $136,640)
95% prediction interval for Y: ($90,113, $173,160)

Which of the following interpretations is correct if you want to use the model to estimate E(Y) for All MBA programs?

A) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$90,113 and $173,16,30.
B) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$126,610 and $136,640.
C) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $126,610 and $136,640.
D) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $90,113 and $173,16,30.

Accepted Answer

The answer of A study of the top MBA programs...

Question 25

Retail price data for $n = 60$ hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive:
$y =$ Retail PRICE (measured in dollars)
$x _ { 1 } =$ Microprocessor SPEED (measured in megahertz)
(Values in sample range from 10 to 40 )
$x _ { 2 } = \mathrm { CHIP }$ size (measured in computer processing units)
(Values in sample range from 286 to 486 )

A first-order regression model was fit to the data. Part of the printout follows:

$\quad$ $\quad$ $\quad$ $\quad$ $\quad$ $\quad$ $\quad$ $\quad$ Parameter Estimates
$\quad$ $\quad$ $\quad$ PARAMETER STANDARD $\quad$ $\quad$ T FOR 0:
VARIABLE DF ESTIMATE ERROR PARAMETER $= 0$ PROB $> | T |$

$\begin{array} { l l l l l l } \text { INTERCEPT } &1 & - 373.526392 & 1258.1243396 & - 0.297 & 0.7676 \\\text { SPEED } & 1 & 104.838940 & 22.36298195 & 4.688 & 0.0001 \\\text { CHIP } & 1 & 3.571850 & 3.89422935 & 0.917 & 0.3629\end{array}$

Identify and interpret the estimate for the SPEED $\beta$ -coefficient, $\hat { \beta } _ { 1 }$ .

A)

\hat { \beta } _ { 1 } = 3.57

; For every 1-megahertz increase in SPEED, we estimate PRICE to increase

\$ 3,57

, holding CHIP fixed.
B)

\hat { \beta } _ { 1 } = 105

; For every 1-megahertz increase in SPEED, we estimate PRICE (y) to increase

\$ 105

, holding CHIP fixed.
C)

\hat { \beta } _ { 1 } = 105

; For every

\$ 1

increase in PRICE, we estimate SPEED to increase 105 megahertz, holding CHIP fixed.
D)

\hat { \beta } _ { 1 } = 3.57

; For every

\$ 1

increase in PRICE, we estimate SPPED to increase by about 4 megahertz, holding CHIP fixed.

Accepted Answer

The answer of Retail price data for $n = 60$...

Question 26

The first-order model below was fit to a set of data.   Explain how to determine if the constant variance assumption is satisfied.

Accepted Answer

The answer of The first-order model below was fit to...

Question 27

Twenty colleges each recommended one of its graduating seniors for a prestigious graduate fellowship. The process to determine which student will receive the fellowship includes several interviews. The gender of each student and his or her score on the first interview are shown below.

\begin{array}{clc}\hline \text { Student } & \text { Gender } & \text { Score } \\\hline 1 & \text { Male } & 18 \\2 & \text { Female } & 17 \\3 & \text { Female } & 19 \\4 & \text { Female } & 16 \\5 & \text { Male } & 12 \\6 & \text { Female } & 15 \\7 & \text { Female } & 18 \\8 & \text { Male } & 16 \\9 & \text { Male } & 18 \\10 & \text { Female } & 20\end{array}

\begin{array}{clc}\hline \text { Student } & \text { Gender } & \text { Score } \\\hline 11 & \text { Female } & 17 \\12 & \text { Male } & 16 \\13 & \text { Male } & 16 \\14 & \text { Female } & 19 \\15 & \text { Female } & 16 \\16 & \text { Male } & 15 \\17 & \text { Female } & 12 \\18 & \text { Male } & 14 \\19 & \text { Female } & 16 \\20 & \text { Female } & 18\end{array}

a. Suppose you want to use gender to model the score on the interview y. Create the
appropriate number of dummy variables for gender and write the model.
b. Fit the model to the data.
c. Give the null hypothesis for testing whether gender is a useful predictor of the score y.
d. Conduct the test and give the appropriate conclusion

\text { Use } \alpha = .05

Accepted Answer

The answer of Twenty colleges each recommended one of its...

Question 28

Retail price data for n = 60 hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive:

\begin{aligned} y = & \text { Retail PRICE (measured in dollars) } \\ x _ { 1 } = & \text { Microprocessor SPEED (measured in megahertz) } \\ & \text { (Values in sample range from } 10 \text { to } 40 \text { ) } \\ x _ { 2 } = & \text { CHIP size (measured in computer processing units) } \\ & \text { (Values in sample range from } 286 \text { to } 486 \text { ) } \end{aligned}

A first-order regression model. was fit to the data. Part of the printout follows:

\quad

\quad

\quad

\quad

\quad

\quad

\quad

\quad

\quad

Parameter Estimates

\quad

\quad

\quad

\quad

\quad

PARAMETER STANDARD

\quad

T FOR 0 :
VARIABLE DF ESTIMATE ERROR PARAMETER

= 0

PROB

> | \mathrm { T } |

\begin{array} { l r l l l l } \text { INTERCEPT } &1 & - 373.526392 & 1258.1243396 & - 0.297 & 0.7676 \\\text { SPEED } & 1 & 104.838940 & 22.36298195 & 4.688 & 0.0001 \\\text { CHIP } & 1 & 3.571850 & 3.89422935 & 0.917 & 0.3629\end{array}

\text { Identify and interpret the estimate of } \beta_{2} \text {. }

Accepted Answer

The answer of Retail price data for n = 60...

Question 29

A public health researcher wants to use regression to predict the sun safety knowledge of pre-school children. The researcher randomly sampled 35 preschoolers, assigned them to one of two groups, and then measured the following three variables:&#10; SUNSCORE: $\quad \mathrm { y } =$ Score on sun-safety comprehension test&#10;READING: $\quad \mathrm { x } _ { 1 } =$ Reading comprehension score&#10;GROUP: $\quad \quad x _ { 2 } = 1$ if child received a Be Sun Safe demonstration, 0 if not&#10;&#10;A regression model was fit and the following residual plot was observed.&#10;Predicted value of $y$&#10; &#10; Which of the following assumptions appears violated based on this plot?&#10;A) The errors are normally distributed&#10;B) The errors are independent&#10;C) The mean of the errors is zero&#10;D) The variance of the errors is constant

Accepted Answer

The answer of A public health researcher wants to use...

Question 30

Consider the partial printout for an interaction regression analysis of the relationship between a dependent variable

y

and two independent variables

x _ { 1 }

and

x _ { 2 }

.
ANOVA

\begin{array}{llllll}\hline & \text { df } & \text { SS } & \text { MS } & F & \text { Significance F } \\\hline \text { Regression } & 3 & 3393.677324 & 1131.225775 & 9391.974782 & 2.11084 \mathrm{E}-11 \\\text { Residual } & 6 & 0.722675987 & 0.120445998 & & \\\text { Total } & 9 & 3394.4 & & & \\\hline\end{array}

\begin{array}{lllllll} & \text { Coefficients } & \text { Standard Error } & t \text { Stat } & \text { P-value } & \text { Lower 95\% } & \text { Upper 95\% } \\\hline \text { Intercept } & 16.72197014 & 8.283997219 & 2.018587126 & 0.09007654 & -3.548255659 & 36.99219593 \\\text { X1 }_{1} & -3.037317759 & 2.678748705 & -1.133856921 & 0.300116382 & -9.591984506 & 3.517348987 \\\text { X2 }_{2} & -1.046522754 & 1.547132645 & -0.676427297 & 0.523973988 & -4.832222727 & 2.73917722 \\\text { X1X2 }_{1} & 4.071685147 & 0.444059933 & 9.169224345 & 9.47663 \mathrm{E}-05 & 2.98510884 & 5.158261454\end{array}

a. Write the prediction equation for the interaction model.
b. Test the overall utility of the interaction model using the global

F

-test at

\alpha = .05

.
c. Test the hypothesis (at

\alpha = .05

) that

x _ { 1 }

and

x _ { 2 }

interact positively.
d. Estimate the change in

y

for each additional 1-unit increase in

x _ { 1 }

when

x _ { 2 } = 6

.

Accepted Answer

The answer of Consider the partial printout for an interaction...

Question 31

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and The average GMAT score of the program's students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary

The model was then used to create 95% confidence and prediction intervals for y and for E(Y) when The tuition charged by the MBA program was $75,000 and the GMAT score was 675. The results are Shown here:

95% confidence interval for E(Y): ($126,610, $136,640)
95% prediction interval for Y: ($90,113, $173,160)

Which of the following interpretations is correct if you want to use the model to estimate E(Y) for All MBA programs?

A) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$90,113 and $173,16,30.
B) We are 95% confident that the average starting salary for graduates of a single MBA program that charges $75,000 in tuition and has an average GMAT score of 675 will fall between
$126,610 and $136,640.
C) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $126,610 and $136,640.
D) We are 95% confident that the average of all starting salaries for graduates of all MBA programs that charge $75,000 in tuition and have an average GMAT score of 675 will fall
Between $90,113 and $173,16,30.

Accepted Answer

The answer of A study of the top MBA programs...

Question 32

During its manufacture, a product is subjected to four different tests in sequential order. An efficiency expert claims that the fourth (and last) test is unnecessary since its results can be predicted based on the first three tests. To test this claim, multiple regression will be used to model Test4 score $( y )$ , as a function of Test1 score $\left( x _ { 1 } \right)$ , Test 2 score $\left( x _ { 2 } \right)$ , and Test3 score $\left( x _ { 3 } \right)$ . [Note: All test scores range from 200 to 800 , with higher scores indicative of a higher quality product.] Consider the model:
$E ( y ) = \beta _ { 1 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 }$
The first-order model was fit to the data for each of 12 units sampled from the production line. The results are summarized in the printout.
$\begin{array}{lrrrrr}\text { SOURCE } & \text { DF } & \text { SS } & \text { MS } & \text { F VALUE } & \text { PROB > F } \\\text { MODEL } & 3 & 151417 & 50472 & 18.16 & .0075 \\\text { ERROR } & 8 & 22231 & 2779 & & \\\text { TOTAL } & 12 & 173648 & & &\end{array}$

$\begin{array}{llll}\text { ROOT MSE } & 52.72 & \text { R-SQUARE } & 0.872 \\\text { DEP MEAN } & 645.8 & \text { ADJ R-SQ } & 0.824\end{array}$

$\begin{array}{lrrrr} & \text { PARAMETER } & \text { STANDARD } & \text { T FOR 0: } & \\\text { VARIABLE } & \text { ESTIMATE } & \text { ERROR } & \text { PARAMETER }=0 & \text { PROB > }|\mathrm{T}| \\\text { INTERCEPT } & 11.98 & 80.50 & 0.15 & 0.885 \\\text { X1(TEST1) } & 0.2745 & 0.1111 & 2.47 & 0.039 \\\text { X2(TEST2) } & 0.3762 & 0.0986 & 3.82 & 0.005 \\\text { X3(TEST3) } & 0.3265 & 0.0808 & 4.04 & 0.004\end{array}$

Suppose the $95 \%$ confidence interval for $\beta _ { 3 }$ is $( .15 , .47 )$ . Which of the following statements is incorrect?

A) We are

95 \%

confident that the Test 3 is a useful linear predictor of Test 4 score, holding Test1 and Test2 fixed.
B) At

\alpha = .05

, there is insufficient evidence to reject

H _ { 0 } : \beta _ { 3 } = 0

in favor of

H _ { \mathrm { a } } : \beta 3 \neq 0

.
C) We are

95 \%

confident that the increase in Test4 score for every 1-point increase in Test3 score falls between

.15

and

.47

, holding Test1 and Test 2 fixed.
D) We are

95 \%

confident that the estimated slope for the Test4-Test3 line falls between

.15

and

.47

holding Test1 and Test2 fixed.

Accepted Answer

The answer of During its manufacture, a product is subjected...

Question 33

The confidence interval for the mean E(y) is narrower that the prediction interval for y.

Accepted Answer

The answer of The confidence interval for the mean E(y)...

Question 34

11eb4064_6152_1a5a_ad6e_897bd3159677_TB2969_00

Accepted Answer

The answer of 11eb4064_6152_1a5a_ad6e_897bd3159677_TB2969_00...

Question 35

11eb4064_6153_04d3_ad6e_9f52215fc701_TB2969_00 11eb4064_6153_04d4_ad6e_61493fb156fe_TB2969_00

Accepted Answer

The answer of 11eb4064_6153_04d3_ad6e_9f52215fc701_TB2969_00 11eb4064_6153_04d4_ad6e_61493fb156fe_TB2969_00...

Question 36

11eb4064_6152_416e_ad6e_3ffa30b1d099_TB2969_00

Accepted Answer

The answer of 11eb4064_6152_416e_ad6e_3ffa30b1d099_TB2969_00...

Question 37

In regression, it is desired to predict the dependent variable based on values of other related independent variables. Occasionally, there are relationships that exist between the independent variables. Which of the following multiple regression pitfalls does this example describe?

A) Multicollinearity
B) Extrapolation
C) Stepwise Regression
D) Estimability

Accepted Answer

The answer of In regression, it is desired to predict...

Question 38

&#10;A) 1&#10;B) 10&#10;C) 16&#10;D) 13

Accepted Answer

The answer of  &#10;A) 1&#10;B) 10&#10;C) 16&#10;D) 13...

Question 39

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and the average GMAT score of the program's students. The results of a regression analysis based on a sample of 75 MBA programs is shown below:

Least Squares Linear Regression of Salary

$\begin{array} { l c c c l }\text {Predictor}\\ \text { Variables } & \text { Coefficient } & \text { Std Error } & \text { T } & \text { P } \\ \text { Constant } & 169.910 & 26.5350 & 6.40 & 0.0000 \\ \text { Tuition } & - 3.37373 & 0.81171 & - 4.16 & 0.0001 \\ \text { TxT } & 0.03563 & 0.00590 & 6.03 & 0.0000 \end{array}$

$\begin{array} { l c c r } \text { R-Squared } & 0.7361 & \text { Resid. Mean Square (MSE) } & 358.887 \\ \text { Adjusted R-Squared } & 0.7288 & \text { Standard Deviation } & 18.9443 \end{array}$

$\begin{array} { l l c c } \text { Source } & \text { DF } & \text { SS } \\ \text { Regression } & 2 & & 72081.8 \\ \text { Residual } & & 72 & 25839.8 \\ \text { Total } & & 74 & 97921.7 \\ & & & \\ \text { Cases Included } 75 & \text { Missing Cases 0 } \end{array}$
One of the t-test test statistics is shown on the printout to be the value $t = 6.03$ . Interpret this value.

A) There is sufficient evidence, at

\alpha = 0.05

, to indicate that at least one of the variables proposed in the interaction model is useful at predicting the average starting salary of graduates of MBA programs.
B) There is sufficient evidence, at

\alpha = 0.05

, to indicate that there is a linear relationship between average starting salary of graduates of MBA programs and the tuition of the MBA program.
C) There is insufficient evidence, at

\alpha = 0.05

, to indicate that at least one of the variables proposed in the interaction model is useful at predicting the average starting salary of graduates of MBA programs.
D) There is sufficient evidence, at

\alpha = 0.05

, to indicate that there is a curvilinear relationship between average starting salary of graduates of MBA programs and the tuition of the MBA program.

Accepted Answer

The answer of A study of the top MBA programs...

Question 40

A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and the average GMAT score of the program's students. The results of a regression analysis based on a sample of 75 MBA programs is shown below:

Least Squares Linear Regression of Salary

$\begin{array} { l c c c l }\text {Predictor}\\ \text { Variables } & \text { Coefficient } & \text { Std Error } & \text { T } & \text { P } \\ \text { Constant } & 169.910 & 26.5350 & 6.40 & 0.0000 \\ \text { Tuition } & - 3.37373 & 0.81171 & - 4.16 & 0.0001 \\ \text { TxT } & 0.03563 & 0.00590 & 6.03 & 0.0000 \end{array}$

$\begin{array} { l c c r } \text { R-Squared } & 0.7361 & \text { Resid. Mean Square (MSE) } & 358.887 \\ \text { Adjusted R-Squared } & 0.7288 & \text { Standard Deviation } & 18.9443 \end{array}$

$\begin{array} { l l c c } \text { Source } & \text { DF } & \text { SS } \\ \text { Regression } & 2 & & 72081.8 \\ \text { Residual } & & 72 & 25839.8 \\ \text { Total } & & 74 & 97921.7 \\ & & & \\ \text { Cases Included } 75 & \text { Missing Cases 0 } \end{array}$
One of the t-test test statistics is shown on the printout to be the value $t = 6.03$ . Interpret this value.

A) There is sufficient evidence, at

\alpha = 0.05

, to indicate that at least one of the variables proposed in the interaction model is useful at predicting the average starting salary of graduates of MBA programs.
B) There is sufficient evidence, at

\alpha = 0.05

, to indicate that there is a linear relationship between average starting salary of graduates of MBA programs and the tuition of the MBA program.
C) There is insufficient evidence, at

\alpha = 0.05

, to indicate that at least one of the variables proposed in the interaction model is useful at predicting the average starting salary of graduates of MBA programs.
D) There is sufficient evidence, at

\alpha = 0.05

, to indicate that there is a curvilinear relationship between average starting salary of graduates of MBA programs and the tuition of the MBA program.

Accepted Answer

The answer of A study of the top MBA programs...

Question 41

A fast food chain test marketing a new sandwich chose 18 of its stores in one major&#10;metropolitan area. Nine of the stores were in malls and nine were free standing. The sandwich was offered at three different introductory prices. The table shows the number of new sandwiches sold at each location for each location type and price combination. &#10;&#10;Number of New Sandwiches Sold&#10; &#10;&#10;a. Write a model for the mean number of sandwiches sold, $E ( y )$, assuming that the relationship between $E ( y )$ and price, $x _ { 1 }$, is first-order.&#10;b. Fit the model to the data.&#10;c. Write the prediction equations for mall and free-standing stores.&#10;d. Do the data provide sufficient evidence that the change in number of sandwiches sold with respect to price is different for mall and free-standing stores? Use $\alpha = .01$.

Accepted Answer

The answer of A fast food chain test marketing a...

Question 42

The concessions manager at a beachside park recorded the high temperature, the number of people at the park, and the number of bottles of water sold for each of 12 consecutiveSaturdays. The data are shown below.

\begin{array}{ccc}\hline \text { Bottles Sold Temperature }\left({ }^{\circ} \mathrm{F}\right) & \text { People } \\\hline 341 & 73 & 1625 \\425 & 79 & 2100 \\457 & 80 & 2125 \\485 & 80 & 2800 \\469 & 81 & 2550 \\395 & 82 & 1975 \\511 & 83 & 2675 \\549 & 83 & 2800 \\543 & 85 & 2850 \\537 & 88 & 2775 \\621 & 89 & 2800 \\897 & 91 & 3100 \\\hline\end{array}

a. Fit the model

E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 1 } x _ { 2 }

to the data, letting

y

represent the number of bottles of water sold,

x _ { 1 }

the temperature, and

x _ { 2 }

the number of people at the park.
b. Identify at least two indicators of multicollinearity in the model.
c. Comment on the usefulness of the model to predict the number of bottles of water sold on a Saturday when the high temperature is

103 ^ { \circ } \mathrm { F }

and there are 3500 people at the park.

Accepted Answer

The answer of The concessions manager at a beachside park...

Question 43

In Hawaii, proceedings are under way to enable private citizens to own the property that their homes are built on. In prior years, only estates were permitted to own land, and homeowners leased the land from the estate. In order to comply with the new law, a large Hawaiian estate wants to use regression analysis to estimate the fair market value of the land. The following variables are proposed:

y = \text { Sale price of property (\$ thousands) }

x _ { 2 } = 1

if property near Cove, 0 if not Write a regression model relating the sale price of a property to the qualitative variable x. Interpret all the ?s in the model.

Accepted Answer

The answer of In Hawaii, proceedings are under way to...

Question 44

11eb4064_6153_c840_ad6e_e143543973be_TB2969_00

Accepted Answer

The answer of 11eb4064_6153_c840_ad6e_e143543973be_TB2969_00...

Question 45

Interpret the residual plot.

Accepted Answer

The answer of   Interpret the residual plot....

Question 46

A college admissions officer proposes to use regression to model a student's college GPA at graduation in terms of the following two variables: $$\begin{array} { l } &#10;x _ { 1 } = \text { high school GPA } \&#10;x _ { 2 } = \text { SAT score }&#10;\end{array}$$ The admissions officer believes the relationship between college GPA and high school GPA is linear and the relationship between SAT score and college GPA is linear. She also believes that the relationship between college GPA and high school GPA depends on the student's SAT score. Write the regression model she should fit.

Accepted Answer

The answer of A college admissions officer proposes to use...

Question 47

A certain type of rare gem serves as a status symbol for many of its owners. In theory, for low prices, the demand decreases as the price of the gem increases. However, experts hypothesize that when the gem is valued at very high prices, the demand increases with price due to the status the owners believe they gain by obtaining the gem. Thus, the model proposed to best explain the demand for the gem by its price is the quadratic model

E ( y ) = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x ^ { 2 }

where

y =

Demand (in thousands) and

x =

Retail price per carat (dollars). This model was fit to data collected for a sample of 12 rare gems. A portion of the printout is given below: Does the quadratic term contribute useful information for predicting the demand for the gem? Use

\alpha = .10

.

\begin{array}{lrrrrr}\text { SOURCE } & \text { DF } & \text { SS } & \text { MS } & \text { F } & \text { PR > F } \\\text { Model } & 2 & 115145 & 57573 & 373 & .0001 \\\text { Error } & 9 & 1388 & 154 & & \\\text { TOTAL } & 11 & 116533 & & &\end{array}

\begin{array}{llll}\text { Root MSE } & 12.42 & \text { R-Square } & .988\end{array}

\begin{array}{lrrrr} & \text { PARAMETER } & \text { T for HO: } \\\text { VARIABLES } & \text { ESTIMATES } & \text { STD. ERROR } & \text { PARAMETER }=0 & \text { PR > }>\mid \\\text { INTERPCEP } & 286.42 & 9.66 & 29.64 & .0001 \\\text { X } & -.31 & .06 & -5.14 & .0006 \\\text { X.X } & .000067 & .00007 & .95 & .3647\end{array}

Does the quadratic term contribute useful information for predicting the demand for the gem? Use

\alpha=10

.

Accepted Answer

The answer of A certain type of rare gem serves...

Question 48

In any production process in which one or more workers are engaged in a variety of tasks, the total time spent in production varies as a function of the size of the workpool and the level of output of the various activities. In a large metropolitan department store, it is believed that the number of man-hours worked

( y )

per day by the clerical staff depends on the number of pieces of mail processed per day

\left( x _ { 1 } \right)

and the number of checks cashed per day

\left( x _ { 2 } \right)

. Data collected for

n = 20

working days were used to fit the model:

E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 }

A printout for the analysis follows:

\begin{array}{l}\text { Analysis of Variance }\\\begin{array} { l r r r r r } \text { SOURCE } & \text { DF } & \text { SS } & \text { MS } & \text { F VALUE } & \text { PROB > F } \\\\\text { MODEL } & 2 & 7089.06512 & 3544.53256 & 13.267 & 0.0003 \\\text { ERROR } & 17 & 4541.72142 & 267.16008 & & \\\text { C TOTAL } & 19 & 11630.78654 & & &\end{array}\end{array}

\begin{array}{llll}\text { ROOT MSE } & 16.34503 & \text { R-SQUARE } & 0.6095 \\\text { DEP MEAN } & 93.92682 & \text { ADJR-SQ } & 0.5636 \\\text { C.V. } & 17.40188 & &\end{array}

Parameter Estimates
PARAMETER STANDARD T FOR 0:
VARIABLE DF ESTIMATE ERROR PARAMETER

=0 \quad

PROB

>|\mathrm{T}|

\begin{array}{lrrrrr}\text { INTERCEPT } & 1 & 114.420972 & 18.68485744 & 6.124 & 0.0001 \\\text { X1 } & 1 & -0.007102 & 0.00171375 & -4.144 & 0.0007 \\\text { X2 } & 1 & 0.037290 & 0.02043937 & 1.824 & 0.0857\end{array}

\begin{array}{rrrrrrrr} & & & \text { Actual } & \text { Predict } & & \text { Lower 95\% CL } & \text { Upper 95\% CL } \\\text { OBS } & \mathrm{X} 1 & \mathrm{X} 2 & \text { Value } & \text { Value } & \text { Residual } & \text { Predict } & \text { Predict } \\1 & 7781 & 644 & 74.707 & 83.175 & -8.468 & 47.224 & 119.126 \\\hline\end{array}

Test to determine if there is a positive linear relationship between the number of man-hours worked,

y

, and the number of checks cashed per day,

x _ { 2 }

. Use

\alpha = .05

.

Accepted Answer

The answer of In any production process in which one...

Question 49

Interpret the residual plot.

Accepted Answer

The answer of   Interpret the residual plot....

Question 50

The model

E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 } + \beta _ { 4 } x _ { 4 }

was used to relate

E ( y )

to a single qualitative variable, where

x _ { 1 } = \left\{ \begin{array} { l l } 1 & \text { if level } 2 \\ 0 & \text { if not } \end{array} \quad x _ { 2 } = \left\{ \begin{array} { l l } 1 & \text { if level } 3 \\ 0 & \text { if not } \end{array} \right. \right.

x _ { 3 } = \left\{ \begin{array} { l l } 1 & \text { if level } 4 \\ 0 & \text { if not } \end{array} \quad x _ { 4 } = \left\{ \begin{array} { l l } 1 & \text { if level } 5 \\ 0 & \text { if not } \end{array} \right. \right.

This model was fit to

n = 40

data points and the following result was obtained:

\hat { y } = 14.5 + 3 x _ { 1 } - 4 x _ { 2 } + 10 x _ { 3 } + 8 x _ { 4 }

a. Use the least squares prediction equation to find the estimate of

E ( y )

for each level of the qualitative variable.
b. Specify the null and alternative hypothesis you would use to test whether

E ( y )

is the same for all levels of the independent variable.

Accepted Answer

The answer of The model \(E ( y ) =...

Question 51

Retail price data for $n = 60$ hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive:
$y =$ Retail PRICE (measured in dollars)
$x _ { 1 } =$ Microprocessor SPEED (measured in megahertz)
(Values in sample range from 10 to 40 )
$x _ { 2 } = \mathrm { CHIP }$ size (measured in computer processing units)
(Values in sample range from 286 to 486 )

A first-order regression model was fit to the data. Part of the printout follows:

$\quad$ $\quad$ $\quad$ $\quad$ $\quad$ $\quad$ $\quad$ $\quad$ Parameter Estimates
$\quad$ $\quad$ $\quad$ PARAMETER STANDARD $\quad$ $\quad$ T FOR 0:
VARIABLE DF ESTIMATE ERROR PARAMETER $= 0$ PROB $> | T |$

$\begin{array} { l l l l l l } \text { INTERCEPT } &1 & - 373.526392 & 1258.1243396 & - 0.297 & 0.7676 \\\text { SPEED } & 1 & 104.838940 & 22.36298195 & 4.688 & 0.0001 \\\text { CHIP } & 1 & 3.571850 & 3.89422935 & 0.917 & 0.3629\end{array}$

Identify and interpret the estimate for the SPEED $\beta$ -coefficient, $\hat { \beta } _ { 1 }$ .

A)

\hat { \beta } _ { 1 } = 3.57

; For every 1-megahertz increase in SPEED, we estimate PRICE to increase

\$ 3,57

, holding CHIP fixed.
B)

\hat { \beta } _ { 1 } = 105

; For every 1-megahertz increase in SPEED, we estimate PRICE (y) to increase

\$ 105

, holding CHIP fixed.
C)

\hat { \beta } _ { 1 } = 105

; For every

\$ 1

increase in PRICE, we estimate SPEED to increase 105 megahertz, holding CHIP fixed.
D)

\hat { \beta } _ { 1 } = 3.57

; For every

\$ 1

increase in PRICE, we estimate SPPED to increase by about 4 megahertz, holding CHIP fixed.

Accepted Answer

The answer of Retail price data for $n = 60$...

Question 52

As part of a study at a large university, data were collected on

n = 224

freshmen computer science (CS) majors in a particular year. The researchers were interested in modeling

y

, a student's grade point average (GPA) after three semesters, as a function of the following independent variables (recorded at the time the students enrolled in the university):

x _ { 1 } =

average high school grade in mathematics (HSM)

x _ { 2 } =

average high school grade in science (HSS)

x _ { 3 } =

average high school grade in English (HSE)

x _ { 4 } =

SAT mathematics score (SATM)

x _ { 5 } =

SAT verbal score (SATV)

A first-order model was fit to data with

R _ { a } ^ { 2 } = .193

.

Interpret the value of the adjusted coefficient of determination

R _ { a } ^ { 2 }

.

Accepted Answer

The answer of As part of a study at a...

Question 53

Consider the data given in the table below.

\begin{array} { c c } \hline \mathrm { X } & \mathrm { Y } \\\hline 1 & 4 \\2 & 6 \\2 & 5 \\3 & 7 \\4 & 7 \\4 & 6 \\5 & 4 \\5 & 5 \\6 & 3 \\\hline\end{array}

a. Plot the data on a scattergram. Does a quadratic model seem to be a good fit for the
data? Explain.
b. Use the method of least squares to find a quadratic prediction equation.
c. Graph the prediction equation on your scattergram.

Accepted Answer

The answer of Consider the data given in the table...

Question 54

A certain type of rare gem serves as a status symbol for many of its owners. In theory, for low prices, the demand decreases as the price of the gem increases. However, experts hypothesize that when the gem is valued at very high prices, the demand increases with price due to the status the owners believe they gain by obtaining the gem. Thus, the model proposed to best explain the demand for the gem by its price is the quadratic model

E ( y ) = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x ^ { 2 }

where

y =

Demand (in thousands) and

x =

Retail price per carat (dollars). This model was fit to data collected for a sample of 12 rare gems. A portion of the printout is given below: Does the quadratic term contribute useful information for predicting the demand for the gem? Use

\alpha = .10

.

\begin{array}{lrrrrr}\text { SOURCE } & \text { DF } & \text { SS } & \text { MS } & \text { F } & \text { PR > F } \\\text { Model } & 2 & 115145 & 57573 & 373 & .0001 \\\text { Error } & 9 & 1388 & 154 & & \\\text { TOTAL } & 11 & 116533 & & &\end{array}

\begin{array}{llll}\text { Root MSE } & 12.42 & \text { R-Square } & .988\end{array}

\begin{array}{lrrrr} & \text { PARAMETER } & \text { T for HO: } \\\text { VARIABLES } & \text { ESTIMATES } & \text { STD. ERROR } & \text { PARAMETER }=0 & \text { PR > }>\mid \\\text { INTERPCEP } & 286.42 & 9.66 & 29.64 & .0001 \\\text { X } & -.31 & .06 & -5.14 & .0006 \\\text { X.X } & .000067 & .00007 & .95 & .3647\end{array}

Does the quadratic term contribute useful information for predicting the demand for the gem? Use

\alpha=10

.

Accepted Answer

The answer of A certain type of rare gem serves...

Question 55

Is there evidence of multicollinearity in the printout? Explain.

Accepted Answer

The answer of   Is there evidence of multicollinearity...

Question 56

11eb4064_6154_1667_ad6e_8d4a32bcd26b_TB2969_00

Accepted Answer

The answer of 11eb4064_6154_1667_ad6e_8d4a32bcd26b_TB2969_00...

Question 57

11eb4064_6153_2be9_ad6e_4de0aadd64da_TB2969_00

Accepted Answer

The answer of 11eb4064_6153_2be9_ad6e_4de0aadd64da_TB2969_00...

Question 58

Why is the random error term ? added to a multiple regression model?

Accepted Answer

The answer of Why is the random error term ?...

Question 59

A college admissions officer proposes to use regression to model a student's college GPA at graduation in terms of the following two variables: $$\begin{array} { l } &#10;x _ { 1 } = \text { high school GPA } \&#10;x _ { 2 } = \text { SAT score }&#10;\end{array}$$ The admissions officer believes the relationship between college GPA and high school GPA is linear and the relationship between SAT score and college GPA is linear. She also believes that the relationship between college GPA and high school GPA depends on the student's SAT score. She proposes the regression model: $$E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 1 } x _ { 2 }$$ Explain how to determine if the relationship between college GPA and SAT score depends on the high school GPA.

Accepted Answer

The answer of A college admissions officer proposes to use...

Question 60

11eb4064_6153_5300_ad6e_cd85166f7bd3_TB2969_00

Accepted Answer

The answer of 11eb4064_6153_5300_ad6e_cd85166f7bd3_TB2969_00...

Question 61

Consider the data given in the table below.

\begin{array} { c c } \hline \mathrm { X } & \mathrm { Y } \\\hline 1 & 4 \\2 & 6 \\2 & 5 \\3 & 7 \\4 & 7 \\4 & 6 \\5 & 4 \\5 & 5 \\6 & 3 \\\hline\end{array}

a. Plot the data on a scattergram. Does a quadratic model seem to be a good fit for the
data? Explain.
b. Use the method of least squares to find a quadratic prediction equation.
c. Graph the prediction equation on your scattergram.

Accepted Answer

The answer of Consider the data given in the table...

Question 62

The table shows the profit y (in thousands of dollars) that a company made during a month when the price of its product was x dollars per unit.

\begin{array}{cc}\hline \text { Profit, } y & \text { Price, } x \\\hline 12 & 1.20 \\17 & 1.25 \\20 & 1.29 \\21 & 1.30 \\24 & 1.35 \\26 & 1.39 \\27 & 1.40 \\23 & 1.45 \\21 & 1.49 \\20 & 1.50 \\15 & 1.55 \\11 & 1.59 \\10 & 1.60 \\5 & 1.65 \\\hline\end{array}

a. Fit the model

y = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x 2 + \varepsilon

to the data and give the least squares prediction equation.
b. Plot the fitted equation on a scattergram of the data.
c. Is there sufficient evidence of downward curvature in the relationship between profit and price? Use

\alpha = .05

.

Accepted Answer

The answer of The table shows the profit y (in...

Question 63

The model $E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 }$ was used to relate $E ( y )$ to a single qualitative variable. How many levels does the qualitative variable have?

Accepted Answer

The answer of The model \(E ( y ) =...

Question 64

The table below shows data for

n = 20

observations.

\begin{array}{ccc}\hline \mathrm{y} & \mathrm{x} 1 & \mathrm{x} 2 \\\hline 18 & 3 & 8 \\23 & 5 & 10 \\15 & 2 & 7 \\31 & 6 & 12 \\24 & 4 & 9 \\28 & 5 & 11 \\17 & 2 & 7 \\19 & 3 & 8 \\30 & 7 & 10 \\28 & 5 & 8 \\14 & 3 & 6 \\32 & 7 & 11 \\17 & 2 & 8 \\24 & 5 & 10 \\26 & 6 & 11 \\27 & 6 & 11 \\21 & 3 & 6 \\31 & 7 & 13 \\19 & 2 & 8 \\25 & 5 & 10 \\\hline\end{array}

a. Use a first-order regression model to find a least squares prediction equation for the model.
b. Find a

95 \%

confidence interval for the coefficient of

x _ { 1 }

in your model. Interpret the result.
c. Find a

95 \%

confidence interval for the coefficient of

x _ { 2 }

in your model. Interpret the result.
d. Find

R ^ { 2 }

and

R _ { a } 2

and interpret these values.
e. Test the null hypothesis

H _ { 0 } : \beta _ { 1 } = \beta _ { 2 } = 0

against the alternative hypothesis

H _ { \mathrm { a } } :

at least one

\beta _ { i } \neq 0

. Use

\alpha = .05

. Interpret the result.

Accepted Answer

The answer of The table below shows data for \(n...

Deck 12: Multiple Regression and Model Building