Services
Discover
Homeschooling
Ask a Question
Log in
Sign up
Filters
Done
Question type:
Essay
Multiple Choice
Short Answer
True False
Matching
Topic
Business
Study Set
Introduction to Econometrics Update
Quiz 11: Regression With a Binary Dependent Variable
Path 4
Access For Free
Share
All types
Filters
Study Flashcards
Practice Exam
Learn
Question 21
Multiple Choice
Your textbook plots the estimated regression function produced by the probit regression of deny on P/I ratio. The estimated probit regression function has a stretched "S" shape given that the coefficient on the P/I ratio is positive. Consider a probit regression function with a negative coefficient. The shape would
Question 22
Essay
A study analyzed the probability of Major League Baseball (MLB)players to "survive" for another season, or, in other words, to play one more season. The researchers had a sample of 4,728 hitters and 3,803 pitchers for the years 1901-1999. All explanatory variables are standardized. The probit estimation yielded the results as shown in the table:
Regression
(1) Hitters
(2) Pitchers
Regression model
probit
probit
constant
2.010
1.625
(
0.030
)
(
0.031
)
number of seasons
−
0.058
−
0.031
played
(
0.004
)
(
0.005
)
performance
0.794
0.677
(
0.025
)
(
0.026
)
average performance
0.022
0.100
(
0.033
)
(
0.036
)
\begin{array} { | c | c | c | } \hline \text { Regression } & \text { (1) Hitters } & \text { (2) Pitchers } \\\hline \text { Regression model } & \text { probit } & \text { probit } \\\hline \text { constant } & 2.010 & 1.625 \\& ( 0.030 ) & ( 0.031 ) \\\hline \text { number of seasons } & - 0.058 & - 0.031 \\\text { played } & ( 0.004 ) & ( 0.005 ) \\\hline \text { performance } & 0.794 & 0.677 \\& ( 0.025 ) & ( 0.026 ) \\\hline \text { average performance } & 0.022 & 0.100 \\& ( 0.033 ) & ( 0.036 ) \\\hline\end{array}
Regression
Regression model
constant
number of seasons
played
performance
average performance
(1) Hitters
probit
2.010
(
0.030
)
−
0.058
(
0.004
)
0.794
(
0.025
)
0.022
(
0.033
)
(2) Pitchers
probit
1.625
(
0.031
)
−
0.031
(
0.005
)
0.677
(
0.026
)
0.100
(
0.036
)
where the limited dependent variable takes on a value of one if the player had one more season (a minimum of 50 at bats or 25 innings pitched), number of seasons played is measured in years, performance is the batting average for hitters and the earned run average for pitchers, and average performance refers to performance over the career. (a)Interpret the two probit equations and calculate survival probabilities for hitters and pitchers at the sample mean. Why are these so high? (b)Calculate the change in the survival probability for a player who has a very bad year by performing two standard deviations below the average (assume also that this player has been in the majors for many years so that his average performance is hardly affected). How does this change the survival probability when compared to the answer in (a)? (c)Since the results seem similar, the researcher could consider combining the two samples. Explain in some detail how this could be done and how you could test the hypothesis that the coefficients are the same.
Question 23
Multiple Choice
(Requires Advanced material) Nonlinear least squares estimators in general are not
Question 24
Essay
Equation (11.3)in your textbook presents the regression results for the linear probability model. a. Using a spreadsheet program such as Excel, plot the fitted values for whites and blacks in the same graph, for P/I ratios ranging from 0 to 1 (use 0.05 increments). b. Explain some of the strengths and shortcomings of the linear probability model using this graph.
Question 25
Essay
Your task is to model students' choice for taking an additional economics course after the first principles course. Describe how to formulate a model based on data for a large sample of students. Outline several estimation methods and their relative advantage over other methods in tackling this problem. How would you go about interpreting the resulting output? What summary statistics should be included?
Question 26
Multiple Choice
The following problems could be analyzed using probit and logit estimation with the exception of whether or not
Question 27
Essay
Equation (11.3)in your textbook presents the regression results for the linear probability model, and equation (11.10)the results for the logit model. a. Using a spreadsheet program such as Excel, plot the predicted probabilities for being denied a loan for both the linear probability model and the logit model if you are black. (Use a range from 0 to 1 for the P/I Ratio and allow for it to increase by increments of 0.05.) b. Given the shortcomings of the linear probability model, do you think that it is a reasonable approximation to the logit model? c. Repeat the exercise using predicted probabilities for whites.
Question 28
Multiple Choice
When testing joint hypothesis, you can use
Question 29
Essay
A study tried to find the determinants of the increase in the number of households headed by a female. Using 1940 and 1960 historical census data, a logit model was estimated to predict whether a woman is the head of a household (living on her own)or whether she is living within another's household. The limited dependent variable takes on a value of one if the female lives on her own and is zero if she shares housing. The results for 1960 using 6,051 observations on prime-age whites and 1,294 on nonwhites were as shown in the table:
Regression
(1) White
(2) Nonwhite
Regression model
Logit
Logit
Constant
1.459
−
2.874
(
0.685
)
(
1.423
)
Age
−
0.275
0.084
(
0.037
)
(
0.068
)
age squared
0.00463
0.00021
(
0.00044
)
(
0.00081
)
education
−
0.171
−
0.127
(
0.026
)
(
0.038
)
farm status
−
0.687
−
0.498
(
0.173
)
(
0.346
)
South
0.376
−
0.520
(
0.098
)
(
0.180
)
expected family
0.0018
0.0011
eamings
(
0.00019
)
(
0.00024
)
fanily composition
4.123
2.751
(
0.294
)
(
0.345
)
Pseudo-R
2
0.266
0.189
Percent Correctly
82.0
83.4
Predicted
\begin{array} { | c | c | c | } \hline \text { Regression } & \text { (1) White } & \text { (2) Nonwhite } \\\hline \text { Regression model } & \text { Logit } & \text { Logit } \\\hline \text { Constant } & 1.459 & - 2.874 \\& ( 0.685 ) & ( 1.423 ) \\\hline \text { Age } & - 0.275 & 0.084 \\& ( 0.037 ) & ( 0.068 ) \\\hline \text { age squared } & 0.00463 & 0.00021 \\& ( 0.00044 ) & ( 0.00081 ) \\\hline \text { education } & - 0.171 & - 0.127 \\& ( 0.026 ) & ( 0.038 ) \\\hline \text { farm status } & - 0.687 & - 0.498 \\& ( 0.173 ) & ( 0.346 ) \\\hline \text { South } & 0.376 & - 0.520 \\& ( 0.098 ) & ( 0.180 ) \\\hline \text { expected family } & 0.0018 & 0.0011 \\\text { eamings } & ( 0.00019 ) & ( 0.00024 ) \\\hline \text { fanily composition } & 4.123 & 2.751 \\& ( 0.294 ) & ( 0.345 ) \\\hline \text { Pseudo-R } 2 & 0.266 & 0.189 \\& & \\\hline \text { Percent Correctly } & 82.0 & 83.4 \\\text { Predicted } & & \\\hline\end{array}
Regression
Regression model
Constant
Age
age squared
education
farm status
South
expected family
eamings
fanily composition
Pseudo-R
2
Percent Correctly
Predicted
(1) White
Logit
1.459
(
0.685
)
−
0.275
(
0.037
)
0.00463
(
0.00044
)
−
0.171
(
0.026
)
−
0.687
(
0.173
)
0.376
(
0.098
)
0.0018
(
0.00019
)
4.123
(
0.294
)
0.266
82.0
(2) Nonwhite
Logit
−
2.874
(
1.423
)
0.084
(
0.068
)
0.00021
(
0.00081
)
−
0.127
(
0.038
)
−
0.498
(
0.346
)
−
0.520
(
0.180
)
0.0011
(
0.00024
)
2.751
(
0.345
)
0.189
83.4
where age is measured in years, education is years of schooling of the family head, farm status is a binary variable taking the value of one if the family head lived on a farm, south is a binary variable for living in a certain region of the country, expected family earnings was generated from a separate OLS regression to predict earnings from a set of regressors, and family composition refers to the number of family members under the age of 18 divided by the total number in the family. The mean values for the variables were as shown in the table.
Variable
(1) White mean
(2) Nonwhite mean
age
46.1
42.9
age squared
2
,
263.5
1
,
965.6
education
12.6
10.4
farm status
0.03
0.02
south
0.3
0.5
expected family earnings
2
,
336.4
1
,
507.3
family composition
0.2
0.3
\begin{array} { | c | c | c | } \hline \text { Variable } & \text { (1) White mean } & \text { (2) Nonwhite mean } \\\hline \text { age } & 46.1 & 42.9 \\\hline \text { age squared } & 2,263.5 & 1,965.6 \\\hline \text { education } & 12.6 & 10.4 \\\hline \text { farm status } & 0.03 & 0.02 \\\hline \text { south } & 0.3 & 0.5 \\\hline \text { expected family earnings } & 2,336.4 & 1,507.3 \\\hline \text { family composition } & 0.2 & 0.3 \\\hline\end{array}
Variable
age
age squared
education
farm status
south
expected family earnings
family composition
(1) White mean
46.1
2
,
263.5
12.6
0.03
0.3
2
,
336.4
0.2
(2) Nonwhite mean
42.9
1
,
965.6
10.4
0.02
0.5
1
,
507.3
0.3
(a)Interpret the results. Do the coefficients have the expected signs? Why do you think age was entered both in levels and in squares? (b)Calculate the difference in the predicted probability between whites and nonwhites at the sample mean values of the explanatory variables. Why do you think the study did not combine the observations and allowed for a nonwhite binary variable to enter? (c)What would be the effect on the probability of a nonwhite woman living on her own, if education and family composition were changed from their current mean to the mean of whites, while all other variables were left unchanged at the nonwhite mean values?
Question 30
Multiple Choice
Probit coefficients are typically estimated using
Question 31
Multiple Choice
In the probit regression, the coefficient β
1
indicates
Question 32
Essay
The logit regression (11.10)on page 393 of your textbook reads:
Pr
(
deny
=
1
∣
P/Iratio,black
)
^
\widehat{\operatorname { Pr } ( \text { deny } = 1 \mid \text { P/Iratio,black } )}
Pr
(
deny
=
1
∣
P/Iratio,black
)
= F(-4.13 + 5.37 P/Iratio + 1.27 black) (a)Using a spreadsheet program such as Excel, plot the following logistic regression function with a single X,
Y
^
\hat { Y }
Y
^
i
=
1
1
+
e
−
(
β
^
0
+
β
^
1
X
1
i
+
β
^
2
X
2
i
)
\frac{1}{1+\mathrm{e}^{-\left(\hat{\beta}_{0}+\hat{\beta}_{1} X_{1 \mathrm{i}}+\hat{\beta}_{2} X_{2 i}\right)}}
1
+
e
−
(
β
^
0
+
β
^
1
X
1
i
+
β
^
2
X
2
i
)
1
where
β
^
\hat { \beta }
β
^
0
= -4.13,
β
^
\hat { \beta }
β
^
1
= 5.37,
β
^
\hat { \beta }
β
^
2
= 1.27. Enter values for X
1
in the first column starting from 0 and then increment these by 0.1 until you reach 2.0. Let X
2
be 0 at first. Then enter the logistic function formula in the next column. Next allow X
2
to be 1 and calculate the new values for the logistic function in the third column. Finally produce the predicted probabilities for both blacks and whites, connecting the predicted values with a line. (b)Using the same spreadsheet calculations, list how the probability increases for blacks and for whites as the P/I ratio increases from 0.5 to 0.6. (c)What is the difference in the rejection probability between blacks and whites for a P/I ratio of 0.5 and for 0.9? Why is the difference smaller for the higher value here? (d)Table 11.2 on page 401 of your textbook lists logit regressions (column 2)with further explanatory variables. Given that you can only produce simple plots in two dimensions, how would you proceed in (a)above if there were more than a single explanatory variable?
Question 33
Essay
Consider the following logit regression: Pr(Y = 1 | X)= F (15.3 - 0.24 × X) Calculate the change in probability for X increasing by 10 for X = 40 and X = 60. Why is there such a large difference in the change in probabilities?