Question 1

You can control for sample-selection bias by performing&#10;A)two-stage least squares.&#10;B)weighted least squares.&#10;C)a Heckman selection correction.&#10;D)difference-in-difference estimation.

Accepted Answer

Sample-selection bias occurs when the sample used for analysis is not random and is based on some criteria, which can lead to biased estimates. Heckman selection correction is used to control for sample-selection bias by modeling the process of sample selection and estimating the probability of selection. It then transforms the original equation to correct for the bias caused by sample selection. Two-stage least squares and difference-in-difference estimation are used to control for endogeneity, while weighted least squares is used to control for heteroscedasticity. Hence, C is the best choice to control for sample-selection bias.

Question 2

Sample-selection bias occurs when&#10;A)the researcher selects a bad sample.&#10;B)the sample contains an independent variable that is correlated with the error term.&#10;C)individuals randomly select the sample to which they belong.&#10;D)individuals non-randomly select themselves into a given outcome of the dependent variable.

Accepted Answer

Sample-selection bias occurs when individuals non-randomly select themselves into a given outcome of the dependent variable. This can lead to skewed results and incorrect conclusions about the relationship between variables.

Question 3

Sample-selection bias presents a problem because it&#10;A)results in OLS coefficient estimates that are biased and inconsistent.&#10;B)does not account for the correlation between the independent variable and the error term.&#10;C)does not account for the time-invariant component of the error term.&#10;D)does not account for the autoregressive structure of the error term.

Accepted Answer

Sample-selection bias occurs when a subset of observations is systematically excluded from the sample, leading to biased and inconsistent OLS coefficient estimates. This bias occurs when the omitted observations are correlated with both the dependent variable and the independent variable.

Question 4

Non-negative count data occur when the dependent variable takes on&#10;A)only positive values.&#10;B)the values of 0 or 1.&#10;C)integers that are greater than or equal to 0 and arise from counting.&#10;D)integers that are strictly greater than 0 and arise from counting rather than ranking.

Accepted Answer

Non-negative count data refers to data that are integers that are greater than or equal to 0 and arise from counting. This means that the data cannot be negative or continuous. Therefore, option C is the best choice as it accurately describes the nature of non-negative count data.

Question 5

When performing difference-in-difference estimation,the control group is the group&#10;A)for which the policy shock occurred.&#10;B)for which the policy shock did not occur.&#10;C)of observation in the &#34;before&#34; sample.&#10;D)of observation in the &#34;after&#34; sample.

Accepted Answer

The control group is the group for which the policy shock did not occur. This serves as a comparison group to estimate the effect of the policy shock on the treatment group.

Question 6

Quantile regression&#10;A)estimates marginal effects at the mean values of the independent variables.&#10;B)results in biased estimates for skewed distributions.&#10;C)can be estimated in Excel.&#10;D)results in estimates approximating either the median or other percentiles of the dependent variable.

Accepted Answer

Quantile regression estimates the conditional median or other percentiles of the dependent variable, and is not limited to estimating marginal effects at the mean like ordinary least squares regression. Skewed distributions do not necessarily produce biased estimates in quantile regression, unlike some other methods. Quantile regression can be conducted in Excel using various add-ins or programming in VBA.

Question 7

Quantile regression is different than OLS in that it&#10;A)does not estimate marginal effects at the mean values of the dependent and independent variables.&#10;B)only uses the data below the quantile where the quantile regression is being estimated.&#10;C)estimates marginal effects at the mean values of the dependent and independent variables.&#10;D)minimizes the sum of squared residuals to obtain the coefficient estimates.

Accepted Answer

Quantile regression estimates the effects of independent variables on different percentiles of the dependent variable, not just at the mean value. Thus, it does not estimate marginal effects at the mean values of the dependent and independent variables, which is the case for OLS.

Question 8

The first-stage in the Heckman selection correction is estimating&#10;A)the individual's self-selection decision and using those estimates to calculate predicted values of the self-selection decision.&#10;B)estimating the regression model and calculating the residuals.&#10;C)the individual's self-selection decision and using those estimates to calculate inverse Mill's ratios.&#10;D)estimating the regression model and calculating the residuals and using those estimates to calculate predicted values of the dependent variable.

Accepted Answer

The first stage in the Heckman selection correction is estimating the individual's self-selection decision and using those estimates to calculate inverse Mill's ratios. This involves estimating a probit model to predict the probability of being selected into the sample, and then calculating the inverse Mill's ratio for each observation.

Question 9

Suppose you wish to determine factors affecting the number of surfers observed surfing at a given surf spot,an appropriate model to estimate the model would be&#10;A)OLS.&#10;B)the logit.&#10;C)the ordered probit.&#10;D)the Poisson model.

Accepted Answer

Since we are modeling the count of surfers, the appropriate model to use would be the Poisson model. The Poisson model is designed specifically for count data, and it models the number of occurrences of a certain event in a fixed time interval or in a certain area. Therefore, the Poisson model is the best choice for estimating the number of surfers observed at a given surf spot.

Question 10

In which of the following cases would you want to estimate a Negative Binomial model?&#10;A)When individuals non-randomly select different outcomes of the dependent variable.&#10;B)When you are attempting to replicate a randomized clinical trial.&#10;C)When you are dealing with non-negative count data.&#10;D)When you suspect that the marginal effects are different for different values of the dependent variable.

Accepted Answer

Negative Binomial models are commonly used when dealing with count data or discrete data where the variance is greater than the mean. Negative Binomial models account for overdispersion and are more appropriate when dealing with non-negative count data than Poisson models, which assume the mean is equal to the variance.

Question 11

Quasi-experimental methods attempt to&#10;A)control for sample-selection bias.&#10;B)estimate marginal effects at different points in the distribution of the dependent variable.&#10;C)account for endogeneity of an independent variable.&#10;D)replicate randomized clinical trials.

Accepted Answer

The answer of Quasi-experimental methods attempt to&#10;A)control for sample-selection bias.&#10;B)estimate...

Question 12

You can choose between the Poisson and the Negative Binomial models by performing a&#10;A)test of over-dispersion.&#10;B)Poisson choice test.&#10;C)Negative Binomial test.&#10;D)test of overall significance of the Poisson model.

Accepted Answer

The answer of You can choose between the Poisson and...

Question 13

In which of the following cases would you want to estimate a Poisson model?&#10;A)When individuals non-randomly select different outcomes of the dependent variable.&#10;B)When you are attempting to replicate a randomized clinical trial.&#10;C)When you are dealing with non-negative count data.&#10;D)When you suspect that the marginal effects are different for different values of the dependent variable.

Accepted Answer

The answer of In which of the following cases would...

Question 14

Non-negative count data presents a challenge because OLS estimates&#10;A)are biased.&#10;B)cannot be calculated.&#10;C)are heteroskedastic.&#10;D)are the BLUE.

Accepted Answer

The answer of Non-negative count data presents a challenge because...

Question 15

The second-stage in the Heckman selection correction is including the _____ in the second-stage regression to control for the potential sample-selection bias.&#10;A)estimated residuals&#10;B)calculated inverse Mills ratios&#10;C)predicted value of the dependent variable&#10;D)predicted value of the self-selection variable

Accepted Answer

The answer of The second-stage in the Heckman selection correction...

Question 16

Difference-in-difference estimators attempt to&#10;A)attempt to replicate randomized clinical trials by comparing treatment and control groups before and after a treatment is imposed to estimate the impact of a given policy intervention.&#10;B)take differences of both the dependent and independent variables.&#10;C)only difference the dependent variables and regress the differences on the independent variables.&#10;D)obtain estimates at points in the distribution of the dependent variable aside from the mean.

Accepted Answer

The answer of Difference-in-difference estimators attempt to&#10;A)attempt to replicate randomized...

Question 17

Suppose you wish to explain the number of nights per week that individuals eat dinner at a restaurant,an appropriate model to estimate would be&#10;A)Weighted Least Squares.&#10;B)the negative binomial model.&#10;C)OLS.&#10;D)the probit.

Accepted Answer

The answer of Suppose you wish to explain the number...

Question 18

In which of the following cases would you want to use a Heckman selection correction model?&#10;A)When individuals non-randomly select different outcomes of the dependent variable.&#10;B)When you are attempting to replicate a randomized clinical trial.&#10;C)When you are dealing with non-negative count data.&#10;D)When you suspect that the marginal effects are different for different values of the dependent variable.

Accepted Answer

The answer of In which of the following cases would...

Question 19

When performing difference-in-difference estimation,the treatment group is the group&#10;A)for which the policy shock occurred.&#10;B)for which the policy shock did not occur.&#10;C)of observation in the &#34;before&#34; sample.&#10;D)of observation in the &#34;after&#34; sample.

Accepted Answer

The answer of When performing difference-in-difference estimation,the treatment group is...

Question 20

In which of the following cases would you want to use quantile regression?&#10;A)When individuals non-randomly select different outcomes of the dependent variable.&#10;B)When you are attempting to replicate a randomized clinical trial.&#10;C)When you are dealing with non-negative count data.&#10;D)When you suspect that the marginal effects are different for different values of the dependent variable.

Accepted Answer

The answer of In which of the following cases would...

Question 21

Suppose you are interested in testing the claim that students who participate in band perform better on standardized tests but you are worried that the results might be biased because individuals are likely to self-select into joining the band.You only have test scores for those students that participate in band.Suppose that for a sample of 14,111 6^th-grade students you estimate the Heckman selection model (marginal effects listed,standard errors in parentheses)

a)Explain why OLS is inappropriate in this circumstance and how this model improves on OLS.
b)Which variable are you using to identify the model? Does this choice seem correct? Explain.
c)Discuss the results above.

Accepted Answer

The answer of Suppose you are interested in testing the...

Question 22

What is quantile regression? When might it be preferred to OLS? Explain.

Accepted Answer

The answer of What is quantile regression? When might it...

Question 23

What is non-negative count data? Why does it present a concern for OLS? How might you control for non-negative count data in the estimation process? Explain.

Accepted Answer

The answer of What is non-negative count data? Why does...

Question 24

In which of the following cases would you want to use difference-in-difference estimation?&#10;A)When individuals non-randomly select different outcomes of the dependent variable.&#10;B)When you are attempting to replicate a randomized clinical trial.&#10;C)When you are dealing with non-negative count data.&#10;D)When you suspect that the marginal effects are different for different values of the dependent variable.

Accepted Answer

The answer of In which of the following cases would...

Question 25

Suppose you are interested in explaining the number of surfers surfing at your favorite spot on a given day.After collecting data on the number of surfers,the height of the waves (in feet),the water temperature,and whether the day was a weekend on a sample of 92 days,you estimate the following marginal effects for the Poisson model

a)Why is OLS not appropriate in this circumstance? How does a Poisson model improve on OLS?
b)Discuss the results.
c)What assumption is necessary for Poisson to be the appropriate model? How would you test this assumption? If this assumption fails,what alternative model could you estimate?

Accepted Answer

The answer of Suppose you are interested in explaining the...

Question 26

Suppose you are interested in explaining the effect that family income (thousands)has on child birth weight and you are concerned that the true marginal effects differ for at different points in the birth weight distribution.In a sample of 22,365 live births,you estimate following

a)What is quantile regression? Which variable is the quantile of?
b)In what circumstances would quantile regression be preferable to OLS?
c)Discuss the results presented above.

Accepted Answer

The answer of Suppose you are interested in explaining the...

Question 27

What is sample-selection bias? Why does it present a problem for OLS? How can you control for its presence? Explain.

Accepted Answer

The answer of What is sample-selection bias? Why does it...

Question 28

Suppose you observe that several different communities in a large metropolitan area raised their sales tax by one percentage point in 2012 while several others did not.In an effort to determine how the increase affected car sales in the affected communities you estimate a difference-in-difference estimator for 2011 and 2013 car sales and get

a)Why would you want to estimate a difference in difference model?
b)Draw a graph with the 4 means on it and explain where the difference in difference estimator is on the graph.

Accepted Answer

The answer of Suppose you observe that several different communities...

Question 29

What is a difference-in-difference estimator? When is it appropriate to use one? How do you do so? Explain.

Accepted Answer

The answer of What is a difference-in-difference estimator? When is...

Deck 15: Quantile Regression, Count Data, Sample Selection Bias, and Quasi-Experimental Methods