Professional Documents
Culture Documents
ECMT-7302
Econometrics II
MA Eco. 2022, Fall 2023
Instructor: Sunaina Dhingra
Lectures: Wednesday, (11.20-12.50pM) & Thursday (9-40-11.10 am)
Lecture Meeting Mode: In person (Classroom: T4-F99)
Office Hours: Wednesday 1-2.30 pm & by appointment in FOB, Office No.1B in south on 7th Floor)
Email-id: sunaina@jgu.edu.in
Lecture Material: Slides and textbooks
Credits: 4.5
Types of dependent variables and their estimation method
1-2
Limited Dependent Variable Models
• Limited dependent variables (LDV)
• LDV are is substantively restricted
• Binary vavariables whose range riables, e.g. employed/not employed
• Nonnegative variables, e.g. wages, prices, interest rates
• Nonnegative variables with excess zeros, e.g. labor supply
• Count variables, e.g. the number of arrests in a year
• Censored variables, e.g. unemployment durations
3
Regression with a Binary Dependent Variable
What if Y is binary?
• What is the effect of a tuition subsidy on an individual’s decision to go to college?
• Y = get into college, or not; X = Tuition Subsidy, high school grades, SAT scores,
demographic variables
• What determines whether a teenager takes up smoking?
• Y = person smokes, or not; X = cigarette tax rate, income, demographic variables
• What determines whether a country receives foreign aid?
• What determines whether a mortgage applicant is successful?
• Y = mortgage application is accepted, or not; X = race, income, house characteristics, marital status
• In all these examples, the outcome of interest is binary: The student does or does not
go to college, the teenager does or does not take up smoking, a country does or does
not receive foreign aid, the applicant does or does not get an approval on mortgage
application.
• The binary dependent variable considered in this chapter is an example of a dependent
variable with a limited range; in other words, it is a limited dependent variable.
Example: Mortgage Denial and Race
The Boston Fed HMDA Dataset
• Individual applications for single-family mortgages made in 1990 in the greater
Boston area
• 2380 observations, collected under Home Mortgage Disclosure Act (HMDA)
Variables
• Dependent variable:
• Is the mortgage denied or accepted?
• Independent variables:
• income, wealth, employment status (One important piece of information is the size of the
required loan payments relative to the applicant’s income)
• other loan, property characteristics
• race of applicant
• Research Question: whether race is a factor in denying a mortgage application;
the binary dependent variable is whether a mortgage application is denied
• What does it mean to fit a line to a dependent variable that can take on only two values, 0
and 1?
• The answer to this question is to interpret the regression function as a conditional
probability. It allows us to apply the multiple regression models to binary dependent
variable. But the predicted probability interpretation also suggests that alternative,
nonlinear regression models can do a better job modeling these probabilities
Example: linear probability model, HMDA data Mortgage denial v. ratio of debt
payments to income (P/I ratio) in a subset of the HMDA data set (n = 127)
• Is β1 =
Y ?
X
• What does the line β0 + β1X mean when Y is binary?
• 1. the population regression function is the expected value of Y given the regressors,
• E(Y| X1, X2,…Xk).
• 2. if Y is a 0–1 binary variable, its expected value (or mean) is the probability that Y = 1;
that is,
• E(Y) = 0 * Pr(Y = 0) + 1 * Pr(Y = 1) =Pr(Y = 1).
• In the regression context the expected value is conditional on the value of the regressors,
so, the probability is conditional on X. Thus for a binary variable,
• E(Y| X1, X2,…Xk). = Pr(Y = 1| X1, X2,…Xk)
The linear probability model, ctd.