Introduction
Simultaneity bias and identification are two fundamental concepts in econometrics, particularly when dealing with simultaneous equation models (SEMs). SEMs occur when more than one endogenous variable is determined within a system of equations, causing problems in estimation using ordinary least squares (OLS).
a) What is Simultaneity Bias?
Simultaneity bias occurs when an explanatory variable in a regression equation is simultaneously determined with the dependent variable. In such cases, the regressor is endogenous, meaning it is correlated with the error term. This violates a key OLS assumption, leading to biased and inconsistent estimates.
Example: Consider the following demand and supply equations:
- Demand: Q = a โ bP + u1
- Supply: Q = c + dP + u2
Both Q and P are determined simultaneously in the market. If we try to estimate the demand equation using OLS, the price P is endogenous (correlated with u1), leading to simultaneity bias.
Implications:
- OLS becomes invalid due to endogeneity.
- Estimates are not consistent or reliable.
Conditions Required for Identification
Before applying techniques like Two-Stage Least Squares (2SLS), we must check whether the structural equations are identified. This involves two key conditions:
1. Order Condition
This is a necessary condition. For an equation to be identified, the number of excluded exogenous variables from that equation must be at least equal to the number of endogenous variables minus one (K โ 1).
Mathematically: M โฅ K โ 1
Where:
- K = number of endogenous variables in the system
- M = number of excluded exogenous variables from the equation
2. Rank Condition
This is a sufficient condition. It requires that the matrix formed by the excluded variables must have full rank. In simpler terms, the excluded variables must influence the endogenous variable sufficiently to help identify the equation.
b) Check the Identification Status of the Given Two-Equation System
We are given the following system:
- Equation 1: Y1 = ฮฑ1 + ฮฑ2Y2 + u1
- Equation 2: Y2 = ฮฒ1 + ฮฒ2Y1 + ฮฒ3Z1 + ฮฒ4Z2 + u2
Endogenous variables: Y1, Y2 (K = 2)
Exogenous variables: Z1, Z2
Equation 1:
- Includes Y2 (endogenous)
- Excludes Z1, Z2 (2 exogenous variables)
- So, M = 2
Order condition: M โฅ K โ 1 โ 2 โฅ 1 โ Satisfied
Conclusion: Equation 1 is over-identified.
Equation 2:
- Includes Y1 (endogenous), Z1, Z2 (exogenous)
- No exogenous variable is excluded from Equation 2
- M = 0
Order condition: M โฅ K โ 1 โ 0 โฅ 1 โ Not satisfied
Conclusion: Equation 2 is under-identified
Conclusion
Simultaneity bias arises when explanatory variables are endogenous, leading to biased and inconsistent OLS estimators. To tackle this, structural equations must be identified, using the order and rank conditions. In the given system, the first equation is over-identified and can be estimated using instrumental variables methods. However, the second equation is under-identified, so it cannot be estimated uniquely with available data.