Misplaced Pages

Testing in binary response index models

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)
This article is an orphan, as no other articles link to it. Please introduce links to this page from related articles; try the Find link tool for suggestions. (December 2015)
This article may be too technical for most readers to understand. Please help improve it to make it understandable to non-experts, without removing the technical details. (April 2018) (Learn how and when to remove this message)
(Learn how and when to remove this message)

Denote a binary response index model as: P [ Y i = 1 X i ] = G ( X i β ) {\displaystyle P=G(X_{i}\beta )} , [ Y i = 0 X i ] = 1 G ( X i β ) {\displaystyle =1-G(X_{i}'\beta )} where X i R N {\displaystyle X_{i}\in R^{N}} .

Description

This type of model is applied in many economic contexts, especially in modelling the choice-making behavior. For instance, Y i {\displaystyle Y_{i}} here denotes whether consumer i {\displaystyle i} chooses to purchase a certain kind of chocolate, and X i {\displaystyle X_{i}} includes many variables characterizing the features of consumer i {\displaystyle i} . Through function G ( ) {\displaystyle G(\cdot )} , the probability of choosing to purchase is determined.

Now, suppose its maximum likelihood estimator (MLE) β ^ u {\displaystyle {\hat {\beta }}_{u}} has an asymptotic distribution as n ( β ^ u β ) d N ( 0 , V ) {\displaystyle {\sqrt {n}}({\hat {\beta }}_{u}-\beta ){\xrightarrow {d}}N(0,V)} and there is a feasible consistent estimator for the asymptotic variance V {\displaystyle V} denoted as V ^ {\displaystyle {\hat {V}}} . Usually, there are two different types of hypothesis needed to be tested in binary response index model.

The first type is testing the multiple exclusion restrictions, namely, testing β 2 = 0 w i t h = [ β 1 ; β 2 ] w h e r e β 2 R Q {\displaystyle \beta _{2}=0with=where\beta _{2}\in R^{Q}} . If the unrestricted MLE can be easily computed, it is convenient to use the Wald test whose test statistic is constructed as:

( D β ^ u ) T ( D V ^ D T / n ) 1 ( D β ^ u ) d X Q 2 {\displaystyle (D{\hat {\beta }}_{u})^{T}(D{\hat {V}}D^{T}/n)^{-1}(D{\hat {\beta }}_{u}){\xrightarrow {d}}X_{Q}^{2}}

Where D is a diagonal matrix with the last Q diagonal entries as 0 and others as 1. If the restricted MLE can be easily computed, it is more convenient to use the Score test (LM test). Denote the maximum likelihood estimator under the restricted model as ( β ^ r ) {\displaystyle ({\hat {\beta }}_{r})} and define u ^ i Y i G ( X i β ^ r ) , G ^ i G ( X i β ^ r ) {\displaystyle {\hat {u}}_{i}\equiv Y_{i}-G(X_{i}'{\hat {\beta }}_{r}),{\hat {G}}_{i}\equiv G(X_{i}'{\hat {\beta }}_{r})} and g ^ i g ( X i β ^ r {\displaystyle {\hat {g}}_{i}\equiv g(X_{i}'{\hat {\beta }}_{r}} , where g ( ) = G ( ) {\displaystyle g(\cdot )=G'(\cdot )} . Then run the OLS regression u ^ i ( G ^ i ( 1 G ^ i ) {\displaystyle {\frac {{\hat {u}}_{i}}{\sqrt {({\hat {G}}_{i}(1-{\hat {G}}_{i})}}}} on g ^ i ( G ^ i ( 1 G ^ i ) X 1 i , g ^ i ( G ^ i ( 1 G ^ i ) X 2 i {\displaystyle {\frac {{\hat {g}}_{i}}{\sqrt {({\hat {G}}_{i}(1-{\hat {G}}_{i})}}}X_{1i}',{\frac {{\hat {g}}_{i}}{\sqrt {({\hat {G}}_{i}(1-{\hat {G}}_{i})}}}X_{2i}'} , where X i = [ X 1 i ; X 2 i ] {\displaystyle X_{i}=} and X 2 i ε R Q {\displaystyle X_{2i}\varepsilon R^{Q}} . The LM statistic is equal to the explained sum of squares from this regression and it is asymptotically distributed as X Q 2 {\displaystyle X_{Q}^{2}} . If the MLE can be computed easily under both of the restricted and unrestricted models, Likelihood-ratio test is also a choice: let L u {\displaystyle L_{u}} denote the value of the log-likelihood function under the unrestricted model and let L r {\displaystyle L_{r}} denote the value under the restricted model, then 2 ( L u L r ) {\displaystyle 2(L_{u}-L_{r})} has an asymptotic X Q 2 {\displaystyle X_{Q}^{2}} distribution.

The second type is testing a nonlinear hypothesis about β {\displaystyle \beta } , which can be represented as H 0 : c ( β ) = 0 {\displaystyle H_{0}:c(\beta )=0} where c ( β ) {\displaystyle c(\beta )} is a Q×1 vector of possibly nonlinear functions satisfying the differentiability and rank requirements. In most of the cases, it is not easy or even feasible to compute the MLE under the restricted model when c ( β ) {\displaystyle c(\beta )} include some complicated nonlinear functions. Hence, Wald test is usually used to deal with this problem. The test statistic is constructed as:

c ( β ^ u ) [ β c ( β ^ u ) V ^ β c ( β ^ u ) ] 1 c ( β ^ u ) d X Q 2 {\displaystyle c({\hat {\beta }}_{u}')^{-1}c({\hat {\beta }}_{u}){\xrightarrow {d}}X_{Q}^{2}}

where β c ( β ^ u ) {\displaystyle \nabla _{\beta }c({\hat {\beta }}_{u})} is the Q×N Jacobian of c ( β ) {\displaystyle c(\beta )} evaluated at β ^ u {\displaystyle {\hat {\beta }}_{u}} .

For the tests with very general and complicated alternatives, the formula of the test statistics might not have the exactly same representation as above. But we can still derive the formulas as well as its asymptotic distribution by Delta method and implement Wald test, Score test or Likelihood-ratio test. Which test should be used is determined by the relative computation difficulty of the MLE under restricted and unrestricted models.

References

  1. For an application example, refer to: Rayton, B. A. (2006): “Examining the Interconnection of Job Satisfaction and Organizational Commitment: an Application of the Bivariate Probit Model”,The International Journal of Human Resource Management, Vol. 17, Iss. 1.
  2. Greene, W. H. (2003), Econometric Analysis, Prentice Hall, Upper Saddle River, NJ .
  3. Wooldridge, J. (2002): Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.
  4. Casella, G., and Berger, R. L. (2002). Statistical inference. Duxbury Press.
  5. Engle, Robert F. (1983). "Wald, Likelihood Ratio, and Lagrange Multiplier Tests in Econometrics". In Intriligator, M. D.; and Griliches, Z. Handbook of Econometrics II. Elsevier. pp. 796–801. ISBN 978-0-444-86185-6.
Statistics
Descriptive statistics
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Data collection
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical inference
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical / Multivariate / Time-series / Survival analysis
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Applications
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Categories: