Statistics - Binary logistic regression

1 - About

3 - Formula

<MATH> \href{odds#logit}{Logit}(\hat{Y}) = ln(\frac{\hat{Y}}{1-\hat{Y}}) = B_0 + \sum_{i=1}^{k}{(B_iX_i)} </MATH>


  • <math>\hat{Y}</math>: predicted value on the outcome variable Y
  • <math>Y</math>: the outcome variable
  • <math>B_0</math>: predicted value on Y when all X = 0
  • <math>X_i</math>: predictor variables
  • <math>B_i</math>: unstandardized regression coefficients
  • <math>Y-\hat{Y}</math>: residual (prediction error)
  • k = the number of predictor variables

4 - Logit Transformation


  • we're trying to predict an outcome variable that can only take on one of two values, 0 or 1.
  • then we have to come up with a predicted score that falls between 0 and 1.

The general lineal model will not guarantee that the linear combination of the predictors will come up with a score that falls between 0 and 1. This is why the logit is used in the left hand site of the formula.

<MATH> \href{odds#logit}{Logit}(\hat{Y}) = ln(\frac{\hat{Y}}{1-\hat{Y}}) </MATH>

The general lineal model assumes:

  • a linear relationship between x and y,
  • and a normal distribution in the outcome variable.

As the general lineal model will not work because we have a binary outcome variable, a logit transformation must be applied.

The logit transformation is a feature of an even more “general” mathematical framework in regression that is called the “Generalized Linear Model”. It sounds almost exactly the same as the general linear model but it's very different.

5 - Interpretation

For example with a coefficient regression B_1 of 0.39.

For every 1 unit increased in X, I'm going to predict:

  • a 0,39 increase in the logit. (not directly in Y, therefore the scale is hard to interpret
  • a 1,48 increase in the odds ratio.

Example for 1 unit increase:

  • if the odds of X - 1, is 1.27, then the odds of X becomes: <math>1,27 * 1,48 = 1.88</math>
  • in terms of probability, P(X)

<MATH> \begin{array}{rrlrrl} P(X) & = & \frac{Odds}{1+Odds} \\ P(X) & = & \frac{1.88}{2.88} & = & .65 \\ P(X-1) & = & \frac{1.27}{2.27} & = & .56 \end{array} </MATH>

6 - Hypothesis tests

We can look at:

  • the individual predictors in the regression equation. Are they statistically significant?
  • the overall model. Is the model statistically significant?
  • model comparisons. Is model A significantly beter than model B?

6.1 - Individual Predictors

To test each predictor variable, we're going to look at:

  • the regression coefficient
  • the odds ratio
  • the Wald test

6.1.1 - Regression coefficient

Are they significant ?

6.1.2 - Odds ratio

Odds ratio are more meaningful.

For one unit increase in X, the predicted changes in Odds.

It's also possible to report confidence interval for odds.

6.1.3 - Wald test

Wald test test the model with the predictor versus a model without the predictor. The Wald test is very common in logistic regression, and in more advanced statistics. We can see how well does the model fit with the predictor in, and then with the predictor taken out.

The Wald test is a function of the regression coefficient. A wall test is calculated for each predictor variable and compares the fit of the model without the predictor.

6.2 - Overall Model

How to assess the overall fit of the model ?

6.2.1 - Chi-square

Comparison of:

  • a model with all the predictors
  • to a chi-square that assumes no relationship between all the predictors and the outcome, otherwise known as the null model.

compare the fit of the model to the fit of the Null model.

6.2.2 - Wald test

multiple models (Wald test)

6.2.3 - Efficiency /Classification Success

Percentages of cases classified correctly.

See how well it classifies cases.

6.3 - Output Component

Main output component are:

  • Regression Coefficient
  • Odds Ratios
  • Wald Tests
  • Model Chi-Square
  • Classification Success
data_mining/binary_logistic_regression.txt · Last modified: 2015/07/19 14:27 by gerardnico