About

Anscombe Regression

An (outcome|dependent) variable is a measure that we want to predict.

There is two type of outcome variables:

A dependent variable (DV) is measured during the experimentation in contrast with independent variable (IV) that describes:

  • the characteristics of the group (such as name)
  • and/of the experimentation (such as the steps: before, after).

The target Attribute in your build data of a supervised model is the attribute that you try to predict.

The target attribute is also called:

  • Outcome measurement (Y)
  • Outcome variable (Y)
  • Output variable
  • Label
  • Dependent variable (DV),
  • Response variable. (It's a statistic term when groups respond to a treatment)
  • Responding variable
  • Regressand
  • Explained variable
  • Experimental variable
  • Output variable
  • Criterion Variable
  • Unobserved variable
  • Predicted variable
  • Measured variable

When the target attribute is:

In the test data, it will contain values with known outcomes in order to measure the performance of the model.

In the scoring data, it will contains the results when the model is applied.

Clustering, feature extraction, association, and anomaly detection models do not use a target because they are unsupervised function.

When the target is predicted, we get an predicted target and you can find the following notations in formulas.

  • <math>Y</math> is the original target score from the training data (ie collected)
  • <math>\hat{Y}</math> is the predicted score from the model.

Terminology

Tools

Weka

Weka Target Class

Documentation / Reference