Data Mining - Nested (Transactional|Historical) Data

Thomas Bayes

About

shallow, yet wide, and nested data problems

nested transactional data = all the claims for a person for example.

Time

Considering a database of retail purchases that includes the item bought, the purchaser, and the date and time of purchase, it's easy to construct a model that will fit the training set perfectly by using the date and time of purchase to predict the other attributes; but this model will not generalize at all to new data, because those past times will never occur again.

Information from all past experience can be divided into two groups:

  • information that is relevant for the future
  • information that is irrelevant (noise).





Discover More
Support Vector Geometry
Data Mining - Support Vector Machines (SVM) algorithm

A support vector machine is a Classification method. supervised algorithm used for: Classification and Regression (binary and multi-class problem) anomalie detection (one class problem) Supports:...
Thomas Bayes
Statistics - Generalized Linear Models (GLM) - Extensions of the Linear Model

The Generalized Linear Model is an extension of the linear model that allows for lots of different,non-linear models to be tested in the context of regression. GLM is the mathematical framework used in...



Share this page:
Follow us:
Task Runner