Data Mining - (Golf|Weather) Data Set

Thomas Bayes

About

The weather data is a small open data set with only 14 examples.

In RapidMiner it is named Golf Dataset, whereas Weka has two data set: weather.nominal.arff and weather.numeric.arff

The dataset contains data about weather conditions are suitable for playing a game of golf.

Data Set

Outlook Temperature
Numeric
Temperature
Nominal
Humidity
Numeric
Humidity
Nominal
Windy Play
overcast 83 hot 86 high FALSE yes
overcast 64 cool 65 normal TRUE yes
overcast 72 mild 90 high TRUE yes
overcast 81 hot 75 normal FALSE yes
rainy 70 mild 96 high FALSE yes
rainy 68 cool 80 normal FALSE yes
rainy 65 cool 70 normal TRUE no
rainy 75 mild 80 normal FALSE yes
rainy 71 mild 91 high TRUE no
sunny 85 hot 85 high FALSE no
sunny 80 hot 90 high TRUE no
sunny 72 mild 95 high FALSE no
sunny 69 cool 70 normal FALSE yes
sunny 75 mild 70 normal TRUE yes





Discover More
Claude Shannon
Data Mining - Information Gain

Information theory was find by Claude_ShannonClaude Shannon. It has quantified entropy. This is key measure of information which is usually expressed by the average number of bits needed to store or communicate...
Data Minig Naives Bayes
Data Mining - Naive Bayes (NB)

Naive Bayes (NB) is a simple supervised function and is special form of discriminant analysis. It's a generative model and therefore returns probabilities. It's the opposite classification strategy...
One R Graph
Machine Learning - (One|Simple) Rule - (One Level Decision Tree)

One Rule is an simple method based on a 1‐level decision tree described in 1993 by Rob Holte, Alberta, Canada. really simple so small/noisy/complex that nothing can be learned from them ...
Bed Overfitting
Machine Learning - (Overfitting|Overtraining|Robust|Generalization) (Underfitting)

A learning algorithm is said to overfit if it is: more accurate in fitting known data (ie training data) (hindsight) but less accurate in predicting new data (ie test data) (foresight) Ie the model...



Share this page:
Follow us:
Task Runner