Data Mining - Content Analysis and Acquisition
> (Statistics|Probability|Machine Learning|Data Mining|Data and Knowledge Discovery|Pattern Recognition|Data Science|Data Analysis)
Table of Contents
1 - List
1.1 - Software
Apache Tika (content analysis toolkit) - The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents (PDF, OppenOffice, Word, …) using existing parser libraries.
1.2 - Text mining
1.3 - Crawler
Advertising