> Database

1 - About

This section regroups all databases that manages data including SQL engine.

2 - Db Design Space

to continue

Db Select Insert/Update Notes
Amazon Athena Yes No Query data against S3
Engine (MapReduce Dependency|Batch Mode) Description
Impala No Impala is well-suited to executing SQL queries for interactive exploratory analytics on large datasets.
Apache Hive Yes Hive and MapReduce are better tools for very long running, batch-oriented tasks such as ETL.

2.1 - Real Time

Interactive SQL support to data stored in Hadoop:

They are based on Google’s ad-hoc query system called Dremel.

2.2 - Batch

See Apache - Hive (HS|Hive Server) with Map Reduce