Amazon Athena

Card Puncher Data Processing

Amazon Athena

About

Amazon Athena is an query service against data in Amazon S3.

Athena query engine is based on hive. See Langauge reference

Example

An external table based on json

CREATE EXTERNAL TABLE IF NOT EXISTS wildrydes (
       Name string,
       StatusTime timestamp,
       Latitude float,
       Longitude float,
       Distance float,
       HealthPoints int,
       MagicPoints int
     )
     ROW FORMAT SERDE 'org.apache.hive.hcatalog.data.JsonSerDe'
     LOCATION 's3://YOUR_BUCKET_NAME_HERE/';

Documentation / Reference





Discover More
Card Puncher Data Processing
Apache - Hive (HS|Hive Server)

Hive is a relational database developed on top of Hadoop to deliver data warehouse functionality. It uses SQL queries (HiveQL) to run MapReduce jobs on Hadoop. The Hive driver converts the HiveQL queries...
Card Puncher Data Processing
Aws - S3 (Amazon Simple Storage Service)

S3 (ie Simple Storage Service) is a edge storage file system in Aws based on Hadoop File System Amazon S3 uses dense storage drives that are optimized for storing larger objects inexpensively. Amazon...
Database Design Space
Database - (Software|Design Space|Category)

, Greenplum Massively parallel open source data warehouse Originally based on PostgreSQL (See also: ) Drill A single query can...
Data System Architecture
Log Analytics

(Amazon Athena) - goal: improve query throughput - goal: improve query latency (hive based) Spark calcite () Druid



Share this page:
Follow us:
Task Runner