Hive - Table-Level Statistics (Table/Partition/Column)

Card Puncher Data Processing

About

Statistics on the data of a table.

Statistics are stored in the Hive Metastore

Management

Conf

set hive.stats.autogather=true;
ANALYZE TABLE [db_name.]tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)]  -- (Note: Fully support qualified table name since Hive 1.2.0, see HIVE-10007.)
  COMPUTE STATISTICS 
  [FOR COLUMNS]          -- (Note: Hive 0.10.0 and later.)
  [CACHE METADATA]       -- (Note: Hive 2.1.0 and later.)
  [NOSCAN];

Documentation / Reference





Discover More
Card Puncher Data Processing
Hive - Column

in Hive Context Column Statistics in Hive (HIVE-1362) See ...
Card Puncher Data Processing
Hive - Partition

in Hive Each Table can have one or more partition. Data in each partition may be furthermore divided into Buckets. The partition columns determine how the data is stored. A separate data directory...
Card Puncher Data Processing
Hive - Table

Table implementation in Hive. serializer/deserializers (SerDe) The fully qualified name in Hive for a table is: where: db_name is the database name By default, tables are assumed to be of:...



Share this page:
Follow us:
Task Runner