Jump to content


From Wikitech

Technical documentation for Wikimedia implementation of Hive, a system that projects structure onto flat data (text or binary) in HDFS and allows this data to be queried using an SQL-like syntax. Hive provides two main components: the Hive Metastore, which catalogs all the datasets available in HDFS, and the Hive Query Engine, which translates SQL to executable code.