This page will help us decide on a centralized data store to handle analytics at WMF.
- Spark SQL
- this is not a datastore? Hm, I guess Hive isn't really eitiher?
- Hive on Spark
- Not yet released.
- Hive on Tez
- OpenTSDB (HBase)
- Apache Drill
- This is like Impala, but Impala would probably be easier (and maybe better)
- Mondrian + PostgreSQL
- Crate (Elastic Search)
This page is helpful: http://blog.matthewrathbone.com/2014/06/08/sql-engines-for-hadoop.html