Analytics/Archive/Hue

From Wikitech
(Redirected from Analytics/Cluster/Hue)

Hue is a web interface for the Hadoop cluster, most notably for running Hive queries. Our Hue instance is at https://hue.wikimedia.org.

General documentation for Hue can be found on our instance at https://hue.wikimedia.org/help/.

Access

To access Hue, you will just need wmf or nda LDAP access. For more details, see Analytics/Data access#LDAP access.[1]

If you want to use Hue to query private data, you will also need analytics-privatedata-users shell access.

Hive query errors with Kerberos

In T242306 it was reported the following error while using the Hive query editor:

Error while compiling statement: FAILED: SemanticException java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient

The error is related to Kerberos authentication, and can be fixed forcing the renewal of the Hive session via the following steps:

  • In the Hue Hive editor, locate the 3-dots button at the top-right corner of the screen and hit it.
  • Then hit "Session" and the "Recreate" button.


Package and release a new version of Hue

We package Hue from the cloudera upstream repository, https://github.com/cloudera/hue/

https://gerrit.wikimedia.org/r/plugins/gitiles/operations/debs/hue/+/refs/heads/master/debian/README.debian

Notes

  1. In the past, you also needed someone from Analytics Engineering to manually sync your LDAP account to Hue. This is no longer necessary.