Analytics/Archive/Hue

This page contains historical information. It may be outdated or unreliable.

Hue is a web interface for the Hadoop cluster, most notably for running Hive queries. Our Hue instance is at https://hue.wikimedia.org.

General documentation for Hue can be found on our instance at https://hue.wikimedia.org/help/.

Access

To access Hue, you will just need wmf or nda LDAP access. For more details, see Analytics/Data access#LDAP access.^[1]

If you want to use Hue to query private data, you will also need analytics-privatedata-users shell access.

Hive query errors with Kerberos

In T242306 it was reported the following error while using the Hive query editor:

Error while compiling statement: FAILED: SemanticException java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient

The error is related to Kerberos authentication, and can be fixed forcing the renewal of the Hive session via the following steps:

In the Hue Hive editor, locate the 3-dots button at the top-right corner of the screen and hit it.

Then hit "Session" and the "Recreate" button.

Package and release a new version of Hue

We package Hue from the cloudera upstream repository, https://github.com/cloudera/hue/

https://gerrit.wikimedia.org/r/plugins/gitiles/operations/debs/hue/+/refs/heads/master/debian/README.debian

Notes

↑ In the past, you also needed someone from Analytics Engineering to manually sync your LDAP account to Hue. This is no longer necessary.

[1] In the past, you also needed someone from Analytics Engineering to manually sync your LDAP account to Hue. This is no longer necessary.

[1]