Analytics/Archive/Hue
Hue is a web interface for the Hadoop cluster, most notably for running Hive queries. Our Hue instance is at https://hue.wikimedia.org.
General documentation for Hue can be found on our instance at https://hue.wikimedia.org/help/.
Access
To access Hue, you will just need wmf
or nda
LDAP access. For more details, see Analytics/Data access#LDAP access.[1]
If you want to use Hue to query private data, you will also need analytics-privatedata-users
shell access.
Hive query errors with Kerberos
In T242306 it was reported the following error while using the Hive query editor:
Error while compiling statement: FAILED: SemanticException java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient
The error is related to Kerberos authentication, and can be fixed forcing the renewal of the Hive session via the following steps:
- In the Hue Hive editor, locate the 3-dots button at the top-right corner of the screen and hit it.
- Then hit "Session" and the "Recreate" button.
Package and release a new version of Hue
We package Hue from the cloudera upstream repository, https://github.com/cloudera/hue/
Notes
- â In the past, you also needed someone from Analytics Engineering to manually sync your LDAP account to Hue. This is no longer necessary.