Superset

From Wikitech

Apache Superset is a platform for data exploration, collaboration, and visualization.

Instances

Wikimedia Cloud Services

A public instance of Superset is currently being tested. Source code can be found here: https://github.com/toolforge/superset-deploy. Patches are welcome!

Data Engineering

The data engineering team runs a restricted access instance. For access, see Analytics/Data_access#LDAP_access.

We are currently in the process of migrating these instances to Kubernetes under task T347710.

How do I ...

Get access to Superset?

Please read Superset_Access

See a list of recent queries from all users?

Look at the Query History under SQL Lab.

Run a SQL query?

Use SQL Lab, it can interactively build your query. See upstream documentation.

Database configuration has a map of which schema is in which database.

Collaborate with others?

Create a dashboard, and add additional owners. Publish the same dashboard and it will also be publicly visible. To publish, click the "Draft" text adjacent to the star icon.

Click "Draft" to publish a dashboard
Click "Draft" to publish a dashboard

Create a visualization?

A chart can be created from any query. See the upstream documentation.

Use the API?

See https://superset.wmcloud.org/swagger/v1 and the upstream documentation.

Add a feature

For new upstream features, submit them to the Apache superset project: https://github.com/apache/superset. For specific configuration related to WMF deployments, see https://github.com/toolforge/superset-deploy.