Superset

Apache Superset is a platform for data exploration, collaboration, and visualization.

Instances

Wikimedia Cloud Services

Tracked in Phabricator
Task T169452

A public instance of Superset is currently being tested. It has access to Wiki Replicas and to ToolsDB (public schemas only). Source code can be found here: https://github.com/toolforge/superset-deploy. Patches are welcome!

Data Engineering

The data engineering team runs a restricted access instance. For access, see Analytics/Data access#LDAP access.

How do I ...

Get access to Superset?

Please read Superset Access.

See a list of recent queries from all users?

Look at the Query History under SQL Lab.

Run a SQL query?

Use SQL Lab, it can interactively build your query. See upstream documentation.

Database configuration has a map of which schema is in which database.

Collaborate with others?

Create a dashboard, and add additional owners. Publish the same dashboard and it will also be publicly visible. To publish, click the "Draft" text adjacent to the star icon.

Create a visualization?

A chart can be created from any query. See the upstream documentation.

Use the API?

See https://superset.wmcloud.org/swagger/v1 and the upstream documentation.

Add a feature

For new upstream features, submit them to the Apache superset project: https://github.com/apache/superset. For specific configuration related to WMF deployments, see https://github.com/toolforge/superset-deploy.