User:Triciaburmeister/Sandbox/Data platform
This page is currently a draft. More information and discussion about changes to this draft on the talk page. |
This page and its subpages are an expanded version of the WIP draft at Data_Platform_Engineering. The contents of the boxes on that page are moved to subpages in this framework, to provide better navigation and space for the large amount of content. |
Wikimedia's data platform is a collection of systems and services that enable data producers and consumers to collect, discover, and use trustworthy data to derive data insights, conduct research and build new data products. The data platform is maintained by the Data Platform Engineering team. To contact us please use the following intake process.
Get started
Find datasets and documentation for WMF private data sources.
Use SQL query engines, Jupyter notebooks, libraries, and compute resources to explore and analyze data.
Generate reports and use dashboards to find and share analytics, while following guidelines for publishing and sharing data.
Gather event data, generate metrics, run experiments and transform data for specific uses.
Data platform infrastructure
Lists of data platform systems and links to their docs are currently at:
Information about data pipelines is currently at: