User:Triciaburmeister/Sandbox/Data platform
This page is currently a draft. More information and discussion about changes to this draft on the talk page. |
This page and its subpages are an expanded version of the WIP draft at Data_Platform_Engineering. The contents of the boxes on that page are moved to subpages in this framework, to provide better navigation and space for the large amount of content. |
Wikimedia's data platform is a collection of systems and services that enable data producers and consumers to collect, discover, and use trustworthy data to derive data insights, conduct research and build new data products. The data platform is maintained by the Data Platform Engineering team. To contact us please use the following intake process.
Get started
Find datasets and documentation for WMF private data sources.
Use SQL query engines, Jupyter notebooks, libraries, and compute resources to explore and analyze data.
Define and schedule jobs to transform existing data. Share data artifacts, reports and dashboards.
Add new instrumentation and analytics data sources to the Data Platform.
Data platform infrastructure
Lists of data platform systems and links to their docs are currently at:
Information about data pipelines is currently at: