User:Triciaburmeister/Sandbox/Data platform/Collect data
This page is currently a draft. More information and discussion about changes to this draft on the talk page. |
This page describes systems for collecting and generating private Wikimedia data. For public data, see meta:Research:Data.
- Data Collection
- Metrics Platform
- Event Instrumentation
- Very brief and roughly drafted guide to measurement plan & instrumentation specification
- Data Collection Guidelines [draft, internal only, succeeds instrumentation DACI; will eventually be posted to Foundation wiki just like Data Publication Guidelines]
- related: Data Retention Guidelines
- Data Ingestion?
- Web Analytics with Matomo
- Batch Transforms
- Airflow User Guide
- Report Updater (Deprecated) / Refine / SystemD
- Streaming Transforms
- Event Platform
- Google Search Console access
- Lifecycle of an Event
- Event* Disambiguates between the many event services, and links to their more extensive documentation.
- Event Schemas and Guidelines
- Hadoop Event Ingestion Lifecycle
- Event Sanitization
- https://wikitech.wikimedia.org/wiki/Analytics/Hive_to_Druid_Ingestion_Pipeline
- Banner impressions data for fundraising campaigns https://wikitech.wikimedia.org/wiki/Analytics/Fundraising