Data Engineering/Systems
Jump to navigation
Jump to search
These subpages explain in technical detail the systems that process data for Analytics at Wikimedia Foundation. They include information about setup, maintenance, architecture, and more.
Child Pages of Data Engineering/Systems
AQS · Airflow · Archiva · Ceph · Conda · DB Replica · Dashiki · DataHub · Dealing with data loss alarms · Druid · EventLogging · EventStreams · Event Data retention · Hadoop Event Ingestion Lifecycle · Java · Jupyter · Kerberos · Maintenance Schedule · Managing systemd timers · Matomo · Presto · Refine · Reportupdater · Superset · Turnilo · Varnishkafka · Wikistats · Wikistats 2
All Subpages of Data Engineering/Systems
- AQS
- AQS/Scaling
- AQS/Scaling/2016/Hardware Refresh
- AQS/Scaling/2017/Cluster Expansion
- AQS/Scaling/2020/Cluster Expansion
- AQS/Scaling/LoadTesting
- Airflow
- Airflow/Airflow testing instance tutorial
- Airflow/Developer guide
- Airflow/Developer guide/Python Job Repos
- Airflow/Instances
- Airflow/Upgrading
- Archiva
- Ceph
- Cluster/Deploy/Refinery
- Cluster/Deploy/Refinery-source
- Cluster/Geotagging
- Cluster/Hadoop/Administration
- Cluster/Hadoop/Load
- Cluster/Oozie
- Cluster/Oozie/Administration
- Cluster/Spark
- Conda
- DB Replica
- Dashiki
- Dashiki/Configuration
- DataHub
- DataHub/Data Catalog Documentation Guide
- DataHub/Upgrading
- Dealing with data loss alarms
- Druid
- Druid/Alerts
- Druid/Load test
- EventLogging
- EventLogging/Administration
- EventLogging/Architecture
- EventLogging/Backfilling
- EventLogging/Data representations
- EventLogging/EventCapsule
- EventLogging/Monitoring
- EventLogging/NotErrorLogging
- EventLogging/Outages
- EventLogging/Performance
- EventLogging/Publishing
- EventLogging/Sanitization vs Aggregation
- EventLogging/Schema Guidelines
- EventLogging/Sensitive Fields
- EventLogging/TestingOnBetaCluster
- EventLogging/User agent sanitization
- EventStreams
- Event Data retention
- Event Data retention/AppInstallId
- Hadoop Event Ingestion Lifecycle
- Java
- Jupyter
- Jupyter/Administration
- Jupyter/Tips
- Kerberos
- Kerberos/Administration
- Maintenance Schedule
- Managing systemd timers
- Matomo
- Presto
- Presto/Administration
- Presto/Query Logger
- Refine
- Reportupdater
- Superset
- Superset/Administration
- Superset/Date functions
- Turnilo
- Varnishkafka
- Wikistats
- Wikistats/Traffic
- Wikistats 2