Analytics/Systems
Jump to navigation
Jump to search
These subpages explain in technical detail the systems that process data for Analytics at Wikimedia Foundation. They include information about setup, maintenance, architecture, and more.
Subpages of Analytics/Systems
- AQS
- AQS/Scaling
- AQS/Scaling/2016/Hardware Refresh
- AQS/Scaling/2017/Cluster Expansion
- AQS/Scaling/2020/Cluster Expansion
- AQS/Scaling/LoadTesting
- Airflow/Airflow testing instance tutorial
- Anaconda
- Archiva
- Clients
- Cluster
- Cluster/AMD GPU
- Cluster/Access
- Cluster/Beeline
- Cluster/BrowserReports
- Cluster/Camus
- Cluster/Coordinator
- Cluster/Data Format Experiments
- Cluster/Data deletion and sanitization
- Cluster/Deploy/Refinery
- Cluster/Deploy/Refinery-source
- Cluster/Deploy a fix to incorrect camus partitionning
- Cluster/Edit data loading
- Cluster/Edit history administration
- Cluster/Edit serving layer
- Cluster/Geolocation
- Cluster/Geotagging
- Cluster/Hadoop
- Cluster/Hadoop/Administration
- Cluster/Hadoop/Alerts
- Cluster/Hadoop/Load
- Cluster/Hadoop/Test
- Cluster/Hive
- Cluster/Hive/Avro
- Cluster/Hive/Compression
- Cluster/Hive/Counting uniques
- Cluster/Hive/Queries
- Cluster/Hive/Queries/Wikidata
- Cluster/Hive/QueryUsingUDF
- Cluster/Ingestion Pipeline
- Cluster/Kafka/Capacity
- Cluster/MediaWiki Avro Logging
- Cluster/Mediawiki History Snapshot Check
- Cluster/Mediawiki history reduced algorithm
- Cluster/Mysql Meta
- Cluster/Oozie
- Cluster/Oozie/Administration
- Cluster/Page and user history reconstruction
- Cluster/Page and user history reconstruction algorithm
- Cluster/Revision augmentation and denormalization
- Cluster/Spark
- Cluster/Spark/Administration
- Cluster/Workflow management tools study
- DB Replica
- Dashiki
- Dashiki/Configuration
- Datastores/Evaluation
- Dealing with data loss alarms
- Druid
- Druid/Load test
- EventLogging
- EventLogging/Administration
- EventLogging/Architecture
- EventLogging/Backfilling
- EventLogging/Data representations
- EventLogging/Data retention
- EventLogging/Data retention/AppInstallId
- EventLogging/EventCapsule
- EventLogging/Monitoring
- EventLogging/NotErrorLogging
- EventLogging/Outages
- EventLogging/Performance
- EventLogging/Publishing
- EventLogging/Sanitization vs Aggregation
- EventLogging/Schema Guidelines
- EventLogging/Sensitive Fields
- EventLogging/TestingOnBetaCluster
- EventLogging/User agent sanitization
- EventStreams
- Exporting from HDFS to Swift
- Geoeditors
- Hive to Druid Ingestion Pipeline
- Jupyter
- Kerberos
- Kerberos/UserGuide
- Maintenance Schedule
- Managing systemd timers
- MariaDB
- Matomo
- Presto
- Presto/Administration
- Refine
- Reportupdater
- Siege
- Superset
- Tier2
- Turnilo
- Varnishkafka
- Wikimetrics
- Wikimetrics/Adding New Features
- Wikimetrics/Adding New Features/CentralAuth Cohorts
- Wikimetrics/Adding New Features/Tag Cohorts
- Wikimetrics/Global metrics
- Wikistats
- Wikistats/Traffic
- Wikistats2/Metrics/FAQ
- Wikistats 2
- ua-parser
- ua-parser/2019-09-18 Update