This page links to detailed information about traffic datasets in the Data Lake.
Most of the datasets below are updated at hourly granularity, meaning that you'll get an hour of new data every hour, with between 2 and 3 hours delay (for the hour to be finished, and the data to be computed).
- webrequest hive table (raw and refined)
- pageview_hourly hive table
- projectview_hourly hive table
- Pageviews and Projectviews dumps [To be updated]
- Compressed pageviews dumps [To be updated]
- Uniques Devices
- Browser General
- pagecounts-raw (deprecated)
- pagecounts-all-sites (deprecated)
The evolution of publishing analytics data at WMF is recorded here in a timeline.