Talk:Analytics/Data Lake/Edits/MediaWiki history dumps

Rendered with Parsoid
From Wikitech

Suggestion: code examples in subpages

I'd move the code examples to subpages to clear up the main article and make browsing easier. That'd be cool if we could just link to notebooks sometime in the newpyter future.

Code moved to subpages and links updated. While I agree it would be nice to link to notebooks, I also wish this code to be helpful for external people :) --Joal (talk) 09:10, 24 January 2020 (UTC)Reply

Access in Toolforge and PAWS

I'm trying to use this dataset in some new tools, but I cannot seem to find it in the /public/dumps path. Is it there already? If not could it be made available there?

Due to the size of the files it'd be more useful to not have to copy them.

Cheers, chicocvenancio (talk) 21:39, 12 February 2020 (UTC)Reply

Thank you for your message! The dataset is available on paws at that path: /public/dumps/public/other/mediawiki_history. Enjoy :) --Joal (talk) 08:16, 13 February 2020 (UTC)Reply