Talk:Analytics/Data Lake/Traffic/Webrequest/RawIPUsage

From Wikitech
Jump to navigation Jump to search

Remaining questions

  1. Is the ability to rerun metrics more important than protecting reader privacy?
  2. On what basis is the decision on the previous question made, or if there is no decision on the question yet, who has the authority to establish that basis?
  3. Can Ops use access logs in which the article names have never been stored on permanent, non-RAM media?
  4. Can the users who require logs of article names use those in which the IP address, proxy information, and geolocation has never been stored on permanent media?

JSalsman (talk) 03:36, 29 January 2017 (UTC)[reply]

Update

More questions per https://lists.wikimedia.org/pipermail/analytics/2017-January/005696.html

5. Is it the official policy of the Wikimedia Foundation to keep reader logs with article names associated with IP addresses, proxy information, or geolocation for 60 days?

6. How can interested parties appeal the decision to do so?

7. If GET requests from readers are split into an article name tally file, one per hour, and an IP, proxy information, and geolocation log with an exact timestamp in a separate file without article names, would that meet all of the Foundation's and research community's needs?

JSalsman (talk) 04:27, 30 January 2017 (UTC)[reply]