Jump to content
Server Admin Log/Archive 33
2017-12-30
18:22 ejegg: disabled failing silverpop data fetch jobs
2017-12-29
18:31 mutante: gerrit: restart service to apply change from Dec 20th, avoid logspam due to unapplied config change
15:38 ariel@tin: Finished deploy [dumps/dumps@51d4fd6]: enable pageslogging dump in parallel jobs (duration: 00m 02s)
15:38 ariel@tin: Started deploy [dumps/dumps@51d4fd6]: enable pageslogging dump in parallel jobs
14:46 apergos: restarted puppetdb on nitrogen, puppetdb refusing to hand out facts again
08:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1110 (duration: 00m 52s)
08:19 marostegui: Stop replication in sync on dbstore1002.s5 and db1110
08:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1110 to reimport dewiki.imagelinks on dbstore1002 (duration: 00m 52s)
07:43 marostegui: Fixing dbstore1002 dewiki.imagelinks
2017-12-28
14:18 marostegui: Power cycle pc2005 as it is down
2017-12-27
19:54 godog: power reset ms-be1033
19:47 godog: reboot ms-be1033 - T183724
17:00 twentyafterfour: restarting apache on phab1001
15:56 mobrovac@tin: Started restart [electron-render/deploy@94d27d7]: Bounce Electron, stuck - T174916
2017-12-26
16:33 mobrovac@tin: Finished deploy [mathoid/deploy@63b2ddc]: Bring back Mathoid in codfw to v0.6.5 in sync with eqiad - T179419 T172767 (duration: 01m 44s)
16:31 mobrovac@tin: Started deploy [mathoid/deploy@63b2ddc]: Bring back Mathoid in codfw to v0.6.5 in sync with eqiad - T179419 T172767
16:27 mutante: re-generated DNS zones to add new language lfn (Lingua Franca Nova) (T183561 )
12:00 Amir1: ladsgroup@terbium:~$ mwscript extensions/Cognate/maintenance/populateCognatePages.php --wiki hifwiktionary (T180785 )
11:59 Amir1: ladsgroup@terbium:~$ mwscript extensions/Cognate/maintenance/populateCognateSites.php --wiki aawiktionary --site-group wiktionary (T180785 )
11:05 ariel@tin: Finished deploy [dumps/dumps@8c44d0a]: re-enable file size reporting for dump files still being written (duration: 00m 02s)
11:05 ariel@tin: Started deploy [dumps/dumps@8c44d0a]: re-enable file size reporting for dump files still being written
08:34 akosiaris: migrate logstash1007 off ganeti1006 T181121
08:21 akosiaris: migrate logstash1007 off ganeti1006
03:52 madhuvishy: restart pdfrender on scb1002
03:50 madhuvishy: restart pdfrender on scb1004
03:32 madhuvishy: restart pdfrender on scb1003 following icinga page
03:28 madhuvishy: restart pdfrender on scb1001 following icinga page
2017-12-24
16:10 _joe_: restarted pdfrenderer on scb1002
2017-12-23
2017-12-22
18:31 mobrovac@tin: Finished deploy [mathoid/deploy@7c5f8e2]: Better handling of chem rules (codfw only) - T183557 (duration: 02m 45s)
18:28 mobrovac@tin: Started deploy [mathoid/deploy@7c5f8e2]: Better handling of chem rules (codfw only) - T183557
16:42 demon@tin: Pruned MediaWiki: 1.31.0-wmf.11 [keeping static files] (duration: 01m 16s)
16:05 demon@tin: Synchronized README: forcing co-master sync (duration: 00m 51s)
15:56 demon@tin: Synchronized scap/plugins/clean.py: no-op (duration: 00m 51s)
15:52 demon@tin: Pruned MediaWiki: 1.31.0-wmf.10 [keeping static files] (duration: 01m 18s)
15:50 demon@tin: Pruned MediaWiki: 1.31.0-wmf.10 [keeping static files] (duration: 01m 47s)
14:48 bblack: esams TLS cert switch from digicert-2016 to digicert-2017
14:18 mobrovac@tin: Finished deploy [mathoid/deploy@6c29c09]: Update Mathoid to v0.7.0 in CODFW only to prefill storage, take 2 - T179419 T172767 (duration: 04m 23s)
14:13 mobrovac@tin: Started deploy [mathoid/deploy@6c29c09]: Update Mathoid to v0.7.0 in CODFW only to prefill storage, take 2 - T179419 T172767
14:05 mobrovac@tin: Finished deploy [mathoid/deploy@7f39b71]: Update Mathoid to v0.7.0 in CODFW only to prefill storage - T179419 T172767 (duration: 02m 57s)
14:02 mobrovac@tin: Started deploy [mathoid/deploy@7f39b71]: Update Mathoid to v0.7.0 in CODFW only to prefill storage - T179419 T172767
13:37 mobrovac: restbase depool restbase2008 for T179419
13:17 bblack: repooling cp4032 - T183176
06:44 jynus: start manual backup of db1029 onto db1056
2017-12-21
19:25 robh: cp4032 going offline for memory swap
17:24 jynus: starting manual backup of x1-master onto db1055
17:01 volans: debugging Icinga notes_url (no side effect expected but logging it in case there will be) T170353
16:56 volans: restarted ircecho to pick up gerrit/399658
15:43 joal@tin: Finished deploy [analytics/refinery@92f9318]: Deploying for new pageview_top_bycountry job (duration: 04m 40s)
15:39 joal@tin: Started deploy [analytics/refinery@92f9318]: Deploying for new pageview_top_bycountry job
12:39 elukey: restart druid historical/broker to apply new jvm settings to druid public workers (druid100[456] - https://gerrit.wikimedia.org/r/399617 )
12:07 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=mw1333.*.eqiad.wmnet
11:43 _joe_: refreshing puppet facts on the puppet compilers
10:54 jynus@tin: Synchronized wmf-config/db-eqiad.php: Remove db1055 & db1056 (duration: 00m 51s)
10:52 jynus@tin: Synchronized wmf-config/db-codfw.php: Remove db1055 & db1056 (duration: 00m 51s)
10:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1082 and db1100 original weight - T161294 (duration: 00m 51s)
10:15 elukey: rolling restart of eventbus on kafka* for openssl security updates
10:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1100 and start restoring original weight for db1082 - T161294 (duration: 00m 48s)
09:50 elukey: restart eventbus on kafka2001 for openssl updates
09:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1100 - T161294 (duration: 00m 51s)
09:30 elukey: restart zookeeper on conf100[2,3] for jvm updates - T179943
09:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1100 - T161294 (duration: 00m 51s)
09:20 elukey: restart zookeeper on conf1001 for jvm updates - T179943
08:51 elukey: run kafka preferred-replica-election after maintenance of kafka1023 (fully bootstrapped now)
08:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1100 - T161294 (duration: 00m 51s)
08:34 marostegui: Stop replication in sync on db1100 and dbstore1002 - T161294
08:23 elukey: repool mw1277 after investigation
07:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Slowly repool db1100- T161294 (duration: 00m 52s)
07:21 marostegui: Upgrade MariaDB on db1100
07:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1109 and db1096:3315 - T161294 (duration: 00m 51s)
06:55 _joe_: restarting hhvm on mw1231 and mw1208
06:50 marostegui: Stop replication in sync on dbstore1002 - db1100 - T161294
06:42 marostegui: Stop replication in sync on db1100 - db2052 - T161294
06:36 marostegui: Remove some old files in dbstore1001:/srv/tmp to address the WARNING alert
06:35 marostegui: Stop replication in sync db1100 and db1071 - T161294
06:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1109 - T161294 (duration: 00m 51s)
03:59 mutante: mw1315 kill and restart hhvm | mw1312 stop and start hhvm
03:55 mutante: mw1290 kill and restart hhvm | mw1230 stop and start hhvm
02:20 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.12) (duration: 05m 31s)
01:16 eileen: update CiviCRM civicrm revision changed from 9d6ba74f57 to ffa9d7fc7a , config revision is da6fa4cce9
2017-12-20
23:26 twentyafterfour: restarting apache on phab1001 to deploy a hotfix for T144184
21:42 mepps: updated dash from 114131713e to b01458b260
21:20 otto@tin: Finished deploy [analytics/refinery@548dad7]: deploying refinery v0.0.56 with JsonRefine fixes to allow Popups schema to be refined. This is a no-op for everything else (duration: 04m 51s)
21:15 otto@tin: Started deploy [analytics/refinery@548dad7]: deploying refinery v0.0.56 with JsonRefine fixes to allow Popups schema to be refined. This is a no-op for everything else
20:31 ebernhardson: T183053 update elasticsearch settings for wikidatawiki_content on codfw to use: index.refresh_interval=5s
20:18 XioNoX: remove local-as from cr2-esams IX4
19:46 XioNoX: remove local-as from cr2-esams IX6
17:57 _joe_: depooling mw1277 for further investigation
17:51 demon@tin: Synchronized wmf-config/InitialiseSettings.php: T172875 (second try, forgot to pull first (duration: 00m 51s)
17:48 elukey: new mw jobrunner in production (mw1334) - T165519
17:47 demon@tin: Synchronized wmf-config/InitialiseSettings.php: T172875 (duration: 00m 51s)
17:46 demon@tin: Synchronized wmf-config/Wikibase-production.php: T180614 (duration: 00m 51s)
17:43 elukey: restart zookeeper on conf2003 for jvm updates - T179943
17:42 elukey: restart hhvm on mw1316
17:38 demon@tin: Synchronized wmf-config/InitialiseSettings.php: sandbox link on Atikamekw 'pedia (duration: 00m 52s)
17:38 elukey: restart hhvm on mw1234
17:32 elukey: restart zookeeper on conf2002 for jvm updates - T179943
17:20 demon@tin: Synchronized wmf-config/CommonSettings.php: more n0-0ps (duration: 00m 52s)
17:16 demon@tin: Synchronized multiversion/submodules.json: no-op (duration: 00m 51s)
17:09 demon@tin: Synchronized README: noop (duration: 00m 51s)
16:27 ema: lvs1003: stop pybal, clean ipvs services, start pybal
16:25 ema: lvs1006: stop pybal, clean ipvs services, start pybal
15:54 moritzm: rolling restart of scb* hosts in eqiad to pick up openssl update
15:47 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1334.eqiad.wmnet
15:38 moritzm: rolling restart of remaining scb* hosts in codfw to pick up openssl update
14:26 ema: start pybal on lvs2003
14:24 ema: stop pybal on lvs2003, clean IPVS table after traffic failover to get rid of trendingedits `TCP 10.2.1.9:6699 wrr`
14:14 moritzm: upgrading openssl on wdqs (along with system service restarts)
14:13 ema: bounce pybal on lvs2006 and clean IPVS table
14:06 elukey: temporarily shutdown kafka on kafka1023 to move some topic partitions on different disk partition (disk space usage alerts)
13:49 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1276.eqiad.wmnet
13:38 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=mw1276.eqiad.wmnet
13:38 moritzm: upgrading openssl on wdqs (along with system service restarts)
13:35 moritzm: rolling restart of aqs to pick up openssl security update
13:20 moritzm: upgrading openssl on aqs/druid clusters (along with system service restarts)
13:11 jynus: restart dbproxy1008 to test workaround is working on cold restart
13:01 moritzm: upgrading openssl on hadoop cluster (along with system service restarts)
12:34 marostegui: Enable notifications for db1100 - T161294
12:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1101:3318 and db1109 - T161294 (duration: 00m 51s)
12:18 moritzm: installing iproute2 bugfix update from stretch point relesae
11:50 godog: create k8s-staging LVs in prometheus/eqiad - T163692
11:37 moritzm: upgrading openssl on elastic* (along with system service restarts)
11:27 akosiaris: repool ganeti1006, rebalance row_A ganeti nodegroup. T181121
11:18 jynus: restart dbproxy1005 to test cold service start
11:11 marostegui: Stop replication in sync on db1100 and db1071 - T161294
10:57 arturo: remove old kernel packages from silver.wikimedia.org to free space
09:56 jynus: restart dbproxy1001 to test cold service start
09:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1105:3311 - T174569 (duration: 00m 51s)
09:24 jynus: disable puppet on dbproxies for gerrit:399359 deployment
08:44 moritzm: powercycling ganeti1005
08:30 moritzm: installing rsync security updates
08:24 jynus: starting reimage of dbproxy1008
08:11 marostegui: Stop replication in sync on db1100 and dbstore1002 - T161294
07:56 jynus: starting reimage of dbproxy1007
07:35 jynus: starting reimage of dbproxy1005
07:24 jynus: upgrading and restarting dbproxy1004
07:09 marostegui: Stop replication in sync on db1096:3315 and db1100 - T161294
07:08 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1096:3315 - T161294 (duration: 00m 51s)
06:53 marostegui: Stop replication in sync on db1101:3318 and db1109 - T161294
06:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1099:3318 depool db1101:3318 - T161294 (duration: 00m 51s)
06:19 marostegui: Deploy schema change on db1105:3311 - T174569
06:18 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1105:3311 - T174569 (duration: 00m 52s)
06:06 Jamesofur: reset mailman password for tawikisource T183329
02:20 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.12) (duration: 05m 45s)
2017-12-19
21:51 XioNoX: removing local-as AS43821 from ams transits - T167840
19:49 mutante: deleted ganglia.wikimedia.org from DNS - webserver was already down since yesterday - not used anymore (T177225 )
19:23 mutante: webperf1001/webperf2001 - rebooting for kernel upgrades (not used yet)
19:18 mutante: gerrit2001 - reboot for kernel upgrade
18:14 moritzm: installing zsh update from stretch point release
17:11 jynus: purging ferm from dbproxy1002, 3, 6, 9, 10 and 11
17:00 moritzm: installing libxv security updates on jessie
16:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1097:3315 - T161294 (duration: 00m 51s)
16:45 marostegui: Defragment s7 databases on db1102 -Â https://phabricator.wikimedia.org/T172169
16:38 awight@tin: Synchronized wmf-config/CirrusSearch-labs.php: wmf-config/CommonSettings-labs.php wmf-config/db-labs.php wmf-config/InitialiseSettings-labs.php wmf-config/interwiki-labs.php wmf-config/jobqueue-labs.php wmf-config/mc-labs.php wmf-config/mobile-labs.php wmf-config/Wikibase-labs.php Sync out labs config changes (duration: 00m 51s)
16:34 elukey: manually started eventlogging cleaner on db1107 to purge/sanitize data up to 90 days ago (tmux is running for user eventlogcleaner) - T108850
16:22 moritzm: installing ncurses updates from jessie point release
15:26 jgleeson: switched back on donations queue consumer and thank you mailer service
15:25 chasemp: labvirt10[19|20] aptitude install linux-image-4.4.0-81-generic linux-image-extra-4.4.0-81-generic; sudo update-grub; /sbin/reboot T172538
15:10 marostegui: Stop replication in sync on db1109 and db1099:3318 - https://phabricator.wikimedia.org/T161294
15:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1099:3318 and db1109 - T161294 (duration: 00m 51s)
15:09 jgleeson: updated civicrm from e0ee2d189c to 9d6ba74f57
15:04 jgleeson: switched off donations queue consumer and thank you mailer service in preparation for new release
15:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1106 - T161294 (duration: 00m 52s)
14:49 moritzm: restarting hhvm on canary app servers to pick up security updates for openssl, icu and libx11
14:44 moritzm: installing request-tracker4 update from jessie point release on ununpentium
14:28 jynus: disabling puppet on dbproxies for 399164 deploy
13:57 moritzm: upgrading pdns-recursor on achernar/acamar to 4.0.4+deb9u3~bpo8+1 (security fix)
13:13 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1317.eqiad.wmnet
13:05 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1318.eqiad.wmnet
13:05 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=mw133[0-1].eqiad.wmnet
12:12 hashar: CI: switching mwgate-composer-php70 job from Nodepool to Docker | https://gerrit.wikimedia.org/r/#/c/398921/
11:59 hashar: CI: switching composer-php55 / composer-package-php55 jobs from Nodepool to Docker | https://gerrit.wikimedia.org/r/#/c/398920/
11:29 moritzm: upgrading pdns-recursor on maerlant to 4.0.4+deb9u3~bpo8+1 (security fix)
11:28 hashar@tin: Finished deploy [docker-pkg/deploy@09087ad]: Bumping Jinja2 2.9.6..2.10 (duration: 00m 30s)
11:28 hashar@tin: Started deploy [docker-pkg/deploy@09087ad]: Bumping Jinja2 2.9.6..2.10
11:16 moritzm: uploaded pdns-recursor 4.0.4+deb9u3~bpo8+1 to apt.wikimedia.org
10:54 ariel@tin: Finished deploy [dumps/dumps@2bafffe]: allow dump runs in specified wiki list order, rather than by longest to wait (duration: 00m 02s)
10:54 ariel@tin: Started deploy [dumps/dumps@2bafffe]: allow dump runs in specified wiki list order, rather than by longest to wait
10:52 moritzm: upgrading pdns-recursor on nescio to 4.0.4+deb9u3~bpo8+1 (security fix)
10:47 elukey: restart zookeeper on conf2001 for jvm updates - T179943
10:45 jynus: disabling puppet on dbproxies for 398450 deploy
10:38 godog: rollout updated version of prometheus-nutcracker-exporter
09:12 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1097:3315 - T161294 (duration: 00m 51s)
08:56 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1106 - T161294 (duration: 00m 51s)
08:35 moritzm: reimaging mw1317 (video scaler) to stretch
08:28 marostegui: Stop replication in sync on db2045 and db1109 - T161294
08:21 moritzm: installing openssl security updates
08:05 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2246.codfw.wmnet
08:05 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2119.codfw.wmnet
07:00 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1106 - T161294 (duration: 00m 51s)
06:53 mobrovac@tin: Finished deploy [restbase/deploy@2b75a64]: Bug fix: Add the time_to_live config option to the Parsoid module (duration: 04m 26s)
06:51 marostegui: Stop replication in sync on db1106 and db2052 - T161294
06:49 mobrovac@tin: Started deploy [restbase/deploy@2b75a64]: Bug fix: Add the time_to_live config option to the Parsoid module
06:40 marostegui: Stop replication in sync on db1106 and dbstore1002 s5 - T161294
06:29 marostegui: Stop replication in sync on db1100 and db1106 - T161294
06:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1106 - T161294 (duration: 00m 53s)
06:09 marostegui: Deploy schema change on db1065 (s1 sanitarium master) with replication, so some lag will be generated on labs - T174569
05:18 andrewbogott: restarting slapd on seaborgium (in response to ldap complaints on the grid master)
02:24 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.12) (duration: 05m 22s)
00:44 mutante: einsteinium: sudo systemctl restrart ircecho (alias kick-icinga-wm)
2017-12-18
22:34 ejegg: updated payments-wiki from f594dfa763 to e91db27108
21:00 mutante: uranium - apt-get remove ganglia-webfrontend, apache2
20:53 mutante: ganglia.wikimedia.org shut down just now after a deprecation period - service is out of commission - T177225
20:53 chasemp: reboot labtestvirt2003
20:49 mutante: install1002/2002 - killing all ganglia processes, decoming aggregators
20:48 bawolff@tin: Synchronized php-1.31.0-wmf.12/extensions/TemplateData/TemplateDataBlob.php: T118682 (duration: 00m 52s)
19:49 robh@puppetmaster1001: conftool action : set/pooled=no; selector: name=cp4032.ulsfo.wmnet
18:42 moritzm: installing xml2 updates from stretch point release
18:28 moritzm: installing libxkbcommon updates from stretch point release
18:11 moritzm: installing python updates from stretch point release
17:51 elukey: run kafka preferred-replica-election on the analytics cluster to allow kafka1023 (new node) to become a partition leader
16:23 demon@tin: Pruned MediaWiki: 1.31.0-wmf.11 [keeping static files] (duration: 01m 18s)
16:17 demon@tin: Pruned MediaWiki: 1.31.0-wmf.8 (duration: 04m 58s)
16:15 elukey@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=mw133[0-7].eqiad.wmnet
16:09 thcipriani@tin: Synchronized README: noop sync to test scap 3.7.4-3 (duration: 03m 02s)
16:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1099:3311 - T174569 (duration: 03m 03s)
15:47 marostegui: Stop MySQL on db1111 to copy its content to db1112 - T180788
15:45 jynus: stop and upgrade db1107 T183123
15:37 marostegui: Stop db1100 and dbstore1002 in sync - T161294
15:28 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=mw1329.eqiad.wmnet
15:23 moritzm: uploaded prometheus-blazegraph-exporter, prometheus-wdqs-updater-exporter and prometheus-pdns-exporter to apt.wikimedia.org
15:16 chasemp: reboot labtestvirt2003
14:41 ema: upgrade pinkunicorn to latest jessie point release (8.10) T182656
14:13 elukey: temporarily stopped mysql consumers on eventlog1001 to ease a mysql backup on db1107 - T183123
13:58 jynus: starting one-time backup of eventlogging database on db1107:/srv/backups T183123
13:29 marostegui: Stop replicaiton in sync on db1109 and db2045 - T161294
13:25 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1307.eqiad.wmnet
13:25 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1313.eqiad.wmnet
13:21 jmm@puppetmaster1001: conftool action : set/pooled=no; selector: mw1313.eqiad.wmnet
13:20 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1260.eqiad.wmnet
13:20 jmm@puppetmaster1001: conftool action : set/pooled=no; selector: mw1260.eqiad.wmnet
13:18 marostegui: Stop replication in sync on db1100 and db2052 - T161294
13:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1106 - T161294 (duration: 03m 06s)
13:00 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1106 - T161294 (duration: 03m 03s)
12:58 marostegui: Stop replication in sync on db1106 and db1100 - T161294
12:35 marostegui: Deploy schema change on db1099:331 and db1067 - T174569
12:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1099:3311 - T174569 (duration: 03m 05s)
12:25 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1066 - T174569 (duration: 03m 06s)
12:20 akosiaris: build scap 3.7.4-3 and upload to jessie-wikimedia, stretch-wikimedia, trusty-wikimedia. T183046 , T182347
11:37 hashar: restarting Jenkins CI to upgrade the monitoring plugin
11:24 _joe_: ran cleanup script on scb* T180384
11:20 mobrovac: stopping the trending edits service - T180384
11:18 moritzm: reimaging mw1307 (video scaler) to stretch
11:08 _joe_: rolling restart of pybal on the low-traffic balancers
11:04 _joe_: disabled notifications for trendingedits.svc T180384
10:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1109 - T161294 (duration: 00m 56s)
10:30 marostegui: Stop replication on db1109 and db2045 in sync - T161294
10:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1109 - T161294 (duration: 00m 56s)
09:47 marostegui: Deploy schema change on db1066 - T174569
09:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1066 - T174569 (duration: 00m 56s)
09:43 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1073 - T174569 (duration: 00m 57s)
09:40 moritzm: installing openssl security updates
09:39 marostegui: Deploy schema change on db1055 (already depooled) - T174569
09:28 gehel: removing initial import datafiles from maps[12]001
08:58 marostegui: Stop replication in sync on db1100 and db2052 - T161294
08:57 elukey: rolling restart of the Yarn nodemanagers (hadoop) on analytics10[456]* to pick up new settings - T182276
08:30 Jamesofur: insert decryption key for 2017 Arb elections
08:08 volans: powercycling ganeti1005
06:57 _joe_: reeanbling puppet across servers with scap
06:44 marostegui: Defragment s2 databases on db1102 - T172169
06:43 _joe_: restarted hhvm on mw1283, still the same kind of lockups
06:33 marostegui: Deploy schema change on db1073 - T174569
06:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1073 - T174569 (duration: 00m 57s)
06:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1106 - T161294 (duration: 00m 56s)
06:17 marostegui: Stop replication in sync on db1106 and db1100 - T161294
06:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1106 - T161294 (duration: 00m 57s)
03:02 eileen: re-enable multiqueue_consumer process-control config revision is 1ae3778278
03:00 eileen: civicrm revision changed from 43d3f4d739 to e0ee2d189c
02:32 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.12) (duration: 05m 57s)
01:17 cwd: disabled fredge multiqueue consumer
00:49 eileen: civicrm revision changed from 798e24671b to 43d3f4d739
2017-12-16
17:27 no_justification: gerrit: Back, might see a few transient puppet failures if git pulls happened during the d/t, but should all recover
17:21 no_justification: gerrit: halting service momentarily for account reindexing
15:00 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1100 (duration: 00m 58s)
12:29 ariel@tin: Finished deploy [dumps/dumps@95dbfe6]: revert previous deploy (duration: 00m 02s)
12:29 ariel@tin: Started deploy [dumps/dumps@95dbfe6]: revert previous deploy
12:16 ariel@tin: Finished deploy [dumps/dumps@faf7de8]: use cat to recombine gzipped files (duration: 00m 02s)
12:16 ariel@tin: Started deploy [dumps/dumps@faf7de8]: use cat to recombine gzipped files
02:04 mutante: labweb1002 - manually downgrade to scap 3.7.4-1 (disabled puppet)
01:49 mutante: reimported scap 3.7.4-1 into APT (jessie-wikimedia) after fixing md5/sha sums in .dsc and .changes files to match orig.tar.gz | copied it from jessie-wikimedia to trusty and stretch-wikimedia. all distributions downgraded to 3.7.4-1 (T183046 )
01:42 mutante: reimported scap 3.7.4-1 into APT (jessie-wikimedia) after fixing md5/sha sums in .dsc and .changes files to match orig.tar.gz
01:15 mutante: reprepro removing scap 3.7.4-2 package, attempting to reimport 3.7.4-1 package
01:11 mutante: reprepro remove trusty-wikimedia scap
00:44 demon@tin: Synchronized php-1.31.0-wmf.12/extensions/LoginNotify/includes/LoginNotify.php: T182867 (duration: 00m 57s)
00:40 demon@tin: Synchronized README: Testing again, this time with feeling (duration: 00m 56s)
00:28 mutante: re-disabled puppet on labweb1001/labweb1002 (as it was before)
00:26 mutante: re-enabling puppet on scap hosts
00:25 demon@tin: Synchronized README: Testing (duration: 00m 57s)
00:10 mutante: no more scap 3.7.4-2 found across 'R:Package = scap' (T183046 )
2017-12-15
23:55 mutante: downgrading scap from 3.7.4-2 to 3.7.4-1 where it is installed - cumin -b 10 -s 5 'R:Package = scap' 'if dpkg -l scap | grep "3.7.4.2" && file /var/cache/apt/archives/scap_3.7.4-1_all.deb; then puppet agent --disable; apt-get remove --yes -q scap ; dpkg -i /var/cache/apt/archives/scap_3.7.4-1_all.deb ; fi' targeting 478 hosts (T183046 )
23:37 mutante: aqs1004, analytics1003, downgraded scap to 3.7.4-1
22:55 mutante: tin - apt-get remove scap ; dpkg -i /var/cache/apt/archives/scap_3.7.4-1_all.deb
19:39 chasemp: reboot labtestvirt2003
18:15 jgleeson: rolled back to civicrm to 798e2467 to investigate prometheus bug
17:32 jynus: stop, upgrade and reboot labsdb1009
17:18 jynus: reloading dbproxy1010
16:44 chasemp: labtestvirt2003:~# /sbin/reboot to pickup new kernel
16:23 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1084 original weight (duration: 00m 57s)
16:10 elukey: re-enable piwik on bohrium after mysql backup restore
15:11 chasemp: reimage labtestvirt2003.codfw.wmnet
13:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1084 and restore original weight for db1081 (duration: 00m 56s)
13:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1084 and db1081 (duration: 00m 56s)
13:20 chasemp: disable puppet across eqiad lab* things to land a bit of code gracefully
13:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1084 (duration: 00m 56s)
12:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1084 with low weight - T180788 (duration: 00m 57s)
10:59 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1080 - T174569 (duration: 00m 56s)
10:59 jynus: reloading dbproxy1011
10:50 marostegui: Stop MySQL on db1084 to clone db1111 - T180788
10:49 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1084 to clone db1111 - T180788 (duration: 00m 56s)
10:31 elukey: rolling restart of yarn nodemanagers on an103* to apply new config - T182276
10:14 moritzm: uploaded prometheus-dns-rec-exporter 0.3 to apt.wikimedia.org
09:50 elukey: restore piwik database on bohrium after mysql corruption - piwik disabled
09:26 marostegui: Deploy schema change on db1080 - T174569
09:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1080 - T174569 (duration: 00m 56s)
09:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1083 - T174569 (duration: 00m 56s)
08:50 moritzm: powercycling ganeti1005
08:38 marostegui: Stop MySQL on db1034 to decommission it - T182556
08:26 marostegui: Remove db1034 from tendril - T182556
06:50 marostegui: Deploy schema change on db1083 - T174569
06:49 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1083 - T174569 (duration: 00m 56s)
06:43 marostegui: Deploy schema change on dbstore1001 (s1) - T174569
06:29 marostegui: Fix dbstore1001 s5 replication
06:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1101:3318 and db1109 - T161294 (duration: 01m 16s)
03:01 mutante: db2016 thru db2019 - had to manually kill gmond process to decom ganglia, other db codfw hosts: didnt need it | running puppet on db205* and others in codfw to remove all ganglia (T177225 )
01:23 catrope@tin: Synchronized php-1.31.0-wmf.12/extensions/ORES: Fix broken join conditions (T182936 ) (duration: 00m 57s)
01:22 hoo: Updated the Wikidata property suggester with data from Monday's JSON dump and applied the T132839 workarounds
01:19 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Don't discount file searches on commonswiki (duration: 00m 57s)
01:11 catrope@tin: Synchronized php-1.31.0-wmf.11: Pulling in today's cherry-picks into wmf.11 too (duration: 10m 49s)
00:44 catrope@tin: Synchronized php-1.31.0-wmf.12/resources/src/mediawiki.rcfilters/ui/mw.rcfilters.ui.MenuSelectWidget.js: T182711 (duration: 00m 56s)
00:33 catrope@tin: Synchronized php-1.31.0-wmf.12/extensions/VisualEditor/modules/ve-mw/init/ve.init.mw.ArticleTargetEvents.js: Track editor mode on save events (T182610 ) (duration: 00m 56s)
00:25 catrope@tin: Synchronized php-1.31.0-wmf.12/resources/lib/oojs-ui/oojs-ui-core.js: Backport OOjs UI fix for T182359 , T182395 (duration: 00m 57s)
00:14 RoanKattouw: updateCollation.php finished on sewiki (T181503 )
00:13 RoanKattouw: Running updateCollation.php on sewiki
00:13 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Set category collation for sewiki to uppercase-se (duration: 00m 57s)
2017-12-14
22:58 demon@tin: rebuilt and synchronized wikiversions files: group2 to wmf.12 (#2)
22:47 demon@tin: Synchronized php-1.31.0-wmf.12/extensions/ORES/includes/Hooks/ApiHooksHandler.php: fix undefined property (duration: 01m 08s)
22:17 demon@tin: rebuilt and synchronized wikiversions files: rollback, ORES breaking stuff on enwiki
22:08 demon@tin: rebuilt and synchronized wikiversions files: group2 to wmf.12
21:53 ejegg: updated CiviCRM from 1086291bb3 to 41a23f9fc3
20:33 awight: stress testing ores*
20:30 demon@tin: Pruned MediaWiki: 1.31.0-wmf.10 [keeping static files] (duration: 03m 32s)
20:28 jgleeson: turned back on donations queue consume service and thank you mailer service
20:21 awight@tin: Started restart [ores/deploy@b67bba7]: (non-production) Restart ORES services on ores*
20:19 thcipriani@tin: Finished scap: SWAT: Update en/i18n message for multiple-unclosed-formatting-tags (duration: 30m 55s)
20:02 jgleeson: updated civicrm from 798e24671b to 1086291bb3
{{safesubst:SAL entry|1=20:02 urandom: lowering cassandra compaction throughput to 5MB/s, restbase101{2,4}-{a,b,c}}}
19:58 jgleeson: Temporarily disabled donations queue consume service and thank you mailer service
19:48 thcipriani@tin: Started scap: SWAT: Update en/i18n message for multiple-unclosed-formatting-tags
19:43 thcipriani@tin: Synchronized wmf-config/throttle.php: SWAT: Define new throttle rule and cleaning expired rules T182889 (duration: 01m 08s)
19:29 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Set $wgNamespaceRobotPolicies for wikidata T181525 (duration: 01m 04s)
19:20 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Remove single editor tab for plwiki T181045 (duration: 01m 09s)
19:02 awight@tin: Finished deploy [ores/deploy@b67bba7]: Redeploy ORES to scb1001 (duration: 01m 04s)
19:01 awight@tin: Started deploy [ores/deploy@b67bba7]: Redeploy ORES to scb1001
19:01 bsitzmann@tin: Finished deploy [mobileapps/deploy@bf85a55]: Update mobileapps to ff74bb1 (T182868 T182774 ) (duration: 12m 26s)
18:54 awight@tin: Started restart [ores/deploy@b67bba7]: Restart ORES services
18:51 arlolra: Updated Parsoid to ca20680 (T182793 , T182774 )
18:48 bsitzmann@tin: Started deploy [mobileapps/deploy@bf85a55]: Update mobileapps to ff74bb1 (T182868 T182774 )
18:41 arlolra@tin: Finished deploy [parsoid/deploy@13b5cb5]: (no justification provided) (duration: 09m 19s)
18:32 arlolra@tin: Started deploy [parsoid/deploy@13b5cb5]: (no justification provided)
18:24 elukey: replace kafka1018 with kafka1023 (Analytics Kafka cluster)
18:11 awight@tin: Started deploy [ores/deploy@b67bba7]: Update ORES service to b67bba77acb
16:34 bd808: Scholarships: Set application start and close dates via web UI (T181072 )
16:25 niharika29@tin: Finished deploy [scholarships/scholarships@872381d]: Deploy wikimania scholarships app for 2018 T181072 (duration: 00m 02s)
16:25 niharika29@tin: Started deploy [scholarships/scholarships@872381d]: Deploy wikimania scholarships app for 2018 T181072
16:07 bd808: Scholarships: updated database schema with 20171212-add-scholarship-orgs-field.sql (T181072 )
15:48 marostegui: Deploy schema change on dbstore1002 (s1) - T174569
15:25 jynus: stop, upgrade and restart labsdb1011
15:07 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1089 - T174569 (duration: 01m 08s)
14:21 hashar@tin: Synchronized php-1.31.0-wmf.12/resources/src/mediawiki.rcfilters: Swat for RCFilters https://gerrit.wikimedia.org/r/#/c/398242/ https://gerrit.wikimedia.org/r/#/c/398239/ (duration: 01m 09s)
13:47 chasemp: I'm reverting https://gerrit.wikimedia.org/r/#/c/394541/ as it broke the database puppet for labservices (used for powerdns backend)
13:42 godog: upgrade grafana to 4.6.3 - T182294
13:41 elukey: update facts for puppet compiler to pick up new hosts
13:32 mobrovac@tin: Finished deploy [restbase/deploy@187d8ba]: Remove Trending Edits end point and stop storing feed results in Cassandra - T180384 T179412 (duration: 05m 37s)
13:26 mobrovac@tin: Started deploy [restbase/deploy@187d8ba]: Remove Trending Edits end point and stop storing feed results in Cassandra - T180384 T179412
13:10 marostegui: Deploy schema change on db1089 (s1) - T174569
13:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1089 - T174569 (duration: 01m 07s)
12:56 awight: stress testing on ores1*.eqiad.wmnet cluster, T182249
12:42 awight@tin: Finished deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster (duration: 00m 59s)
12:41 awight@tin: Started deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster
12:40 awight@tin: Finished deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster (duration: 00m 45s)
12:39 awight@tin: Started deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster
12:18 gehel: re-initialize cassandra on maps-test2001 - T182583
12:11 jynus: disable puppet on all databases to deploy safely https://gerrit.wikimedia.org/r/398246
12:00 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1091 - T174569 (duration: 01m 08s)
11:27 awight@tin: Finished deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster (duration: 00m 20s)
11:27 awight@tin: Started deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster
11:24 awight@tin: Finished deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster (duration: 00m 05s)
11:23 awight@tin: Started deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster
11:23 awight@tin: Finished deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster (duration: 00m 11s)
11:23 awight@tin: Started deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster
11:16 akosiaris@tin: Finished deploy [ores/deploy@b67bba7]: T181661 (duration: 00m 03s)
11:16 akosiaris@tin: Started deploy [ores/deploy@b67bba7]: T181661
11:14 akosiaris@tin: Finished deploy [ores/deploy@b67bba7]: T181661 (duration: 00m 03s)
11:14 akosiaris@tin: Started deploy [ores/deploy@b67bba7]: T181661
11:13 akosiaris@tin: Finished deploy [ores/deploy@b67bba7]: T181661 (duration: 00m 55s)
11:12 akosiaris@tin: Started deploy [ores/deploy@b67bba7]: T181661
09:27 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1067 (duration: 01m 08s)
08:39 marostegui: Reload dbproxy1011 config
07:18 marostegui: Stop replication and set read-only on labsdb1003 - T142807
07:08 marostegui: Stop replication in sync on db1109 and db1101:3318 - T161294
07:08 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1101:3318 and db1109 - T161294 (duration: 01m 07s)
07:02 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1101:3318 and db1109 - T161294 (duration: 01m 08s)
06:55 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1101:3318 and db1109 - T161294 (duration: 01m 09s)
06:35 marostegui: Deploy schema change on db1091 (s4) - T174569
06:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1091 - T174569 (duration: 01m 08s)
02:26 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.11) (duration: 08m 05s)
01:46 twentyafterfour: no phabricator deployment tonight, not enough time to prepare and test the update due to a short outage earlier this evening.
00:12 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: T182534 (duration: 01m 08s)
2017-12-13
23:54 twentyafterfour: restarted phd on phab1001 (for good measure)
23:23 reedy@tin: Synchronized php-1.31.0-wmf.12/extensions/Flow/Hooks.php: unbreak CheckUser (duration: 01m 08s)
23:22 foks: deleted 22 illegal images from server
22:51 volans: restarting apache2 on phab1001, phabricator timing out
22:23 ppchelko@tin: Finished deploy [cpjobqueue/deploy@044cd23]: Fix sha1-based deduplication (duration: 00m 34s)
22:23 ppchelko@tin: Started deploy [cpjobqueue/deploy@044cd23]: Fix sha1-based deduplication
22:03 mholloway-shell@tin: Finished deploy [mobileapps/deploy@e62d8e3]: Update mobileapps to ddddebb (duration: 05m 24s)
21:58 mholloway-shell@tin: Started deploy [mobileapps/deploy@e62d8e3]: Update mobileapps to ddddebb
21:54 ejegg: updated civicrm from 85a8526eb8 to 798e24671b
21:37 mholloway-shell@tin: Finished deploy [mobileapps/deploy@b8082da]: Update mobileapps to bf67c97 (duration: 04m 19s)
21:33 mholloway-shell@tin: Started deploy [mobileapps/deploy@b8082da]: Update mobileapps to bf67c97
21:24 ppchelko@tin: Finished deploy [restbase/deploy@a993556]: Do not fallback if the revision is not specified T182770 (duration: 04m 04s)
21:20 ppchelko@tin: Started deploy [restbase/deploy@a993556]: Do not fallback if the revision is not specified T182770
20:27 demon@tin: Synchronized php-1.31.0-wmf.12/extensions/WikimediaEvents/extension.json: James_F made me do it (duration: 01m 08s)
20:14 demon@tin: rebuilt and synchronized wikiversions files: group1 to wmf.12
20:10 demon@tin: Synchronized php: symlink bump for wmf.12 (duration: 01m 07s)
19:37 ebernhardson@tin: Synchronized php-1.31.0-wmf.12/extensions/VisualEditor/modules/ve-mw/init/ve.init.mw.trackSubscriber.js: SWAT: VE trackSubscriber: Add timing data for 'loaded' state (duration: 01m 07s)
19:35 ppchelko@tin: Finished deploy [restbase/deploy@3f4bedc]: Remove references to Cassandra 2 from Parsoid storage T179417 (duration: 04m 43s)
19:34 ebernhardson@tin: Synchronized php-1.31.0-wmf.12/resources/src/mediawiki.rcfilters/mw.rcfilters.UriProcessor.js: SWAT: T182734 : RCLFilters: support target page with a subpage (duration: 01m 07s)
19:30 ppchelko@tin: Started deploy [restbase/deploy@3f4bedc]: Remove references to Cassandra 2 from Parsoid storage T179417
19:25 ebernhardson@tin: Synchronized php-1.31.0-wmf.12/extensions/VisualEditor/modules/ve-mw/init/ve.init.mw.trackSubscriber.js: SWAT: VE trackSubscriber: data isn't required (duration: 01m 08s)
19:14 ebernhardson@tin: Synchronized php-1.31.0-wmf.12/resources/src/mediawiki.rcfilters/: SWAT: T182788 : RCFilters: Fix live update (duration: 01m 08s)
19:07 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Turn on cirrus MLR for 4 more wikis (duration: 01m 09s)
19:05 mutante: phab[12]001,mx[12]001,mendelevium,fermium: rm /usr/local/bin/exim-to-gmetric and remove root's crontab lines to follow-up gerrit:382916
18:31 mutante: releases2001: /srv/mediawiki# rm -rf extensions/ skins/ vendor/ | clean up removed repos, let puppet clone, to match releases1001 and fix puppet run
17:33 awight@tin: Finished deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster (duration: 00m 59s)
17:32 awight@tin: Started deploy [ores/deploy@b67bba7]: (non-production) Update ORES on new cluster
17:30 moritzm: installing wireshark security updates
16:30 marostegui: Deploy schema change on s4 on dbstore1001 - T174569
16:19 godog: try again deleting obsolete cassandra metrics from graphite2002 - T181964
15:50 akosiaris@tin: Finished deploy [ores/deploy@b4f2b02]: (no justification provided) (duration: 02m 22s)
15:48 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: (no justification provided)
15:47 akosiaris@tin: Finished deploy [ores/deploy@b4f2b02]: (no justification provided) (duration: 01m 30s)
15:46 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: (no justification provided)
15:43 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1034 from config - T182556 (duration: 01m 07s)
15:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1034 from config - T182556 (duration: 01m 09s)
15:39 akosiaris@tin: Finished deploy [ores/deploy@b4f2b02]: (no justification provided) (duration: 04m 46s)
15:34 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: (no justification provided)
15:21 chasemp: remove and purge vblade-persist and runit from labstore1004 T182781
15:02 jynus: upgrade and restart db1067
15:02 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1067 (duration: 01m 07s)
14:43 zeljkof: EU SWAT finished
14:38 zfilipin@tin: Synchronized portals: SWAT: Bumping portals to master (T128546) (duration: 01m 09s)
14:37 zfilipin@tin: Synchronized portals/prod/wikipedia.org/assets: SWAT: Bumping portals to master (T128546) (duration: 01m 08s)
14:29 dcausse@tin: Synchronized php-1.31.0-wmf.12/extensions/Wikibase: T182293 Extract names of search fields as constants (duration: 02m 05s)
14:22 dcausse@tin: Synchronized wmf-config/InitialiseSettings.php: T182293 [cirrus] tune wikidata similarity configuration 2/2 (duration: 01m 07s)
14:20 dcausse@tin: Synchronized wmf-config/Wikibase.php: T182293 [cirrus] tune wikidata similarity configuration 1/2 (duration: 01m 12s)
14:03 akosiaris: gnt-node remove ganeti1006 T181121
14:01 elukey: restart Yarn nodemanagers on analytics102[8,9] to apply new settings - T182276
13:21 moritzm: uploaded prometheus-pdns-rec-exporter 0.2-1 to apt.wikimedia.org
13:10 marostegui: Deploy schema change on s4 - dbstore1002 - T174569
12:53 marostegui: Deploy alter table on s4 db1064 (sanitarium master) with replication, this will generate lag on labs replicas - T174569
12:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1103:3314 - T174569 (duration: 01m 36s)
12:19 moritzm: uploaded prometheus-ircd-exporter and prometheus-pdns-rec-exporter to apt.wikimedia.org
11:59 elukey: forced remount of /mnt/hdfs after OOM event on stat1005
11:41 akosiaris: empty ganeti1006 for reimage after ltpstress and bios upgrades T181121
11:16 mobrovac@tin: Finished deploy [graphoid/deploy@7979a40]: Update to service-template-node v0.5.4 (duration: 03m 56s)
11:12 mobrovac@tin: Started deploy [graphoid/deploy@7979a40]: Update to service-template-node v0.5.4
10:19 mobrovac@tin: Finished deploy [recommendation-api/deploy@ac66089]: Update to service-template-node v0.5.4 (duration: 02m 20s)
10:17 mobrovac@tin: Started deploy [recommendation-api/deploy@ac66089]: Update to service-template-node v0.5.4
09:41 godog: upload prometheus-elasticsearch-exporter to jessie-wikimedia - T181627
08:38 jynus: migrate away from ganeti1008
06:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1103:3314 - T174569 (duration: 01m 11s)
06:25 marostegui: Deploy schema change on db1103:3314 - T174569
06:12 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1097:3314 - T174569 (duration: 01m 08s)
02:26 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.11) (duration: 07m 02s)
00:49 ebernhardson@tin: Synchronized php-1.31.0-wmf.11/extensions/WikimediaEvents/: SWAT: T182616 : turn on second mlr ab test for hewiki (duration: 01m 08s)
00:44 ebernhardson@tin: Synchronized php-1.31.0-wmf.12/extensions/WikimediaEvents/: SWAT: T182616 : turn on second mlr ab test for hewiki (duration: 01m 08s)
00:40 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Turn on cirrus MLR for most wikis with >1% of search traffic (duration: 01m 08s)
00:35 mholloway-shell@tin: Finished deploy [mobileapps/deploy@5832a8c]: Update mobileapps to bfc3588 (duration: 04m 48s)
00:32 brion: restarted requeueTranscodes.php on terbium for mp3 audio generation backfill (had dropped DB connection)
00:31 mholloway-shell@tin: Started deploy [mobileapps/deploy@5832a8c]: Update mobileapps to bfc3588
00:05 ebernhardson@tin: Synchronized wmf-config/: SWAT: T182616 : Setup MLR AB test for hewiki (duration: 01m 10s)
2017-12-12
22:30 mholloway-shell@tin: Finished deploy [mobileapps/deploy@ea8f05d]: Update mobileapps to 94f267b (duration: 05m 10s)
22:24 mholloway-shell@tin: Started deploy [mobileapps/deploy@ea8f05d]: Update mobileapps to 94f267b
22:02 urandom: setting compaction throughput to 5 MB/s, restbase1010
21:31 mutante: releases1001 - rm mediawiki core repo and let puppet try to recreate it (follow-up issue after gerrit:397891)
21:28 mobrovac@tin: Finished deploy [restbase/deploy@dceab2e]: Switch to Parsoid content v1.6.0 and switch to Cassandra 3 storage - T179417 (duration: 04m 16s)
21:27 demon@tin: Synchronized php-1.31.0-wmf.12/extensions/GlobalBlocking/: (no justification provided) (duration: 01m 07s)
21:26 mholloway-shell@tin: Finished deploy [mobileapps/deploy@28bfda3]: Update mobileapps to d0ee651 (duration: 01m 45s)
21:24 demon@tin: Synchronized php-1.31.0-wmf.11/extensions/GlobalBlocking/: (no justification provided) (duration: 01m 08s)
21:24 mholloway-shell@tin: Started deploy [mobileapps/deploy@28bfda3]: Update mobileapps to d0ee651
21:24 mholloway-shell@tin: Finished deploy [mobileapps/deploy@b2d5b8e]: Update mobileapps to 172abc7 (duration: 00m 30s)
21:24 mholloway-shell@tin: Started deploy [mobileapps/deploy@b2d5b8e]: Update mobileapps to 172abc7
21:23 mobrovac@tin: Started deploy [restbase/deploy@dceab2e]: Switch to Parsoid content v1.6.0 and switch to Cassandra 3 storage - T179417
21:21 ppchelko@tin: Finished deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries, take 4 (duration: 02m 45s)
21:18 ppchelko@tin: Started deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries, take 4
21:15 ppchelko@tin: Finished deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries, take 3 (duration: 02m 04s)
21:13 ppchelko@tin: Started deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries, take 3
21:13 ppchelko@tin: Finished deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries, take 2 after failed content rerender (duration: 05m 05s)
21:12 mobrovac@tin: Started restart [electron-render/deploy@94d27d7]: Electron hanging - T174916
21:08 ppchelko@tin: Started deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries, take 2 after failed content rerender
21:08 ppchelko@tin: Finished deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries (duration: 05m 51s)
21:02 ppchelko@tin: Started deploy [restbase/deploy@dceab2e]: Bump expected parsoid version, but do not switch summaries
20:55 ppchelko@tin: Finished deploy [restbase/deploy@d3ca789]: Revert deployment for using MCS for summaries (duration: 01m 36s)
20:53 ppchelko@tin: Started deploy [restbase/deploy@d3ca789]: Revert deployment for using MCS for summaries
20:53 demon@tin: rebuilt and synchronized wikiversions files: group0 to wmf.12
20:51 ppchelko@tin: Finished deploy [restbase/deploy@506047c]: Update expected Parsoid version, switched summary to MCS (duration: 03m 30s)
20:48 ppchelko@tin: Started deploy [restbase/deploy@506047c]: Update expected Parsoid version, switched summary to MCS
20:44 mholloway-shell@tin: Finished deploy [mobileapps/deploy@b2d5b8e]: Update mobileapps to 172abc7 (duration: 04m 48s)
20:39 mholloway-shell@tin: Started deploy [mobileapps/deploy@b2d5b8e]: Update mobileapps to 172abc7
20:35 mholloway-shell@tin: Finished deploy [mobileapps/deploy@0a9d635]: Update mobileapps to 035608d (duration: 02m 32s)
20:32 mholloway-shell@tin: Started deploy [mobileapps/deploy@0a9d635]: Update mobileapps to 035608d
20:31 mholloway-shell@tin: Finished deploy [mobileapps/deploy@2690678]: Update mobileapps to 5b8796d (duration: 40m 04s)
19:57 arlolra: Updated Parsoid to 741fc5d (T114072 , T181226 , T21910 , T152540 , T103714 , T97093 , T118520 , T181229 , T182338 , T182170 , T169006 )
19:51 mholloway-shell@tin: Started deploy [mobileapps/deploy@2690678]: Update mobileapps to 5b8796d
19:43 arlolra@tin: Finished deploy [parsoid/deploy@98139cb]: (no justification provided) (duration: 14m 57s)
19:35 anomie: Running cleanupUsersWithNoId.php on all wikis (this will take a while), see T181731
19:12 arlolra: Parsoid deploy aborted and rolled back to 01c1fc3 while RESTBase fixes an issue
19:06 arlolra@tin: Finished deploy [parsoid/deploy@98139cb]: Updating Parsoid to 741fc5d (duration: 05m 33s)
19:00 arlolra@tin: Started deploy [parsoid/deploy@98139cb]: Updating Parsoid to 741fc5d
18:48 aaron@tin: Synchronized php-1.31.0-wmf.12/includes/Setup.php: 058c17e702eb0 (duration: 01m 09s)
18:21 moritzm: uploaded prometheus-ircd-exporter to apt.wikimedia.org
17:52 demon@tin: Finished scap: bootstrap wmf.12 (duration: 29m 35s)
17:22 demon@tin: Started scap: bootstrap wmf.12
17:15 demon@tin: Pruned MediaWiki: 1.31.0-wmf.7 (duration: 08m 45s)
17:12 akosiaris@tin: Finished deploy [ores/deploy@b4f2b02]: T181661 (duration: 00m 03s)
17:12 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: T181661
17:11 akosiaris@tin: Finished deploy [ores/deploy@b4f2b02]: T181661 (duration: 00m 36s)
17:11 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: T181661
17:03 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: T181661
16:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1084 and db1081 original weight (duration: 00m 56s)
16:40 jynus: restart and upgrade db1059 (phabricator passive db)
16:35 gehel@tin: Finished deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2001 (duration: 00m 19s)
16:35 gehel@tin: Started deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2001
16:34 gehel@tin: Finished deploy [tilerator/deploy@29d633e]: new tilerator packaging on maps-test2001 (duration: 00m 20s)
16:34 gehel@tin: Started deploy [tilerator/deploy@29d633e]: new tilerator packaging on maps-test2001
16:30 gehel@tin: Finished deploy [tilerator/deploy@29d633e]: new tilerator packaging on maps-test2002 (duration: 00m 20s)
16:30 gehel@tin: Started deploy [tilerator/deploy@29d633e]: new tilerator packaging on maps-test2002
16:28 gehel@tin: Finished deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2002 (duration: 00m 18s)
16:28 gehel@tin: Started deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2002
16:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1084 (duration: 00m 53s)
16:24 jynus@tin: Synchronized wmf-config/db-codfw.php: Repool db2072 (duration: 00m 55s)
16:22 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: T181661
16:21 akosiaris: failover boron to ganeti1008
16:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1084 (duration: 00m 56s)
15:59 akosiaris@tin: Finished deploy [ores/deploy@b4f2b02]: T181661 (duration: 00m 04s)
15:59 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: T181661
15:59 akosiaris@tin: Finished deploy [ores/deploy@b4f2b02]: T181661 (duration: 00m 09s)
15:58 akosiaris@tin: Started deploy [ores/deploy@b4f2b02]: T181661
15:46 jynus: stop, upgrade and reboot db2072
15:34 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2072 (duration: 00m 56s)
15:31 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1097:3314 - T174569 (duration: 01m 01s)
15:31 marostegui: Deploy schema change on s4 db1097:3314 - T174569
15:25 gehel: powercycling maps-test2003
15:24 elukey: rename notebook1002 -> kafka1023 - step 3, replace notebook1002 with kafka1023 in the puppet config
15:20 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1084 with low weight (duration: 01m 18s)
15:06 marostegui: Upgrade MySQl on db1084
15:02 elukey: clear recdns records related to notebook1002/kafka1023 (rec_control wipe-cache kafka1023.eqiad.wmnet kafka1023.mgmt.eqiad.wmnet notebook1002.eqiad.wmnet 14.5.64.10.in-addr.arpa 104.3.65.10.in-addr.arpa) - T181518
14:58 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1260.eqiad.wmnet
14:46 elukey: start rename notebook1002 -> kafka1023 - step 2, dns config (host already shutdown) - T181518
14:39 hashar: Rebuild operations-puppet-tests-docker image based on c76d892 | T178620 and /cache being owned by root
14:37 jynus@tin: Synchronized wmf-config/db-codfw.php: Repool db2071 (duration: 00m 56s)
14:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore original weight for db1086 (duration: 00m 56s)
14:20 zeljkof: EU SWAT finished
14:18 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add NS aliases for zh_wikiquote (T181374) (duration: 00m 56s)
14:09 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: Lift account registration on en.wiki (T182665) (duration: 00m 56s)
14:02 herron: upgrading codfw puppet agents
13:59 jynus: stop, upgrade and reboot db2071
13:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1086 (duration: 00m 56s)
13:56 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2071 (duration: 00m 57s)
13:25 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1086 (duration: 00m 57s)
12:44 akosiaris@puppetmaster1001: conftool action : set/weight=10; selector: scb1004.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])
12:10 akosiaris@puppetmaster1001: conftool action : set/weight=10; selector: scb1004.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])
12:10 akosiaris@puppetmaster1001: conftool action : set/weight=10; selector: scb1003.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])
12:06 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1092 (duration: 00m 55s)
11:56 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1086 with low weight - T163190 (duration: 00m 55s)
11:27 jynus: stop, upgrade and reboot db1092
11:26 marostegui: Upgrade MySQL and kernel on db1086
11:26 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1092 (duration: 00m 56s)
11:12 jynus@tin: Synchronized wmf-config/db-codfw.php: Repool db2066 (duration: 00m 57s)
11:11 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1084 - T174569 (duration: 00m 57s)
11:11 marostegui: Deploy schema change on db1084 (s4) - T174569
11:05 marostegui: Deploy schema change on db1056 (s4) already depooled - T174569
11:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1081 - T174569 (duration: 00m 56s)
10:46 gehel@tin: Finished deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2003 (duration: 00m 10s)
10:46 gehel@tin: Started deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2003
10:26 jynus: stop, upgrade and reboot db2066
10:24 gehel@tin: Finished deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2004 (duration: 00m 18s)
10:24 gehel@tin: Started deploy [kartotherian/deploy@6e223df]: new kartotherian packaging on maps-test2004
10:23 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2066 (duration: 00m 56s)
10:14 moritzm: reimaging mw1260 (video scaler) to stretch
10:04 jynus@tin: Synchronized wmf-config/db-codfw.php: Repool db2059 (duration: 00m 56s)
09:51 moritzm: updated stretch installer netboot image after stretch 9.3 point release
09:51 moritzm: updated stretch installer netboot image after jessie 8.10 point release
09:44 marostegui: Stop db1039 and db1086 in sync - T163190
09:43 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1086 - T163190 (duration: 00m 56s)
09:38 gehel: reduce replication factor for cassandra on maps-test cluster and reset cassandra on maps-test2001 to work around limited disk space - T182583
09:34 marostegui: Stop replication in sync on db1034 and db1039 for data consistency check - T163190
09:23 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1096:3316 after InnoDB there - T178359 (duration: 00m 56s)
09:00 jynus: stop, upgrade and reboot db2059
08:57 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2059 (duration: 00m 56s)
08:47 moritzm: updated jessie installer netboot image after jessie 8.10 point release
07:15 marostegui: Deploy schema change on db1081 - https://phabricator.wikimedia.org/T174569
07:14 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1081 - T174569 (duration: 00m 56s)
06:56 marostegui: stop MySQL on db1055 - T182653
06:48 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1055 - T178359 T182653 (duration: 00m 56s)
02:27 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.11) (duration: 06m 21s)
00:53 legoktm@tin: Synchronized wmf-config/CommonSettings.php: Remove manual firejailing of Score binaries (T181535 ) (duration: 00m 56s)
00:47 legoktm@tin: Synchronized wmf-config/CommonSettings.php: Have ExtensionDistributor treat REL1_30 as stable (duration: 00m 56s)
00:39 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Re-enable ORES on fawiki (T182354 ) (duration: 00m 56s)
00:25 ejegg: updated CiviCRM from 2db9c76 to 85a8526
00:23 catrope@tin: Synchronized wmf-config/CommonSettings.php: Fix default for rcOresDamagingPref (duration: 00m 56s)
00:22 ejegg: disabled nightly Ingenico WX audit parsing job
2017-12-11
23:37 smalyshev@tin: Finished deploy [wdqs/wdqs@f6b110f]: updater fix and GUI update (duration: 06m 10s)
23:31 smalyshev@tin: Started deploy [wdqs/wdqs@f6b110f]: updater fix and GUI update
23:28 mholloway-shell@tin: Finished deploy [mobileapps/deploy@07293bc]: Update mobileapps to e290b17 (duration: 06m 16s)
23:22 mholloway-shell@tin: Started deploy [mobileapps/deploy@07293bc]: Update mobileapps to e290b17
21:11 mholloway-shell@tin: Finished deploy [mobileapps/deploy@6347d62]: Update mobileapps to 61ca333 (duration: 07m 56s)
21:04 mholloway-shell@tin: Started deploy [mobileapps/deploy@6347d62]: Update mobileapps to 61ca333
19:47 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: T181493 : Enable Page Previews EventLogging instrumentation (duration: 00m 56s)
19:47 mobrovac@tin: Finished deploy [restbase/deploy@be7d72f]: Expose the Reading Lists end points, take #3 - T181107 (duration: 06m 19s)
19:46 ppchelko@tin: Finished deploy [cpjobqueue/deploy@b1beaf1]: Revert dedupe based on sha1 as well as on event ID (duration: 00m 29s)
19:46 ppchelko@tin: Started deploy [cpjobqueue/deploy@b1beaf1]: Revert dedupe based on sha1 as well as on event ID
{{safesubst:SAL entry|1=19:43 urandom: lower compaction throughput to 2 MB/s, restbase1010-{a,b,c} - T178177}}
19:40 mobrovac@tin: Started deploy [restbase/deploy@be7d72f]: Expose the Reading Lists end points, take #3 - T181107
19:34 mobrovac@tin: Finished deploy [restbase/deploy@bce2885]: Expose the Reading Lists end points, take #2 - T181107 (duration: 05m 33s)
19:28 mobrovac@tin: Started deploy [restbase/deploy@bce2885]: Expose the Reading Lists end points, take #2 - T181107
19:28 mobrovac@tin: Finished deploy [restbase/deploy@bce2885]: Expose the Reading Lists end points - T181107 (duration: 01m 26s)
19:26 mobrovac@tin: Started deploy [restbase/deploy@bce2885]: Expose the Reading Lists end points - T181107
19:24 ebernhardson@tin: Synchronized wmf-config/throttle.php: SWAT: T182613 Update throttle rule for McGill University Library (duration: 00m 56s)
19:21 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: T131132 : Switch submit button from save to publish on enwiki (duration: 02m 43s)
19:19 awight@tin: Finished deploy [ores/deploy@1c0ede0]: (non-production) Testing parallel ORES deployment, T181661 (duration: 01m 12s)
19:17 awight@tin: Started deploy [ores/deploy@1c0ede0]: (non-production) Testing parallel ORES deployment, T181661
19:16 ebernhardson@tin: Synchronized php-1.31.0-wmf.11/extensions/CirrusSearch/includes/Search/RescoreBuilders.php: SWAT: Add query string for running Cirrus MLR pre-deploy checks (duration: 00m 57s)
18:56 mobrovac@tin: Synchronized wmf-config/InitialiseSettings.php: Switch cebwiki, ruwiki and small projects to Kafka for htmlCacheUpdate - T182023 (duration: 00m 57s)
18:55 ppchelko@tin: Finished deploy [cpjobqueue/deploy@e1075af]: Enable htmlCacheUpdate for ceb and ru wiki and small projects T182023 (duration: 00m 34s)
18:54 ppchelko@tin: Started deploy [cpjobqueue/deploy@e1075af]: Enable htmlCacheUpdate for ceb and ru wiki and small projects T182023
18:03 gehel@tin: Finished deploy [wdqs/wdqs@353b3cb]: wdqs: GUI and updater updates (duration: 01m 14s)
18:02 gehel@tin: Started deploy [wdqs/wdqs@353b3cb]: wdqs: GUI and updater updates
16:42 hashar: Restarting Jenkins
15:36 marostegui: Deploy schema change on s2 master (db1054) - T174569
14:51 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Disable the wgTranslateNumerals at hiwikiversity T182584 (duration: 00m 56s)
14:46 niharika29@tin: Synchronized php-1.31.0-wmf.11/includes/specials/SpecialUndelete.php: Revert replacing textarea in Special:Undelete with OOUI T182398 (duration: 00m 57s)
14:34 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Bureaucrats to grant and remove translationadmin rights; sysops to add and remove the same from themselves - T182492 (duration: 00m 56s)
14:32 niharika29@tin: Synchronized wmf-config/interwiki.php: Update the Interwiki map - T182506 (duration: 00m 56s)
14:25 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Create NS_PROJECT and NS_PROJECT_TALK alias for kowikisource (T182487 ) (duration: 00m 56s)
13:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1060 - T174569 (duration: 04m 42s)
12:39 _joe_: trying to get a full core dump from hhvm on mw1200
11:12 _joe_: depooling mw1200 for investigation instead
11:11 _joe_: restarting hhvm on mw1200, stuck in a kernel task
11:08 jdrewniak@tin: Synchronized portals: Wikimedia Portals Update: Bumping portals to master (T128546) (duration: 00m 45s)
11:07 jdrewniak@tin: Synchronized portals/prod/wikipedia.org/assets: Wikimedia Portals Update: Bumping portals to master (T128546) (duration: 00m 44s)
10:49 ema: cp4021: restart varnish-be due to mbox lag
10:04 godog: upgrade grafana to 4.6.2 on labmon1001 - T182294
10:00 jynus: stopping dbstore2001:s5 and dbstore1002 (s5) mysql replication in sync
09:28 akosiaris: upload scap_3.7.4-1 to apt.wikimedia.org/jessie-wikimedia/main
09:16 gehel: cleaning old cassandra dumps on maps-test2001 servers
09:15 gehel: cleaning up old postgres logs on maps-test2001
09:05 elukey: set notebook1002 as role::spare as prep step to reimage it to kafka1023
09:03 jynus: dropping multiple leftover files from db1102
08:52 marostegui: Stop replication in sync on db1034 and db1039 - T163190
08:12 elukey: powercycle ganeti1008 - all vms stuck, console com2 showed a ton of printks without a clear indicator of the root cause
07:49 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1034 - T182556 (duration: 00m 45s)
07:44 _joe_: restarting hhvm on mw1189,mw1229,mw1235,mw1282,mw1285,mw1315,mw1316, all stuck with a kernel hang
06:59 _joe_: restarted hhvm, nginx on mw1280, hanging kernel operations
06:45 marostegui: Deploy schema change on s2 db1060 with replication enabled, this will generate some lag on s2 on labs - T174569
06:45 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1060 - T174569 (duration: 00m 44s)
06:22 marostegui: Compress s6 on db1096 - T178359
06:21 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1096:3316 to compress InnoDB there - T178359 (duration: 00m 45s)
02:43 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.11) (duration: 09m 21s)
2017-12-10
20:33 elukey: execute restart-hhvm on mw1312 - hhvm stuck multiple times queueing requests
20:01 elukey: ran kafka preferred-replica-election for the kafka analytics cluster (1012->1022) to re-add kafka1012 to the kafka brokers acting as partition leaders (will spread the load in a better way)
2017-12-09
17:00 apergos: restarted hhvm on mw1276, the same old hang with the same old symptoms
16:10 awight@tin: Finished deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity (take 4\!) (duration: 03m 01s)
16:07 awight@tin: Started deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity (take 4\!)
16:02 awight@tin: Finished deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity (duration: 05m 58s)
15:56 awight@tin: Started deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity
15:55 awight@tin: Finished deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity (duration: 00m 17s)
15:55 awight@tin: Started deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity
15:53 awight@tin: Finished deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity (duration: 00m 31s)
15:53 awight@tin: Started deploy [ores/deploy@1c0ede0]: Reducing ORES Celery log verbosity
15:53 apergos: did same on scb1002,3,4
15:48 awight: Making an emergency deployment to ORES logging config to reduce verbosity.
15:45 apergos: on scb1001 moved daemon.log out of the way, did "service rsyslog rotate", saved the last 5000 entries for use by ores team, removed the log
11:44 apergos: that server list: mw1278, 1277, 1226, 1234, 1230
11:42 apergos: restarted hhvm on api servers after lockup
11:19 legoktm@tin: Synchronized wmf-config/InitialiseSettings.php: Disable ORES in fawiki - T182354 (duration: 00m 45s)
00:11 Jamesofur: removed 2FA from EVinente after verification T182373
2017-12-08
23:23 hashar: force ran puppet on contint2001
22:15 madhuvishy: Kicked off rsync of /data/xmldatadumps/public to labstore1006 & 7
22:05 smalyshev@tin: Finished deploy [wdqs/wdqs@353b3cb]: temporary fix for T182464 , better fix coming soon (duration: 05m 55s)
21:59 smalyshev@tin: Started deploy [wdqs/wdqs@353b3cb]: temporary fix for T182464 , better fix coming soon
20:22 aaron@tin: Synchronized php-1.31.0-wmf.11/includes/Setup.php: a319c3e7ab61 - disable cpPosTime injection (duration: 00m 45s)
18:00 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: Disable GlobalBlocking on fishbowl wikis (duration: 00m 45s)
16:23 urandom: starting cassandra, restbase1010 - T178177
16:22 urandom: disabling smart path, restbase1010, arrays 'b'...'e' - T178177
16:20 urandom: disabling smart path, restbase1010, array 'a' (canary) - T178177
16:15 urandom: shutting down cassandra, restbase1010 - T178177
15:35 marostegui: Fix dbstore1002 s5 replication
15:28 gehel@tin: Finished deploy [tilerator/deploy@29d633e]: testing new tilerator packaging on maps-test2003 (duration: 00m 03s)
15:28 gehel@tin: Started deploy [tilerator/deploy@29d633e]: testing new tilerator packaging on maps-test2003
15:08 gehel@tin: Finished deploy [tilerator/deploy@29d633e]: testing new tilerator packaging on maps-test2003 (duration: 02m 08s)
15:06 gehel@tin: Started deploy [tilerator/deploy@29d633e]: testing new tilerator packaging on maps-test2003
15:05 gehel@tin: Finished deploy [tilerator/deploy@29d633e]: testing new tilerator packaging on maps-test2003 (duration: 00m 42s)
15:05 gehel@tin: Started deploy [tilerator/deploy@29d633e]: testing new tilerator packaging on maps-test2003
14:39 gehel@tin: Finished deploy [tilerator/deploy@e52ea1d]: testing new tilerator packaging on maps-test2003 (duration: 02m 34s)
14:36 gehel@tin: Started deploy [tilerator/deploy@e52ea1d]: testing new tilerator packaging on maps-test2003
11:45 elukey: updated prometheus-druid-exporter on druid* to 0.6
11:39 elukey: upload prometheus-druid-exporter 0.6 to stretch/jessie wikimedia
06:52 marostegui: Fix labsdb1004 replication broken
06:43 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1099:3311 - T178359 (duration: 00m 55s)
04:56 bd808: scholarships: updated db schema for 2018 cycle (T181072 )
00:04 bblack: cp4026 - restart varnish backend, mailbox lag
2017-12-07
23:59 tgr@tin: Synchronized wmf-config/InitialiseSettings.php: T181107 enable ReadingLists on all wikis (duration: 00m 46s)
23:54 tgr: ran mwscript extensions/ReadingLists/maintenance/populateProjectsFromSiteMatrix.php --wiki=testwiki
23:47 tgr@tin: Finished scap: T181107 deploy ReadingLists to testwiki (duration: 24m 44s)
23:22 tgr@tin: Started scap: T181107 deploy ReadingLists to testwiki
23:04 tgr: ran mwscript ../../../home/tgr/sql.php --wiki=mediawikiwiki --cluster extension1 --wikidb wikishared /srv/mediawiki-staging/php-1.31.0-wmf.11/extensions/ReadingLists/sql/readinglists.sql
22:33 tgr@tin: Synchronized php-1.31.0-wmf.11/extensions/ReadingLists/: ReadingLists/wmf.11: catching up with master (duration: 00m 45s)
22:29 tgr@tin: Synchronized php-1.31.0-wmf.10/extensions/ReadingLists/: ReadingLists/wmf.10: catching up with master (duration: 00m 46s)
20:39 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.11
20:35 elukey: restart hhvm on mw1235 - hhvm-dump-debug hanging out, not stacktrace available
20:31 elukey: restart hhvm on mw1281 - hhvm stuck (hhvm-dump-debug timing out)
20:22 joal@tin: Finished deploy [analytics/refinery@53bd630]: Regular analytics deploy - Long time no see, deployment :) - post-patch-2 (hopefully last for tonight) (duration: 05m 21s)
20:18 herron: re-pooling eqiad puppet 4 masters via dns puppet.eqiad.wmnet puppet.wikimedia.org
20:17 joal@tin: Started deploy [analytics/refinery@53bd630]: Regular analytics deploy - Long time no see, deployment :) - post-patch-2 (hopefully last for tonight)
20:14 joal@tin: Finished deploy [analytics/refinery@3e52903]: Regular analytics deploy - Long time no see, deployment :) - post-patch (duration: 01m 20s)
20:12 joal@tin: Started deploy [analytics/refinery@3e52903]: Regular analytics deploy - Long time no see, deployment :) - post-patch
19:52 joal@tin: Finished deploy [analytics/refinery@bd9c6cc]: Regualr analytics deploy - Long time no see, deployment :) (duration: 12m 15s)
19:40 joal@tin: Started deploy [analytics/refinery@bd9c6cc]: Regualr analytics deploy - Long time no see, deployment :)
19:35 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Start description usage tracking for commonswiki T106287 (duration: 00m 48s)
19:29 thcipriani@tin: Synchronized php-1.31.0-wmf.11/includes/specialpage/ChangesListSpecialPage.php: SWAT: WLFilters: Correctly check if RCFilters should be enabled on WL T182318 (duration: 00m 48s)
19:24 thcipriani@tin: Synchronized wmf-config/Wikibase-production.php: SWAT: Remove obsolete WikibaseQualityConstraints settings (duration: 00m 48s)
19:14 ppchelko@tin: Finished deploy [changeprop/deploy@3c4f51d]: Long awaited deploy: generic optimizations, gc metric, delay reporting (duration: 01m 15s)
19:13 ppchelko@tin: Started deploy [changeprop/deploy@3c4f51d]: Long awaited deploy: generic optimizations, gc metric, delay reporting
19:12 thcipriani@tin: Synchronized wmf-config/throttle.php: SWAT: Rm all past throttle overrides in throttle.php (duration: 00m 48s)
18:06 mholloway-shell@tin: Finished deploy [mobileapps/deploy@2fa32ed]: Update mobileapps to 71f581c (duration: 05m 36s)
18:00 mholloway-shell@tin: Started deploy [mobileapps/deploy@2fa32ed]: Update mobileapps to 71f581c
18:00 herron: re-pooling eqiad puppet 4 masters as puppet.ulsfo.wnet
17:33 demon@tin: Synchronized php-1.31.0-wmf.11/extensions/VisualEditor/lib/ve: Ief480487 , Deskana made me do it (duration: 00m 49s)
17:25 milimetric@tin: Finished deploy [analytics/aqs/deploy@4ec13b4]: (no justification provided) (duration: 07m 28s)
17:25 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1314.eqiad.wmnet
17:25 herron: puppetmaster1001 upgraded to puppet 4. re-enabling puppet agents across the fleet
17:18 milimetric@tin: Started deploy [analytics/aqs/deploy@4ec13b4]: (no justification provided)
17:13 herron: upgrading puppetmaster1001 to puppet 4
17:08 herron: temporarily disabling all puppet agents during puppetmaster1001 (puppet ca) upgrade to puppet 4
16:14 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1099:3311 weight - T178359 (duration: 00m 48s)
16:13 herron: upgrading puppetmaster1002 to puppet 4
16:03 moritzm: uploaded openssl 1.0.2n for jessie-wikimedia to apt.wikimedia.org
16:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1093 back as main traffic in s6 - T178359 (duration: 00m 48s)
15:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1099:3311 weight - T178359 (duration: 00m 48s)
15:42 elukey: hhvm-dump-debug for mw1314 saved to /tmp/hhvm.17991.bt.
15:30 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=mw1314.eqiad.wmnet
15:24 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1099:3311 weight - T178359 (duration: 00m 48s)
15:19 otto@tin: Finished deploy [analytics/superset/deploy@f0f5adf]: initial deployment (duration: 00m 02s)
15:18 otto@tin: Started deploy [analytics/superset/deploy@f0f5adf]: initial deployment
14:55 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1099:3311 with low weight - T178359 (duration: 00m 47s)
14:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully repool db1074 (duration: 00m 47s)
14:09 zeljkof: EU SWAT finished
14:08 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: alswiki: Set wgRestrictDisplayTitle = false (T182154) (duration: 00m 49s)
13:39 moritzm: reimaging mw2246 to stretch
12:54 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2152.codfw.wmnet
12:32 addshore@tin: Synchronized wmf-config/Wikibase.php: Move Wikibase dispatchingLockManager to InitialiseSettings PT 2/2 (duration: 00m 48s)
12:31 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: Move Wikibase dispatchingLockManager to InitialiseSettings PT 1/2 (duration: 00m 48s)
11:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase API traffic for db1074 (duration: 00m 48s)
11:41 moritzm: reimaging mw2152 to stretch
11:35 marostegui: Compress s8 on db1099 - T178359
11:21 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase API traffic for db1074 (duration: 00m 48s)
11:11 mobrovac@tin: Finished deploy [restbase/deploy@097ba7d]: Add CORS headers to erroneous responses as well - T182103 (duration: 05m 24s)
11:05 mobrovac@tin: Started deploy [restbase/deploy@097ba7d]: Add CORS headers to erroneous responses as well - T182103
10:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1074 with low weight (duration: 00m 48s)
10:50 elukey: powercycle analytics1003 - no serial console, ssh stuck in System is booting up. See pam_nologin(8)
10:28 _joe_: depooling mw1283 for further investigation
10:14 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1098:3316 db1098:3317 - T178359 (duration: 00m 51s)
10:12 elukey: reboot analytics1003 for kernel+jvm updates - T179943
10:01 gehel: upgrade of ELK stack on logstash100* completed - Kibana was unavailable for longer than expected - T178412
09:48 hashar: CI: removed Wikidata from configuration, replaced by Wikibase. wmf/* and REL branches are going to be broken though | https://gerrit.wikimedia.org/r/395704 | T181838
09:46 akosiaris: silence ganeti1006 on icinga T181121
09:36 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1098:3316 db1098:3317 - T178359 (duration: 00m 52s)
09:28 marostegui: Upgrade MySQL and kernel on db1074
09:22 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=logstash1009.eqiad.wmnet
09:19 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=logstash1009.eqiad.wmnet
09:18 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=logstash1008.eqiad.wmnet
09:14 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=logstash1008.eqiad.wmnet
09:13 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=logstash1007.eqiad.wmnet
09:08 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=logstash1007.eqiad.wmnet
09:05 gehel@tin: Finished deploy [logstash/plugins@b13d2fa]: (no justification provided) (duration: 00m 02s)
09:05 gehel@tin: Started deploy [logstash/plugins@b13d2fa]: (no justification provided)
09:00 gehel: upgrading ELK stack on logstash100* - some log messages might be lost during the upgrade - T178412
08:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1098:3316 db1098:3317 - T178359 (duration: 00m 48s)
08:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1098:3316 db1098:3317 - T178359 (duration: 00m 48s)
08:28 elukey: install prometheus-druid-exporter 0.5 on druid*
08:26 elukey: upload prometheus-druid-exporter 0.5-1 to jessie/stretch-wikimedia
07:19 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Slowly pool db1098:3316 db1098:3317 - T178359 (duration: 00m 47s)
07:18 marostegui@tin: Synchronized wmf-config/db-codfw.php: Slowly pool db1098:3316 db1098:3317 - T178359 (duration: 00m 48s)
06:45 marostegui: Stop replication on db1099:3311 to reimport: change_tag, tag_summary, user and watchlist tables and recompress again - T178359
06:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1074 - T174569 (duration: 00m 48s)
06:40 marostegui: Deploy schema change on db1074 (s2) - T174569
03:00 catrope@tin: Synchronized php-1.31.0-wmf.11/resources/src/mediawiki.rcfilters/: T182268 (duration: 00m 57s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.10) (duration: 08m 57s)
2017-12-06
23:45 eileen: update CiviCRM from 85263f6 to 2db9c76 (improve org merges)
23:12 ppchelko@tin: Started restart [changeprop/deploy@065a06e]: (no justification provided)
23:04 eileen: update process-control to b1cd515 (reenable bounce processing, enable dedupe on orgs)
22:58 ppchelko@tin: Finished deploy [cpjobqueue/deploy@761524f]: Separate delay and totaldelay metrics T182216 (duration: 00m 31s)
22:57 ppchelko@tin: Started deploy [cpjobqueue/deploy@761524f]: Separate delay and totaldelay metrics T182216
22:55 mutante: stat1003 - re-enabled puppet after putting role::spare on it (T175150 )
22:48 mutante: stat1003 - this host was kind of invisible (not in site, not in icinga) but still up, re-enabling puppet after re-adding it to site
22:03 demon@tin: Synchronized php-1.31.0-wmf.11/extensions/WikidataPageBanner/includes/WikidataPageBanner.hooks.php: unbreak (duration: 00m 48s)
22:00 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.11, again
21:51 hoo@tin: Synchronized php-1.31.0-wmf.10/extensions/Wikibase/lib/includes/Changes/EntityChange.php: Make EntityChange truly forward compatible with compact diffs (T182243 ) (duration: 00m 48s)
21:43 arlolra: Updated Parsoid to 01c1fc3 (T178253 , T61840 , T180930 , T179259 )
21:34 arlolra@tin: Finished deploy [parsoid/deploy@dfcc622]: Updating Parsoid to 01c1fc3 (duration: 09m 45s)
21:24 arlolra@tin: Started deploy [parsoid/deploy@dfcc622]: Updating Parsoid to 01c1fc3
20:45 marostegui: Add 400G to labsdb1003 /srv partition
20:15 hoo: Ran "scap pull" on snapshot1001 after T177486 related tests
20:10 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: partial rollback -- wikidata errors
20:07 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.11
20:05 demon@tin: Synchronized php: symlink bump (duration: 00m 47s)
19:32 hoo@tin: Synchronized php-1.31.0-wmf.11/extensions/Wikibase/repo/maintenance/dispatchChanges.php: dispatch: track how long client selecting takes (duration: 00m 48s)
19:24 ppchelko@tin: Finished deploy [cpjobqueue/deploy@8c66189]: Deduplicate based on event sha1 as well as on id, take 2 (duration: 00m 38s)
19:24 ppchelko@tin: Started deploy [cpjobqueue/deploy@8c66189]: Deduplicate based on event sha1 as well as on id, take 2
19:23 hoo@tin: Synchronized php-1.31.0-wmf.10/extensions/Wikibase/repo/maintenance/dispatchChanges.php: dispatch: track how long client selecting takes (duration: 00m 48s)
19:22 ppchelko@tin: Started deploy [cpjobqueue/deploy@8c66189]: Deduplicate based on event sha1 as well as on id, take 2
19:22 ppchelko@tin: Finished deploy [cpjobqueue/deploy@8c66189]: Deduplicate based on event sha1 as well as on id (duration: 00m 11s)
19:22 ppchelko@tin: Started deploy [cpjobqueue/deploy@8c66189]: Deduplicate based on event sha1 as well as on id
19:20 ppchelko@tin: Finished deploy [cpjobqueue/deploy@df72b34]: Deduplicate based on event sha1 as well as on id (duration: 00m 09s)
19:19 ppchelko@tin: Started deploy [cpjobqueue/deploy@df72b34]: Deduplicate based on event sha1 as well as on id
18:44 volans: stopped irc echo on tegmen, re-enabled puppet and run it (it was disabled by a run-no-puppet sync_icinga_state)
18:32 mobrovac@tin: Finished deploy [zotero/translators@3044b3a]: Update translators to 092c7bc - T178596 (duration: 00m 07s)
18:32 mobrovac@tin: Started deploy [zotero/translators@3044b3a]: Update translators to 092c7bc - T178596
18:16 herron: upgrading rhodium to puppet 4
18:13 mobrovac@tin: Synchronized wmf-config/jobqueue.php: Switch htmlCacheUpdate jobs for wiktionaries to EventBus, file 2/2 - T182023 (duration: 00m 48s)
18:12 mobrovac@tin: Synchronized wmf-config/InitialiseSettings.php: Switch htmlCacheUpdate jobs for wiktionaries to EventBus, file 1/2 - T182023 (duration: 00m 48s)
18:05 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add https://studiezaal.nijmegen.nl to $wgCopyUploadsDomains (T181713) (duration: 00m 49s)
17:51 ppchelko@tin: Finished deploy [cpjobqueue/deploy@df72b34]: Switch htmlCacheUpdate for wiktionaries, attempt 2 T182023 (duration: 00m 32s)
17:50 ppchelko@tin: Started deploy [cpjobqueue/deploy@df72b34]: Switch htmlCacheUpdate for wiktionaries, attempt 2 T182023
17:44 ppchelko@tin: Finished deploy [cpjobqueue/deploy@3281df1]: Switch htmlCacheUpdate for wiktionaries T182023 (duration: 02m 57s)
17:42 ppchelko@tin: Started deploy [cpjobqueue/deploy@3281df1]: Switch htmlCacheUpdate for wiktionaries T182023
17:36 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1260.eqiad.wmnet
17:05 hoo: Started Wikidata RDF dumps (sudo -b -u datasets bash -c 'dumpwikidatardf.sh all ttl; dumpwikidatardf.sh truthy nt') on snapshot1007
16:33 anomie@terbium: Finished cleanupUsersWithNoId.php on mediawikiwiki
16:29 anomie@terbium: Running cleanupUsersWithNoId.php for mediawikiwiki, see T181731
16:28 anomie@terbium: Finished cleanupUsersWithNoId.php on testwikidatawiki
16:27 anomie@terbium: Running cleanupUsersWithNoId.php for testwikidatawiki, see T181731
16:26 anomie@terbium: Finished cleanupUsersWithNoId.php on test2wiki
16:14 anomie@terbium: Running cleanupUsersWithNoId.php for test2wiki, see T181731
16:11 anomie@terbium: Finished cleanupUsersWithNoId.php on testwiki
16:03 anomie@terbium: Running cleanupUsersWithNoId.php for testwiki, see T181731
15:13 akosiaris: upload kubernetes_1.7.10-1_amd64 on apt.wikimedia.org/stretch-wikimedia/main T181489
14:59 addshore: swat done
14:57 addshore@tin: Synchronized php-1.31.0-wmf.11/extensions/Wikibase/repo/includes/ChangeDispatcher.php: SWAT Tracking within ChangeDispatcher::getPendingChanges (duration: 00m 49s)
14:53 addshore@tin: Synchronized php-1.31.0-wmf.10/extensions/Wikibase/repo/includes/ChangeDispatcher.php: SWAT Tracking within ChangeDispatcher::getPendingChanges (duration: 00m 49s)
14:33 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add https://studiezaal.nijmegen.nl to $wgCopyUploadsDomains (T181713) (duration: 00m 47s)
14:22 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add Portal namespace for mwl.wikipedia (T180052) (duration: 00m 48s)
14:13 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT T180476 wmgUseNewWikiDiff2Extension true for dewiki (duration: 00m 48s)
14:08 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT T181498 T181329 Enable AdvancedSearch on fawiki and huwiki (duration: 00m 50s)
13:40 moritzm: reimaging mw1260 to stretch
10:55 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2118.codfw.wmnet
10:55 moritzm: reimaging mw2119 to stretch
10:45 mobrovac@tin: Finished deploy [restbase/deploy@b1d7c82]: Use Cass3 for revisions, deprecate trending-edits, fix CX end point - T179421 T180384 T173801 (duration: 06m 02s)
10:39 mobrovac@tin: Started deploy [restbase/deploy@b1d7c82]: Use Cass3 for revisions, deprecate trending-edits, fix CX end point - T179421 T180384 T173801
09:34 gehel: shuttting down logstash / elasticsearch on logstash100[123] in preparation for decommission -T175830
08:05 moritzm: reimaging mw2118 to stretch (now for real, yesterday's reimage logged to SAL was interrupted)
07:33 bblack: cp4024 - backend restart, mailbox lag
02:39 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.10) (duration: 08m 53s)
01:01 eileen: update process-control to 55529ea - threshold for $500+ dedupe was too low
00:36 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable description usage tracking for all wikis except commons T106287 (duration: 00m 48s)
00:27 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable RemexHTML on wikis with zero high priority linter errors T182042 (duration: 00m 51s)
2017-12-05
23:07 eileen: update civicrm from d2c70d2 to 85263f6 (Major gifts address - choose latest)
22:29 eileen: update process_control to 3522186 renable jobs to fetch recipient data
22:13 mutante: restbase1010 failed at reboot with P6431 , after a cold start (power off, power on) it came back though :) (T178177 T141756 )
22:00 mutante: restbase1010 - rebooting for firmware upgrade
21:58 mutante: restbase1010 - upgraded HP firmware (Flashing Smart Array P440ar in Slot 0 [ 3.56 -> 6.06 ]) T141756 T178177
21:56 urandom: draining cassandra instances, restbase1010 - T178177
21:52 mutante: restbase1010 - upgrading firmware - Flashing Smart Array P440ar in Slot 0 [ 3.56 -> 6.06 ]
21:31 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 shenanigans / wmf.11
21:12 demon@tin: Finished scap: bootstrap wmf.11 (duration: 51m 01s)
20:50 twentyafterfour: phabricator: running `sudo bin/garbage collect --collector search.ferret.ngram`
20:29 twentyafterfour: phabricator: running `sudo bin/search ngrams --threshold 0.2`
20:21 demon@tin: Started scap: bootstrap wmf.11
19:50 mutante: forcing puppet ron on ores eqiad to reduce number of celery workers used for stress test
19:37 ppchelko@tin: Finished deploy [eventlogging/eventbus@6ca0372]: Make the kafka async deliver callback thread-safe. T180017 (duration: 01m 29s)
19:36 ppchelko@tin: Started deploy [eventlogging/eventbus@6ca0372]: Make the kafka async deliver callback thread-safe. T180017
19:20 gehel: moving eventlogging collection by logstash from logstash1003 to logstash1007, no messages **should** be lost - T175830
18:24 ppchelko@tin: Finished deploy [eventlogging/eventbus@6ca0372]: Make the kafka async deliver callback thread-safe. Limited to kafka1001. T180017 (duration: 00m 14s)
18:24 ppchelko@tin: Started deploy [eventlogging/eventbus@6ca0372]: Make the kafka async deliver callback thread-safe. Limited to kafka1001. T180017
17:27 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2118.codfw.wmnet
15:56 awight: beginning stress test on ores* (non-production)
15:54 awight@tin: Finished deploy [ores/deploy@6baed71]: (non-production) Test ORES deployment to ores100[1-2] (duration: 01m 01s)
15:53 awight@tin: Started deploy [ores/deploy@6baed71]: (non-production) Test ORES deployment to ores100[1-2]
15:15 ottomata: restarrting kafka-jumbo brokers, applying SSL (downtime scheduled)
15:03 urandom: bootstrapping cassandra, restbase1014-c - T179422
14:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully repool db1076 (duration: 00m 43s)
14:41 zeljkof: EU SWAT finished
14:40 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Revert "Add category collation for sewiki" (T181503) (duration: 00m 44s)
14:15 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable RemexHTML on itwiki and dewiki (T181188 T181190) (duration: 00m 43s)
13:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1076 weight (duration: 00m 44s)
13:14 moritzm: reimaging mw2118/mw2119 (video scalers) to stretch
12:21 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully repool db1034 (duration: 00m 43s)
12:13 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1076 with low weight (duration: 00m 44s)
11:58 marostegui: Upgrade MariaDB and kernel on db1076
11:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1034 - T178359 ! (duration: 00m 43s)
10:53 moritzm: rebooting meitnerium/archiva.wikimedia.org for update to 4.9.51
10:46 moritzm: rebooting einsteinium (icinga.wikimedia.org) for update to 4.9,51
10:45 elukey: reboot druid1003 for kernel+jvm updates - T179943
10:33 addshore@tin: Synchronized wmf-config/extension-list: extension-list extension.json entrypoint for ArticlePlaceholder (duration: 00m 43s)
10:33 moritzm: rebooting tegmen for update to 4.9,51
10:07 jynus: stopping dbstore1002 (s5) and dbstore2001 (s5) for maintenance
10:05 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1034 - T178359 ! (duration: 00m 43s)
10:00 ema: cp4021: restart varnish-be due to mbox lag/fetch failures
09:49 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1034 with low weight - T178359 ! (duration: 00m 43s)
09:42 elukey: reboot analytics100[12] for kernel+jvm updates (Hadoop Master nodes) - T179943
09:39 godog: bootstrap restbase1014-b - T179422
09:20 marostegui: Optimize s7 on db1098 - T178359
09:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1076 - T174569 (duration: 00m 43s)
09:01 marostegui: Deploy schema change on db1076 (s2) - T174569
08:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1090 - T174569 (duration: 00m 44s)
08:54 moritzm: enabling test production traffic for mw1259 (stretch-based video scaler)
08:24 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1259.eqiad.wmnet
08:08 moritzm: installing python updates on trusty
07:27 kartik@tin: Finished deploy [cxserver/deploy@4b74f03]: Update cxserver to 1693bcf (duration: 03m 22s)
07:23 kartik@tin: Started deploy [cxserver/deploy@4b74f03]: Update cxserver to 1693bcf
06:58 marostegui: Stop MySQL on db1034 to clone db1098:3317 - T178359
06:55 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1034 - T178359 ! (duration: 00m 43s)
06:48 marostegui: Fix dbstore1002 replication
06:20 marostegui: Deploy schema change on db1090 (s2) - T174569
06:20 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1090 - T174569 (duration: 00m 43s)
06:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1103:3312 - T174569 (duration: 00m 44s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.10) (duration: 06m 36s)
00:29 mutante: bast4001 - removing ganglia aggregators, package, config...
2017-12-04
23:44 mutante: bast3002 - killall -u ganglia to kill all aggregator procs, apt-get remove --purge ganglia-monitor, rm -rf /etc/ganglia, rm -rf /usr/lib/ganglia, apt-get autoremove
23:28 anomie@tin: Synchronized wmf-config/InitialiseSettings.php: Revert wgCommentTableSchemaMigrationStage change, breaks too much stuff (duration: 00m 44s)
22:08 brion: brion running requeueTranscodes.php on terbium to batch-run .mp3 output on Commons (T181749 )
21:27 ppchelko@tin: Finished deploy [cpjobqueue/deploy@c8bea2e]: (no justification provided) (duration: 00m 42s)
21:26 ppchelko@tin: Started deploy [cpjobqueue/deploy@c8bea2e]: (no justification provided)
21:26 awight@tin: Finished deploy [ores/deploy@6baed71]: Update ORES to 6baed71 (duration: 12m 54s)
21:25 eileen: re-enable dedupe job process control commit 461f482
21:13 awight@tin: Started deploy [ores/deploy@6baed71]: Update ORES to 6baed71
21:07 gehel@tin: Finished deploy [kartotherian/deploy@6e223df]: testing new kartotherian packaging on maps-test2003 (duration: 00m 20s)
21:07 gehel@tin: Started deploy [kartotherian/deploy@6e223df]: testing new kartotherian packaging on maps-test2003
20:07 anomie@tin: Synchronized wmf-config/InitialiseSettings.php: Revert wgCommentTableSchemaMigrationStage change, breaks too much stuff (duration: 00m 43s)
19:31 legoktm@tin: Synchronized wmf-config/InitialiseSettings.php: Set wgCommentTableSchemaMigrationStage = MIGRATION_WRITE_BOTH on test wikis (T166733 ) (duration: 00m 43s)
19:18 gehel@tin: Finished deploy [kartotherian/deploy@e166d87]: testing new kartotherian packaging on maps-test2003 (duration: 00m 03s)
19:18 gehel@tin: Started deploy [kartotherian/deploy@e166d87]: testing new kartotherian packaging on maps-test2003
19:17 legoktm@tin: Synchronized wmf-config/InitialiseSettings.php: Set $wgRestrictionMethod = 'firejail'; everywhere (T173370 ) (duration: 00m 43s)
19:11 legoktm@tin: Synchronized static/images/mobile/copyright/: Add converted copyright svg images as png files - https://gerrit.wikimedia.org/r/#/c/394820/ (duration: 00m 43s)
19:06 legoktm@tin: Synchronized wmf-config/InitialiseSettings-labs.php: beta: Add ORES filter thresholds for simplewiki (duration: 00m 43s)
18:46 hoo: Started dumpwikidatajson.sh on snapshot1007 (T181385 )
18:40 hoo: Ran scap pull on snapshot1001 (T181385 )
18:35 hoo@tin: Synchronized php-1.31.0-wmf.10/includes/objectcache/ObjectCache.php: Only send statsd data for WAN cache in non-CLI mode (T181385 ) (duration: 00m 44s)
18:28 godog: bootstrap restbase1014-a - T179422
18:04 gehel@tin: Finished deploy [wdqs/wdqs@2873745]: wdqs GUI update (duration: 02m 10s)
18:02 gehel@tin: Started deploy [wdqs/wdqs@2873745]: wdqs GUI update
17:41 demon@tin: Synchronized php-1.31.0-wmf.10/extensions/CirrusSearch/maintenance/: fix some deprecated spam (duration: 00m 44s)
17:40 ejegg: disabled CiviCRM bounce processing again
17:12 demon@tin: Pruned MediaWiki: 1.31.0-wmf.6 (duration: 02m 29s)
17:08 demon@tin: Pruned MediaWiki: 1.31.0-wmf.5 (duration: 02m 58s)
16:54 demon@tin: Synchronized docroot/noc/conf/dblists: double checking symlink move (duration: 00m 44s)
16:44 ejegg: re-enabled CiviCRM bounced mail processing
16:41 marostegui: Deploy schema change on db1053 (s2) - T174569
16:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1103:3312 - T174569 (duration: 00m 45s)
16:39 demon@tin: Finished scap: docroot/noc/conf/ Adding new dblist symlink (duration: 02m 36s)
16:39 marostegui: Deploy schema change on db1103:3312 - T174569
16:37 demon@tin: Started scap: docroot/noc/conf/ Adding new dblist symlink
16:37 demon@tin: scap aborted: docroot/noc/conf/dblists (duration: 00m 02s)
16:37 demon@tin: Started scap: docroot/noc/conf/dblists
16:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1105:3312 - T174569 (duration: 00m 45s)
16:32 demon@tin: Finished scap: docroot/noc/conf/ drop some dangling symlinks (duration: 03m 53s)
16:28 demon@tin: Started scap: docroot/noc/conf/ drop some dangling symlinks
16:11 herron: re-enabling puppet agents in eqiad
16:07 demon@tin: Finished scap: Revert "Special:Preferences: Use OOjs UI" and follow-ups (duration: 21m 19s)
16:05 herron: re-enabling puppet agents in codfw
15:46 demon@tin: Started scap: Revert "Special:Preferences: Use OOjs UI" and follow-ups
15:31 herron: temporarily disabling puppet agents in eqiad and codfw while puppetdb catches up with command queue
15:29 herron: disabling ircecho temporarily
15:24 _joe_: restarting puppetdb on nihal, will cause puppet failures
15:23 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1259.eqiad.wmnet
15:18 demon@tin: Synchronized wmf-config/CommonSettings-labs.php: no-op (duration: 00m 45s)
15:01 godog: reimage restbase1014 - T179422
14:51 herron: cutting over all production puppet agents to codfw puppet 4 masters via dns
14:50 Amir1: deployed backward compatibility of entity compact diff transmit T113468
14:49 ladsgroup@tin: Synchronized php-1.31.0-wmf.10/extensions/Wikibase: (no justification provided) (duration: 01m 41s)
14:37 gehel@tin: Finished deploy [kartotherian/deploy@e166d87]: dummy kartotherian deployment to test udp2log config change - T175242 (duration: 00m 03s)
14:37 gehel@tin: Started deploy [kartotherian/deploy@e166d87]: dummy kartotherian deployment to test udp2log config change - T175242
14:30 elukey: reboot druid100[23] for kernel updates
14:28 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Localize sitename and meta NS for wawiktionary (T181782) (duration: 00m 46s)
14:01 elukey: reboot analytics106* (hadoop worker nodes) for kernel+jvm updates - T179943
13:24 paravoid: upgrading bast2001 to stretch
13:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1105:3312 - T174569 (duration: 00m 45s)
13:20 marostegui: Deploy schema change on db1105:3312 (s2) - T174569
13:05 marostegui: Compress s6 on db1098 - T178359
12:44 gehel@tin: Finished deploy [kartotherian/deploy@e166d87]: testing new kartotherian packaging on maps-test2003 (duration: 00m 22s)
12:43 gehel@tin: Started deploy [kartotherian/deploy@e166d87]: testing new kartotherian packaging on maps-test2003
11:06 jdrewniak@tin: Synchronized portals: Wikimedia Portals Update: Bumping portals to master (T128546) (duration: 00m 45s)
11:05 jdrewniak@tin: Synchronized portals/prod/wikipedia.org/assets: Wikimedia Portals Update: Bumping portals to master (T128546) (duration: 00m 45s)
10:37 jynus@tin: Synchronized wmf-config/db-codfw.php: Repool db2085 (duration: 00m 43s)
10:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1096:3316 - T178359wq! (duration: 00m 45s)
09:55 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1099:3318 from s5 (duration: 00m 44s)
09:51 godog: bootstrap restbase1012-c - T179422
09:32 godog: clear erroneous table metrics from graphite1003 / graphite2002 - T181689
09:24 elukey: reboot analytics105* (hadoop worker nodes) for kernel+jvm updates - T179943
09:19 jynus: rebooting mariadb at labsdb1005
09:12 moritzm: reimaging mw1259 (video scaler) to stretch, will be kept disabled initially (some controlled live tests following)
08:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1096:3315 and 3316 - T178359 (duration: 00m 45s)
08:45 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1096:3316 - T178359 (duration: 00m 45s)
08:44 moritzm: updating tor on radium to 0.3.1.9
08:41 moritzm: updating tor packages to 0.3.1.9
08:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1096:3315 and pool db1096:3316 - T178359 (duration: 00m 45s)
08:12 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db1096:3315 - T178359 (duration: 00m 44s)
08:11 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1096:3315 - T178359 (duration: 00m 45s)
07:53 moritzm: installing curl security updates
07:17 marostegui: Compress s1 on db1099 - T178359
07:08 marostegui: Stop MySQL on db1044 as it will be decommissioned - T181696
07:05 _joe_: playing with puppetdb status for ores2003 (deactivating/reactivating node)
06:40 marostegui: Stop MySQL on db1098 to clone db1096.s6 - T178359
06:39 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1044 from config as it will be decommissioned - T181696 (duration: 00m 45s)
06:38 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1044 from config as it will be decommissioned - T181696 (duration: 00m 45s)
06:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1098 - T178359 (duration: 00m 46s)
06:21 marostegui: Deploy alter table on s3 master (db1075) without replication - T174569
02:32 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.10) (duration: 06m 28s)
2017-12-03
15:33 ejegg: disabled CiviCRM bounce processing job
12:17 akosiaris: empty ganeti1006, it had issues this morning per T181121
12:06 marostegui: Fix dbstore1002 replication
07:44 akosiaris: ran puppet on conf2002, etcdmirror-conftool-eqiad-wmnet got started again
05:11 andrewbogott: deleting files on labsdb1003 /srv/tmp older than 30 days
03:57 no_justification: gerrit2001: icinga is flapping on the gerrit process/systemd check, but this is kind of known (not sure why it's doing this all of a sudden). It's not letting me acknowledge it, but it's fine/harmless. Cf T176532
2017-12-02
17:55 marostegui: Reboot db1096.s5 to pick up the correct innodb_buffer_pool size after finishing compressing s5 - T178359
03:51 hoo: Ran "scap pull" on snapshot1001, after final T181385 tests
00:03 mutante: tried one more time on db2028,db2029, both trusty. on db2028: gmond was running as user ganglia-monitor, failed, had to manually kill the process, run puppet again then ok. on db2029, gmond was running as "499" but puppet just ran and removed it without manual intervention. (T177225 )
2017-12-01
23:15 urandom: starting cassandra bootstrap, restbase1012-b - T179422
21:49 mutante: db2029 - removing ganglia-monitor, testing to kill gmond, running puppet to figure out how to cleanly remove it on trusty
21:12 mutante: db2023 killed gmond (ganglia-monitor) process manually which was still running even though ganglia-monitor package was removed and caused puppet breakage (it seems only on trusty). after that puppet run is clean again and ganglia removed. (T177225 ) (https://gerrit.wikimedia.org/r/#/c/394647/1 )
20:18 awight@tin: Started deploy [ores/deploy@9afbf14]: (non-production) Test ORES deployment to ores100*
20:17 awight@tin: Finished deploy [ores/deploy@9afbf14]: (non-production) Test ORES deployment to ores1001 (duration: 02m 31s)
20:15 awight@tin: Started deploy [ores/deploy@9afbf14]: (non-production) Test ORES deployment to ores1001
20:03 aaron@tin: Synchronized php-1.31.0-wmf.10/includes/libs/objectcache/WANObjectCache.php: f096d0b465b75d - temp logging for statsd spam (duration: 00m 45s)
18:59 demon@tin: Synchronized wmf-config/CommonSettings-labs.php: no-op (duration: 00m 46s)
18:22 mutante: Phabricator: restarting Apache for php-curl update
18:21 _joe_: restarting apache2 on the codfw puppetmasters
18:06 marktraceur@tin: Synchronized php-1.31.0-wmf.10/extensions/UploadWizard/resources/controller/uw.controller.Deed.js: (no justification provided) (duration: 00m 46s)
17:49 mutante: phab2001 - restarted apache
17:33 herron: stopped ircecho on einsteinium
17:00 awight@tin: Unlocked for deployment [ores/deploy]: Don't deploy while we're messing with git-lfs (duration: 00m 14s)
17:00 awight@tin: Locking from deployment [ores/deploy]: Don't deploy while we're messing with git-lfs (planned duration: 16666666666m 39s)
17:00 awight@tin: Locking from deployment [ores/deploy]: Don't deploy while we're messing with git-lfs (planned duration: -1m 59s)
16:59 awight@tin: Unlocked for deployment [ores/deploy]: Don't deploy while we're messing with git-lfs (duration: 00m 07s)
16:59 awight@tin: Locking from deployment [ores/deploy]: Don't deploy while we're messing with git-lfs (planned duration: 60m 00s)
16:34 jynus: stopping db2092 to clone s1 to db2085
16:24 urandom: starting cassandra bootstrap, restbase1012-a -- T179422
15:27 godog: bounce uwsgi on labmon1001 - stuck
15:21 moritzm: installing nspr security updates on trusty
15:17 moritzm: installing ffmpeg security updates
15:14 gehel@tin: Finished deploy [kartotherian/deploy@df7ebff]: testing new kartotherian packaging on maps-test2003 (duration: 00m 20s)
15:14 jynus@tin: Synchronized wmf-config/db-codfw.php: Undeploy db2092, use db2085 for s1 (duration: 00m 45s)
15:14 gehel@tin: Started deploy [kartotherian/deploy@df7ebff]: testing new kartotherian packaging on maps-test2003
15:14 moritzm: installing libxcursor security updates on trusty
15:09 jynus@tin: Synchronized wmf-config/db-eqiad.php: Undeploy db2092, use db2085 for s1 (duration: 00m 45s)
14:24 awight@tin: Finished deploy [ores/deploy@532bd0b]: (non-production) Update ORES on new cluster (duration: 02m 06s)
14:22 awight@tin: Started deploy [ores/deploy@532bd0b]: (non-production) Update ORES on new cluster
14:22 akosiaris: upload apertium-crh-tur_0.3.0~r83159-1+wmf1 to apt.wikimedia.org/jessie-wikimedia component main. T181465
14:10 herron: cutting all puppet service records over to codfw puppet 4 masters
12:44 elukey: reboot druid1001 for kernel+jvm updates - T179943
12:11 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2055 (duration: 00m 45s)
11:59 jynus: restarting and upgrading mysql on labsdb1004
11:28 jynus: upgrading and restarting dbstore2001
11:14 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2055 (duration: 00m 46s)
11:08 marostegui: Stop MySQL on db2055 for testing
11:04 marostegui: Update MySQL on db1039 for testing
10:57 elukey: reboot analytics1028 for kernel + jvm updates (Hadoop HDFS journalnode) - T179943
10:56 godog: delete docker diskspace metrics from labs - T181476
10:21 godog: initial purge of old table metrics from graphite2002 - T181689
10:18 moritzm: reimaging mw1259 (video scaler) to stretch, will be kept disabled initially (with some live tests starting next week)
09:55 akosiaris: run memtester 61G on ganeti1008 T181121
09:23 elukey: reboot analytics104* for kernel+jvm updates - T179943
09:23 akosiaris: powercycle ganeti1008 T181121 , it's largely unresponsive
08:41 marostegui: Remove db1046 and db1047 from tendril - T156844
08:40 elukey: reboot the remaining analytics103* hadoop workers to pick up kernel+jvm updates - T179943
08:28 akosiaris: empty ganeti1008, move VMs to ganeti1006 T181121
08:20 akosiaris: repool ganeti1005, ganeti1006 to empty ganeti1008 T181121
07:51 akosiaris: upload apertium-tur_0.2.0~r83161-1+wmf1, apertium-crh_0.2.0~r83161-1+wmf1 to apt.wikimedia.org/jessie-wikimedia component main. T181465
07:14 marostegui: Logging retroactively for the record, restarting MySQL on db1039
05:14 legoktm@tin: Synchronized wmf-config/InitialiseSettings.php: touch (duration: 00m 44s)
01:46 hoo: Ran scap pull on mwdebug1001 after T181385 testing
00:13 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: HTML5 sections be upon us! (duration: 00m 45s)
00:04 hoo: Killed all remaining Wikidata JSON/RDF dumpers, due to T181385 . This means no dumps this week!
2017-11-30
23:54 demon@tin: Synchronized wmf-config/extension-list-labs: no-op (duration: 00m 45s)
23:39 addshore@tin: Synchronized wmf-config: wdbuild: T173818 T177060 Add wikidata extensions to extension-list (duration: 00m 46s)
23:35 addshore@tin: Synchronized wmf-config: wdbuild: T173818 Remove Wikibase-buildentry.php config file (empty) (duration: 00m 46s)
23:34 addshore@tin: Synchronized wmf-config/Wikibase.php: wdbuild: T173818 Remove Wikibase-buildentry.php config file (empty) (duration: 00m 45s)
23:30 addshore@tin: Synchronized wmf-config: wdbuild: T173818 Remove wmgUseWikidataBuild (duration: 00m 46s)
23:29 addshore@tin: Synchronized wmf-config/Wikibase-buildentry.php: wdbuild: T173818 Remove wmgUseWikidataBuild (duration: 00m 45s)
23:21 addshore@tin: Synchronized wmf-config: wdbuild: T173818 wdbuild: Stop loading from build on ALL WIKIS The build is dead! Mwahahaaa (duration: 00m 47s)
23:18 bsitzmann@tin: Finished deploy [mobileapps/deploy@4305d96]: Update mobileapps to 4317ea5 (T181743 ) (duration: 04m 50s)
23:15 addshore@tin: Synchronized wmf-config: wdbuild: wdbuild: Stop loading from build on all wikis (except the one i really dont want to break) (duration: 00m 46s)
23:14 bsitzmann@tin: Started deploy [mobileapps/deploy@4305d96]: Update mobileapps to 4317ea5 (T181743 )
23:06 addshore@tin: Synchronized wmf-config: wdbuild: wdbuild: Stop loading from build on group1 (duration: 00m 46s)
22:55 addshore@tin: Synchronized wmf-config: wdbuild: wdbuild: Stop loading from build on group0 (duration: 00m 46s)
22:42 addshore: BETA ONLY was a lie on that last one ...
22:41 addshore@tin: Synchronized wmf-config: wdbuild: BETA ONLY wdbuild: Stop loading from build on test and testwikidata (duration: 00m 47s)
22:40 demon@tin: Pruned MediaWiki: 1.31.0-wmf.8 [keeping static files] (duration: 02m 33s)
22:27 addshore@tin: Synchronized wmf-config: wdbuild: BETA ONLY gerrit:394411 (duration: 00m 47s)
21:35 addshore@tin: Synchronized wmf-config: wdbuild: BETA ONLY (duration: 00m 47s)
21:31 jynus: upgrading labsdb1010 and restarting mariadb
21:23 mutante: powercycling kafka1018 (was down in Icinga and saw in SAL: reboot kafka10[12-22] for kernel + jvm updates - T179943 )
21:18 jynus: upgrade and restart db2078
21:14 addshore@tin: Synchronized wmf-config: wdbuild: BETA ONLY (duration: 00m 47s)
21:01 addshore@tin: Synchronized wmf-config: wdbuild: T173818 : add switch to ease killing (again) (duration: 00m 46s)
20:59 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: wdbuild: T173818 : add switch to ease killing (again) (duration: 00m 45s)
20:49 XenoRyet: Updated tools from 6e604fd to 626fe02
20:40 bd808: Sending Toolforge survey final reminder emails from silver for T177126
20:32 addshore@tin: Synchronized wmf-config: REVERT wdbuild: T173818 : add switch to ease killing (duration: 00m 47s)
20:30 addshore@tin: Synchronized wmf-config: wdbuild: T173818 : add switch to ease killing (duration: 00m 47s)
20:13 ejegg: re-adjusted Civi job timings. QC every odd min for 105 sec, TY every even min for 70 sec after 45 sec delay
20:12 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.10
19:55 ejegg: 105 seconds for each job
19:55 ejegg: adjusted timings of Civi jobs to let TY and QC run concurrently
19:37 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Revert "Temporary disable remex html" T178632 (duration: 00m 49s)
19:20 thcipriani@tin: Synchronized php-1.31.0-wmf.10/includes/htmlform/fields/HTMLMultiSelectField.php: SWAT: HTMLMultiSelectField: Allow formatting in section headings in OOUI mode T181698 (duration: 00m 49s)
19:12 thcipriani@tin: Synchronized wmf-config: SWAT: Enable MP3 uploads on Commons T120288 (duration: 00m 51s)
17:57 andrewbogott: upgrading labtestpuppetmaster2001 to puppet 4.8
17:47 moritzm: uploaded prometheus-openldap-exporter 0+git20171128-1 for jessie-wikimedia (T181511 )
17:06 herron: beginning cut over of esams to codfw puppet 4 masters
16:44 urandom: drop (erroneous) legacy tables from -ng cassandra cluster - T181689
16:12 elukey: drain and reboot analytics1031->39 to pick up jvm+kernel updates - T179943
15:59 jynus: setup prometheus with unix_socket on new server db1107 and db1108
15:19 gehel: restart blazegraph on wdqs1004
15:16 jynus@tin: Synchronized wmf-config/db-eqiad.php: Increase db1110 load (duration: 00m 48s)
15:11 herron: beginning cut over of ulsfo to codfw puppet 4 masters
14:50 zeljkof: EU SWAT finished
14:46 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable WikiLove Extension on pa.wiki (T178919) (duration: 00m 49s)
14:28 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add import sources to de.wiki (T181695) (duration: 00m 49s)
14:20 zfilipin@tin: Synchronized php-1.31.0-wmf.10/extensions/ContentTranslation/modules/dashboard/ext.cx.dashboard.js: SWAT: Set default languages after fetching valid languages (duration: 00m 49s)
14:19 marostegui: Deploy schema change on s3 - db1078 - T174569
13:23 addshore@tin: Synchronized php-1.31.0-wmf.10/extensions/AdvancedSearch/modules/ext.advancedSearch.init.js: pre-swat: T181644 Force search profile advanced in AdvancedSearch gerrit:394297 (duration: 00m 48s)
13:22 addshore@tin: Synchronized php-1.31.0-wmf.8/extensions/AdvancedSearch/modules/ext.advancedSearch.init.js: pre-swat: T181644 Force search profile advanced in AdvancedSearch gerrit:394299 (duration: 00m 50s)
13:18 herron: cutting codfw puppet agents over to puppetmaster2001.codfw.wmnet
12:07 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Tackle s4 DB weights to make them more equal (duration: 00m 48s)
11:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1081 and reduce traffic for db1091 (duration: 00m 50s)
10:33 jynus: stopping replication on db1044 and db1095 (s3)
09:16 moritzm: installing exim security updates on stretch (jessie/trusty not affected)
09:14 elukey: drain and reboot analytics1029/1030 for jvm+kernel updates (Hadoop worker canaries)
09:05 godog: add 200G of space to graphite2002 carbon lv
08:25 moritzm: rolling restart of mw canaries to pick up curl security update
08:21 marostegui: Enable GTID on s8 eqiad hosts that do not have it enabled (db1109, db1104, db1101, db1092, db1087, db1063) - T177208
07:48 moritzm: installing curl security updates
07:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1056 (duration: 01m 18s)
06:48 marostegui: Deploy schema change on dbstore1001 s3 - T174569
06:26 marostegui: Deploy schema change on s3 db1077 - T174569
02:48 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@7aa39b7]: (no justification provided) (duration: 03m 15s)
02:45 ebernhardson@tin: Started deploy [search/mjolnir/deploy@7aa39b7]: (no justification provided)
02:29 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.8) (duration: 08m 43s)
01:17 ejegg: updated fundraising tools from 6d4b6f3 to 6e604fd
01:06 twentyafterfour: finished phabricator upgrade, service is online and appears to be functioning normally
01:03 twentyafterfour: Starting phabricator upgrade & maintenance. Service will be offline for less than 5 minutes.
01:02 bsitzmann@tin: Finished deploy [mobileapps/deploy@fa2a877]: Update mobileapps to dcea7d3 (T181004 ) (duration: 06m 08s)
00:56 bsitzmann@tin: Started deploy [mobileapps/deploy@fa2a877]: Update mobileapps to dcea7d3 (T181004 )
00:53 maxsem@tin: Finished scap: Message updates for https://gerrit.wikimedia.org/r/#/c/394155/ (duration: 26m 38s)
00:48 ejegg: re-enabled CiviCRM jobs
00:42 ejegg: updated CiviCRM from 0f95f3e to e81228f
00:40 ejegg: disabled CiviCRM jobs
00:31 ejegg: updated fundraising dashboard from 6ee6567 to 1141317
00:27 maxsem@tin: Started scap: Message updates for https://gerrit.wikimedia.org/r/#/c/394155/
00:24 maxsem@tin: Synchronized php-1.31.0-wmf.10/extensions/ContentTranslation/: https://gerrit.wikimedia.org/r/#/c/394206/ (duration: 00m 52s)
00:21 ejegg: added weekly Ingenico audit processing job in makemissing mode
2017-11-29
23:20 eileen: update civicrm from a1022cf to 0f95f3e (Benevity import fix)
22:23 hashar: Nodepool had some troubles spawning new instances from 21:09 to 21:36, and took a while to recover. Issue similar to T170492 #3581822
21:51 SMalyshev: starting wikidata reindex (T181426 )
21:23 no_justification: docroot sync was for If1afa59a
21:21 demon@tin: Synchronized docroot/: (no justification provided) (duration: 00m 49s)
20:29 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.10
20:26 demon@tin: Synchronized php: symlink bump (duration: 00m 48s)
20:05 urandom: restarting cassandra bootstrap of restbase1012-a (T179422 )
19:55 mutante: restarting gerrit to apply config change and set gitBasicAuth to true to unblock T171758 (gerrit:391865)
19:50 herron: beginning rolling cut over of eqiad scb hosts to codfw puppet 4 masters
19:15 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/394059/3 (duration: 00m 49s)
18:57 herron: beginning cutover of codfw lvs2* systems to codfw puppet 4 masters. first standby nodes, then active nodes
18:12 marostegui: Compress s5 on db1096 - T178359
18:10 jynus@tin: Synchronized wmf-config/db-eqiad.php: Increase db1110 load (duration: 00m 53s)
18:05 awight@tin: Finished deploy [ores/deploy@532bd0b]: (non-production) Update ORES on new cluster (duration: 01m 11s)
18:04 awight@tin: Started deploy [ores/deploy@532bd0b]: (non-production) Update ORES on new cluster
18:03 herron: beginning cut over of codfw cp2* servers to codfw puppet 4 masters
17:40 jynus@tin: Synchronized wmf-config/db-eqiad.php: Pool db1110 with low load (duration: 00m 48s)
17:40 godog: bootstrapping restbase1012-a - T179422
17:19 herron: beginning cut over of cp200[12] to codfw puppet 4 masters
17:16 ejegg: increased time limit for donation queue consumer from 60 to 70 seconds, reduced thank you job from 55 to 45 seconds
17:03 ejegg: re-enabled Civi jobs
16:58 ejegg: disabled Civi jobs for Deadlock-retry update
16:46 jynus@tin: Synchronized wmf-config/db-codfw.php: Update db1110 ip (duration: 00m 50s)
16:43 jynus@tin: Synchronized wmf-config/db-eqiad.php: Update db1110 ip (duration: 00m 49s)
15:02 chasemp: purge old qcow2 images from under /home on labtestcontrol2001
15:00 awight: Restarting ORES celery workers manually, T181538
14:39 _joe_: reloading apache on puppetmasters to pick up the configuration changes
14:37 awight@tin: Started restart [ores/deploy@e58bfbf]: Restart ORES services (take 2), T181538
14:36 elukey: reboot druid100[456] for jvm+kernel updates - T179943
14:35 awight@tin: Finished deploy [ores/deploy@e58bfbf]: Restart ORES services, T181538 (duration: 00m 16s)
14:35 awight@tin: Started deploy [ores/deploy@e58bfbf]: Restart ORES services, T181538
14:25 zeljkof: EU SWAT finished
14:18 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: Define throttle rule (T181367) (duration: 00m 49s)
14:14 herron: beginning cut over of cache::misc and codfw cache::canary cp servers to codfw puppet4 masters
14:13 moritzm: rebooting neodymium for kernel update to 4.9.51
14:12 godog: reimage restbase1012 - T179422
14:10 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT T180291 T180128 AdvancedSearch for arwiki (duration: 00m 49s)
14:07 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT T180128 AdvancedSearch for dewiki (duration: 00m 50s)
13:34 awight@tin: Finished deploy [ores/deploy@e58bfbf]: (non-production) Update ORES on new cluster (take 3) (duration: 16m 49s)
13:18 elukey: reboot kafka100[23] for jvm+kernel updates - T179943
13:17 awight@tin: Started deploy [ores/deploy@e58bfbf]: (non-production) Update ORES on new cluster (take 3)
12:34 jynus@tin: Synchronized wmf-config/db-eqiad.php: Setup s8 on eqiad, with no wikis (duration: 00m 48s)
12:24 jynus: scap pull on mwdebug1001
12:23 jynus@tin: Synchronized wmf-config/db-codfw.php: Pool db1096:3315 (duration: 00m 48s)
12:01 marostegui: Deploy schema change on db1072 (sanitarium master) on s3 with replication enabled to replicate to labs - T174569
11:30 elukey: reboot kafka1001 for kernel + jvm updates - T179943
10:15 akosiaris: disable puppet on oresrdb* for merging https://gerrit.wikimedia.org/r/#/c/394022/ . T181563
09:50 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1110 (duration: 00m 48s)
09:46 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1110 (duration: 00m 49s)
09:39 godog: upload cassandra-tools-wmf 1.0.2-1 - T181438
09:28 mobrovac@tin: Finished deploy [electron-render/deploy@94d27d7]: Update to electron v1.7.9 and start using the Charter font - T181200 (duration: 04m 03s)
09:24 mobrovac@tin: Started deploy [electron-render/deploy@94d27d7]: Update to electron v1.7.9 and start using the Charter font - T181200
09:04 godog: bootstrap restbase1007-c - T179422
08:41 moritzm: uploaded prometheus-openldap-exporter 0+git20171128-1 for jessie-wikimedia (T181511 )
08:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1096 - T178359 (duration: 00m 48s)
07:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1099:3318 and db1055 - T178359 (duration: 00m 48s)
07:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1099:3318 traffic - T178359 (duration: 00m 45s)
07:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1099:3318 with low weight - T178359 (duration: 00m 45s)
06:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1055 - T178359 (duration: 00m 48s)
06:42 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db1099:3311 and db1099:3318 to the config (depooled) T178359 (duration: 00m 48s)
06:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1099:3311 and db1099:3318 to the config (depooled) T178359 (duration: 00m 49s)
06:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1055 with lower weight - T178359 (duration: 00m 50s)
06:17 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@7aa39b7]: (no justification provided) (duration: 04m 27s)
06:12 ebernhardson@tin: Started deploy [search/mjolnir/deploy@7aa39b7]: (no justification provided)
03:39 ejegg: reduced CiviMail sample rate to 0.35
03:09 ejegg: reduced donation queue consumer time limit from 70 to 60 seconds, increased ty mail batch time limit from 45 to 55 seconds
02:50 ejegg: reduced donation queue consumer time limit from 75 to 70 seconds, increased ty mail batch time limit from 40 to 45 seconds
02:44 ejegg: reduced CiviMail record creation rate from 100% to 50%
02:24 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.8) (duration: 06m 08s)
01:25 mutante: snapshot1001 - closed idle screen session
01:22 mutante: analytics1003 - closed idle screen session
01:21 mutante: mw1276 - run "scap pull" to get in sync after hardware issue, then pooled again (T181397 )
01:09 mutante: restarting ircecho - it stopped talking
01:07 mutante: forcing puppet run on all labvirt* machines to clean out Icinga alerts
00:44 awight@tin: Synchronized php-1.31.0-wmf.10/extensions/ORES: Hotfix to mitigate cache stampeding, T181567 (duration: 00m 49s)
00:33 reedy@tin: Synchronized php-1.31.0-wmf.10/includes/logging/LogPager.php: Fix fatal on Special:Log T181565 (duration: 00m 48s)
00:32 awight@tin: Synchronized php-1.31.0-wmf.8/extensions/ORES: Hotfix to mitigate cache stampeding, T181567 (duration: 00m 50s)
00:18 Jamesofur: deleted 6 archived files from servers for legal compliance
00:08 ejegg: updated fundrasing dashboard from df94248 to 6ee6567
2017-11-28
23:58 hoo: Ran scap pull on mwdebug1001 after T181385 related testing
23:07 akosiaris@tin: Synchronized wmf-config/CommonSettings.php: T181538 (duration: 00m 49s)
22:31 akosiaris@tin: Synchronized wmf-config/CommonSettings.php: (no justification provided) (duration: 00m 49s)
22:31 akosiaris: deploy wmf-config/CommonSettings.php for ORES internal discovery URL, https://gerrit.wikimedia.org/r/#/c/393924/ T181538
21:41 akosiaris: disable ORES queue redis persistency by config set appendonly no on oresrdb1001
21:24 hoo: Manually killed all remaining Wikidata TTL (RDF) dumpers on snapshot1007. Some shards failed due to the db1110 depool.
21:23 hoo: Manually killed all remaining Wikidata JSON dumpers on snapshot1007. Some shards failed due to the db1110 depool.
20:56 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: removing aawiki from group0
20:42 gehel: repooling elastic2004 after RAID controller maintenance - T181412
20:42 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.10
20:28 mutante: forcing puppet run on cache misc to revert "failover ORES to codfw"
20:19 demon@tin: Synchronized scap/plugins/prep.py: no-op (duration: 00m 48s)
20:16 demon@tin: Synchronized dblists/group0.dblist: adding some new wikis (duration: 00m 48s)
18:57 demon@tin: Finished scap: bootstrap wmf.10 (duration: 35m 20s)
18:51 demon@tin: Finished deploy [gerrit/gerrit@571cf4c]: deploying 2.15+ polygerrit style changes (duration: 00m 09s)
18:51 demon@tin: Started deploy [gerrit/gerrit@571cf4c]: deploying 2.15+ polygerrit style changes
18:40 akosiaris: revert weight changes for scb1001, scb1002 T181835
18:39 akosiaris@puppetmaster1001: conftool action : set/weight=10; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])
18:39 akosiaris@puppetmaster1001: conftool action : set/weight=10; selector: scb1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])
18:35 awight@tin: Finished deploy [ores/deploy@e58bfbf]: (non-production) Update ORES on new cluster (take 2) (duration: 04m 30s)
18:32 akosiaris: force puppet run on cache::misc boxes T181538
18:30 awight@tin: Started deploy [ores/deploy@e58bfbf]: (non-production) Update ORES on new cluster (take 2)
18:28 urandom: (re)bootstrapping cassandra, restbase1007-b - T179422
18:22 demon@tin: Started scap: bootstrap wmf.10
18:18 awight@tin: Finished deploy [ores/deploy@e58bfbf]: (non-production) Update ORES on new cluster (duration: 01m 41s)
18:17 awight@tin: Started deploy [ores/deploy@e58bfbf]: (non-production) Update ORES on new cluster
18:00 akosiaris: force stop celery-ores-worker on scb1001
17:58 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])
17:58 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: scb1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])
17:46 urandom: decommissioning cassandra, restbase1007-b - T179422
17:42 urandom: restart cassandra, restbase1007, to pickup logstash java deps - T179422
16:45 ejegg: updated fundraising dashboard from d8c86e7 to df94248
16:25 bblack: mw1329 boot to PXE (should come up with new .66 IP)
15:39 jynus@tin: Synchronized wmf-config/db-eqiad.php: depool db1110 (duration: 00m 44s)
15:38 bblack: powered off mw1329
15:06 marostegui: Compress s5 on db1099
15:04 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Revert "Comply wikidata with new ores thresholds" (duration: 00m 45s)
14:56 marostegui: Deploy schema change on s3 on dbstore1002 - T174569
14:45 gehel@tin: Finished deploy [kartotherian/deploy@cb9b1ef]: testing new kartotherian packaging on maps-test2003 (duration: 00m 02s)
14:45 gehel@tin: Started deploy [kartotherian/deploy@cb9b1ef]: testing new kartotherian packaging on maps-test2003
14:41 otto@tin: Finished deploy [eventlogging/analytics@c464b8c]: Fixing bug where userAgent set by client producer was not used T178440 (duration: 00m 02s)
14:40 otto@tin: Started deploy [eventlogging/analytics@c464b8c]: Fixing bug where userAgent set by client producer was not used T178440
14:40 otto@tin: Started deploy [eventlogging/analytics@c464b8c]: Fixing bug where userAgent set by client producer was not used
14:39 gehel@tin: Finished deploy [kartotherian/deploy@cb9b1ef]: testing new kartotherian packaging on maps-test2003 (duration: 00m 18s)
14:39 gehel@tin: Started deploy [kartotherian/deploy@cb9b1ef]: testing new kartotherian packaging on maps-test2003
14:37 marostegui: Stop MySQL on db1055 to clone db1099:3311 - T178359
14:34 gehel@tin: Finished deploy [kartotherian/deploy@55c5da4]: (no justification provided) (duration: 00m 11s)
14:34 gehel@tin: Started deploy [kartotherian/deploy@55c5da4]: (no justification provided)
14:21 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1055 - T178359 (duration: 00m 44s)
14:17 elukey: reboot kafka10[12-22] for kernel + jvm updates - T179943
14:03 elukey: reboot kafka200[123] for kernel + jvm updates - T179943
12:25 godog: cleanup wanobjectcache metrics with hashes - T178531
12:14 godog: bounce carbon-frontend-relay after https://gerrit.wikimedia.org/r/393749
11:25 hoo: Manually re-started Wikidata JSON dumps on snapshot1007, got stuck after db1082 went down.
10:55 jynus: stop db1044 replication for db1095 master switchover
10:49 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1082. Depool db1044 (duration: 00m 44s)
10:44 mobrovac@tin: Synchronized wmf-config/jobqueue.php: Process wikibase-addUsagesForPage only via EventBus - T175212 (duration: 00m 44s)
10:08 ppchelko@tin: Finished deploy [cpjobqueue/deploy@c4b9e16]: Enable wikibase-addUsagesForPage with low concurrency (duration: 00m 29s)
10:07 ppchelko@tin: Started deploy [cpjobqueue/deploy@c4b9e16]: Enable wikibase-addUsagesForPage with low concurrency
09:35 jynus: restart db1087 for maintenance
09:22 jynus: restart db1082
09:19 godog: unmask and restart restbase1007-b - T179422
09:15 moritzm: installibg libxml-libxml-perl security updates on trusty (Debian already fixed)
09:11 ppchelko@tin: Finished deploy [cpjobqueue/deploy@b0b1793]: Remove double-processing created for consumer group renaming (duration: 00m 28s)
09:10 ppchelko@tin: Started deploy [cpjobqueue/deploy@b0b1793]: Remove double-processing created for consumer group renaming
09:04 marostegui: Drop database log from dbstore1002 - T156844
08:57 ppchelko@tin: Finished deploy [cpjobqueue/deploy@2212086]: Move enabled jobs config to vars.yaml (duration: 00m 39s)
08:56 ppchelko@tin: Started deploy [cpjobqueue/deploy@2212086]: Move enabled jobs config to vars.yaml
08:38 jynus@tin: Synchronized wmf-config/db-eqiad.php: depool db1082 (duration: 00m 44s)
08:02 mobrovac: bootstrap restbase1007-b - T179422
06:39 marostegui: Stop MySQL on db1099 to copy its content to dbstore1001 and reimage it as multi-instance - T178359
05:23 mutante: labtestpuppetmaster2001 - manually purging ganglia things since puppet is disabled
05:05 mutante: labweb1001/labweb1002: manually purging ganglia package/config/service/unit files because puppet is disabled there (T177225 )
04:08 eileen: update process-control to b40eaec , disabling 3 jobs to reduce load for Big english starting: dedupe_civicrm_contacts, omnimail_recipient_load, omnimail_recipient_load_backfill
03:58 demon@tin: Pruned MediaWiki: 1.31.0-wmf.4 (duration: 03m 13s)
03:31 mutante: labservices1001/1002,labtestservices2001 - remove pdns_gmetric cronjobs causing cron spam after ganglia decom from lab*
02:20 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.8) (duration: 05m 23s)
2017-11-27
22:29 mutante: labtestnet2001 - re-enabled puppet to decom ganglia, no errors
22:22 mutante: decom'ing Ganglia from all labtest* hosts - packages removed by puppet etc, might cause some false positives in Icinga but it's ok (T177225 )
22:21 otto@tin: Finished deploy [eventlogging/eventbus@e024af3]: deploy revert checked out at 3df06ab (we caught one). T180017 (duration: 00m 10s)
22:21 otto@tin: Started deploy [eventlogging/eventbus@e024af3]: deploy revert checked out at 3df06ab (we caught one). T180017
21:59 awight: clearing ORES threshold caches for ruwiki, frwiki, wikidatawiki.
21:59 awight@tin: Synchronized wmf-config/InitialiseSettings.php: Reenable ORES on frwiki, ruwiki, and wikidata; T181006 (duration: 00m 45s)
21:53 awight@tin: Finished deploy [ores/deploy@e58bfbf]: (codfw) Update ORES for impossible threshold handling and new wikidata editquality model, T179711 T180686 T180450 (duration: 05m 06s)
21:47 awight@tin: Started deploy [ores/deploy@e58bfbf]: (codfw) Update ORES for impossible threshold handling and new wikidata editquality model, T179711 T180686 T180450
21:46 awight@tin: Finished deploy [ores/deploy@e58bfbf]: Update ORES for impossible threshold handling and new wikidata editquality model, T179711 T180686 T180450 (duration: 12m 55s)
21:33 awight@tin: Started deploy [ores/deploy@e58bfbf]: Update ORES for impossible threshold handling and new wikidata editquality model, T179711 T180686 T180450
21:23 awight@tin: Synchronized php-1.31.0-wmf.8/extensions/ORES: ORES error handling for bad thresholds, T181191 (duration: 00m 46s)
21:11 awight@tin: Synchronized wmf-config/InitialiseSettings.php: Temporarily disable ORES on wikidata (duration: 00m 45s)
21:10 awight: Previous ârollback ORESâ was from a stale screen session, no actions were taken today.
21:08 awight@tin: Finished deploy [ores/deploy@82a13ae]: Rollback ORES (take 3); 181006 (duration: 9942m 42s)
20:21 demon@tin: i lied, that was for wgStyleVersion removal
20:20 demon@tin: Synchronized wmf-config/CommonSettings.php: cli sapi fixes (duration: 00m 45s)
20:17 ejegg: updated payments-wiki from fea6b37 to f594dfa
20:03 kaldari@tin: Synchronized wmf-config/InitialiseSettings-labs.php: (no justification provided) (duration: 00m 45s)
20:02 kaldari@tin: Synchronized wmf-config/CommonSettings.php: (no justification provided) (duration: 00m 45s)
20:01 kaldari@tin: Synchronized wmf-config/InitialiseSettings.php: (no justification provided) (duration: 00m 46s)
19:54 bblack: crN-ulsfo: remove lvs400[1-4] from PyBal BGP neighbors list
19:42 catrope@tin: Synchronized php-1.31.0-wmf.8/resources/src/mediawiki.rcfilters/mw.rcfilters.UriProcessor.js: T181100 (duration: 00m 45s)
19:41 demon@tin: Synchronized wmf-config/InitialiseSettings.php: babel officewiki thing (duration: 00m 45s)
19:39 catrope@tin: Synchronized php-1.31.0-wmf.8/includes/specials/SpecialRecentchangeslinked.php: T181100 (duration: 00m 45s)
19:34 otto@tin: Finished deploy [eventlogging/eventbus@3df06ab]: Temp deploy to kafka1001 only to catch bug: T180017 (duration: 00m 12s)
19:34 otto@tin: Started deploy [eventlogging/eventbus@3df06ab]: Temp deploy to kafka1001 only to catch bug: T180017
19:21 catrope@tin: Synchronized wmf-config/Wikibase.php: T176903 (duration: 00m 45s)
19:17 AaronSchulz: Removed more bogus md5 wanobjecache/ metric from graphite[12]001
19:11 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: T179241 (duration: 00m 45s)
19:02 volans: restarted ircecho
18:51 ejegg: updated payments-wiki from 6b3019b to fea6b37
18:50 dcausse: elastic/cirrus: reindexing english group2 wikis: T179945
18:45 volans: stopped ircecho temporary to avoid spam
18:14 XioNoX: pushing v6 gre firewall filter
18:12 gehel@tin: Finished deploy [wdqs/wdqs@7ad20f3]: (no justification provided) (duration: 02m 10s)
18:11 gehel: deploying blazegraph + GUI updates
18:10 gehel@tin: Started deploy [wdqs/wdqs@7ad20f3]: (no justification provided)
18:08 hoo: Killed truthy nt dumpers stuck at 100% CPU (from last week) on snapshot1007 (T181385 )
18:06 akosiaris: poweroff ganeti1005, ganeti1006 T181121
18:05 bd808: Sending Toolforge survey 1 week reminder emails from silver for T177126
16:07 bblack: lvs400x - puppet disabled for https://gerrit.wikimedia.org/r/#/c/393610
15:52 herron: beginning canary cutover/deployment of codfw prometheus servers to codfw puppet 4 puppetmasters
15:19 chasemp: disable puppet across cloud things for cleanup
15:11 herron: beginning canary cutover/deployment of codfw elasticsearch servers to codfw puppet 4 puppetmasters
14:22 zeljkof: EU SWAT finished
14:19 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: IP cap lift for Semaine contributive 2017-2018 (T181360) (duration: 00m 45s)
14:10 addshore@tin: Synchronized php-1.31.0-wmf.8/extensions/AdvancedSearch: SWAT AdvancedSearch T181175 T181222 , adjust links in beta section & fix usability of mobile search (duration: 00m 46s)
13:31 moritzm: installing nspr security updates on trusty
13:22 elukey: remove eventlogging replication support (log database) from dbstore1002 - T156844
13:17 moritzm: updating nginx on francium
13:01 Pchelolo: stop cpjobqueue in eqiad for backlog accumulation
12:44 Pchelolo: started parsoid linter script to generate load on cpjobqueue (python3 parsoid_reparse.py http://parsoid.discovery.wmnet:8000 --sitematrix --linter-only --skip-closed https://commons.wikimedia.org/w/api.php )
12:44 addshore@tin: Synchronized wmf-config/InitialiseSettings-labs.php: gerrit:393587 BETA wmgMonologChannels FileImporter => debug (duration: 01m 01s)
12:43 jmm@puppetmaster1001: conftool action : set/pooled=inactive; selector: mw1276.eqiad.wmnet
12:35 moritzm: powercycling mw1276
12:21 addshore@tin: Synchronized wmf-config/CommonSettings-labs.php: gerrit:393582 LABS ONLY Actually call wfLoadExtension for FileExporter & Importer on beta BETA (T181383 ) (duration: 00m 44s)
12:14 marostegui: Stop replication on db1109 to test table rename for s5/s8 failover
12:09 ppchelko@tin: Finished deploy [cpjobqueue/deploy@47d27dc]: Enable keep-alive T181007 (duration: 01m 03s)
12:08 ppchelko@tin: Started deploy [cpjobqueue/deploy@47d27dc]: Enable keep-alive T181007
11:47 addshore@tin: Synchronized wmf-config/CommonSettings-labs.php: gerrit:393577 LABS ONLY Enable FileImporter & FileExporter on BETA PT2/2 (T181383 ) (duration: 00m 45s)
11:46 addshore@tin: Synchronized wmf-config/InitialiseSettings-labs.php: gerrit:393577 LABS ONLY Enable FileImporter & FileExporter on BETA PT1/2 (T181383 ) (duration: 00m 45s)
11:44 addshore@tin: Synchronized wmf-config/extension-list-labs: gerrit:393576 BETA ONLY Add FileExporter & FileImporter to extension-list-labs (duration: 00m 45s)
11:27 hashar: contint1001 enable Icinga "Disks space" notification again. It is no more complaing about Docker partitions | ping mutante | T178454
11:17 godog: bootstrap restbase1007-a - T179422
11:05 jdrewniak@tin: Synchronized portals: Wikimedia Portals Update: Bumping portals to master (T128546) (duration: 00m 45s)
11:05 jdrewniak@tin: Synchronized portals/prod/wikipedia.org/assets: Wikimedia Portals Update: Bumping portals to master (T128546) (duration: 00m 46s)
10:53 moritzm: installing postgresql-common security updates
10:08 ppchelko@tin: Finished deploy [cpjobqueue/deploy@e35aa05]: Revert using keep-alive (duration: 00m 22s)
10:08 ppchelko@tin: Started deploy [cpjobqueue/deploy@e35aa05]: Revert using keep-alive
09:42 ppchelko@tin: Finished deploy [cpjobqueue/deploy@b570d4e]: Make http agent use keep-alive (duration: 00m 48s)
09:41 ppchelko@tin: Started deploy [cpjobqueue/deploy@b570d4e]: Make http agent use keep-alive
09:14 godog: reimage restbase1007 - T179422
08:55 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1097:3314 - T178359 (duration: 00m 43s)
08:40 moritzm: installing openjdk security updates on hadoop, druid and kafka clusters
08:27 marostegui: Deploy schema change on dbstore1002 and dbstore1001 - T174569
08:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1097:3314 weight - T178359 (duration: 00m 45s)
07:20 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1097:3314 weight - T178359 (duration: 00m 45s)
07:10 marostegui: Stop MySQL on db1021 as it will be decommissioned - T181378
06:53 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1021 from the config as it will be decommissioned - T181378 (duration: 00m 44s)
06:52 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1021 from the config as it will be decommissioned - T181378 (duration: 00m 45s)
06:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1097:3314 with low weight - T178359 (duration: 00m 46s)
02:22 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.8) (duration: 05m 44s)
2017-11-25
19:31 marostegui: Set 32:3 disk to offline on db1051
16:09 kartik@tin: Finished deploy [cxserver/deploy@11aecc9]: Update cxserver to 0c242c0 , Pin service-runner to 2.4.2 (duration: 03m 29s)
16:05 kartik@tin: Started deploy [cxserver/deploy@11aecc9]: Update cxserver to 0c242c0 , Pin service-runner to 2.4.2
16:05 godog: unban statsd traffic from scb on graphite1001 - T181333
15:45 ppchelko@tin: Finished deploy [cpjobqueue/deploy@e35aa05]: Rollback. Disable GC metric reporting T181333 (duration: 00m 31s)
15:45 ppchelko@tin: Started deploy [cpjobqueue/deploy@e35aa05]: Rollback. Disable GC metric reporting T181333
15:37 volans: restarted statsd-proxy on graphite1001 (died during investigation) T181333
14:34 godog: rolling restart of cxserver to alleviate metrics leak - T181333
14:26 godog: restart cxserver on scb100[34] - T181333
14:10 godog: roll-restart cpjobqueue to alleviate metrics leak - T181333
13:40 godog: drop incoming statsd from scb to graphite1001 temporarily - T181333
08:32 ariel@tin: Finished deploy [dumps/dumps@ec21673]: fix abstracts recombine job (duration: 00m 02s)
08:32 ariel@tin: Started deploy [dumps/dumps@ec21673]: fix abstracts recombine job
2017-11-24
13:52 moritzm: removing git packages from jessie-wikimedia/experimental (replaced by component/git)
13:24 moritzm: installing openjpeg2 updates (original security already got installed after initial release, but there was a binNMU for amd64)
13:17 marostegui: Stop replication on db1097 to reimport and recompress commonswiki.watchlist
12:54 jynus: reenabling puppet on db1071
12:50 jynus: resetting replication on es1011 for consistency with other replica sets
12:40 jynus: setting up s8 topology on eqiad
12:38 jynus: disable puppet on db1071 and stop local s5 heartbeat there
12:32 reedy@tin: Synchronized docroot/mediawiki/keys/: Fixup keys (duration: 00m 45s)
12:13 marostegui: Enable GTID on es2018 - T181293
11:57 marostegui: Disable puppet on es2018 - T181293
11:50 jynus@tin: Synchronized wmf-config/db-codfw.php: depool es2018 T181293 (duration: 00m 45s)
11:48 marostegui: Reboot es2018 after full-upgrade - T181293
11:25 marostegui: Restart mysql on es2018
11:25 jynus@tin: Synchronized wmf-config/db-eqiad.php: db2085:3318, db2086:3318 (duration: 00m 43s)
11:24 jynus@tin: Synchronized wmf-config/db-codfw.php: Pool db2038, db2085:3318, db2086:3318 (duration: 00m 45s)
10:55 marostegui: Restart MySQL on db2086 to move s5 to s8
10:37 jynus: cancelling db2085 restart, only doing mysql:s5
10:35 jynus: restarting db2085 (including both s5 and s3 instances)
10:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool all future s8 slaves for a topology change - T177208 (duration: 00m 45s)
10:22 moritzm: installing ca-cerfificates updates on trusty hosts
09:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1097:3315 and db1092 (duration: 00m 45s)
09:50 jynus: restarting db2045
09:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1101:3318 db1097:3315 abd db1092 (duration: 00m 45s)
08:49 marostegui: Stop MySQL on db1092
08:48 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1101:3318 in s5 to warm it up and depool db1092 - T178359 T177208 (duration: 00m 45s)
08:40 moritzm: installing java security updates on notebook* hosts
08:38 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Slowly pool db1097:3315 - T178359 (duration: 00m 45s)
08:37 marostegui@tin: Synchronized wmf-config/db-codfw.php: Slowly pool db1097:3315 - T178359 (duration: 00m 45s)
08:20 moritzm: installing java security updates on meitnerium
08:15 moritzm: installing java security updates on stat1004
08:14 hashar: restarting jenkins on contint1001 for a java update
08:07 elukey: re-enabling piwik on bohrium (only VM running on ganeti1006 atm) after mysql tables restore completed
06:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Slowly pool db1101:3318 in s5 to warm it up - T178359 (duration: 00m 45s)
06:46 marostegui@tin: Synchronized wmf-config/db-codfw.php: Slowly pool db1101:3318 in s5 to warm it up - T178359 (duration: 00m 49s)
00:12 reedy@tin: Synchronized docroot/mediawiki/keys/: Add Brian Wolff's key (duration: 00m 45s)
2017-11-23
23:42 reedy@tin: Synchronized wmf-config/: Simplify variable assignment (duration: 00m 47s)
23:25 reedy@tin: Synchronized composer.lock: (no justification provided) (duration: 00m 44s)
23:24 reedy@tin: Synchronized composer.json: (no justification provided) (duration: 00m 45s)
23:23 reedy@tin: Synchronized phpcs.xml: (no justification provided) (duration: 00m 45s)
22:39 demon@tin: Synchronized wmf-config/: style fixes, no-op (duration: 00m 47s)
22:19 demon@tin: Synchronized wmf-config/CommonSettings.php: no-op, moving stuff around (duration: 00m 47s)
21:34 ariel@puppetmaster1001: conftool action : set/pooled=no; selector: name=mw2251.codfw.wmnet
21:15 apergos: rebooted mw2251 after unresponsive on mgmt console and no ping
20:49 demon@tin: Synchronized static/images/project-logos/nowikimedia.png: logo update (duration: 02m 51s)
17:29 akosiaris: set ganeti1006 as drained. ganeti1005 was already set. That will prevent scheduling VMs on those. T181121
17:01 akosiaris: force powercycle on ganeti1006 T181121
16:30 akosiaris: gnt-node failover ganeti1006
15:50 jynus: disable puppet on db2023, db2038 for deployment whle transfer is ongoing
15:23 moritzm: rebooting mwdebug* servers for update to 4.9.51
14:10 marostegui: Stop Mysql on db1101.s5 to clone db1097 - T178359
14:09 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1053 in s2 as vslow to replace db1021 - T134476 (duration: 00m 45s)
13:13 jynus: sutting down db2023 and db2038 for cloning
12:47 reedy@tin: Synchronized wmf-config/CommonSettings.php: GWToolset migratory config T87928 (duration: 00m 46s)
12:40 moritzm: migrating instances off ganeti2001 / kernel reboot to 4.9.51
12:10 moritzm: migrating instances off ganeti2002 / kernel reboot to 4.9.51
11:52 moritzm: migrating instances off ganeti2003 / kernel reboot to 4.9.51
11:37 kartik@tin: Finished deploy [cxserver/deploy@51b78ce]: T181209 Update cxserver to e8fe3f0 to fix Youdao MT (duration: 03m 01s)
11:35 moritzm: migrating instances off ganeti2004 / kernel reboot to 4.9.51
11:35 moritzm: migrating instances off ganeti200 / kernel reboot to 4.9.51
11:34 kartik@tin: Started deploy [cxserver/deploy@51b78ce]: T181209 Update cxserver to e8fe3f0 to fix Youdao MT
11:23 moritzm: migrating instances off ganeti2005 / kernel reboot to 4.9.51
11:15 moritzm: migrating instances off ganeti2006 / kernel reboot to 4.9.51
11:06 moritzm: migrating instances off ganeti2007 / kernel reboot to 4.9.51
10:57 moritzm: migrating instances off ganeti2008 / kernel reboot to 4.9.51
10:07 moritzm: rebooting sarin for update to 4.9.51
09:16 jynus@tin: Synchronized wmf-config/db-codfw.php: mariadb: Switchover s5 codfw master (duration: 00m 45s)
08:56 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1072 (duration: 00m 44s)
08:44 jynus: stopping and restarting db2052
08:31 moritzm: installing bdb security updates on trusty
08:29 jynus: starting switchover of db2023 to db2052
08:21 marostegui: Stop MySQL on db1072 to MySQL upgrade and kernel upgrade
08:21 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1072 (duration: 00m 45s)
07:28 marostegui: Stop MySQL on db1021 and db1053 to clone db1053
07:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1021 - T134476 (duration: 00m 45s)
07:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Move db1053 to s2 to replace db1021 as vslow, dump slave - T134476 (duration: 00m 45s)
06:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1089 - T180045 (duration: 00m 45s)
06:55 marostegui: Drop index on ores_classification on s1 - T180045
06:53 marostegui: Drop index on ores_classification on s2 - T180045
06:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1089 - T180045 (duration: 00m 45s)
06:44 legoktm: legoktm@tin:~$ echo "https://www.mediawiki.org/keys/keys.html " | mwscript purgeList.php
06:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1051 and db1063 in vslow service for s5 to warm them up for the s8 split - T177208 (duration: 00m 46s)
05:58 kartik@tin: Finished deploy [cxserver/deploy@5b35ed5]: Update cxserver to e8fe3f0 (duration: 04m 13s)
05:54 kartik@tin: Started deploy [cxserver/deploy@5b35ed5]: Update cxserver to e8fe3f0
02:23 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.8) (duration: 05m 27s)
01:36 demon@tin: Synchronized wmf-config/InitialiseSettings.php: dropping education program from cswiki (duration: 00m 45s)
01:29 demon@tin: Synchronized wmf-config/CommonSettings.php: Set $wgCentralAuthGlobalBlockInterwikiPrefix (duration: 00m 45s)
01:26 demon@tin: Synchronized wmf-config/InitialiseSettings-labs.php: no-op (duration: 00m 44s)
01:24 demon@tin: Synchronized wmf-config/CommonSettings.php: abusefilter stuff (duration: 00m 45s)
01:21 demon@tin: Synchronized wmf-config/InitialiseSettings.php: abusefilter stuff (duration: 00m 45s)
01:19 demon@tin: Synchronized wmf-config/CommonSettings.php: abusefilter stuff (duration: 00m 45s)
01:05 demon@tin: Synchronized docroot/mediawiki/keys/: update formatting, docs, etc (duration: 00m 46s)
2017-11-22
21:15 mutante: running puppet on logstash hosts to apply config change 392897 (add log4j filter)
21:11 demon@tin: Synchronized docroot/noc/conf/index.php: favicon fix (duration: 00m 46s)
20:35 demon@tin: Synchronized dblists/: now with more alphabeticalizedness (duration: 00m 45s)
20:25 demon@tin: Synchronized dblists/Makefile: no-op (duration: 00m 45s)
20:23 demon@tin: Synchronized tests/noc-conf/NOCDblistTest.php: no-op (duration: 00m 45s)
20:19 demon@tin: Synchronized wmf-config/LabsServices.php: no-op (duration: 00m 45s)
20:14 demon@tin: Synchronized wmf-config/InitialiseSettings.php: adding national library of Israel to copy domains (duration: 00m 45s)
20:13 otto@tin: Finished deploy [eventlogging/analytics@57234e7]: no-op: removing now unneeded code that might accidentally serialize userAgent to json string: T179625 (duration: 00m 04s)
20:13 otto@tin: Started deploy [eventlogging/analytics@57234e7]: no-op: removing now unneeded code that might accidentally serialize userAgent to json string: T179625
20:09 demon@tin: Synchronized robots.txt: block a nasty bot đ (duration: 00m 44s)
20:02 demon@tin: Synchronized multiversion/bin/expanddblist: fix param warning (duration: 00m 45s)
20:00 mepps: updated civicrm from a16e566 to a1022cf
19:58 demon@tin: Synchronized wmf-config/InitialiseSettings.php: comments (duration: 00m 45s)
19:56 demon@tin: Synchronized multiversion/MWScript.php: assume aawiki for purgeUrls (duration: 00m 45s)
19:51 demon@tin: Synchronized docroot/: removing old foundation docroot (duration: 00m 46s)
19:49 urandom: starting cassandra cleanups, restbase-200{1,3,5}-a - T179422
19:41 demon@tin: Synchronized w/extract2.php: removing old portal support (duration: 00m 45s)
18:47 demon@tin: Synchronized dblists/closed.dblist: closed transitionteamwiki (duration: 00m 45s)
18:34 demon@tin: Synchronized wmf-config/InitialiseSettings-labs.php: no-op (duration: 00m 45s)
18:32 demon@tin: Synchronized scap/plugins/updatewikiversions.py: minor fix (duration: 00m 45s)
18:30 demon@tin: Pruned MediaWiki: 1.31.0-wmf.7 [keeping static files] (duration: 01m 46s)
18:12 herron: re-enabling puppet agents after puppetdb postgres security updates
18:09 moritzm: installing postgres security updates on nitrogen/puppetdb
18:05 moritzm: installing postgres security updates on nihal/puppetdb
18:02 herron: disabling puppet agents for puppetdb postgres security update
17:30 demon@tin: Synchronized php-1.31.0-wmf.8/extensions/AdvancedSearch/: fixing layout issues in timeless (duration: 00m 46s)
16:54 mepps: updated payments-wiki from 1ca91b1 to 6b3019b
16:45 chasemp: disable puppet accross labtest things
16:34 marostegui: Compress s4 on db1097 - T178359
16:28 herron: starting canary deploy/cutover of codfw scb hosts to codfw puppet 4 masters
16:16 elukey: restart druid broker,coordinator,historical daemons on druid100[123] to pick up new logging settings
15:41 jynus: starting manually pt-heartbeat for s8 on db1071
15:22 herron: beginning cut over of codfw db servers (^db2.*) to codfw puppet 4 masters
14:49 jynus@tin: Synchronized wmf-config/db-codfw.php: mariadb: Setup s8 replica set on codfw (duration: 00m 45s)
14:27 moritzm: installing libxml-libxml-perl security updates
14:21 jynus: starting database topology changes for s8 on codfw T177208
14:11 urandom: bootstrapping cassandra, restbase2004-c.codfw.wmnet - T179422
13:43 apergos: one more round of labstore1006 <-- ms1001 rsync catchup
13:37 moritzm: installing imagemagick security updates
12:39 marostegui: Stop MySQL on db1053 to clone db1097.s4 - T178359
12:38 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1053 - T178359 (duration: 00m 45s)
12:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Start adapting the config to move db1097 to s4 and s5 as multi-instance rc slave T178359 (duration: 00m 45s)
12:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1101.s7 - T178359 (duration: 00m 45s)
11:51 jynus: starting dropping incorrectly created database on s7 amwikimedia (not to be confused with production wiki s3 amwikimedia)
11:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1101.s7 - T178359 (duration: 00m 45s)
11:19 akosiaris: gnt-node evacuate -s -f ganeti1005. T181121
11:02 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1101.s7 - T178359 (duration: 00m 45s)
10:54 akosiaris: gnt-node migrate -f ganeti1005. T181121
10:51 marostegui: Drop index from ores_classification on s5 - T180045
10:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1051 and db1063 in vslow service for s5 to warm them up for the s8 split - T177208 (duration: 00m 45s)
10:23 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1101 to s5 and s7 as recentchanges multi-instance slave - T178359 (duration: 00m 45s)
10:04 moritzm: running "scap pull" on mw1191, it's depooled and marked as "inactive", but health checks are triggering db errors
09:35 bblack: cr[12]-ulsfo - switch static fallback LVS routes from lvs400[12] to lvs400[56]
09:27 bblack: lvs@ulsfo - done switching primaries (host MED config) - lvs400[56] now primary for text/upload traffic
09:11 akosiaris@tin: Finished deploy [parsoid/deploy@b150764]: T180211 (duration: 05m 05s)
09:08 bblack: puppet disabled on lvs400[1256] for switching primaries
09:06 akosiaris@tin: Started deploy [parsoid/deploy@b150764]: T180211
09:04 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp2017.codfw.wmnet
09:00 bblack: lvs4005 - reboot to clear experimental stuff
08:16 bblack: backend restart on cp4024 (upload@ulsfo) - mailbox lag
07:56 marostegui: Drop index from ores_classification on s3 - T180045
07:50 marostegui: Drop index from ores_classification on s6 - T180045
07:48 marostegui: Drop index from ores_classification on s7 - T180045
07:29 _joe_: stopping the additional workers for htmlCacheUpdate (commons and ruwiki), adding one additional runner for refreshLinks on ruwiki
06:43 marostegui: Stop MySQL on db1063 and db1051 (which is going to be recloned) - T177208
06:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Move db1051 from s1 to s5 - T177208 (duration: 00m 53s)
06:28 ejegg: updated payments-wiki from f871160 to 1ca91b1
03:36 mutante: powercycled mw2251 which had gone down without further comment
02:35 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.8) (duration: 11m 34s)
01:09 hoo: Cleaned out remaining T180934 related log blow up on snapshot1007 (dumpwikidatajson-wikidata-20171120-all-0.log)
00:43 mutante: gerrit restarting service to apply config change
00:40 mutante: gerrit - re-enabling puppet to apply logstash change on cobalt, gerrit restart incoming (T141324 )
00:39 maxsem@tin: Synchronized php-1.31.0-wmf.8/extensions/CentralNotice/: https://gerrit.wikimedia.org/r/#/c/392754/ (duration: 00m 47s)
00:11 maxsem@tin: Synchronized php-1.31.0-wmf.8/extensions/InputBox/: https://gerrit.wikimedia.org/r/#/c/392745/ (duration: 00m 45s)
00:10 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/392576/ (duration: 00m 46s)
00:07 bblack@neodymium: conftool action : set/pooled=yes; selector: name=nescio.wikimedia.org
00:04 bblack@neodymium: conftool action : set/pooled=no; selector: name=nescio.wikimedia.org
00:02 bblack@neodymium: conftool action : set/pooled=yes; selector: name=maerlant.wikimedia.org
00:01 bblack@neodymium: conftool action : set/pooled=no; selector: name=maerlant.wikimedia.org
00:01 bblack@neodymium: conftool action : set/pooled=yes; selector: name=maerlant.wikimedia.org
00:00 bblack@neodymium: conftool action : set/pooled=yes; selector: name=chromium.wikimedia.org
2017-11-21
23:55 bblack@neodymium: conftool action : set/pooled=no; selector: name=chromium.wikimedia.org
23:30 mholloway-shell@tin: Finished deploy [mobileapps/deploy@fc01242]: Update mobileapps to 52d6a83 (duration: 04m 36s)
23:26 mholloway-shell@tin: Started deploy [mobileapps/deploy@fc01242]: Update mobileapps to 52d6a83
22:45 bblack@neodymium: conftool action : set/pooled=yes; selector: name=achernar.wikimedia.org
22:38 bblack@neodymium: conftool action : set/pooled=no; selector: name=achernar.wikimedia.org
22:36 bblack@neodymium: conftool action : set/pooled=yes; selector: name=hydrogen.wikimedia.org
22:29 urandom: Bootstrapping Cassandra, restbase2004-b.codfw.wmnet (T179422 )
22:12 mutante: gerrit - temp disable puppet on cobalt (prod gerrit), test switching gerrit logging to logstash on gerrit2001 - gerrit:392079 gerrit:392083 T141324
22:05 bblack@neodymium: conftool action : set/pooled=no; selector: name=hydrogen.wikimedia.org
22:03 bblack@neodymium: conftool action : set/pooled=yes; selector: name=acamar.wikimedia.org
22:02 bblack: repooling acamar for recdns
21:26 ariel@tin: Finished deploy [dumps/dumps@16f92d6]: take 2: gzip namespace and abstract dumps; remove last configfile existence checks (duration: 00m 02s)
21:26 ariel@tin: Started deploy [dumps/dumps@16f92d6]: take 2: gzip namespace and abstract dumps; remove last configfile existence checks
20:45 bblack: recdns: puppet disabled on all, acamar depooled, careful deploys going on for anycast+recdns stuff
20:42 ayounsi@neodymium: conftool action : set/pooled=no; selector: name=acamar.wikimedia.org
20:26 ariel@tin: Finished deploy [dumps/dumps@16f92d6]: gzip namespace and abstract dumps; remove last configfile existence checks (duration: 00m 16s)
20:26 ariel@tin: Started deploy [dumps/dumps@16f92d6]: gzip namespace and abstract dumps; remove last configfile existence checks
20:22 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to 1.31.0-wmf.8
20:00 thcipriani: finishing wmf.8 rollout, starting group2 to wmf.8
19:21 addshore: SWAT done!
19:20 addshore@tin: Synchronized wmf-config/CommonSettings.php: SWAT: Add images.collection.cooperhewitt.org & *.dimu.org to wgCopyUploadsDomains. T180791 T180241 (duration: 00m 48s)
19:11 addshore@tin: Synchronized wmf-config/CommonSettings.php: SWAT: Enable AdvancedSearch on group0 PT2/2 (duration: 00m 49s)
19:10 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable AdvancedSearch on group0 PT1/2 (duration: 00m 50s)
18:40 mholloway-shell@tin: Finished deploy [mobileapps/deploy@dd41387]: Update mobileapps to 9d1602d (duration: 05m 09s)
18:35 mholloway-shell@tin: Started deploy [mobileapps/deploy@dd41387]: Update mobileapps to 9d1602d
18:28 smalyshev@tin: Finished deploy [wdqs/wdqs@c69c739]: Restore categories vocabulary to V003 (duration: 01m 55s)
18:26 smalyshev@tin: Started deploy [wdqs/wdqs@c69c739]: Restore categories vocabulary to V003
18:25 smalyshev@tin: Finished deploy [wdqs/wdqs@7d951d2]: Restore categories vocabulary to V003 (duration: 00m 11s)
18:25 smalyshev@tin: Started deploy [wdqs/wdqs@7d951d2]: Restore categories vocabulary to V003
16:45 marostegui: Compress s3 on db2085 - T178359
16:35 papaul: powering down wtp2017 for disk replacement
16:19 papaul: updating firmware on db2068
15:31 _joe_: rolling restart of pybal on low-traffic in codfw, eqiad for the new depool thresholds for MW
15:00 ppchelko@tin: Finished deploy [cpjobqueue/deploy@3e948de]: Revert: Temporarily disable deduplication (duration: 00m 26s)
15:00 ppchelko@tin: Started deploy [cpjobqueue/deploy@3e948de]: Revert: Temporarily disable deduplication
14:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore original weight for db1096 (duration: 00m 48s)
14:39 ppchelko@tin: Finished deploy [cpjobqueue/deploy@aac3201]: Temporarily disable deduplication (duration: 00m 29s)
14:39 ppchelko@tin: Started deploy [cpjobqueue/deploy@aac3201]: Temporarily disable deduplication
14:29 zeljkof: EU SWAT finished
14:25 bblack: cr[12]-ulsfo: allow lvs400[567] as PyBal neighbors for BGP
14:18 zfilipin@tin: Synchronized php-1.31.0-wmf.8/extensions/TimedMediaHandler/MwEmbedModules/EmbedPlayer/resources/mw.EmbedPlayerOgvJs.js: SWAT: Disable wasm, use asm.js codec modules for Safari/Edge (T181022) (duration: 00m 49s)
14:12 jdrewniak@tin: Synchronized portals: SWAT: Bumping portals to master (T128546) (duration: 00m 49s)
14:11 jdrewniak@tin: Synchronized portals/prod/wikipedia.org/assets: SWAT: Bumping portals to master (T128546) (duration: 00m 50s)
14:11 ppchelko@tin: Started restart [cpjobqueue/deploy@5341d94]: Restart to try increased maxSockets
14:11 Pchelolo: restart cpjobqueue to try increasing maxSockets
14:08 bblack: not-abnormally-quick reboot on lvs4005
14:08 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: electron stuck - T174916
13:45 ppchelko@tin: Finished deploy [cpjobqueue/deploy@5341d94]: Enable GC metrics reporting (duration: 00m 36s)
13:44 ppchelko@tin: Started deploy [cpjobqueue/deploy@5341d94]: Enable GC metrics reporting
13:39 _joe_: starting 2 manual runners for htmlcacheupdate on commons, 1 for htmlcacheupdate and 1 for refreshlinks on ruwiki, on terbium
13:39 bblack: quick reboot on lvs4005
12:02 kartik@tin: Finished deploy [cxserver/deploy@b87a27a]: Update cxserver to 4301987 (duration: 03m 24s)
11:58 kartik@tin: Started deploy [cxserver/deploy@b87a27a]: Update cxserver to 4301987
11:04 akosiaris: reboot ununpentium for serial tty change
11:03 akosiaris: reboot oresrdb2001 for serial tty change
11:02 akosiaris: reboot netmon1003 for serial tty change
11:01 akosiaris: reboot install2002 for serial tty change
10:57 akosiaris: reboot install1002 for serial tty change
10:35 godog: bootstrap cassandra restbase2004-a - T179422
10:11 ppchelko@tin: Finished deploy [cpjobqueue/deploy@e35aa05]: Set consumer_batch_size to 10 T181007 (duration: 00m 31s)
10:10 ppchelko@tin: Started deploy [cpjobqueue/deploy@e35aa05]: Set consumer_batch_size to 10 T181007
09:39 elukey: upload prometheus-druid-exporter 0.4 to jessie/stretch-wikimedia
09:28 marostegui: Shutdown db2068 for maintenance - T180927
09:19 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1096 (duration: 00m 49s)
09:02 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1096 (duration: 00m 49s)
08:39 marostegui: Drop index on db1089 enwiki.ores_classification - T180045
08:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1096 with low weight - T178359 (duration: 00m 48s)
08:27 marostegui: Reboot db1096 for kernel and MariaDB upgrade
08:19 marostegui: Compress s5 on db1101 - T178359
08:11 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1109 and db1110 - T180700 (duration: 00m 48s)
07:36 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1109 and db1110 - T180700 (duration: 00m 49s)
07:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1109 and db1110 in s5 with small weight - T180700 (duration: 00m 48s)
06:50 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1087 - T179106 (duration: 00m 48s)
06:47 marostegui: Remove index wb_terms_language from db1087 - https://phabricator.wikimedia.org/T179106
06:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1087 - T179106 (duration: 00m 48s)
06:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1096 - T178359 (duration: 00m 48s)
06:27 marostegui: Stop MySQL on db1096 to clone db1101.s5 - T178359
06:19 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1082 original weight - T177208 (duration: 00m 49s)
02:58 mutante: phabricator back up
02:55 mutante: phab1001 (phabricator prod) reboot for kernel upgrade
02:48 krinkle@tin: Synchronized docroot/noc: clean up (duration: 00m 49s)
02:23 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 05m 42s)
02:03 smalyshev@tin: Started deploy [wdqs/wdqs@7d951d2]: Restore categories vocabulary to V003
00:19 catrope@tin: Synchronized php-1.31.0-wmf.8/resources/src/mediawiki.rcfilters/ui/mw.rcfilters.ui.ItemMenuOptionWidget.js: T180863 (duration: 00m 49s)
2017-11-20
23:46 mutante: phab2001 - reboot for kernel upgrade
23:35 legoktm@tin: Synchronized wmf-config/InitialiseSettings.php: emergency disable ORES on frwp/ruwp T181006 (duration: 00m 49s)
23:35 awight: purge cache keys for ORES thresholds on frwiki and ruwiki
23:25 awight@tin: Started deploy [ores/deploy@82a13ae]: Rollback ORES (take 3); 181006
23:19 awight: aborted ORES rollback
23:18 awight@tin: Started deploy [ores/deploy@95cd523]: Rollback ORES (take 2); 181006
23:14 smalyshev@tin: Finished deploy [wdqs/wdqs@7d951d2]: Rollback categories vocabulary version due to a bug (duration: 08m 11s)
23:11 awight: purged memcache key 'ruwiki:ORES:threshold_statistics:goodfaith:1â, T181006
23:06 smalyshev@tin: Started deploy [wdqs/wdqs@7d951d2]: Rollback categories vocabulary version due to a bug
22:55 awight@tin: Finished deploy [ores/deploy@5084251]: Rollback ORES; T179711 (duration: 01m 05s)
22:55 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: no wmf.8 for group2. i hate my life
22:55 awight: rolling back ORES to fix T181006
22:54 eileen: update civicrm from 272580a to a16e566
22:54 awight@tin: Started deploy [ores/deploy@5084251]: Rollback ORES; T179711
22:50 Krinkle: MW HTTP 500 spike tracked as https://phabricator.wikimedia.org/T181006
22:49 Krinkle: Sharp rise in HTTP 500 errors as of 22:05 (45 minutes ago)
22:29 aaron@tin: Synchronized php-1.31.0-wmf.7/includes/libs/objectcache/WANObjectCache.php: fix cache key namespace (duration: 00m 49s)
22:27 awight@tin: Finished deploy [ores/deploy@5084251]: Updating ORES to revscoring 2.0.10, T179711 (duration: 49m 54s)
22:26 aaron@tin: Synchronized php-1.31.0-wmf.7/extensions/Collection: cache key name fix (duration: 00m 49s)
22:11 aaron@tin: Synchronized php-1.31.0-wmf.8/includes/libs/objectcache/WANObjectCache.php: 7e74b49 : namespace WAN cache variant keys (duration: 00m 48s)
22:04 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.8
21:52 bblack: ns2 back on eeden
21:52 aaron@tin: Synchronized php-1.31.0-wmf.8/extensions/Collection: 3baebf4a : cache key name fix (duration: 00m 51s)
21:44 bblack: rebooting eeden
21:37 awight@tin: Started deploy [ores/deploy@5084251]: Updating ORES to revscoring 2.0.10, T179711
21:33 bblack: routing ns2.wikimedia.org to radon for eeden reboot
21:19 bblack: routing ns0.wikimedia.org back to radon post-reboot
21:13 bblack: rebooting radon
21:07 bblack: re-routing ns0.wikimedia.org traffic to baham for radon reboot
20:17 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Remove overlapping userrights (T101983 ) (duration: 19m 41s)
20:16 bblack: ns1 dns traffic back to normal on baham
20:16 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Revert "Enable Translate extension in amwikimedia (T180879 )" (duration: 00m 48s)
20:11 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Enable Translate extension in amwikimedia (T180879 ) (duration: 00m 49s)
20:11 hashar: CI docker jobs were all broken due to a mistake. Should be back now. T177684
20:07 ladsgroup@tin: Synchronized php-1.31.0-wmf.8/resources/src/mediawiki.rcfilters/ui/mw.rcfilters.ui.ItemMenuOptionWidget.js: RCFilters: Only apply excluded label to namespace items (T180863 ) (duration: 00m 49s)
20:06 bblack: rebooting baham
19:58 bblack: re-routing ns1.wikimedia.org traffic to radon for baham reboot
19:52 chasemp: updating phab mail handler
19:51 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Remove overlapping userrights (T101983 ) (duration: 00m 49s)
19:47 ottomata: restarted coal with fixes for eventcapsule changes in T179625
19:42 ladsgroup@tin: Synchronized wmf-config/throttle.php: Adjust throttle.php for dewiki workshop (T180046 ) (duration: 00m 49s)
19:40 ladsgroup@tin: Synchronized wmf-config/throttle.php: Adjust throttle.php for dewiki workshop (T180046 ) (duration: 00m 48s)
19:36 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Allow admins to remove users from MP3 uploaders user group (T180002 ) (duration: 00m 49s)
19:31 ladsgroup@tin: Synchronized portals: SWAT: Bumping portals to master (T128546) (duration: 00m 51s)
19:30 ladsgroup@tin: Synchronized portals/prod/wikipedia.org/assets: SWAT: Bumping portals to master (T128546) (duration: 00m 49s)
19:22 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Switch submit button from 'save' to 'publish' on dewiki (duration: 00m 50s)
18:51 chasemp: disable puppet across cloud things to rollout https://gerrit.wikimedia.org/r/#/c/392168/ slowly
18:25 smalyshev@tin: Finished deploy [wdqs/wdqs@2e39b69]: Blazegraph and GUI update (duration: 01m 42s)
18:23 smalyshev@tin: Started deploy [wdqs/wdqs@2e39b69]: Blazegraph and GUI update
17:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1082 with low weight - T177208 (duration: 00m 49s)
17:19 bd808: Sending Toolforge survey emails from silver for T177126
17:00 moritzm: uploaded git 2.11-3+deb9u2+bpo8+wmf1 for component/git to apt.wikimedia.org/jessie-wikimedia
16:43 marostegui: Reboot db1082 for kernel upgrade and MariaDB upgrade to 10.0.33
16:39 ppchelko@tin: Finished deploy [cpjobqueue/deploy@174420f]: Revert: Temporary set consumer_batch_size to 50, forgot -f, checkout prev rev (duration: 03m 33s)
16:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1109 and db1110 to the config depooled - T180700 (duration: 00m 48s)
16:38 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db1109 and db1110 to the config depooled - T180700 (duration: 00m 48s)
16:36 ppchelko@tin: Started deploy [cpjobqueue/deploy@174420f]: Revert: Temporary set consumer_batch_size to 50, forgot -f, checkout prev rev
16:32 ppchelko@tin: Finished deploy [cpjobqueue/deploy@f0610d3]: Revert: Temporary set consumer_batch_size to 50, forgot -f (duration: 00m 28s)
16:32 ppchelko@tin: Started deploy [cpjobqueue/deploy@f0610d3]: Revert: Temporary set consumer_batch_size to 50, forgot -f
16:24 marostegui: Add db1109 and db1110 to tendril - T180700
16:06 ppchelko@tin: Finished deploy [cpjobqueue/deploy@f0610d3]: Revert: Temporary set consumer_batch_size to 50 (duration: 00m 17s)
16:06 ppchelko@tin: Started deploy [cpjobqueue/deploy@f0610d3]: Revert: Temporary set consumer_batch_size to 50
15:59 urandom: Use lz4 compression instead of deflate (T180804 )
15:45 jynus: shutting down db2068 for maintenance after depool T180927
15:42 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2068 (duration: 00m 49s)
15:34 chasemp: disable puppet for a merge across cloud things
15:31 ppchelko@tin: Finished deploy [cpjobqueue/deploy@f0610d3]: Temporary set consumer_batch_size to 50 (duration: 00m 30s)
15:30 ppchelko@tin: Started deploy [cpjobqueue/deploy@f0610d3]: Temporary set consumer_batch_size to 50
15:03 herron: pointing codfw mw servers at codfw puppet 4 masters via puppetmaster2001
14:39 zeljkof: EU SWAT finished
14:34 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Single edit tab in Catalan Wikipedia (T180660) (duration: 00m 49s)
14:25 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable wgNamespacesWithSubpages for hiwikiversity (T180913) (duration: 00m 48s)
14:22 jynus: enable semi-sync replication on s5
14:14 dcausse@tin: Synchronized wmf-config/CirrusSearch-common.php: T180805 : Revert [cirrus] disable token count router (duration: 00m 49s)
14:05 elukey: upload prometheus-druid-exporter 0.3 to jessie-wikimedia
13:53 Pchelolo: disable puppet on scb100x and stop cpjobqueue to accumulate some backlog
13:39 ppchelko@tin: Finished deploy [cpjobqueue/deploy@174420f]: Optimise committed offset calculation (duration: 00m 28s)
13:39 ppchelko@tin: Started deploy [cpjobqueue/deploy@174420f]: Optimise committed offset calculation
13:30 elukey: upload prometheus-druid-exporter 0.3 to stretch-wikimedia
12:40 marostegui: Optimize db1063 wikidatawiki.wb_terms
12:17 moritzm: uploaded hhvm 3.18.5+dfsg-1+wmf1+icu57 to apt.wikimedia.org (jessie-wikimedia/component/icu57) (HHVM build linked against a co-installable backport of icu57)
11:46 moritzm: rebooting tungsten for update to 4.9.51
11:41 moritzm: rebooting etherpad1001 (etherpad.wikimedia.org) for update to 4.9.51
11:36 moritzm: rebooting pybal-test for update to 4.9.51
11:27 moritzm: rebooting restbase-test cluster for update to 4.9.51
11:23 ppchelko@tin: Finished deploy [cpjobqueue/deploy@bdcef23]: Various performance improvements in committing (duration: 00m 30s)
11:22 ppchelko@tin: Started deploy [cpjobqueue/deploy@bdcef23]: Various performance improvements in committing
11:13 moritzm: rebooting ruthenium for update to 4.9.51
10:57 godog: reimage restbase2004 - T179422
10:51 moritzm: rebooting cerium/praseodymium/xenon for update to 4.9.51
10:42 moritzm: rebooting mwlog1001 for update to 4.9.51
10:39 moritzm: rebooting mwlog2001 for update to 4.9.51
10:23 hoo: Manually re-started the Wikidata entity JSON dump on snapshot1007 (T180934 )
10:19 dcausse: elastic/cirrus: reindexing english group0 and group1 wikis: T179945
10:08 hashar: contint1001: sudo systemctl start jenkins
10:05 moritzm: rebooting contint1001 for update to 4.9.51
09:22 moritzm: rebooting hafnium for update to 4.9.51
08:46 marostegui: Run mydumper for db1047.staging - T156844
07:58 moritzm: installing procmail security updates
07:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1097 - T178359 (duration: 00m 49s)
06:41 marostegui: Stop MySQL on db1082 to clone db1109 and db1110 - T180700
06:36 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1082 - T177208 (duration: 00m 48s)
06:25 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore original weights for db1100 and db1071 - T180917 (duration: 00m 49s)
06:15 marostegui: Reboot db2068 - T180927
02:28 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 06m 26s)
2017-11-19
23:20 Jamesofur: removed 2FA for Ask21 T180889
20:28 krinkle@tin: Synchronized docroot/noc/index.html: noc: Link to Grafana (duration: 00m 49s)
19:50 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1100 (duration: 00m 49s)
07:09 mobrovac@tin: Started restart [zotero/translators@a0c41c3]: Zotero eating up memory
07:01 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: electron stuck - T174916
2017-11-18
20:06 gehel: upgrade of elasticsearch eqiad complete - T178411
18:03 reedy@tin: Synchronized multiversion/vendor/: bump (duration: 01m 14s)
15:53 marostegui: Compress s5 on db2085 - T178359
14:48 marostegui: Compress s3 on db2092 - T178359
00:30 urandom: Bootstrapping restbase2006-b (T179422 )
2017-11-17
20:35 mutante: netmon1002 - rsync smokeping data back from local backup to show measurements made from eqiad as requested on T180812
20:05 mutante: running puppet on all cache misc to switch smokeping web to eqiad
20:02 mutante: T180812 copying smokeping data from 2001 to 1002 - netmon1002: /usr/bin/rsync -avp rsync://netmon2001.wikimedia.org/var-lib-smokeping /var/lib/smokeping/ | switching backend from codfw to eqiad
19:37 mutante: T180812 - @netmon1002:# rsync -avp /var/lib/smokeping/ /root/backup/netmon1002/201711717/var/lib/smokeping/@netmon2001:/# rsync -avp /var/lib/smokeping/ /root/backup/netmon2001/201711717/var/lib/smokeping/
19:36 mutante: @netmon1002:/var/lib/smokeping# rsync -avp /var/lib/smokeping/ /root/backup/netmon1002/201711717/var/lib/smokeping/
18:37 anomie@tin: Synchronized wmf-config/InitialiseSettings.php: Setting wgCommentTableSchemaMigrationStage = MIGRATION_WRITE_BOTH on Beta Cluster, no prod change (duration: 00m 48s)
18:36 anomie@tin: Synchronized wmf-config/InitialiseSettings-labs.php: Setting wgCommentTableSchemaMigrationStage = MIGRATION_WRITE_BOTH on Beta Cluster, no prod change (duration: 00m 49s)
17:43 demon@tin: Synchronized php-1.31.0-wmf.8/extensions/Wikidata/extensions/Wikibase/client/WikibaseClient.php: fix client dependencies (duration: 00m 49s)
17:41 demon@tin: Synchronized php-1.31.0-wmf.8/extensions/Wikibase/client/WikibaseClient.php: fix client dependencies (duration: 00m 50s)
17:21 bblack: enabling new normalization code for all upload@esams (done)
17:19 bblack: enabling new normalization code for all upload@eqiad
17:14 marostegui: Revert schema change on dbstore1001 - T180714
17:12 marostegui: Move dbstore1001 under db1070 - T180714
17:12 bblack: enabling new normalization code for all upload@codfw
16:52 bblack: enabling new normalization code for all upload@ulsfo
16:24 bblack: disabling puppet on all cp* (testing encoding patch)
16:19 moritzm: uploaded boost 1.55.0+dfsg-3+wmf2+icu57 to apt.wikimedia.org for jessie-wikimedia/component/icu57 (needed for HHVM build linked against ICU 57)
16:04 dcausse@tin: Synchronized wmf-config/CirrusSearch-common.php: T180795 [cirrus] disable token count router (duration: 00m 49s)
15:43 chasemp: labservices1001:~# mv /var/zones/tools.eqiad.wmflabs /home/rush T180797
15:26 urandom: Starting restbase2006-c w/ -Dcassandra.replace_address=10.192.48.51 (T179422 )
14:06 jynus: deploy master events to db1070
14:04 chasemp: labstore1003 service nfs-kernel-server restart && service rsync start
13:55 moritzm: uploaded boost 1.55.0+dfsg-3+wmf1+icu57 to apt.wikimedia.org for jessie-wikimedia/component/icu57 (needed for HHVM build linked against ICU 57)
12:15 moritzm: installing openssl updates on poolcounters
12:07 moritzm: installing openssl updates on graphite* hosts
11:46 moritzm: installing openssl updates on etcd* hosts
11:43 moritzm: installing openssl updates on dbproxy hosts
11:33 moritzm: installing openssl updates on puppetmasters
11:01 akosiaris: create webperf1001, webperf2001 in ganeti T179036
10:50 akosiaris@tin: Synchronized wmf-config/db-eqiad.php: (no justification provided) (duration: 00m 49s)
10:49 akosiaris: sync wmf-config/db-eqiad.php for T180724
10:48 akosiaris@tin: scap aborted: (no justification provided) (duration: 00m 02s)
10:48 akosiaris@tin: Started scap: (no justification provided)
10:47 akosiaris: pool mw2251 T180724
10:47 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw2251.codfw.wmnet
10:01 moritzm: rebooting darmstadtium (docker registry) for update to 4.9.51
09:59 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore original traffic for the s5 eqiad hosts that were out for hours due to schema change revert after s5 master crash (duration: 00m 48s)
09:54 moritzm: rebooting labnodepool* for update to 4.9.51
09:38 ppchelko@tin: Finished deploy [cpjobqueue/deploy@2141162]: Bump concurrency even more (duration: 00m 29s)
09:38 ppchelko@tin: Started deploy [cpjobqueue/deploy@2141162]: Bump concurrency even more
09:20 godog: start restbase2006-c instead, restbase2006-b failed and -c shows as "down" - T179422
09:13 jynus: changing master of db1071 (63 -> 70)
09:10 gehel: cleanup leftover titlesuggest indices on elasticsearch eqiad (jawiki, frwiki, ptwiki)
09:05 ppchelko@tin: Finished deploy [cpjobqueue/deploy@cbd25d3]: Bump overall concurrency to get rid of RecordLintJob backlog (duration: 00m 35s)
09:04 ppchelko@tin: Started deploy [cpjobqueue/deploy@cbd25d3]: Bump overall concurrency to get rid of RecordLintJob backlog
08:54 godog: bootstrap restbase2006-b - T179422
08:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for the s5 eqiad hosts that were out for hours due to schema change revert after s5 master crash (duration: 00m 50s)
08:04 elukey: reboot stat100[456] for kernel updates
07:51 marostegui: Revert schema change on dbstore1002 - T180714
07:50 marostegui: Move dbstore1002 under db1070 - T180714
07:33 marostegui: Enable GTID on all the eqiad up-to-date hosts, only pending db1071 - T180714
07:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool s5 eqiad hosts after reverting schema change - T180714 (duration: 00m 50s)
06:34 marostegui: Revert schema changes on s5 codfw master with replication enabled, lag will be generated on codfw s5 - T180714
04:43 krinkle@tin: Synchronized wmf-config/CommonSettings-labs.php: no-op for labs (duration: 00m 51s)
01:32 ejegg: re-enabled donation queue consumer
01:28 ejegg: updated CiviCRM from 8454e06 to 272580a
01:07 ejegg: updated CiviCRM from 0b8ceea to 8454e06
food: disabled donations queue consumer for thank you subject update
00:41 krinkle@tin: Synchronized wmf-config/PhpAutoPrepend.php: I60cce0 (duration: 00m 48s)
00:40 krinkle@tin: Synchronized wmf-config/StartProfiler.php: I60cce0 (duration: 00m 49s)
00:37 krinkle@tin: Synchronized wmf-config/profiler.php: I60cce0 (duration: 00m 48s)
00:23 urandom: Decommissioning Cassandra, restbase1014-c.eqiad.wmnet (T179422 )
00:22 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/390131/3 (duration: 00m 49s)
2017-11-16
23:50 mutante: wezen - systemctl restart rsyslog
21:52 mepps: updated civicrm from 22f6532 to 0b8ceea
21:05 mepps: updated payments-wiki from 210cb37 to f871160
20:51 apergos: one more catchup rsync from ms1001 to labstore1006 kicking off
20:15 jynus@tin: Synchronized wmf-config/db-eqiad.php: Pool only 'good' servers - third try (duration: 00m 49s)
20:11 jynus@tin: Synchronized wmf-config/db-eqiad.php: Pool only 'good' servers - second try (duration: 00m 49s)
20:10 demon@tin: Unlocked for deployment [operations/mediawiki-config]: No deploys, recovering from downtime (more narrow locking) (duration: 06m 49s)
20:03 demon@tin: Locking from deployment [operations/mediawiki-config]: No deploys, recovering from downtime (more narrow locking) (planned duration: 360m 00s)
20:02 twentyafterfour: deploy a9613b4 to hotfix T180706
19:56 demon@tin: Unlocked for deployment [ALL REPOSITORIES]: No deploys, recovering from downtime (duration: 50m 28s)
19:54 ayounsi@tin: Finished deploy [netbox/deploy@19f4f65]: (no justification provided) (duration: 00m 04s)
19:54 ayounsi@tin: Started deploy [netbox/deploy@19f4f65]: (no justification provided)
19:49 akosiaris: schedule extra downtime for s5 slaves
19:46 urandom: Bootstapping Cassandra restbase2006-a (T179422 )
19:13 jynus: reset slave all on db1070 @ db1063-bin.001382:234464548
19:05 demon@tin: Locking from deployment [ALL REPOSITORIES]: No deploys, recovering from downtime (planned duration: 360m 00s)
19:05 demon@tin: Unlocked for deployment [ALL REPOSITORIES]: Dealing with outage, no deploys for now (duration: 01m 22s)
19:03 demon@tin: Locking from deployment [ALL REPOSITORIES]: Dealing with outage, no deploys for now (planned duration: 60m 00s)
18:48 marostegui: dewikipedia and wikidata currently back to writable
18:44 jynus@tin: Synchronized wmf-config/db-eqiad.php: Pool only 'good' servers (duration: 00m 48s)
18:41 mutante: dewikipedia and wikidata currently back to readonly-mode while wikidata is being worked on
18:41 marostegui: Set s5 master read_only
18:25 akosiaris: set mw2251 to inactive. T180724
18:24 akosiaris@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=mw2251.codfw.wmnet
17:57 gehel: silencing wdqs
17:55 jynus@tin: Synchronized wmf-config/db-eqiad.php: Failover db1063 to db1070 (duration: 00m 46s)
17:44 XioNoX: merging netbox CR
17:41 bblack: disable port ge-5/0/39 on asw-c-eqiad (db1063)
17:40 urandom: Decommissioning Cassandra, restbase1014-b.eqiad.wmnet (T179422 )
17:36 jynus: stopping slave on db1070
17:28 urandom: Converting 'enwiki parsoid' to size-tiered compaction (T179422 )
17:19 demon@tin: Finished scap: consistency (duration: 25m 12s)
17:18 bblack: Temporary GeoDNS routing changes (eqsin traffic simulation using ulsfo) - https://gerrit.wikimedia.org/r/#/c/391357/ - expecting ~24h, West Asia latencies will probably increase, spike in cache misses, etc...
17:18 urandom: Converting 'wikipedia parsoid' to size-tiered compaction (T179422 )
17:16 godog: reimage restbase2006 - T179422
17:15 urandom: Converting 'commons mobile' to size-tiered compaction (T179422 )
17:05 urandom: Converting 'others mobile' to size-tiered compaction (T179422 )
16:54 demon@tin: Started scap: consistency
16:54 godog: reimage restbase2006 - T179422
16:48 jynus: stop and restart db1071 for upgrade and reconfiguration
16:48 herron: beginning gradual cutover of codfw mw systems to puppet 4 master puppetmaster2001
16:48 godog: upgrade hpsa firmware to 6.06 on restbase2006 - T141756
16:43 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1034 and db1079 original weight (duration: 00m 48s)
16:29 jynus@tin: Synchronized wmf-config/db-eqiad.php: mariadb: Depool db1071, pool db1104 as api (duration: 00m 49s)
16:21 moritzm: restarting apache on labs puppet masters to pick up openssl updates
16:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1034 weight (duration: 00m 48s)
15:47 moritzm: rebooting ores1* for update to 4.9.51
15:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1034 weight (duration: 00m 49s)
15:33 moritzm: rebooting seaborgium (slapd) for update to 4.9.51
15:22 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2038 - T178359 (duration: 00m 49s)
15:16 milimetric@tin: Finished deploy [analytics/refinery@4ef15d3]: Mainly deploying the interlanguage navigation dataset (duration: 14m 29s)
15:13 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1034 weight (duration: 00m 48s)
15:10 godog: upgrade prometheus-redis-exporter to 0.13-1 - T148637
15:06 moritzm: installing postgres security updates on labsdb1006/1007
15:02 milimetric@tin: Started deploy [analytics/refinery@4ef15d3]: Mainly deploying the interlanguage navigation dataset
15:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1051 - T177208 (duration: 00m 48s)
15:00 herron: beginning cut over of codfw canary appservers to puppet 4 master puppetmaster2001
14:55 moritzm: rebooting serpens (slapd) for update to 4.9.51
14:50 elukey: updating puppet compiler's facts (following https://wikitech.wikimedia.org/w/index.php?title=Nova_Resource:Puppet3-diffs#FAQ )
14:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1034 weight - T178359 (duration: 00m 49s)
14:26 dcausse: EU swat done
14:23 moritzm: rebooting kubetcd* to 4.9.51
14:18 dcausse@tin: Synchronized wmf-config/CirrusSearch-common.php: T177913 : [cirrus] Add overridden iw prefix for svwiki (2/2) (duration: 00m 48s)
14:16 dcausse@tin: Synchronized wmf-config/InitialiseSettings.php: T177913 : [cirrus] Add overridden iw prefix for svwiki (1/2) (duration: 00m 50s)
14:02 gehel: starting upgrade of elasticsearch eqiad - T178411
13:46 moritzm: rebooting bohrium (piwik host) for update to 4.9.51
13:33 moritzm: installing openssl updates on es* hosts
13:24 moritzm: rebooting dubnium/pollux (openldap corp mirror) for update to 4.9.51
13:12 moritzm: installing openssl updates on pc* hosts
13:07 elukey: restart aqs on aqs100[5-9] to apply localQuorum (https://gerrit.wikimedia.org/r/391765 ) - T164348
13:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1034 weight - T178359 (duration: 00m 49s)
12:54 moritzm: rebooting radium (tor relay) for update to 4.9.51
12:34 moritzm: installing openssl updates on memcached/redis clusters
12:27 hoo: Updated operations/dumps/dcat (ea4e75..7734e04) on snapshot1007
12:09 moritzm: rebooting dbmonitor* hosts for update to 4.9.51
12:03 addshore@tin: Synchronized php-1.31.0-wmf.8/extensions/Wikidata/extensions/Constraints: T180665 Manually applied: Fix SparqlHelper::getCacheMaxAge() (duration: 00m 52s)
11:58 addshore@tin: Synchronized php-1.31.0-wmf.8/extensions/WikibaseQualityConstraints: T180665 Fix SparqlHelper::getCacheMaxAge() (duration: 00m 53s)
11:42 moritzm: installing openssl updates on conf* clusters
11:37 addshore@tin: Synchronized wmf-config/Wikibase-production.php: testwikidata only, T180665 , WBQC configuration for testwikidatawiki (duration: 00m 49s)
11:32 addshore: addshore@terbium:~$ mwscript extensions/WikibaseQualityConstraints/maintenance/ImportConstraintStatements.php --wiki=testwikidatawiki > importConstraintStatements.log
11:21 godog: upgrade prometheus to 1.8.1+ds+k8s-1 in ulsfo/esams/eqiad - T177395
11:09 moritzm: rebooting debug proxies (hassium/hassaleh) for update to 4.9.51
11:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1034 weight - T178359 (duration: 00m 48s)
11:01 jynus: shutting down labsdb1010 to clone to labsdb1009
10:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1034 with low weight - T178359 (duration: 00m 48s)
10:26 addshore: Kill the build deploy slot done!
10:24 addshore@tin: Synchronized php-1.31.0-wmf.8/extensions/Wikibase: T180634 Bring Wikibase up to date with .8 branch of Wikidata build extension (duration: 01m 46s)
10:15 addshore@tin: Synchronized php-1.31.0-wmf.8/extensions/WikibaseQualityConstraints: T180634 Bring WikibaseQualityConstraints up to date with .8 branch of Wikidata build extension (duration: 00m 53s)
09:59 moritzm: rebooting prometheus servers in eqiad for update to 4.9.51
09:44 elukey: restart aqs on aqs1004 to apply localQuorum (https://gerrit.wikimedia.org/r/391765 ) - T164348
09:38 moritzm: rebooting prometheus servers in codfw for update to 4.9.51
09:30 moritzm: uploaded icu57.1-6+wmf2 to jessie-wikimedia/component/icu57
09:23 godog: bootstrap restbase2002-c - T179422
09:19 godog: upgrade grafana to 4.6.1 on https://grafana.wikimedia.org/ - T180428
08:19 marostegui: Deploy schema change on db1092 - T174569
08:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1092 - T174569 (duration: 00m 49s)
07:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1106 (duration: 00m 49s)
07:40 marostegui: Stop MySQL on db1034 to copy its content to db1101.s7 - T178359
07:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1034 - T178359 (duration: 00m 49s)
07:17 ppchelko@tin: Finished deploy [eventlogging/eventbus@872cfb3]: Revert gerrit 302372 due to AssertionError T180017 (duration: 00m 14s)
07:17 ppchelko@tin: Started deploy [eventlogging/eventbus@872cfb3]: Revert gerrit 302372 due to AssertionError T180017
06:54 marostegui: Deploy alter table on db1096 - T174569
06:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1096 - T174569 (duration: 00m 48s)
06:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1099 and db1071 - T174569 (duration: 00m 49s)
06:15 smalyshev@tin: Finished deploy [wdqs/wdqs@b44cf27]: data reload/T176593 (duration: 00m 28s)
06:14 smalyshev@tin: Started deploy [wdqs/wdqs@b44cf27]: data reload/T176593
02:26 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 07m 54s)
02:09 twentyafterfour: phabricator database migrations complete, service is back online
01:28 twentyafterfour: Phabricator will be offline for a couple of minutes while I apply database migrations.
01:22 twentyafterfour: updating phabricator (belatedly)
01:05 demon@tin: Synchronized scap/plugins/prep.py: no-op, co-master sync (duration: 00m 55s)
00:59 tzatziki: Removing 2FA from AuburnPilot (T180654 )
00:59 tzatziki: Removing 2FA from Twotwo2019 (T180438 )
00:42 eileen: update civicrm from b99a9cf to 22f6532
2017-11-15
22:48 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: wikidatawiki back to wmf.7
22:21 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.8
22:11 urandom: Setting keyspaces erroneously configured for leveled compaction, to use size-tiered (T180568 )
22:05 madhuvishy: Kicking off re-enabling puppet and puppet runs across Cloud VPS instances
21:33 demon@tin: Synchronized php: symlink swap (duration: 00m 49s)
21:21 madhuvishy: Disabling puppet across cloud VPS through cumin on labpuppetmaster1001
21:01 ottomata: restarting kafka-jumbo brokers to update to Kafka 0.11.0.1
20:52 urandom: Restarting restbase2002-a.codfw.wmnet (T180568 )
20:40 mutante: restarting gerrit to enable 'large file support'-plugin gerrit:391635
20:38 addshore@tin: Finished scap: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch (again) T180539 (duration: 21m 48s)
20:17 addshore@tin: Started scap: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch (again) T180539
20:11 otto@tin: Finished deploy [eventlogging/eventbus@872cfb3]: deploying kafka-futures change to kafka1001 only, will apply async with https://gerrit.wikimedia.org/r/#/c/391634/ (duration: 00m 20s)
20:11 otto@tin: Started deploy [eventlogging/eventbus@872cfb3]: deploying kafka-futures change to kafka1001 only, will apply async with https://gerrit.wikimedia.org/r/#/c/391634/
19:53 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Add BP and WP aliases for project namespace on mwlwiki (T180052 ) (duration: 00m 48s)
19:48 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Enable SandboxLink on mwlwiki (T180052 ) (duration: 00m 48s)
19:41 demon@tin: Finished deploy [gerrit/gerrit@bce982f]: adding lfs plugin @ 2.13.9 (duration: 00m 08s)
19:41 demon@tin: Started deploy [gerrit/gerrit@bce982f]: adding lfs plugin @ 2.13.9
19:40 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Disable EventLogging for Popups (T178500 ) (duration: 00m 49s)
19:37 gehel: upgrade of elasticsearch codfw completed, cluster still recovering - T178411
19:34 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Enable Minerva download icon on all wikis (T179914 ) (duration: 00m 48s)
19:30 chasemp: labstore1004:~# service nfs-kernel-server restart
19:19 catrope@tin: Synchronized php-1.31.0-wmf.8/resources/src/mediawiki.rcfilters/mw.rcfilters.UriProcessor.js: T180577 (duration: 00m 49s)
19:16 catrope@tin: Synchronized php-1.31.0-wmf.7/skins/MinervaNeue/resources/skins.minerva.scripts/init.js: Limit download button to Google Chrome (T179529 , T179914 ) (duration: 00m 49s)
19:14 catrope@tin: Synchronized wmf-config/flaggedrevs.php: Enable RCFilters on all remaining wikis (T177445 ) (duration: 00m 49s)
19:01 urandom: Restarting Cassandra instances on restbase2001.codfw.wmnet (T180568 )
18:38 Jamesofur: removed 2FA from User:Einsbor after verification for votewiki, stewardwiki and SUL
18:20 moritzm: rebooting auth* servers for update to 4.9.51
18:20 moritzm: rebooting auth
17:57 demon@tin: Synchronized docroot/search.wikimedia.org/index.php: removing support for non-wikipedias (duration: 00m 49s)
16:03 addshore@tin: Finished scap: Revert partial scap: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch, second try (T180539 ) (duration: 23m 48s)
15:56 oblivian@tin: Finished deploy [docker-pkg/deploy@9b319d2]: Adding do extension to jinja2, needed for contint images (duration: 00m 18s)
15:56 oblivian@tin: Started deploy [docker-pkg/deploy@9b319d2]: Adding do extension to jinja2, needed for contint images
15:55 ema: restart pybal on lvs[12]00[36] for config change https://gerrit.wikimedia.org/r/#/c/389964/
15:43 jynus: shutting down labsdb1009 for maintenance T179244
15:40 addshore@tin: Started scap: Revert partial scap: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch, second try (T180539 )
15:29 moritzm: installing perl security updates on trusty (Debian hosts fixed two months ago)
15:14 ladsgroup@tin: scap aborted: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch, second try (T180539 ) (duration: 03m 48s)
15:13 moritzm: removing unused kernels from prometheus*
15:11 ladsgroup@tin: Started scap: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch, second try (T180539 )
15:05 moritzm: rebooting kubestagetcd* for update to 4.9.51
14:57 ladsgroup@tin: scap aborted: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch (T180539 ) (duration: 00m 31s)
14:57 ladsgroup@tin: Started scap: Update extensions/Wikidata to new wmf/1.31.0-wmf.8 branch (T180539 )
14:42 Amir1: deployed ORES change for wikidata (gerrit:391197, phab:T180450 )
14:41 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: (no justification provided) (duration: 00m 49s)
14:37 mobrovac: restbase creating Cassandra 3 revision tables on restbase1009 - T179421
14:30 zfilipin@tin: Synchronized wmf-config/: SWAT: Remove wgContentTranslationEnableSuggestions (duration: 00m 51s)
14:27 ema: cache_upload: upgrade varnish to 4.1.8-1wm2
14:22 zfilipin@tin: Synchronized wmf-config/CommonSettings-labs.php: SWAT: Beta: Explicitly set cookieDomain for ContentTranslationSiteTemplates (T149879) (duration: 00m 49s)
14:18 ema: repool cp4024 T174891
14:17 mutante: wtp2017 - systemctl start ferm (ferm wasnt running due to failed DNS lookup for prometheus2003 sometime in the past)
14:12 zfilipin@tin: Synchronized php-1.31.0-wmf.8/extensions/EventBus/EventBus.php: SWAT: Increase request timeout to match kafka produce timeout (T180017) (duration: 00m 50s)
14:11 godog: upgrade hpsa firmware to 6.06 on restbase2004 - T180562 T141756
14:10 ema: upgrade varnish to 4.1.8-1wm2 on cp4024 (cache_upload, depooled)
14:04 jynus: reload haproxy on dbproxy1010
14:00 moritzm: installing openssl updates on kafka and hadoop clusters
13:34 ema: cache_text: upgrade varnish to 4.1.8-1wm2
13:31 ppchelko@tin: Started restart [changeprop/deploy@065a06e]: Restart to rebalance all rules T179684
13:14 moritzm: rebooting video scalers in eqiad for update to 4.9.51 (and to pick up openssl update)
13:07 ema: upgrade varnish to 4.1.8-1wm2 on cp3030 (cache_text)
12:41 elukey: re-enable eventlogging after maintenance
12:24 jynus: deploying dns change for m4-master
12:19 ema: cache_misc: upgrade varnish to 5.1.3-1wm3
12:09 elukey: executed sysctl -w net.netfilter.nf_conntrack_tcp_timeout_time_wait=65 on all jobrunners
12:01 ema: upgrade varnish to 5.1.3-1wm3 on cp3007 (cache_misc)
11:58 ema: varnish 4.1.8-1wm2 uploaded to apt.w.o (main)
11:50 ema: varnish 5.1.3-1wm3 uploaded to apt.w.o (experimental)
11:45 jynus: restart haproxy on dbproxy1009
11:32 marostegui: Stop MySQL on db1046
10:58 moritzm: rebooting job runners in eqiad for update to 4.9.51 (and to pick up openssl update)
10:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1106, optimizing wikidatawiki.wb_terms (duration: 00m 49s)
10:09 ppchelko@tin: Finished deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x attempt 4 T179786 (duration: 04m 51s)
10:04 ppchelko@tin: Started deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x attempt 4 T179786
10:04 ppchelko@tin: Finished deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x attempt 3, force T179786 (duration: 04m 44s)
09:59 ppchelko@tin: Started deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x attempt 3, force T179786
09:59 ppchelko@tin: Finished deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x attempt 2 T179786 (duration: 03m 32s)
09:58 ppchelko@tin: (no justification provided)
09:55 ppchelko@tin: Started deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x attempt 2 T179786
09:55 ppchelko@tin: Finished deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x T179786 (duration: 02m 44s)
09:53 godog: reboot restbase2004 - T180562
09:52 ppchelko@tin: Started deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x T179786
09:52 mobrovac@tin: Started restart [restbase/deploy@c76a665]: Pick up the new seeds definition - T179422
09:52 ppchelko@tin: Started deploy [trending-edits/deploy@a0e1fe3]: Update node-rdkafka to v1.x T179786
09:32 godog: restart cassandra on restbase2002-b - T180568
09:30 moritzm: updating openssl on database hosts
09:19 godog: restart cassandra on restbase2001-c - T180568
09:15 marostegui: Stop mysql on db1046 to transfer its content to db1107 - T177405
09:08 elukey: stop eventlogging on eventlog1001, eventlogging replication on db1108/db1047/dbstore1002 as preparation steps to migrate the log db from db1046 to db1107
09:02 jynus: rebooting labsdb1010 for kernel upgrade
08:51 elukey: reboot thorium (hosting all analytics websites) for kernel updates
08:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1101 - going to convert it to mult-instance T178359 (duration: 00m 48s)
07:49 marostegui: Deploy schema change on db1071 and db1099 (s5) - T174569
07:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1099 and db1071 - T174569 (duration: 00m 49s)
07:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1106 and db1100 - T174569 (duration: 00m 49s)
05:33 subbu: subbu@terbium (followup to the linter-reparse.py script log entry) it is safe to kill -9 the script on terbium anytime if there are any problems because of it
05:27 subbu: subbu@terbium running linter-reparse.py script to initialize baseline linter categories for all wikis (12 hours in so far .. expected to run for ~2 weeks). hits parsoid eqiad cluster.
04:06 demon@tin: Synchronized docroot/search.wikimedia.org/robots.txt: go away robots / kill some 404s (duration: 00m 50s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 05m 57s)
00:59 ejegg: updated payments-wiki from d150287 to 210cb37
00:53 Reedy: going to restart zuul as it's got backed up
00:33 maxsem@tin: Synchronized wmf-config/throttle.php: https://gerrit.wikimedia.org/r/#/c/391220/ (duration: 00m 49s)
00:18 ejegg: updated SmashPig payments listener from 5262d53 to 45aa626
00:12 ejegg: updated CiviCRM from f571c67 to b99a9cf
2017-11-14
23:29 mutante: releases1001 - chmod g+w /srv/org/wikimedia/releases/mediawiki/1.*
23:01 urandom: Decommissioning Cassandra, restbase1014-a.eqiad.wmnet (T179422 )
22:57 demon@tin: Synchronized w/: Removing mobilelanding from global w/, few sites actually need it (duration: 00m 49s)
22:55 demon@tin: Synchronized docroot/m.wikipedia.org/w/mobilelanding.php: symlink -> real file (duration: 00m 50s)
21:42 no_justification: gerrit: restarted services on master/cobalt, things will flap for a second
21:38 mutante: gerrit2001 - restarted gerrit
21:17 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@494e0c6]: redeploy mjolnir submodule bump to master (duration: 02m 11s)
21:15 ebernhardson@tin: Started deploy [search/mjolnir/deploy@494e0c6]: redeploy mjolnir submodule bump to master
21:03 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@77310bc]: update mjolnir to master (duration: 02m 05s)
21:01 ebernhardson@tin: Started deploy [search/mjolnir/deploy@77310bc]: update mjolnir to master
20:37 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.8
20:15 bblack: reboot cp4024
19:58 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@b20f0da]: test mjolnir deployment (duration: 02m 12s)
19:57 urandom: Bootstrapping restbase2002-b.codfw.wmnet (T179422 )
19:56 ebernhardson@tin: Started deploy [search/mjolnir/deploy@b20f0da]: test mjolnir deployment
19:47 demon@tin: Finished scap: bootstrap wmf.8 (duration: 44m 37s)
19:32 chasemp: clean up instances in error state in testlabs project
19:10 jynus: restart db2034 for mariadb upgrade
19:09 chasemp: for i in `OS_TENANT_NAME=testlabs openstack server list | grep stress | awk '{print $2}'`; do echo $i; OS_TENANT_NAME=testlabs openstack server delete $i; sleep 30; done T171473
19:02 demon@tin: Started scap: bootstrap wmf.8
18:32 arlolra: Updated Parsoid to e71937d0 (T178253 )
18:31 mholloway-shell@tin: Finished deploy [mobileapps/deploy@9b10959]: Redeploying: Update mobileapps to c002862 (duration: 29m 12s)
18:28 moritzm: upgraded nginx on notebook* to 1.13.6
18:26 madhuvishy: Upgraded notebook1001 and 1002 to kernel version 4.9.51-1~bpo8+1
18:22 arlolra@tin: Finished deploy [parsoid/deploy@b150764]: Updating Parsoid to e71937d0 (duration: 09m 13s)
18:21 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@e6905f4]: test mjolnir deployment (duration: 01m 13s)
18:19 ebernhardson@tin: Started deploy [search/mjolnir/deploy@e6905f4]: test mjolnir deployment
18:13 arlolra@tin: Started deploy [parsoid/deploy@b150764]: Updating Parsoid to e71937d0
18:08 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@cd6ddda]: (no justification provided) (duration: 04m 28s)
18:07 moritzm: rebooting notebook* hosts for update to 4.9.51
18:04 ebernhardson@tin: Started deploy [search/mjolnir/deploy@cd6ddda]: (no justification provided)
18:02 mholloway-shell@tin: Started deploy [mobileapps/deploy@9b10959]: Redeploying: Update mobileapps to c002862
17:58 ebernhardson@tin: Finished deploy [search/mjolnir/deploy@ceb5c2f]: (no justification provided) (duration: 00m 20s)
17:57 ebernhardson@tin: Started deploy [search/mjolnir/deploy@ceb5c2f]: (no justification provided)
17:56 ebernhardson@tin: Finished deploy [search/MjoLniR/deploy@607adfb]: (no justification provided) (duration: 00m 14s)
17:56 ebernhardson@tin: Started deploy [search/MjoLniR/deploy@607adfb]: (no justification provided)
17:44 herron: restarted puppetdb on nitrogen and nihal to pick up jre updates
17:42 smalyshev@tin: Finished deploy [wdqs/wdqs@b44cf27]: data reload/T176593 (duration: 00m 16s)
17:42 smalyshev@tin: Started deploy [wdqs/wdqs@b44cf27]: data reload/T176593
17:27 demon@tin: Pruned MediaWiki: 1.31.0-wmf.3 (duration: 07m 58s)
17:12 demon@tin: Synchronized wmf-config/CommonSettings.php: Beta-only, no-op (duration: 00m 43s)
17:11 demon@tin: Synchronized wmf-config/extension-list-labs: No-op (duration: 00m 44s)
17:10 demon@tin: Pruned MediaWiki: 1.31.0-wmf.6 [keeping static files] (duration: 02m 48s)
16:32 twentyafterfour: Restarting apache2 on phab1001 (deploy phabricator hotfix: D876 )
16:23 jynus: stop labsdb1010 mariadb to clone it later to labsdb1009 T179244
15:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Fully pool db1105 in s1 and s2 - T178359 (duration: 00m 45s)
15:00 thcipriani@tin: testing IRC logging
15:00 urandom: Decommissioning Cassandra, restbase1012-c.eqiad.wmnet (T179422 )
14:37 moritzm: installing postgres security updates on maps*
14:29 zeljkof: EU SWAT finished
14:07 moritzm: installing postgres security updates on labsdb1004
13:11 moritzm: installing openssl updates
13:10 akosiaris: shutdown install1002 for disk resize
12:57 godog: upgrade grafana to 4.6.1 on https://grafana-labs.wikimedia.org/ - T180428
12:03 ema: restart varnish-be on cp3007, requests failing with 'no backend connection'
11:49 moritzm: rebooting remaining app servers in eqiad for update to Linux 4.9.51 (and to pick up OpenSSL updates)
11:22 moritzm: rebooting remaining API servers in eqiad for update to Linux 4.9.51 (and to pick up OpenSSL updates)
11:04 godog: reimage restbase2002 - T179422
10:50 marostegui: Deploy alter table on s5: db1104 db1100 db1106 - T174569
10:32 elukey: removed old target configs from /srv/prometheus/analytics/targets on prometheus100[34] after https://gerrit.wikimedia.org/r/391179
10:23 moritzm: rebooting image scalers in eqiad for update to Linux 4.9.51 (and to pick up OpenSSL updates)
10:07 godog: upload scap 3.7.2-1 - T127762
09:59 mobrovac@tin: Synchronized wmf-config/jobqueue.php: JobQueue: Migrate RecordLintJob to EventBus - T175212 (duration: 00m 46s)
09:56 marostegui: Deploy alter table on s5 - dbstore1001 - T174569
09:49 foks: Disabled 2FA for Jean-FrĂŠdĂŠric
09:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1105 as multi-instance host for s1 and s2 - T178359 (duration: 00m 46s)
09:45 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db1105 as multi-instance host for s1 and s2 - T178359 (duration: 00m 47s)
09:43 godog: upgrade prometheus to 1.8.1 with k8s on prometheus2004 - T177395
09:27 ppchelko@tin: Finished deploy [cpjobqueue/deploy@df5fca9]: Enable RecordLintJob processing (duration: 00m 40s)
09:27 ppchelko@tin: Started deploy [cpjobqueue/deploy@df5fca9]: Enable RecordLintJob processing
09:07 moritzm: rebooting remaining app servers in codfw for update to Linux 4.9.51 (and to pick up OpenSSL updates)
09:05 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2108.codfw.wmnet
09:05 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: wtp2018.codfw.wmnet
08:31 marostegui: Deploy alter table on s5 - dbstore1002 - T174569
08:22 addshore@tin: Synchronized wmf-config/extension-list-labs: Add AdvancedSearch to extension-list T180147 PT 2/2 LABS ONLY (duration: 00m 46s)
08:21 addshore@tin: Synchronized wmf-config/extension-list: Add AdvancedSearch to extension-list T180147 PT 1/2 (duration: 00m 47s)
08:13 moritzm: rebooting job runners in codfw for update to Linux 4.9.51 (and to pick up OpenSSL updates)
06:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1103 on s2 and s4 - T178359 (duration: 00m 47s)
06:18 marostegui: Deploy alter table on s6 primary master (db1061) - T174569
02:30 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Nov 14 02:30:59 UTC 2017 (duration 6m 38s)
02:24 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 07m 28s)
01:35 mdholloway: mobileapps finished rolling back to 11/8 deployment
01:34 mholloway-shell@tin: Finished deploy [mobileapps/deploy@00e60b2]: (no justification provided) (duration: 02m 27s)
01:31 mholloway-shell@tin: Started deploy [mobileapps/deploy@00e60b2]: (no justification provided)
01:30 mdholloway: mobileapps rolling back today's deployment due to a significant increase in errors.
2017-11-13
22:49 bawolff: deployed patch T119158 (Will affect language converter. -{}- no longer allowed in link urls)
22:42 bawolff: deployed patch T124404
22:02 mholloway-shell@tin: Finished deploy [mobileapps/deploy@9b10959]: Update mobileapps to c002862 (duration: 05m 31s)
21:57 mholloway-shell@tin: Started deploy [mobileapps/deploy@9b10959]: Update mobileapps to c002862
21:40 urandom: Decommissioning Cassandra, restbase1012-b.eqiad.wmnet (T179422 )
20:20 ejegg: updated CiviCRM from ddc3881 to f571c67
19:20 smalyshev@tin: Finished deploy [wdqs/wdqs@b44cf27]: data reload/T176593 (duration: 00m 20s)
19:20 smalyshev@tin: Started deploy [wdqs/wdqs@b44cf27]: data reload/T176593
19:18 thcipriani@tin: Synchronized wmf-config/abusefilter.php: SWAT: Enable per-filter profiling on enwiki T179323 (duration: 00m 45s)
19:16 thcipriani@tin: Synchronized docroot/search.wikimedia.org/index.php: search.wikimedia.org: Clean up result returning logic (duration: 00m 47s)
19:15 madhuvishy: Kick of second dumps rsync from ms1001 to labstore1006
18:28 gehel: deploy latest blazegraph + GUI on wdqs200[23] to switch vocabulary - T176593
18:20 elukey: drain + shutdown analytics1029 as prep step to replace the BBU - T178742
18:09 hoo: Ran "scap pull" on mwdebug1001/snapshot1001 after (further) tests re T177486
18:02 urandom: Restarting Cassandra, restbase-dev1004.eqiad.wmnet (testing new `c-foreach-restart`)
18:00 demon@tin: Synchronized docroot/search.wikimedia.org/index.php: minor cleanup (duration: 00m 47s)
17:28 hoo: Ran "scap pull" on mwdebug1001 after tests re T177486
17:12 mobrovac: restbase depooling restbase1007, restbase1012, restbase1014, restbase2002, restbase2004, restbase2006 for T179422
16:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1103 on s2 and s4 - T178359 (duration: 00m 47s)
16:45 mobrovac@tin: Finished deploy [mathoid/deploy@63b2ddc]: Update to service-template-node v0.5.3 - T151396 (duration: 03m 45s)
16:42 mobrovac@tin: Started deploy [mathoid/deploy@63b2ddc]: Update to service-template-node v0.5.3 - T151396
16:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1085 - T174569 (duration: 00m 46s)
15:58 akosiaris@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=wtp2017.codfw.wmnet
15:58 akosiaris@puppetmaster1001: conftool action : set/pooled=no; selector: name=wtp2017.codfw.wmnet
15:22 otto@tin: Finished deploy [eventlogging/analytics@e024af3]: T179625 (duration: 00m 02s)
15:22 otto@tin: Started deploy [eventlogging/analytics@e024af3]: T179625
15:09 otto@tin: Finished deploy [eventlogging/analytics@03285e4]: Reverting, got an error: userAgent is a <type unicode>. (duration: 00m 02s)
15:09 otto@tin: Started deploy [eventlogging/analytics@03285e4]: Reverting, got an error: userAgent is a <type unicode>.
15:08 urandom: Decommissioning Cassandra, restbase1012-a.eqiad.wmnet (T179422 )
15:06 otto@tin: Finished deploy [eventlogging/analytics@5796c27]: T179625 (duration: 00m 04s)
15:06 otto@tin: Started deploy [eventlogging/analytics@5796c27]: T179625
14:58 herron: upgrading puppetmaster2002 to puppet 4
14:15 zeljkof: EU SWAT finished
14:10 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Whitelist jenkins in test wiki (T167432 ) (duration: 00m 47s)
14:08 marostegui: Deploy alter table on db1102.s6 (with replication - sanitarium master) - T174569
13:49 marostegui: Stop replication on labsdb1010 to copy cebwiki.geo_tags table
13:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase weight for db1103 on s2 and s4 - T178359 (duration: 00m 46s)
13:36 marostegui@tin: Synchronized wmf-config/db-codfw.php: Increase weight for db1103 on s2 and s4 - T178359 (duration: 00m 46s)
13:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1083 - T174569 (duration: 00m 46s)
13:09 marostegui: Deploy schema change on db1083 - T174569
12:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1088 - T174569 (duration: 00m 47s)
12:14 ema: cache_misc: upgrade varnish to 5.1.3-1wm2
12:08 ema: cp3008: upgrade varnish to 5.1.3-1wm2
12:02 ema: cp4021: restart varnish-be (mbox lag)
11:56 moritzm: installing imagemagick security updates
11:48 moritzm: installing irssi security updates
11:18 elukey: restart of all the druid daemons on druid100[1-6] to apply the new prometheus jmx jvm exporters - T177459
10:57 gehel: upgrade elasticsearch on cirrus / codfw - T178411
10:49 marostegui: Deploy schema change on db1088 - T174569
10:49 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1088 - T174569 (duration: 00m 54s)
10:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1093 - T174569 (duration: 00m 46s)
10:28 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db1103 as multi-instance host for s2 and s4 - T178359 (duration: 00m 46s)
10:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1103 as multi-instance host for s2 and s4 - T178359 (duration: 00m 47s)
10:12 moritzm: rebooting remaining Parsoid hosts in eqiad for update to Linux 4.9.51 (and to pick up OpenSSL update)
09:44 godog: test upgrade of prometheus 1.8.1 with k8s on prometheus2003 - T177395
09:29 moritzm: rebooting mw1221-mw1235 for update to Linux 4.9.51 (and to pick up OpenSSL update)
09:02 elukey: restart of druid brokers on druid100[1-6] to apply https://gerrit.wikimedia.org/r/390419 - T177459
08:41 marostegui: Deploy alter table on db1093 - T174569
08:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1093 - T174569 (duration: 00m 46s)
08:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1098 after alter table - T174569 (duration: 00m 47s)
08:33 moritzm: rebooting mw1238-mw1258 for update to Linux 4.9.51 (and to pick up OpenSSL update)
07:48 moritzm: installing ruby2.3 security updates
07:38 marostegui: Deploy alter table to db1104 - T179106
07:33 marostegui: Optimize wb_terms table on db2052 - T179106
06:44 marostegui: Deploy alter table directly on codfw s5 master (db2023), this will generate lag on codfw - T179793
06:27 marostegui: Deploy alter table on db1098 - T174569
06:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1098 - T174569 (duration: 00m 49s)
06:18 marostegui: Deploy alter table db2086 - T179106
04:01 urandom: Decommissioning Cassandra, restbase1007-c.eqiad.wmnet (T179422 )
02:38 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Nov 13 02:38:38 UTC 2017 (duration 6m 42s)
02:31 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 08m 18s)
2017-11-12
19:39 urandom: Decommissioning Cassandra, restbase1007-b.eqiad.wmnet (T179422 )
16:19 bblack: cp4026 - restart backend (mailbox lag)
01:29 urandom: Decommissioning Cassandra, restbase1007-a.eqiad.wmnet (T179422 )
2017-11-11
13:52 urandom: Decommissioning Cassandra, restbase2006-c.codfw.wmnet (T179422 )
00:48 urandom: Decommissioning Cassandra, restbase2006-b.codfw.wmnet (T179422 )
2017-11-10
18:08 smalyshev@tin: Finished deploy [wdqs/wdqs@ccab8ce]: data reload/T176593 (duration: 00m 34s)
18:08 smalyshev@tin: Started deploy [wdqs/wdqs@ccab8ce]: data reload/T176593
17:20 moritzm: uploaded icu 57.1-6+wmf1 for jessie-wikimedia/component/icu57 (co-installable build for ICU migration)
17:11 moritzm: freed some disk space on install1002
16:47 marostegui: Deploy alter table on s6, dbstore1001, dbstore1002 abd db1030 - T174569
15:03 moritzm: rebooting remaing API servers in codfw to 4.9.51 (and to pick up OpenSSL updates)
14:57 urandom: Decommissioning Cassandra, restbase2006-a.codfw.wmnet (T179422 )
14:17 moritzm: rebooting image scalers in codfw to 4.9.51 (and to pick up OpenSSL updates)
13:10 marostegui: truncate /var/log/nginx/error.log.1 on install1002 as it is filling up
13:09 moritzm: powercycling mw2118, stuck after reboot
13:05 ema: cp4021: restart varnish-be due to mbox lag
12:48 moritzm: rebooting video scalers in codfw to 4.9.51 (and to pick up OpenSSL updates)
11:46 _joe_: restarted all services and repooled scb1001
11:38 moritzm: rebooting mw2163-2199 to 4.9.51 (and to pick up OpenSSL updates)
11:32 _joe_: stopping mobileapps as well on scb1001
11:27 moritzm: rebooting wtp1025 to 4.9.51
11:22 _joe_: stopping changeprop, celery-ores, cpjobqueue on scb1001
11:21 addshore@tin: Synchronized wmf-config/InitialiseSettings-labs.php: Disable AdvancedSearch on deployment.beta BETA ONLY T180201 (duration: 00m 46s)
11:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove old comment about db1080 (duration: 00m 46s)
11:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1055 original weight - T178359 (duration: 00m 46s)
10:59 jmm@puppetmaster1001: conftool action : set/pooled=inactive; selector: wtp2017.codfw.wmnet
10:55 _joe_: depooling scb1001 from all services while it becomes healthy again
10:52 _joe_: restarting ores on scb1001, causing memory exhaustion
10:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1055 weight - T178359 (duration: 00m 47s)
10:49 moritzm: powercycling wtp2017, stuck after reboot
10:24 marostegui: Deploy schema change on db2089 - T179106
10:11 moritzm: rebooting Parsoid servers in codfw to 4.9.51 (and to pick up OpenSSL updates)
10:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1055 weight - T178359 (duration: 00m 58s)
09:54 marostegui: Compress enwiki on db1105.s1 - T178359
09:45 moritzm: powercycling mw2108, stuck after reboot
09:30 hashar: Upgrading operations-puppet-tests-docker jenkins job to stop passing docker --tty and thus have signals forwarded from 'docker run' - T176747
09:23 moritzm: rebooting mw2097-mw2117 to 4.9.51 (and to pick up OpenSSL updates)
09:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1055 with low weight - T178359 (duration: 00m 47s)
09:13 moritzm: powercycling mw2213, stuck after reboot
08:43 moritzm: rebooting mw2200-mw2223 to 4.9.51 (and to pick up OpenSSL updates)
07:50 smalyshev@tin: Finished deploy [wdqs/wdqs@213f864]: (no justification provided) (duration: 00m 33s)
07:49 smalyshev@tin: Started deploy [wdqs/wdqs@213f864]: (no justification provided)
07:27 _joe_: restarting apache on phab1001
07:20 marostegui: Deploy alter table on s3.codfw master (db2018) with replication, this will generate lag on codfw - T174569
06:50 marostegui: Deploy alter table on s5 eqiad master (db1063) - T172207
06:41 marostegui: Stop MySQL on db1055 to copy its content to db1105 - T178359
06:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1055 - T178359 (duration: 00m 49s)
06:39 marostegui: Force a BBU relearn on db1046 - T166141
01:21 aaron@tin: Synchronized php-1.31.0-wmf.7/includes/db: Use the main stash for LBFactory "memStash" parameter (duration: 00m 47s)
01:18 demon@tin: Synchronized docroot/search.wikimedia.org/index.php: minor cleanups, less 500s (duration: 00m 47s)
2017-11-09
22:28 urandom: Decommissioning Cassandra, restbase2004-c.codfw.wmnet (T179422 )
20:28 demon@tin: Synchronized php-1.31.0-wmf.7/includes/libs/objectcache/WANObjectCache.php: less spammy error logs (duration: 00m 47s)
20:09 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.7
19:06 ebernhardson@tin: Synchronized php-1.31.0-wmf.7/extensions/WikimediaEvents/modules/all/ext.wikimediaEvents.searchSatisfaction.js: SWAT: Turn off DBN sizing AB test (duration: 00m 51s)
19:00 bblack: cp3030 - end experimentation, puppetizing back to normal config
18:46 bblack: cp3030 - round 2 of ssl_do_wait_shutdown test
18:41 arlolra: Updated Parsoid to 2887b5ad (T178253 , T173643 , T176728 , T180010 , T171381 , T179757 )
18:28 arlolra@tin: Finished deploy [parsoid/deploy@d1c7386]: Updating Parsoid to 2887b5ad (duration: 12m 20s)
18:22 bblack: cp3030: puppet-disabled + manual nginx ssl_do_wait_shutdown config
18:16 arlolra@tin: Started deploy [parsoid/deploy@d1c7386]: Updating Parsoid to 2887b5ad
18:14 moritzm: not rebooting parsoid hosts due to Services deployment window, instead rolling restart of mw2120-mw2139 for kernel update to 4.9.51
18:04 moritzm: rolling restart of parsoid servers in codfw for 4.9.51 kernel update
17:14 urandom: Restarting Cassandra, restbase2005-b.codfw.wmnet (T179419 )
16:33 urandom: Restarting Cassandra, restbase2005-a.codfw.wmnet (T179419 )
15:12 urandom: Creating mathoid schema (T179419 )
15:05 addshore@tin: Synchronized wmf-config/CommonSettings-labs.php: Enable AdvancedSearch on beta LABS / BETA ONLY PT2/2 (duration: 00m 49s)
15:04 addshore: last sync was actually "Enable AdvancedSearch on beta"
15:04 addshore@tin: Synchronized wmf-config/InitialiseSettings-labs.php: Add AdvancedSearch to extension-list-labs LABS / BETA ONLY PT1/2 (duration: 00m 50s)
14:57 addshore@tin: Synchronized wmf-config/extension-list-labs: Add AdvancedSearch to extension-list-labs LABS / BETA ONLY (duration: 00m 50s)
14:42 zeljkof: EU SWAT finished
14:40 zfilipin@tin: Synchronized wmf-config/StartProfiler.php: SWAT: xenon: encode the request method as a virtual stack frame (duration: 00m 50s)
14:35 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Use a threshold that ores in frwiki can stand (T180115 ) (duration: 00m 50s)
14:30 zfilipin@tin: Synchronized php-1.31.0-wmf.7/extensions/ContentTranslation/modules/: SWAT: Bring back the overlay support for a specific screen region (T179997) (duration: 00m 50s)
14:25 zfilipin@tin: Synchronized php-1.31.0-wmf.7/extensions/EventBus/EventBus.php: SWAT: Logging improvements Rename logged field to fix logstash mapping (duration: 00m 54s)
14:09 urandom: Decommissioning Cassandra, restbase2004-b.codfw.wmnet (T179422 )
13:04 moritzm: rebooting mw1209-mw1220 (app servers) to 4.9.51 (also to pick up new OpenSSL)
12:38 moritzm: rebooting mw1189-mw1208 (API servers) to 4.9.5 (also to pick up new OpenSSL)
12:37 Amir1: ladsgroup@terbium:/srv/mediawiki-staging/php-1.31.0-wmf.6$ mwscript extensions/ORES/maintenance/CheckModelVersions.php --wiki=frwiki (T180115 )
11:51 moritzm: rebooting mw1180-mw1188 (app servers) to 4.9.5 (also to pick up new OpenSSL)
11:45 _joe_: removed all local hacks from puppetmaster1001, now it uses rhodium again
11:37 _joe_: cleaning up spurious directories /var/lib/puppet/server/ssl/ca from eqiad's puppetmaster backends, generated due to some error on 8/11/2017
11:10 ema: powercycle cp2008, stuck rebooting
11:03 moritzm: rebooting mw1276-mw1279 (API canaries) to 4.9.5 (also to pick up new OpenSSL)
09:50 moritzm: rolling reboot of scb in eqiad for kernel update (also to pick up openssl updates)
07:56 ema: cp1074 failed rebooting, power-cycled
07:46 _joe_: restarting apache on rhodium after setting --profile --trace in the puppet settings
07:04 legoktm@tin: Synchronized php-1.31.0-wmf.7/resources/: Restore jquery.badge and jquery.placeholder modules (duration: 00m 53s)
02:58 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Nov 9 02:58:30 UTC 2017 (duration 6m 59s)
02:51 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 09m 09s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.6) (duration: 09m 18s)
01:02 twentyafterfour: Not deploying any phabricator updates this week.
2017-11-08
23:45 urandom: Decommissioning Cassandra, restbase2004.codfw.wmnet (T179422 )
22:50 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.7 try #2
22:43 ebernhardson@tin: Synchronized php-1.31.0-wmf.6/extensions/CirrusSearch/: Backport cirrus rescore profile refactor to wmf.6 (duration: 01m 02s)
22:21 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 back to wmf.6
22:09 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.7
21:44 bsitzmann@tin: Finished deploy [mobileapps/deploy@00e60b2]: Update mobileapps to 8e82983 (T178706 T178708 T178333 T170692 ) (duration: 07m 12s)
21:37 bsitzmann@tin: Started deploy [mobileapps/deploy@00e60b2]: Update mobileapps to 8e82983 (T178706 T178708 T178333 T170692 )
21:34 ejegg: re-started donations queue consumer with new thank you letter
21:09 ejegg: updated CiviCRM from b11c591 to ddc3881
20:52 ejegg: turned off donations consumer for ty letter update
20:36 ejegg: updated payments-wiki from a539d27 to d150287
20:33 aaron@tin: Synchronized php-1.31.0-wmf.6/extensions/CentralAuth: Use the proper cache key method in loadFromCache() (duration: 00m 54s)
20:22 apergos: rsync from ms1001 to labstore1006 of dumps, 17T so expect it to take several days
20:10 herron: depooled rhodium via puppetmaster1001 apache config
20:09 demon@tin: Synchronized php: symlink (duration: 00m 49s)
19:57 demon@tin: Synchronized php-1.31.0-wmf.7/extensions/CentralAuth/includes/CentralAuthUser.php: (no justification provided) (duration: 00m 51s)
19:55 urandom: Creating mathoid schema (T179419 )
19:31 niharika29@tin: Synchronized php-1.31.0-wmf.6/extensions/ORES/: Store stats of accessing ores service for getting thresholds T179862 (duration: 00m 51s)
19:31 urandom: Creating page restrictions schema (T179421 )
19:30 niharika29@tin: Synchronized php-1.31.0-wmf.7/extensions/ORES/: Store stats of accessing ores service for getting thresholds T179862 (duration: 00m 51s)
19:24 niharika29@tin: Synchronized php-1.31.0-wmf.7/extensions/CirrusSearch/: Revert Improve handling of 5xx responses to elasticsearch requests (duration: 01m 02s)
19:14 smalyshev@tin: Finished deploy [wdqs/wdqs@b330bc8]: Update service whitelist (duration: 02m 58s)
19:11 smalyshev@tin: Started deploy [wdqs/wdqs@b330bc8]: Update service whitelist
18:27 demon@tin: Synchronized wmf-config/: Dropping old PrivateSettings symlink (ducks and covers) (duration: 00m 52s)
18:19 demon@tin: Synchronized phpcs.xml: no-op (duration: 00m 50s)
17:58 urandom: Restarting Cassandra, restbase2005-[abc]
17:51 urandom: Clearing snapshots in RESTBase legacy Cassandra cluster (T179417 )
17:47 marostegui: Deploy alter table on s1 codfw primary master (db2048) with replication, this will generate lag on codfw - T174569
17:43 mobrovac: restbase truncate the default parsoid storage group's tables for T179417
17:41 urandom: Restarting Cassandra, restbase2001-[abc]
17:19 urandom: Restarting Cassandra, restbase2003-[abc]
17:09 herron: regenerated rhodium puppet certificate
17:08 urandom: Restarting Cassandra, restbase1009-[abc]
16:58 urandom: Restarting Cassandra, restbase1008-[abc]
16:49 awight@tin: Finished deploy [ores/deploy@82a13ae]: Roll back scb1002 (duration: 02m 37s)
16:47 awight@tin: Started deploy [ores/deploy@82a13ae]: Roll back scb1002
16:45 awight@tin: Finished deploy [ores/deploy@1b0e59f]: Try to purge specter of revscoring 1 (duration: 05m 45s)
16:40 awight@tin: Started deploy [ores/deploy@1b0e59f]: Try to purge specter of revscoring 1
16:37 urandom: Restarting Cassandra, restbase1010-[abc]
16:37 godog: disregard message about thumbor rolling-restart, upgrade already done and only thumbor1001 rebooted now
16:34 awight@tin: Finished deploy [ores/deploy@82a13ae]: Fix ORES on scb1002 (duration: 00m 03s)
16:34 awight@tin: Started deploy [ores/deploy@82a13ae]: Fix ORES on scb1002
16:29 godog: roll-restart thumbor in eqiad for kernel upgrade
16:11 mobrovac@tin: Finished deploy [restbase/deploy@c5dd1e2]: Switch wiktionary definitions to use the next-gen storage, take 2b - T179420 (duration: 07m 22s)
16:04 mobrovac@tin: Started deploy [restbase/deploy@c5dd1e2]: Switch wiktionary definitions to use the next-gen storage, take 2b - T179420
16:02 mobrovac@tin: Finished deploy [restbase/deploy@c5dd1e2]: Switch wiktionary definitions to use the next-gen storage, take 2 - T179420 (duration: 00m 13s)
16:02 mobrovac@tin: Started deploy [restbase/deploy@c5dd1e2]: Switch wiktionary definitions to use the next-gen storage, take 2 - T179420
16:01 otto@tin: Finished deploy [eventlogging/analytics@03285e4]: Reverting EvenCapsule update and fixes, processes got restarted too early (duration: 00m 02s)
16:01 otto@tin: Started deploy [eventlogging/analytics@03285e4]: Reverting EvenCapsule update and fixes, processes got restarted too early
15:33 urandom: Decommissioning restbase2001-c.codfw.wmnet (T179422 )
15:23 _joe_: testing changes on rhodium regarding hostprivkey,hostcert
15:21 ema: eqiad lvs reboots: upgrading kernel to 4.9.51, libssl to 1.0.2m
15:17 otto@tin: Finished deploy [eventlogging/analytics@02c5a6b]: EventCapsule update and fixes, this is no-op as is. T179625 (duration: 00m 04s)
15:17 otto@tin: Started deploy [eventlogging/analytics@02c5a6b]: EventCapsule update and fixes, this is no-op as is. T179625
15:01 otto@tin: Started restart [eventlogging/eventbus@41e3418]: Bumping worker processes to 16 on all targets: T180017
14:55 otto@tin: Started restart [eventlogging/eventbus@41e3418]: Bumping worker processes to 16: T180017
14:54 ema: esams lvs reboots: upgrading kernel to 4.9.51, libssl to 1.0.2m
14:54 otto@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 12s)
14:54 otto@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
14:41 ema: powercycle cp1050 (failed reboot)
14:29 addshore@tin: Synchronized php-1.31.0-wmf.7/docs/uidesign/mediawiki.diff.html: SWAT Add render moved paragraphs marker in diff view PT 2/2 DOCS ONLY (duration: 00m 50s)
14:28 addshore@tin: Synchronized php-1.31.0-wmf.7/resources/src/mediawiki/mediawiki.diff.styles.css: SWAT Add render moved paragraphs marker in diff view PT 1/2 (duration: 00m 51s)
14:16 volans: upgrading cumin to v1.3.0 on prod and WMCS cumin masters
14:10 zfilipin@tin: Synchronized php-1.31.0-wmf.6/extensions/EventBus/EventBus.php: SWAT: Logging improvements. (duration: 00m 52s)
14:08 ema: codfw lvs reboots: upgrading kernel to 4.9.51, libssl to 1.0.2m
13:03 hasharAway: Upgrading jenkins on contint1001/contint2001
12:54 bblack@puppetmaster1001: conftool action : set/pooled=yes; selector: name=phab2001-vcs.codfw.wmnet
12:53 bblack: restart pybal on lvs2002 for git-ssh.codfw deploy
12:51 bblack: restart pybal on lvs2005 for git-ssh.codfw deploy
12:46 mutante: osmium - re-enabling puppet - temp test is over and will be decom'ed
12:41 mutante: krypton (misc PHP apps, scholarships.wm, iegreview.wm, grafana, racktables, burrow) rebooting for kernel upgrade
12:35 mutante: bromine (misc static tistes, annual/transparency/static-bz) - rebooting for kernel upgrade
12:07 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1051 original weight after maintenance - T178359 (duration: 00m 50s)
12:03 mutante: alcyone (url-downloader) rebooting for kernel upgrade
11:53 mutante: netmon1003 (servermon) - rebooting for kernel upgrade
11:48 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1051 weight after maintenance - T178359 (duration: 00m 50s)
11:48 moritzm: installed openjdk-8/openssl updates and new kernels on restbase*
11:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1051 weight after maintenance - T178359 (duration: 00m 50s)
11:12 moritzm: installing jenkins security update on releases*
11:10 moritzm: imported jenkins 2.73.3 to apt.wikimedia.org
10:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1051 with low weight after maintenance - T178359 (duration: 01m 01s)
10:57 ema: ulsfo lvs reboots: upgrading kernel to 4.9.51, libssl to 1.0.2m
10:42 mutante: actinium - reboot for kernel upgrade (url-downloader)
10:40 ema: varnish 5.1.3-1wm2 built and uploaded to apt.w.o (experimental)
10:39 mutante: aluminium - reboot for kernel upgrade (url-downloader)
10:28 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=aqs1005.eqiad.wmnet
10:18 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=aqs1004.eqiad.wmnet
10:07 elukey: reboot aqs100[4-9] for jvm and kernel updates
09:37 mutante: planet1001, alsafi (url-downloader) - apt autoremove; reboot for kernel upgrade
09:32 mutante: restarting ircecho (icinga-wm)
09:26 mutante: ununpentium (rt.wikimedia.org): apt-get autoremove; reboot for kernel upgrade
09:23 mutante: rutherforidum (people.wikimedia.org) : apt-get autoremove ; reboot for kernel upgrade
09:22 Pchelolo: restart cassandra-a on restbase1010
09:09 Pchelolo: restart restbase on 1013 and 1015
08:58 mutante: planet2001 - apt autoremove; reboot for kernel upgrade
08:40 ema: resume cache_text/upload rolling reboots: upgrading kernel to 4.9.51, libssl to 1.0.2m and 1.1.0g
07:55 marostegui: Deploy alter table on s1 - on codfw master (db2048) with replication enabled - T172207
07:46 marostegui: Deploy alter table on s2 - on codfw master (db2017) with replication enabled - T172207
07:21 marostegui: Deploy alter table on s3 - on codfw master (db2018) with replication enabled - T172207
07:13 marostegui: Deploy alter table on s7 - on codfw master (db2029) with replication enabled - T172207
07:13 krinkle@tin: Synchronized docroot/noc/conf/: I2e51e783a (duration: 01m 06s)
06:57 marostegui: Deploy alter table on s6 - on codfw master (db2028) with replication enabled - T172207
06:26 marostegui: Add 330G to db2023 partition to make sure the alter over logging table runs fine - T174569
06:16 Krinkle: Restarted uwsgi-graphite-web service on graphite2001
06:16 Krinkle: Restarted uwsgi-graphite-web service on graphite1001
06:15 marostegui: Stop MySQL on db1051 to copy its content to db1105.s1 - T178359
06:14 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1051 - T178359 (duration: 00m 51s)
05:45 Krinkle: rm -rf /var/lib/carbon/whisper/MediaWiki/wanobjectcache/centralauth_user_* on graphite1001 and graphite2001 for T179999
03:17 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Nov 8 03:17:53 UTC 2017 (duration 7m 13s)
03:10 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.7) (duration: 15m 16s)
02:44 aaron@tin: Synchronized php-1.31.0-wmf.7/tests: Deploy 087f2d579a9f which reverts 4432e898be0 due to statsd spam (duration: 01m 14s)
02:42 aaron@tin: Synchronized php-1.31.0-wmf.7/includes: Deploy 087f2d579a9f which reverts 4432e898be0 due to statsd spam (duration: 01m 40s)
02:33 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.6) (duration: 07m 38s)
2017-11-07
23:57 urandom: Decommissioning restbase2001-b.codfw.wmnet (T179422 )
22:50 ejegg: re-enabled thank you mailer
22:49 ejegg: updated CiviCRM from 85e89c6 to dc1b279
22:39 hoo: Reset a global account's email, per T179950
22:28 ejegg: updated CiviCRM from 122ba65 to 85e89c6
21:56 ejegg: updated CiviCRM from dc9d054 to 122ba65
21:20 addshore@tin: Synchronized wmf-config/Wikibase-buildentry.php: T176948 Remove Shared Cache settings from Wikibase-buildentry (duration: 00m 50s)
21:16 addshore@tin: Synchronized wmf-config/Wikibase.php: T176948 Stop using wgWikibaseSharedCacheKeyPrefix from Wikidata build (duration: 00m 49s)
21:09 addshore@tin: Synchronized wmf-config/InitialiseSettings-labs.php: T176948 Remove wmgWikibaseUseConfigFromWikidataBuild PT 3/3 (LABS) (duration: 00m 50s)
21:07 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T176948 Remove wmgWikibaseUseConfigFromWikidataBuild PT 2/3 (duration: 00m 52s)
21:06 addshore@tin: Synchronized wmf-config/Wikibase.php: T176948 Remove wmgWikibaseUseConfigFromWikidataBuild PT 1/3 (duration: 00m 50s)
21:02 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T176948 wmgWikibaseUseConfigFromWikidataBuild flase for all of PROD (duration: 00m 49s)
21:01 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T176948 Load wikibase build from mediawiki-config for wikidataclient (duration: 00m 50s)
20:59 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T176948 Load wikibase build from mediawiki-config for group0 & group1 (duration: 00m 50s)
20:54 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T176948 Load wikibase build from mediawiki-config for hewiki (duration: 00m 49s)
20:51 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T176948 Load wikibase build from mediawiki-config for wikidatawiki (duration: 00m 50s)
20:48 herron: puppet issue cleared after reverting 386666. restarting ircecho on einsteinium
20:45 addshore@tin: Synchronized wmf-config/InitialiseSettings-labs.php: T176948 #1 #2 #3 Load wikibase build from mediawiki-config for BETA ONLY (duration: 00m 50s)
20:28 addshore@tin: Synchronized wmf-config/Wikibase.php: Add loading of wikibase extensions from build PT 3/3 (duration: 00m 50s)
20:27 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: Add loading of wikibase extensions from build PT 2/3 (duration: 00m 50s)
20:25 addshore@tin: Synchronized wmf-config/Wikibase-buildentry.php: Add loading of wikibase extensions from build PT 1/3 (duration: 00m 49s)
20:20 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: Remove unused wmgUseWikibasePropertySuggester PT 2/2 (duration: 00m 50s)
20:19 addshore@tin: Synchronized wmf-config/CommonSettings.php: Remove unused wmgUseWikibasePropertySuggester PT 1/2 (duration: 00m 50s)
20:06 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.7
19:44 volans: restarted apache2 on rhodium (puppet master failing)
19:31 herron: restart apache2 service on rhodium
19:12 demon@tin: Finished scap: wmf.7 bootstrap (duration: 48m 15s)
19:03 awight: begin stress test on ores*
18:58 awight@tin: Finished deploy [ores/deploy@29905e5]: test deployment to repair ores1009 (non-production) (duration: 00m 31s)
18:57 awight@tin: Started deploy [ores/deploy@29905e5]: test deployment to repair ores1009 (non-production)
18:56 awight@tin: Finished deploy [ores/deploy@29905e5]: test deployment to repair ores1008 (non-production) (duration: 00m 32s)
18:56 awight@tin: Started deploy [ores/deploy@29905e5]: test deployment to repair ores1008 (non-production)
18:50 awight@tin: Finished deploy [ores/deploy@29905e5]: test deployment to repair ores1002 (non-production) (duration: 00m 33s)
18:49 awight@tin: Started deploy [ores/deploy@29905e5]: test deployment to repair ores1002 (non-production)
18:39 volans: slowly running puppet on failed hosts with cumin (concurrency=5)
18:38 awight@tin: Finished deploy [ores/deploy@29905e5]: test deployment to ores* (non-production) (duration: 00m 20s)
18:38 awight@tin: Started deploy [ores/deploy@29905e5]: test deployment to ores* (non-production)
18:36 _joe_: restarting apache2 on rhodium
18:24 demon@tin: Started scap: wmf.7 bootstrap
18:16 awight@tin: Finished deploy [ores/deploy@29905e5]: test deployment to ores* (non-production) (duration: 01m 04s)
18:15 awight@tin: Started deploy [ores/deploy@29905e5]: test deployment to ores* (non-production)
18:06 herron: restarting puppetdb service on nitrogen
17:56 urandom: Decommissioning restbase2001-a.codfw.wmnet (T179422 )
17:55 elukey: stop ircecho on einstenium (puppet shower from nitrogen)
17:40 ema: stop cache_text/upload rolling reboots, resuming tomorrow
{{safesubst:SAL entry|1=17:38 urandom: Restart Cassandra, restbase1010-{a,b,c}.eqiad.wmnet (T178177)}}
17:33 urandom: Clearing Cassandra snapshots (T179422 )
17:32 moritzm: rolling reboot of scb in codfw to pick up new kernel (and openssl updates)
{{safesubst:SAL entry|1=17:02 urandom: Restarting Cassandra, restbase2001-{a,b,c} to apply OpenJDK upgrade}}
16:47 demon@tin: Synchronized multiversion/submodules.json: no op (duration: 00m 47s)
16:12 ema: start cache_text/upload rolling reboots: upgrading kernel to 4.9.51, libssl to 1.0.2m and 1.1.0g
15:46 gehel: rolling restart of cassandra maps-test for logging change
15:43 godog: roll-restart thumbor in eqiad for kernel upgrade
15:37 urandom: T179420 : recreating wiktionary definition schemas
15:29 mobrovac@tin: Finished deploy [restbase/deploy@eab2948]: revert definition switch, wrong schema - T179420 (duration: 06m 46s)
15:22 mobrovac@tin: Started deploy [restbase/deploy@eab2948]: revert definition switch, wrong schema - T179420
15:12 _joe_: added a runner for htmlCacheUpdate on cewiki too
15:10 mobrovac@tin: Finished deploy [restbase/deploy@c5dd1e2]: Switch wiktionary definitions to use the next-gen storage - T179420 (duration: 07m 52s)
15:02 mobrovac@tin: Started deploy [restbase/deploy@c5dd1e2]: Switch wiktionary definitions to use the next-gen storage - T179420
14:52 godog: roll-restart thumbor for kernel upgrades
14:38 mobrovac: restbase creating wiktionary definition schemas for T179420
14:36 godog: reboot wezen for kernel upgrade
14:33 godog: reboot lithium for kernel upgrade
14:26 elukey: rolling restart of kafka on kafka-jumbo* for jvm security updates
14:20 moritzm: rebooting mw canaries to 4.9.51 kernel (also picking up openssl/openssl1.1 updates)
14:19 moritzm: rearming keyholder on naos
14:14 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable ShortUrl on pa.wiki - T178919 (duration: 00m 46s)
14:11 hashar: mwscript extensions/WikimediaMaintenance/createExtensionTables.php --wiki=pawiki ShortUrl - T178919
14:08 moritzm: rebooting tureis for kernel update
14:07 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: add _ in appendix talk namespace at mywiktionary - T179907 (duration: 00m 45s)
14:02 moritzm: actually holding mw canary reboots until SWAT is over
14:01 moritzm: rebooting mw canaries to 4.9.51 kernel (also picking up openssl/openssl1.1 updates)
13:50 herron: rebooted fermium (lists) for kernel update
13:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add comment about db1105 current status (duration: 00m 47s)
13:43 herron: starting rolling mx reboots for kernel update
13:31 marostegui: Deploy schema change on s5 codfw master (db2023) with replication, this will generate lag on codfw - T174569
13:19 moritzm: rebooting naos for kernel update
13:17 gehel: reboot maps eqiad cluster for upgrades
13:10 moritzm: rebooting wasat for kernel update
12:50 mutante: hafnium, tungsten: groupdel perf-roots to go with gerrit:389663 (T179728 )
12:00 mobrovac@tin: Finished deploy [restbase/deploy@eab2948]: Use the new storage for wikidata.org - T179417 (duration: 08m 14s)
11:52 mobrovac@tin: Started deploy [restbase/deploy@eab2948]: Use the new storage for wikidata.org - T179417
11:43 mobrovac: restbase truncating cassandra 2 non-WP tables for T179417
11:42 mobrovac: restbase truncating cassandra 2 non-WP tables for T179420
11:29 moritzm: installing java security updates/restarting cassandra on restbase2001 (cassandra3 node)
11:07 ema: cache_misc rolling reboots: upgrading kernel to 4.9.51, libssl to 1.0.2m and 1.1.0g
10:59 ema: reboot cp3007: upgrading kernel to 4.9.51, libssl to 1.0.2m and 1.1.0g
10:59 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2092 as recentchanges multi-instance host on s1 and s3 T178359 (duration: 00m 45s)
10:58 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2092 as recentchanges multi-instance host on s1 and s3 T178359 (duration: 00m 45s)
10:41 moritzm: restarting ntpd on dns recursors to pick up openssl update
10:28 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2091 as recentchanges multi-instance host on s2 and s4 T178359 (duration: 00m 45s)
10:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2091 as recentchanges multi-instance host on s2 and s4 T178359 (duration: 00m 46s)
10:24 elukey: create staging database on db1108 (researchers scratch pad) - T177405
10:19 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2086 as recentchanges multi-instance host on s5 and s7 T178359 (duration: 00m 45s)
10:09 gehel: reboot of maps codfw cluster for upgrades
09:59 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2086 as recentchanges multi-instance host on s5 and s7 T178359 (duration: 00m 45s)
09:58 paravoid: updating certspotter to 0.5 in apt and tegmen/einsteinium
09:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2089 as recentchanges multi-instance host on s6 and s5(s8) T178359 (duration: 00m 46s)
09:39 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2089 as recentchanges multi-instance host on s6 and s5(s8) T178359 (duration: 00m 45s)
09:13 marostegui: Stop MySQL on db1038 - host to be decommissioned - T177911
09:11 moritzm: installing java security updates/restarting cassandra on restbase2002
09:02 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1038 from config - T177911 (duration: 00m 45s)
09:02 ppchelko@tin: Finished deploy [cpjobqueue/deploy@3db0cc4]: Do not set Host header for requests to jobrunner (duration: 00m 49s)
09:02 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1038 from config - T177911 (duration: 00m 47s)
09:01 ppchelko@tin: Started deploy [cpjobqueue/deploy@3db0cc4]: Do not set Host header for requests to jobrunner
08:49 marostegui: Run redact_sanitarium for hifwiktionary on db1095 (sanitarium) - T173647
08:48 marostegui: Optimize pagelinks and templatelinks on s7 master - db1062 - T174509
07:18 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: Electron stuck, restarting - T174916
07:11 ema: reboot pinkunicorn for kernel (4.9.51) and openssl (1.0.2m) upgrades
06:19 marostegui: Deploy alter table on s7 codfw master (db2029) with replication, this will cause lag in codfw - T174569
02:29 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Nov 7 02:29:54 UTC 2017 (duration 6m 39s)
02:23 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.6) (duration: 07m 19s)
2017-11-06
22:34 bawolff: Deploy patch for T178451
21:48 arlolra: Updated Parsoid to 6cb77104 (T179579 , T175792 )
21:42 arlolra@tin: Finished deploy [parsoid/deploy@d1df3c3]: (no justification provided) (duration: 08m 46s)
21:35 awight: Begin stress testing ores* (non-production)
21:33 arlolra@tin: Started deploy [parsoid/deploy@d1df3c3]: (no justification provided)
21:20 XioNoX: re-activating BGP session to telia in ulsfo
21:16 ladsgroup@tin: Finished deploy [ores/deploy@97a1d80]: Deploying early November (T179837 ) (duration: 10m 47s)
21:05 ladsgroup@tin: Started deploy [ores/deploy@97a1d80]: Deploying early November (T179837 )
20:56 reedy@tin: Synchronized wmf-config/interwiki.php: Update IW Map again (duration: 00m 46s)
20:50 marostegui: Deploy alter table on s2 codfw master (db2017) with replication, this will generate lag in codfw - T174569
20:36 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: hifwiktionary T173643 (duration: 00m 45s)
20:34 reedy@tin: rebuilt wikiversions.php and synchronized wikiversions files: hifwiktionary T173643
20:33 reedy@tin: Synchronized dblists/: hifwiktionary T173643 (duration: 00m 47s)
20:04 reedy@tin: Synchronized wmf-config/interwiki.php: Update IW map (duration: 00m 45s)
19:57 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: electcomwiki T174370 (duration: 00m 45s)
19:56 reedy@tin: rebuilt wikiversions.php and synchronized wikiversions files: electcomwiki T174370
19:55 reedy@tin: Synchronized dblists/: electcomwiki T174370 (duration: 00m 44s)
19:52 demon@tin: Pruned MediaWiki: 1.31.0-wmf.2 (duration: 02m 49s)
19:47 demon@tin: Pruned MediaWiki: 1.31.0-wmf.5 [keeping static files] (duration: 01m 38s)
19:22 Amir1: ladsgroup@terbium:/srv/mediawiki-staging/php-1.31.0-wmf.4$ mwscript extensions/ORES/maintenance/CheckModelVersions.php --wiki=enwiki (T179596 )
19:21 kaldari@tin: Synchronized wmf-config/InitialiseSettings.php: Updating InitialiseSettings for draftquality data (duration: 00m 48s)
18:15 XioNoX: deactivating BGP session to telia in ulsfo
18:09 gehel@tin: Finished deploy [wdqs/wdqs@fa8bb12]: WDQS GUI and Blazegraph update (duration: 01m 53s)
18:07 gehel@tin: Started deploy [wdqs/wdqs@fa8bb12]: WDQS GUI and Blazegraph update
17:50 awight@tin: Finished deploy [ores/deploy@29905e5]: restart services on ores* (non-production) (duration: 17m 18s)
17:48 urandom: T179083 : Restarting Cassandra instances on restbase2001.codfw.wmnet
17:32 awight@tin: Started deploy [ores/deploy@29905e5]: restart services on ores* (non-production)
16:51 marostegui: Optimize pagelinks and templatelinks on s2 master - db1054 - T174509
16:26 bblack: Switching unified TLS certificate in the US
16:25 awight@tin: Finished deploy [ores/deploy@82a13ae]: Redeploy ORES master to scb* (duration: 16m 10s)
16:09 awight@tin: Started deploy [ores/deploy@82a13ae]: Redeploy ORES master to scb*
15:50 akosiaris@tin: Started deploy [ores/deploy@29905e5]: testing deploy
15:49 akosiaris@tin: Started deploy [ores/deploy@29905e5]: testing deploy
15:43 awight@tin: Finished deploy [ores/deploy@29905e5]: restart services on ores* (non-production) (duration: 12m 45s)
15:31 awight@tin: Started deploy [ores/deploy@29905e5]: restart services on ores* (non-production)
15:26 gehel: reboot of maps-test cluster for jvm and kernel upgrades
15:13 ppchelko@tin: Finished deploy [cpjobqueue/deploy@96f55c6]: USe correct regex for job selection (duration: 00m 29s)
15:12 ppchelko@tin: Started deploy [cpjobqueue/deploy@96f55c6]: USe correct regex for job selection
15:03 addshore@tin: Synchronized wmf-config/CommonSettings-labs.php: SWAT Set wgWikiDiff2MovedParagraphDetectionCutoff for group0 PT 3/3 (duration: 00m 46s)
15:01 addshore@tin: Synchronized wmf-config/CommonSettings.php: SWAT Set wgWikiDiff2MovedParagraphDetectionCutoff for group0 PT 2/3 (duration: 00m 46s)
15:00 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT Set wgWikiDiff2MovedParagraphDetectionCutoff for group0 PT 1/3 (duration: 00m 47s)
14:57 gehel: reboot of relforge for kernel + jvm upgrade
14:50 mobrovac@tin: Synchronized wmf-config/jobqueue.php: Switch MessageIndexRebuildJob, flaggedrevs_CacheUpdate and deleteLinks jobs to the EventBus infrastructure - T175210 (duration: 00m 46s)
14:50 ppchelko@tin: Finished deploy [cpjobqueue/deploy@e93feba]: Start processing all 'hearted' jobs T175210 (duration: 00m 44s)
14:49 ppchelko@tin: Started deploy [cpjobqueue/deploy@e93feba]: Start processing all 'hearted' jobs T175210
14:48 ppchelko@tin: Started deploy [cpjobqueue/deploy@e93feba]: (no justification provided)
14:45 awight@tin: Finished deploy [ores/deploy@29905e5]: Force deployment on ores* (non-production) (duration: 05m 43s)
14:45 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs2003.codfw.wmnet
14:45 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs2002.codfw.wmnet
14:40 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs2002.codfw.wmnet
14:40 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs2001.codfw.wmnet
14:39 awight@tin: Started deploy [ores/deploy@29905e5]: Force deployment on ores* (non-production)
14:35 awight@tin: Finished deploy [ores/deploy@29905e5]: restart services on ores* (non-production) (duration: 04m 48s)
14:34 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs2001.codfw.wmnet
14:33 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs1005.eqiad.wmnet
14:30 awight@tin: Started deploy [ores/deploy@29905e5]: restart services on ores* (non-production)
14:27 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs1005.eqiad.wmnet
14:27 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs1004.eqiad.wmnet
14:21 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs1004.eqiad.wmnet
14:21 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs1003.eqiad.wmnet
14:20 awight@tin: Finished deploy [ores/deploy@29905e5]: celery 4 -> ores* (non-production) (duration: 19m 22s)
14:18 mobrovac@tin: Synchronized wmf-config/InitialiseSettings.php: EventBus: Enable logging - T150106 (duration: 00m 47s)
14:14 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs1003.eqiad.wmnet
14:14 gehel@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=wdqs1003.eqiad.wmnet
14:13 mobrovac@tin: Synchronized php-1.31.0-wmf.6/extensions/EventBus/EventBus.php: EventBus - Improve logging for T150106 (duration: 00m 48s)
14:01 gehel: full restart of wdqs eqiad / codfw for multiple upgrades
14:01 awight@tin: Started deploy [ores/deploy@29905e5]: celery 4 -> ores* (non-production)
13:03 elukey: rolling restart of druid historical daemons on druid100[1-6] to apply https://gerrit.wikimedia.org/r/#/c/389429
12:58 moritzm: installing openssl security updates
12:55 moritzm: restarting apache on netmon to pick up openssl update
12:43 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: Electron stuck, restrting - T174916
12:29 moritzm: installing imagemagick security updates
11:45 oblivian@tin: Started deploy [jobrunner/jobrunner@a20d043]: (no justification provided)
11:44 marostegui: Deploy alter table on s6 codfw master (with replication, so this will generate lag on codfw) - T174569
11:19 marostegui: Compress InnoDB on db1103.s2 - T178359
11:16 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2087 as recentchanges multi-instance host on s6 and s7 T178359 (duration: 00m 47s)
11:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2087 as recentchanges multi-instance host on s6 and s7 T178359 (duration: 00m 48s)
10:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1101 original weight after maintenance - T178359 (duration: 00m 46s)
10:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give db1101 more weight after maintenance - T178359 (duration: 00m 46s)
10:05 moritzm: uploading linux 4.9.51-1~bpo8+1 for jessie-wikimedia to apt.wikimedia.org
09:57 godog: roll-upgrade swift to 2.10.2 in codfw and roll-restart for kernel upgrade - T177739
09:56 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2088 as recentchanges multi-instance host on s1 and s2 T178359 (duration: 00m 46s)
09:55 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2088 as recentchanges multi-instance host on s1 and s2 T178359 (duration: 00m 46s)
09:37 _joe_: manually running htmlCacheUpdate for commonswiki and ruwiki on terbium, T173710
09:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2085 as recentchanges multi-instance host on s3 and s5 T178553 T178359 (duration: 00m 46s)
09:25 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2085 as multi-instance core host on eqiad file T178553 T178359 (duration: 00m 46s)
09:18 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give db1101 more weight after maintenance - T178359 (duration: 00m 46s)
09:07 moritzm: removed monitoring/RAID packages from stretch-wikimedia/thirdparty, now in thirdparty/hwraid
08:48 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1101 with low weight after maintenance - T178359 (duration: 00m 47s)
06:46 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2084 as multi-instance core host on eqiad file T178553 T178359 (duration: 00m 46s)
06:38 marostegui: Stop MySQL on db1101 to copy its content to db1103.s4 - T178359
06:31 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1101 - T178359 (duration: 00m 48s)
06:19 marostegui: Optimize pagelinks and templatelinks on s6 master (db1061) - T174509
06:16 marostegui: Deploy alter table on s4 codfw master (with no replication) - T174569
02:44 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Nov 6 02:44:41 UTC 2017 (duration 6m 55s)
02:37 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.6) (duration: 07m 17s)
2017-11-04
13:34 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=scb1002.eqiad.wmnet
13:19 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=scb1002.eqiad.wmnet
12:36 mobrovac: trending-edits depool it from scb1002 to investigate
00:18 Krinkle: Finished whisper-mass-resize for frontend.navtiming on graphite2001 (T179622 )
2017-11-03
20:28 marostegui: Deploy alter table db2058 - T174569
18:52 Krinkle: Starting whisper-mass-resize for frontend.navtiming on graphite2001 (T179622 )
18:51 Krinkle: whisper-mass-resize completed for graphite1001.eqiad.wmnet:/var/lib/carbon/whisper/frontend/navtiming (at 01:34 UTC actually)
17:56 demon@tin: Synchronized php-1.31.0-wmf.6/includes/libs/rdbms/database/DatabaseMysqlBase.php: logging (duration: 00m 47s)
16:17 marostegui: Deploy alter table on db2044 - T174569
16:03 demon@tin: Synchronized php-1.31.0-wmf.6/extensions/ContentTranslation/api/ApiQueryTranslatorStats.php: fix warnings about bad min() calls (duration: 00m 47s)
15:50 cmjohnson1: powering off labvirt1015 to swap CPU (again)
15:49 akosiaris: T177393 enable RBAC for kubernetes in production and staging
15:35 volans: uploaded cumin_1.3.0-1_amd64.deb to apt.wikimedia.org jessie-wikimedia
14:19 moritzm: installing java sec updates / cassandra restarts on xenon/praseodymium (cerium already done earlier)
14:16 hashar: restarting Jenkins on contint1001
14:12 hashar: Upgrading packages on contint1001 / contint2001
13:50 moritzm: installing heimdal security updates on trusty
13:35 ppchelko@tin: Started restart [electron-render/deploy@8dd5f13]: Restarting, service is stuck
13:33 Pchelolo: restart electron render service
12:41 moritzm: uploaded openjdk 8u151-b12 to apt.wikimedia.org/jessie-wikimedia
11:45 moritzm: restarting jenkins on releases* to pick up openjdk security update
11:41 mutante: gerrit - restart Apache - apply gerrit:354078 - only listen on service IP
11:12 moritzm: uploaded openssl 1.0.2m-1 for to apt.wikimedia.org
10:46 marostegui: Compress InnoDB on db1103.s4 - T178359
10:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add comment to db1103 status (duration: 00m 47s)
10:42 mutante: cobalt: (gerrit) temp disable puppet - gerrit2001: restart apache, apply gerrit:354078 stop listening on all interfaces
10:39 oblivian@tin: Synchronized wmf-config/CommonSettings.php: Increase concurrency of htmlCacheUpdate jobs T173710 (duration: 00m 48s)
09:09 mobrovac@tin: Finished deploy [restbase/deploy@a67b4a7]: Use only the new storage for summaries - T179418 (duration: 10m 40s)
08:58 mobrovac@tin: Started deploy [restbase/deploy@a67b4a7]: Use only the new storage for summaries - T179418
08:46 ema: pool cp4021
08:16 elukey: drop CommandInvocation_15243810 and CommandInvocation_15243810_15423246 from analytics dbs (db1046/db1047/db1108/dbstore1002) - data archived on HDFS - T166712
08:12 moritzm: installing openjdk-8 security updates on stretch
07:57 marostegui: Deploy alter table on s4.db2037 - T174569
07:29 elukey: depooled mw1191 and powercycle - host down with 'CPU 1 check errors' in racadm getsel
06:39 marostegui: Stop MySQL on db2089.s5 to copy it to db2085 - T178359
06:25 marostegui: Stop MySQL on db1103 to copy it to dbstore1001 and reimage it - T178359
2017-11-02
23:36 krinkle@tin: Synchronized php-1.31.0-wmf.6/extensions/NavigationTiming/modules/ext.navigationTiming.js: Fix zero values - T178479 (duration: 00m 47s)
23:10 herron: disabled puppet master debug logging on puppetmaster2001 via /usr/share/puppet/rack/puppet-master/config.ru
22:48 volans: cleaned daemon.log from puppet spam logs on puppetmaster2001 (15GB) - cc herron
22:39 halfak@tin: Started deploy [ores/deploy@82a13ae]: T179621
22:02 Krinkle: Mass-resizing Graphite/Whisper files on graphite1001 and graphite2001 for T179622 (frontend.* namespace)
21:53 ayounsi@tin: Finished deploy [librenms/librenms@ad48b4d]: (no justification provided) (duration: 00m 05s)
21:52 ayounsi@tin: Started deploy [librenms/librenms@ad48b4d]: (no justification provided)
21:50 XioNoX: upgrading librenms
21:38 hoo: Updated the Wikidata property suggester with data from Monday's JSON dump and applied the T132839 workarounds
20:56 madhuvishy: Powercycling labvirt1015
20:18 chasemp: simulate load for labvirt1015 'sudo cumin "name:labvirt1015stresstest*" "stress-ng --cpu 1 --io 2 --vm 1 --vm-bytes 1G &"'
19:45 mepps: updated payments-wiki from b88b805 to a539d27
19:28 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.6
18:20 gehel: restarting blazegraph on wdqs1003
18:13 gehel: restarting stuck updater on wdqs1003
17:55 demon@tin: Synchronized php-1.31.0-wmf.6/includes/Message.php: logging fixes (duration: 00m 48s)
17:53 demon@tin: Synchronized php-1.31.0-wmf.6/includes/libs/rdbms: logging improvements (duration: 00m 54s)
17:47 moritzm: uploaded openssl 1.1.0g-1+wmf1 to jessie-wikimedia
17:42 awight@tin: Finished deploy [ores/deploy@0e54a3c]: revscoring 2, T175180 (duration: 33m 50s)
17:08 awight@tin: Started deploy [ores/deploy@0e54a3c]: revscoring 2, T175180
16:14 ema: restart varnish-be on cp4026 due to mbox lag
16:04 mobrovac@tin: Synchronized wmf-config/jobqueue.php: Use only EventBus for processing updateBetaFeatureUserCount - T175210 (duration: 00m 51s)
13:46 chasemp: ugrade dnsmasq-base on labcontrol*
13:36 zeljkof: EU SWAT finished
13:35 zfilipin@tin: Synchronized php-1.31.0-wmf.6/extensions/ContentTranslation/modules/publish/ext.cx.publish.js: SWAT: CX1: Check for template adaptation failures before publishing (T154116) (duration: 00m 50s)
13:11 zfilipin@tin: Synchronized wmf-config/ProductionServices.php: SWAT: use the logstash LVS endpoint (T175242) (duration: 00m 51s)
12:09 moritzm: removed obsolete packages from jessie-wikimedia/experimental
12:06 mobrovac@tin: Finished deploy [restbase/deploy@9314cf6]: Parsoid: Switch all but WPs to use the next-generation storage - T179417 (duration: 08m 43s)
11:57 mobrovac@tin: Started deploy [restbase/deploy@9314cf6]: Parsoid: Switch all but WPs to use the next-generation storage - T179417
11:43 elukey: drop table log.PageContentSaveComplete_5588433 from db1046,db1047,db1108,dbstore1002, archived on hdfs - T177101
11:31 moritzm: remove obsolete packages from stretch-wikimedia/experimental, now empty
11:29 mobrovac@tin: Finished deploy [restbase/deploy@f6c4e2d]: Parsoid module: use the Cassandra 2 tables as fallback when needed - T179417 (duration: 08m 15s)
11:26 elukey: drop log.MediaViewer_10867062_15423246 from db1047,db1108 since already archived in hdfs - T168303
11:21 mobrovac@tin: Started deploy [restbase/deploy@f6c4e2d]: Parsoid module: use the Cassandra 2 tables as fallback when needed - T179417
10:54 gehel@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=logstash1003.eqiad.wmnet
10:53 gehel@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=logstash1002.eqiad.wmnet
10:53 gehel@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=logstash1001.eqiad.wmnet
10:53 gehel: depooling logstash100[123] in preparation for decommission - T175830
10:38 gehel: rolling restart of wdqs nodes for GC tuning - T175919
10:13 marostegui: Deploy alter table on db2019 - T174569
09:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1091 original weight - T161088 (duration: 00m 50s)
09:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1091 weight after maintenance - T161088 (duration: 00m 50s)
08:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1091 weight after maintenance - T161088 (duration: 00m 50s)
08:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1091 with low weight after maintenance - T161088 (duration: 00m 50s)
08:15 moritzm: restarting app server canaries to pick up curl security update
08:04 moritzm: installing curl security updates
07:59 marostegui: Stop mysql event scheduler on labsdb1001 - T179464
06:54 marostegui: Stop s3 instance on db2092 to copy it to db2085 - T178359
06:33 legoktm: done on mwdebug1002
06:31 marostegui: Stop MySQL on db1091 and db1103 to migrate db1091 to file per table
06:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1091 - T161088 (duration: 01m 07s)
06:27 marostegui: Deploy optimize table wikidatawiki.pagelinks on s5 master (db1063) - T174509
06:02 legoktm: live-hacking mwdebug1002
05:43 TimStarling: on puppetmaster2001: disk full due to logs, so I'm truncating /var/log/debug which is apparently copied to daemon.log anyway
03:29 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Nov 2 03:29:35 UTC 2017 (duration 7m 12s)
03:22 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.6) (duration: 15m 37s)
02:43 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.5) (duration: 16m 06s)
00:52 ebernhardson@tin: Synchronized php-1.31.0-wmf.5/extensions/WikimediaEvents/extension.json: SWAT followup: Update SearchSatisfaction eventlogging schema revision id (duration: 00m 50s)
00:51 ebernhardson@tin: Synchronized php-1.31.0-wmf.6/extensions/WikimediaEvents/extension.json: (no justification provided) (duration: 00m 50s)
00:32 bblack: cp4022 - varnish backend restart, mailbox
00:28 bblack: lvs1001-3: re-enable ethernet flowcontrol (short ethernet blip), with pybal stopped to failover to backups during...
2017-11-01
23:37 thcipriani@tin: Synchronized php-1.31.0-wmf.5/extensions/WikimediaEvents: SWAT: Turn on Cirrus AB test for DBN group sizing (duration: 00m 50s)
23:35 thcipriani@tin: Synchronized php-1.31.0-wmf.6/extensions/WikimediaEvents: SWAT: Turn on Cirrus AB test for DBN group sizing (duration: 00m 51s)
23:29 thcipriani@tin: Synchronized php-1.31.0-wmf.6/extensions/ParserMigration/includes/ApiParserMigration.php: SWAT: API: Fix WikiPage namespace (duration: 00m 52s)
23:25 thcipriani@tin: Synchronized php-1.31.0-wmf.6/extensions/Wikibase/repo/Wikibase.hooks.php: SWAT: Allow turning Cirrus usage off from query T179428 (duration: 00m 49s)
23:23 thcipriani@tin: Synchronized php-1.31.0-wmf.6/extensions/Wikidata/extensions/Wikibase/repo/Wikibase.hooks.php: SWAT: Allow turning Cirrus usage off from query T179428 (duration: 00m 51s)
23:09 thcipriani@tin: Synchronized wmf-config/Wikibase-production.php: SWAT: Revert "Revert "Add negative weight to disambig entities"" T148411 (duration: 00m 51s)
22:49 ejegg: disabled fundraising unsubscribe import processing
22:33 Krinkle: group2(all-wikidata) wikis to wmf.5 from 24 hours ago seems to have caused a 60% drop in navigation timing metric report count (100/min => 40/min)
22:19 eileen1: process-control updated to 9c8b4bb
22:17 demon@tin: Synchronized wmf-config/CommonSettings.php: rel1_28 is dead (duration: 00m 50s)
22:06 ejegg: re-enabled fundraising jobs
21:48 ejegg: fundraising jobs disabled
20:10 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.6
20:05 demon@tin: Synchronized php: symlink swap (duration: 00m 49s)
19:57 krinkle@tin: Synchronized php-1.31.0-wmf.6/resources/src/startup.js: T178943 (duration: 00m 51s)
19:53 demon@tin: Synchronized php-1.31.0-wmf.5/extensions/CentralNotice: fix weird git rebasing issue (duration: 00m 53s)
19:41 demon@tin: Pruned MediaWiki: 1.31.0-wmf.4 [keeping static files] (duration: 01m 52s)
19:31 awight@tin: Synchronized php-1.31.0-wmf.5/extensions/ORES: Fix for API=oresscores, T179430 (duration: 00m 50s)
19:28 awight@tin: Synchronized php-1.31.0-wmf.6/extensions/ORES: Fix for API=oresscores, T179430 (duration: 00m 52s)
19:14 bblack: all hosts: manual cumin+sed removal of ethernet autoneg params from /e/n/i to match https://gerrit.wikimedia.org/r/#/c/387849/
19:08 otto@tin: Finished deploy [analytics/refinery@6d11d67]: Deploying refinery-source artifacts for 0.0.54 for JsonRefine job, T162610 (duration: 03m 45s)
19:05 otto@tin: Started deploy [analytics/refinery@6d11d67]: Deploying refinery-source artifacts for 0.0.54 for JsonRefine job, T162610
18:39 demon@tin: Synchronized wmf-config/: Dropping squid.php (hang on to your pants folks, this could be fun) (duration: 00m 51s)
18:36 demon@tin: Synchronized wmf-config/CommonSettings.php: use reverse-proxy.php no more squid.php (duration: 00m 50s)
18:35 demon@tin: Synchronized docroot/: dropping squid.php (duration: 00m 52s)
18:13 demon@tin: Synchronized static/images/project-logos/sewikimedia.png: new logo (duration: 00m 50s)
18:02 moritzm: installing openjpeg2 security updates
17:39 demon@tin: Synchronized php-1.31.0-wmf.6/includes/specials/pagers/ContribsPager.php: fix missing page_is_new error (duration: 00m 51s)
17:10 demon@tin: Synchronized wmf-config/: dropped squid-labs, no-op in prod (duration: 00m 52s)
17:09 demon@tin: Synchronized docroot/noc/: dropped squid-labs.php (duration: 00m 51s)
16:01 bblack: lvs1003 - puppet disabled, testing experimental ethtool ringbuffer change
15:15 moritzm: repo reorg: moved ftpsync from thirdparty to main and docker-engine from thirdparty to thirdparty/k8s
15:12 awight@tin: Finished deploy [ores/deploy@9f361d2]: revscoring 2 -> ores1002 (non-production) (duration: 02m 25s)
15:10 awight@tin: Started deploy [ores/deploy@9f361d2]: revscoring 2 -> ores1002 (non-production)
15:09 awight@tin: Finished deploy [ores/deploy@9f361d2]: revscoring 2 -> ores* (non-production) (duration: 01m 57s)
15:07 awight@tin: Started deploy [ores/deploy@9f361d2]: revscoring 2 -> ores* (non-production)
15:07 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: Update CirrusSearch MLR model on enwiki (duration: 00m 51s)
15:01 mobrovac: restbase: removing wikimedia-lvs-realserver from staging hosts T179494
14:55 bblack: strongswan experiment done, cp* back to puppet-agent-enabled
14:41 moritzm: installing poppler security updates
14:09 bblack: cp*: disabling puppet to test strongswan change...
13:49 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable NewUserMessage on fawikiquote - T179442 (duration: 00m 50s)
13:46 hashar@tin: Synchronized wmf-config/abusefilter.php: Enable blocking feature of abuse filter in fawikiquote - T178227 (duration: 00m 50s)
13:45 moritzm: installing quagga security updates
13:43 hashar@tin: Synchronized wmf-config/CommonSettings.php: Properly check for cluster existence prior setting TTM mirrors - T179270 (duration: 01m 05s)
13:41 ppchelko@tin: Finished deploy [restbase/deploy@2321c4c]: Update hyperswitch dependency. Take 2 (duration: 05m 50s)
13:35 ppchelko@tin: Started deploy [restbase/deploy@2321c4c]: Update hyperswitch dependency. Take 2
13:33 ppchelko@tin: Finished deploy [restbase/deploy@2321c4c]: Update hyperswitch dependency (duration: 03m 50s)
13:29 ppchelko@tin: Started deploy [restbase/deploy@2321c4c]: Update hyperswitch dependency
12:48 moritzm: installing libav security updates
12:44 moritzm: installinng libdatetime-timezone-perl stable updates on Debian
12:10 kartik@tin: Finished deploy [cxserver/deploy@10651e2]: Update cxserver to 0227acb (duration: 03m 07s)
12:07 kartik@tin: Started deploy [cxserver/deploy@10651e2]: Update cxserver to 0227acb
08:35 elukey: forced umount/mount for /mnt/hdfs on stat1005 (not working after repeated oom kill actions)
03:22 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Nov 1 03:22:02 UTC 2017 (duration 7m 17s)
03:14 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.6) (duration: 15m 21s)
02:36 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.5) (duration: 09m 46s)
2017-10-31
23:56 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: Adjust cirrussearch interleave configuration for AB test (duration: 00m 50s)
23:48 demon@tin: Synchronized scap/plugins/prep.py: no-op (duration: 00m 50s)
23:21 ebernhardson@tin: Synchronized wmf-config/CirrusSearch-common.php: Setup CirrusSearch AB test on dbn group sizing (duration: 00m 50s)
23:19 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: Setup CirrusSearch AB test on dbn group sizing (duration: 00m 53s)
21:38 awight@tin: Started deploy [ores/deploy@9f361d2]: revscoring 2 -> ores* (non-production)
21:37 awight@tin: Started deploy [ores/deploy@9f361d2]: revscoring 2 -> ores* (non-production)
21:16 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/386553 https://gerrit.wikimedia.org/r/386710 (duration: 00m 52s)
21:00 XioNoX: removing old AMS-IX IPv6 - T167840
20:57 mepps: rollback payments-wiki from 5303a8e to b88b805
20:54 mepps: updated payments-wiki from b88b805 to 5303a8e
20:45 mepps: rolledback payments-wiki from 5303a8e to b88b805
20:43 mepps: updated payments-wiki from b88b805 to 5303a8e
19:18 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.6
18:53 awight@tin: Finished deploy [ores/deploy@437fcf5]: revscoring 2 -> ores* (non-production) (duration: 10m 29s)
18:42 awight@tin: Started deploy [ores/deploy@437fcf5]: revscoring 2 -> ores* (non-production)
18:38 awight@tin: Finished deploy [ores/deploy@437fcf5]: revscoring 2 -> ores1002 (non-production) (duration: 02m 24s)
18:36 awight@tin: Started deploy [ores/deploy@437fcf5]: revscoring 2 -> ores1002 (non-production)
18:32 demon@tin: Synchronized php-1.31.0-wmf.6/extensions/CentralNotice: deployyyyy (duration: 00m 52s)
18:27 ejegg: updated CiviCRM from a34a9c6 to 415fde1
17:23 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: wikidata back to wmf.5
16:50 demon@tin: Finished scap: bootstrap wmf.6 (duration: 47m 39s)
16:41 ppchelko@tin: Finished deploy [restbase/deploy@7631ea7]: Enable new parsoid storage on test wikipedias (duration: 09m 07s)
16:32 ppchelko@tin: Started deploy [restbase/deploy@7631ea7]: Enable new parsoid storage on test wikipedias
16:09 ariel@tin: Finished deploy [dumps/dumps@fa0583b]: adapt dumpadmin script to use override section in config (duration: 00m 02s)
16:09 ariel@tin: Started deploy [dumps/dumps@fa0583b]: adapt dumpadmin script to use override section in config
16:02 demon@tin: Started scap: bootstrap wmf.6
15:43 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: all but wikidata.org -> wmf.5
15:42 demon@tin: Synchronized php: Symlink bump (duration: 00m 48s)
15:41 demon@tin: Pruned MediaWiki: 1.31.0-wmf.1 (duration: 03m 15s)
14:18 bblack: authdns (baham, radon, eeden): upgrade base packages (curl, git, libc-dev, etc)
14:13 bblack: lvsNNNN: upgrade base packages (curl, git, libc-dev, etc)
14:11 bblack: cpNNNN: upgrade base packages (curl, git, libc-dev, etc)
13:57 bblack: caches@eqiad - upgrade nginx to 1.13.6-2+wmf1~jessie1
13:54 bblack: caches@esams - upgrade nginx to 1.13.6-2+wmf1~jessie1
13:22 bblack: caches@codfw - upgrade nginx to 1.13.6-2+wmf1~jessie1
13:18 zeljkof: EU SWAT finished
13:17 zfilipin@tin: Synchronized php-1.31.0-wmf.4/extensions/EventBus/EventBus.php: SWAT: Dont log full request to service in case of delivery error (T179280) (duration: 00m 51s)
13:14 bblack: caches@ulsfo - upgrade nginx to 1.13.6-2+wmf1~jessie1
13:05 mobrovac@tin: Finished deploy [restbase/deploy@c0f9dc4]: Cassandra 3 Parsoid stash tables (duration: 02m 58s)
13:02 mobrovac@tin: Started deploy [restbase/deploy@c0f9dc4]: Cassandra 3 Parsoid stash tables
13:01 mobrovac@tin: Finished deploy [restbase/deploy@c0f9dc4]: Cassandra 3 Parsoid stash tables (duration: 15m 19s)
12:45 mobrovac@tin: Started deploy [restbase/deploy@c0f9dc4]: Cassandra 3 Parsoid stash tables
12:39 bblack: eqiad primary lvses (1001-3): disable pause [LRO not possible] on all ethernet interfaces (under pybal stopped briefly)
12:36 bblack: codfw primary lvses (2003-6): disable LRO,pause on all ethernet interfaces (under pybal stopped briefly)
12:24 bblack: cp4025: restart varnish-be for mailbox lag
12:23 bblack: cp4023: restart varnish-be for mailbox lag
12:21 bblack: esams primary lvses (3001-2): disable LRO,pause on eth0 (under pybal stopped briefly)
12:19 bblack: ulsfo primary lvses (4001-2): disable LRO,pause on eth0 (under pybal stopped briefly)
12:12 bblack: esams+ulsfo backup lvses (3003-4,4003-4): disable LRO,pause on eth0
12:10 bblack: core DC backup lvses (1004-6,2004-6): disable LRO,pause on all ethernet interfaces
11:05 oblivian@tin: Finished deploy [docker-pkg/deploy@20908e8]: Actually deploy the fix (duration: 00m 04s)
11:05 oblivian@tin: Started deploy [docker-pkg/deploy@20908e8]: Actually deploy the fix
11:02 oblivian@tin: Finished deploy [docker-pkg/deploy@576c80f]: Fix image_tag filter (duration: 00m 02s)
11:02 oblivian@tin: Started deploy [docker-pkg/deploy@576c80f]: Fix image_tag filter
10:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1084 original weight - T161088 (duration: 00m 48s)
10:05 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1084 weight - T161088 (duration: 00m 50s)
09:12 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2084 as multi-instance core host T178553 T178359 (duration: 00m 49s)
08:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1084 weight - T161088 (duration: 00m 50s)
08:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1084 with low weight - T161088 (duration: 00m 50s)
08:15 marostegui: Stop MySQL on db2086 to copy s5 to db2089 - T178359
07:09 marostegui: Stop MySQL on db2087.s6 to copy its content to db2089 - T178359
06:37 marostegui: Stop MySQL on db1103 and db1084 - T161088
06:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1084 - T161088 (duration: 00m 49s)
06:17 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2039 (duration: 01m 12s)
03:08 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Oct 31 03:08:48 UTC 2017 (duration 7m 18s)
03:01 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.5) (duration: 10m 05s)
02:34 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 10m 33s)
2017-10-30
23:28 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: Update cirrussearch MLR model for enwiki (duration: 00m 50s)
23:22 ebernhardson@tin: Synchronized php-1.31.0-wmf.4/extensions/CirrusSearch/: Fix highlight tags inside section anchors of search results (duration: 01m 01s)
23:19 ebernhardson@tin: Synchronized php-1.31.0-wmf.5/extensions/CirrusSearch/: Fix highlight tags inside section anchors of search results (duration: 01m 03s)
22:03 eileen1: update process-control to revision is 2392ddf
21:55 awight@tin: Finished deploy [ores/deploy@a0f7d5c]: Fun with deployments (non-production) T179336 (duration: 02m 03s)
21:53 awight@tin: Started deploy [ores/deploy@a0f7d5c]: Fun with deployments (non-production) T179336
21:48 awight@tin: Finished deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production) (duration: 01m 42s)
21:47 chasemp: restart nova-fullstack to force a run
21:46 awight@tin: Started deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production)
21:36 eileen1: update CiviCRM from 25c7a33 to a34a9c6
21:02 urandom: T179083 : Creating commons_T_parsoid_stash_wikitextf0PBY8UXqY8UuiDv.{meta,data} (30 sec delay)
20:56 urandom: T179083 : Creating commons_T_parsoid_stash_htmlmXxc_uDhgnFQAdM8PPlH.{data,meta} (25 sec delay)
20:46 urandom: T179083 : Creating enwiki_T_parsoid_stash_section2ACMDK1DRzacK9nUB3.{data,meta} (15sec delay)
20:36 urandom: T179083 : Creating enwiki_T_parsoid_stash_dataWH8IDUS9SGI6LPpsJsLOQ.data
20:35 arlolra: Updated Parsoid to 292633c4 (T133334 )
20:34 urandom: T179083 : Creating enwiki_T_parsoid_stash_dataWH8IDUS9SGI6LPpsJsLOQ.meta
20:33 urandom: T179083 : Creating enwiki_T_parsoid_stash_dataWH8IDUS9SGI6LPpsJsLOQ
20:28 urandom: T179083 : Creating table enwiki_T_parsoid_stash_wikitextf0PBY8UXqY8UuiDv1.data
20:27 urandom: T179083 : Creating table enwiki_T_parsoid_stash_wikitextf0PBY8UXqY8UuiDv1.meta
20:26 arlolra@tin: Finished deploy [parsoid/deploy@8a2c0ce]: Updating Parsoid to 292633c4 (duration: 07m 33s)
20:25 urandom: T179083 : Creating keyspace enwiki_T_parsoid_stash_wikitextf0PBY8UXqY8UuiDv1
20:20 urandom: T179083 : Creating table enwiki_T_parsoid_stash_htmlmXxc_uDhgnFQAdM8PPlH5.data
20:18 arlolra@tin: Started deploy [parsoid/deploy@8a2c0ce]: Updating Parsoid to 292633c4
20:14 urandom: T179083 : Creating table enwiki_T_parsoid_stash_htmlmXxc_uDhgnFQAdM8PPlH5.meta
20:09 awight@tin: Finished deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production) (duration: 14m 11s)
20:08 urandom: T179083 : Creating keyspace enwiki_T_parsoid_stash_htmlmXxc_uDhgnFQAdM8PPlH5
19:55 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Create Appendix NS on Burmese Wiktionary T178545 (duration: 00m 55s)
19:55 awight@tin: Started deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production)
19:07 awight@tin: Finished deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production) (duration: 06m 16s)
19:01 awight@tin: Started deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production)
19:01 awight@tin: Finished deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production) (duration: 19m 53s)
18:41 awight@tin: Started deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production)
18:40 awight@tin: Started deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production)
18:22 awight@tin: Started deploy [ores/deploy@a0f7d5c]: Smoke test for timeout bug on ores1002 (non-production)
18:16 thcipriani@tin: Synchronized portals: SWAT: Bumping portals to master T128546 (duration: 00m 50s)
18:15 thcipriani@tin: Synchronized portals/prod/wikipedia.org/assets: SWAT: Bumping portals to master T128546 (duration: 00m 50s)
18:08 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add Timeless skin to wikitech T154371 (duration: 00m 51s)
17:08 madhuvishy: Revert dns switchover for c1 shards to c3 post labsdb1001 reboot T168584
17:07 gehel@tin: Finished deploy [wdqs/wdqs@fdb9100]: GUI update (duration: 02m 17s)
17:05 gehel@tin: Started deploy [wdqs/wdqs@fdb9100]: GUI update
16:45 oblivian@tin: Finished deploy [docker-pkg/deploy@576c80f]: New version with small fixes (duration: 00m 04s)
16:44 oblivian@tin: Started deploy [docker-pkg/deploy@576c80f]: New version with small fixes
15:20 XioNoX: re-enabling Zayo transit in eqiad
14:36 marostegui: Reboot labsdb1001 - T168584
14:30 marostegui: Stop replication and MySQL on labsdb1001 - T168584
14:09 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Revert "UBN! disbale ores for wikidata" (T179107 ) (duration: 00m 50s)
14:07 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Revert "UBN! disbale ores for wikidata" (T179107 ) (duration: 00m 49s)
14:00 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Change the af.wiktionary logo, part II (T178824 ) (duration: 00m 50s)
13:59 ladsgroup@tin: Synchronized static/images/project-logos: Change the af.wiktionary logo, part I (T178824 ) (duration: 00m 50s)
13:40 madhuvishy: Switched dns over for c1 shards to c3 in preparation of labsdb1001 reboot (Merged https://gerrit.wikimedia.org/r/#/c/386660/ , Ran puppet on labservice1001|2)
13:21 marostegui: Set innodb_max_dirty_pages_pct = 10 on labsdb1001 so it powers off a bit faster - T168584
13:19 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable the Autopatrolled User Rights & Ext:SandboxLink on hiwikiversity (T179251 T179252) (duration: 00m 50s)
12:58 reedy@tin: Synchronized wmf-config/db-labs.php: beta labs stuffs (duration: 00m 50s)
12:49 oblivian@tin: Finished deploy [docker-pkg/deploy@e9b9146]: Proper deploy of first version (duration: 00m 07s)
12:49 oblivian@tin: Started deploy [docker-pkg/deploy@e9b9146]: Proper deploy of first version
12:02 hoo: Manually started dumpwikidatajson on snapshot1007 (cron run failed tonight)
12:02 oblivian@tin: Finished deploy [docker-pkg/deploy@549c157]: (no justification provided) (duration: 00m 11s)
12:01 oblivian@tin: Started deploy [docker-pkg/deploy@549c157]: (no justification provided)
12:00 oblivian@tin: Finished deploy [docker-pkg/deploy@549c157]: (no justification provided) (duration: 00m 19s)
11:59 oblivian@tin: Started deploy [docker-pkg/deploy@549c157]: (no justification provided)
11:57 oblivian@tin: Finished deploy [docker-pkg/deploy@549c157]: (no justification provided) (duration: 00m 11s)
11:57 oblivian@tin: Started deploy [docker-pkg/deploy@549c157]: (no justification provided)
11:55 oblivian@tin: Started deploy [docker-pkg/deploy@549c157]: Test deploy for virtualenv creation
11:48 hoo: Ran rebuildPropertyInfo for wikidatawiki and manually cleared the property cache (mc)
11:44 hoo@tin: Synchronized wmf-config/Wikibase.php: Re-add property for RDF mapping of external identifiers for Wikidata (T179156 , T178180 ) (duration: 00m 49s)
11:33 hoo@tin: Synchronized wmf-config/Wikibase-production.php: Re-enable constraints check with SPARQL (T179156 ) (duration: 00m 50s)
11:24 marostegui: Compress table cebwiki.externallinks on db2092 - T178359
11:15 hoo@tin: rebuilt wikiversions.php and synchronized wikiversions files: Wikidatawiki back to wmf.5 (T179156 )
10:58 mobrovac@tin: Finished deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch, take #gazillion (duration: 08m 33s)
10:52 akosiaris: poweroff copper T176957
10:50 mobrovac@tin: Started deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch, take #gazillion
10:31 marostegui: Stop MySQL on db2039 to copy its data to db2087.s6 - T178359
10:27 mobrovac@tin: Finished deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch (duration: 04m 41s)
10:22 mobrovac@tin: Started deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch
10:03 mobrovac@tin: Finished deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch (duration: 12m 12s)
09:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1097 original weight - T161088 (duration: 00m 52s)
09:50 mobrovac@tin: Started deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch
09:22 marostegui: Stop replication in sync on db2039 and db2046 to reimport tables
09:08 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1097 api traffic - T161088 (duration: 00m 50s)
09:08 marostegui: Drop wb_entity_per_page page from s3 and s5 - T177601
08:51 ema: cp4022: restart varnish-be for mbox lag
08:42 elukey: raised priority of refreshlink and htmlcacheupdate job execution on jobrunners (https://gerrit.wikimedia.org/r/#/c/386636/ ) - T173710
08:39 marostegui: Stop replication on db2039 to reimport and compress tables
08:38 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1097 with low weight - T161088 (duration: 00m 50s)
08:33 moritzm: installing wget security updates
08:32 gehel: rolling restart of wdqs for config reload - T175919
08:31 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2039 (duration: 00m 50s)
08:28 ariel@tin: Finished deploy [dumps/dumps@c204c72]: fix args initialization issue in getconfigvals script (duration: 00m 02s)
08:28 ariel@tin: Started deploy [dumps/dumps@c204c72]: fix args initialization issue in getconfigvals script
08:23 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2040 - T178359 (duration: 00m 50s)
08:11 marostegui: Stop MySQL on db2086 to transfer s7 to db2087 - T178359
06:51 marostegui: Stop MySQL on db1097 to clone db1103 - T161088
06:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1097 - T161088 (duration: 00m 50s)
06:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Move db1103 from s3 to s4 - T161088 (duration: 00m 49s)
06:40 marostegui: Stop MySQL on db1103 to reclone it - T161088
06:24 marostegui: Optimize dewiki.pagelinks and dewiki.templatelinks on db1063 (s5 master) - T174509
06:12 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2040 - T178359 (duration: 00m 51s)
06:08 marostegui: Stop MySQL on db2040 to populate s7 on db2086 - T178359
05:58 marostegui: Deploy alter table on db1075 (s3 primary master) - T174509
03:11 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Oct 30 03:11:37 UTC 2017 (duration 7m 11s)
03:04 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.5) (duration: 11m 38s)
02:35 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 11m 18s)
2017-10-29
23:49 ema: powercycle cp4024
22:31 ariel@tin: Finished deploy [dumps/dumps@2aa2275]: fix keep setting to work with overrides (duration: 00m 02s)
22:31 ariel@tin: Started deploy [dumps/dumps@2aa2275]: fix keep setting to work with overrides
17:55 ariel@tin: Finished deploy [dumps/dumps@d8978ce]: add overrides section processing to config file (duration: 00m 04s)
17:55 ariel@tin: Started deploy [dumps/dumps@d8978ce]: add overrides section processing to config file
17:23 ariel@tin: Finished deploy [dumps/dumps@d426cf7]: batch 7z jobs, multistream job fixup (duration: 00m 02s)
17:23 ariel@tin: Started deploy [dumps/dumps@d426cf7]: batch 7z jobs, multistream job fixup
12:54 ema: cp4026: restart varnish-be for mbox lag
2017-10-28
21:03 bblack: cp1067 (current target cache): disabling the relatively-new VCL that sets do_stream=false if !CL on applayer fetches...
19:39 hoo@tin: Synchronized wmf-config/CommonSettings.php: Half the Flow -> Parsoid timeout (100s -> 50s) (T179156 ) (duration: 00m 51s)
19:39 bblack: backend restart on cp1065
18:39 bblack: restarting varnish backend on cp1053 to move the lag/503 issues to another box and buy more time to debug
18:28 bblack: cp4025 - restart backend for mailbox lag (upload@ulsfo, unrelated to text-cluster issues)
18:21 bblack: cp1053 - manual VCL change, backends appservers+api_appservers, reduce connect/firstbyte/betweenbytes timeoues from 5/180/60 to 3/20/10
16:51 elukey: restart varnish backend on cp1055 - mailbox lag + T179156
12:14 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1313.eqiad.wmnet
12:10 elukey: manually killed (SIGTERM) hhvm on mw1313 - high load, hhvm-dump-debug not responsive
12:01 elukey@puppetmaster1001: conftool action : set/pooled=no; selector: name=mw1313.eqiad.wmnet
11:53 elukey: restart hhvm on mw1285 - hhvm-dump-debug in /tmp/hhvm.17700.bt
11:24 hoo@tin: Synchronized wmf-config/Wikibase-labs.php: Consistency sync (duration: 00m 50s)
10:52 volans: restarted pdfrender on scb1001, was stuck since 2d with AssertionError: display is not set!
2017-10-27
20:54 MaxSem: running migratePreferences.php on group2 wikis
19:09 hoo: Ran scap pull on mwdebug1001
18:20 awight@tin: Finished deploy [ores/deploy@185170f]: Test pip-9 scap trick on ores1002 (non-production) (duration: 02m 17s)
18:18 awight@tin: Started deploy [ores/deploy@185170f]: Test pip-9 scap trick on ores1002 (non-production)
17:54 hoo: Taking mwdebug1001 to do tests regarding T179156
16:38 gehel: re-enabling wdqs-updater
16:16 bblack: cp1054 varnish backend restarted (was 503s / bad-conns target of ongoing issues)
16:16 gehel: wdqs updater is now stopped for real
16:10 XioNoX: deactivating BGP sessions to Zayo in eqiad (flapping)
15:58 gehel: disabling wdqs updater on all nodes
15:50 hoo@tin: Synchronized wmf-config/Wikibase-production.php: Disable constraints check with SPARQL for now (T179156 ) (duration: 00m 50s)
15:48 marostegui: Compress InnoDB on db2039 (s6) - T178359
15:46 bblack: restart varnish-backend on cp4022 (upload@ulsfo) - mailbox
14:49 bblack: turn on cp4024 port on asw-ulsfo
13:52 bblack: reboot cp4021 to clean up oom messes
13:49 bblack: restarting nginx on cp4021, without NUMA memory constraints
12:10 marostegui: Optimize commonswiki.templatelinks on dbstore1001 - T162789
12:03 elukey: execute systemctl reset-failed kafka-mirror-main-eqiad_to_jumbo-eqiad.service on kafka-jumbo hosts (old unit not deployed anymore)
11:41 mutante: gerrit back - maintenance over
11:39 mutante: gerrit restart to apply gerrit:386793 is imminent
11:36 ema: cp4023: varnish-backend-restart for lag
11:06 mobrovac@tin: Finished deploy [citoid/deploy@ff63420]: Update dependencies (duration: 03m 18s)
11:04 marostegui: Optimize cebwiki.templatelinks on db1103 - T174509
11:03 mobrovac@tin: Started deploy [citoid/deploy@ff63420]: Update dependencies
08:56 marostegui: Optimize table commonswiki.recentchanges on dbstore1001 - T162789
08:11 marostegui: Drop redundant indexes from pagelinks and templatelinks on s3 wikis only for dbstore2001 and dbstore2002 - T174509
05:59 marostegui: Stop MySQL on db2084 to copy s5 to db2086 - T178359
05:55 marostegui: dbstore1001: convert itwiki.pagelinks to TokuDB - T162789
05:53 mutante: uploaded parsoid_0.8.0 to releases.wikimedia.org (for Subbu - T179134 )
05:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1060 - T174509 (duration: 00m 50s)
03:02 bblack: cp1067, cp4026 - backend restarts, mailbox lag
2017-10-26
22:47 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: wikidata to wmf.4
22:39 bblack: restarting varnish-backend on cp1053 (mailbox lag from ongoing issues elsewhere?)
21:40 bblack: raising backend max_connections for api.svc.eqiad.wmnet + appservers.svc.eqiad.wmnet from 1K to 10K on cp1053.eqiad.wmnet (current funnel for the bulk of the 503s)
21:32 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Temporary disable remex html (T178632 ) (duration: 00m 50s)
21:30 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Temporary disable remex html (T178632 ) (duration: 00m 50s)
21:00 hoo: Fully revert all changes related to T178180
20:59 hoo@tin: Synchronized wmf-config/Wikibase.php: Revert "Add property for RDF mapping of external identifiers for Wikidata" (T178180 ) (duration: 00m 50s)
20:02 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: UBN! disbale ores for wikidata (T179107 ) (duration: 00m 50s)
20:00 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: UBN! disbale ores for wikidata (T179107 ) (duration: 00m 50s)
19:41 awight@tin: Finished deploy [ores/deploy@0adae70]: Increase extractor wikidata API timeout to 15s, T179107 (duration: 07m 25s)
19:36 aaron@tin: Started restart [jobrunner/jobrunner@a20d043]: (no justification provided)
19:34 awight@tin: Started deploy [ores/deploy@0adae70]: Increase extractor wikidata API timeout to 15s, T179107
19:33 awight@tin: Finished deploy [ores/deploy@0adae70]: Increase extractor wikidata API timeout to 15s, T179107 (duration: 00m 10s)
19:33 awight@tin: Started deploy [ores/deploy@0adae70]: Increase extractor wikidata API timeout to 15s, T179107
19:27 demon@tin: Synchronized php-1.31.0-wmf.5/includes/libs/filebackend/SwiftFileBackend.php: Better error grouping (duration: 00m 50s)
19:14 ladsgroup@tin: Synchronized php-1.31.0-wmf.5/extensions/WikibaseQualityConstraints/includes/ConstraintCheck/DelegatingConstraintChecker.php: Fix sorting of NullResults (T179038 ) (duration: 00m 49s)
19:12 ladsgroup@tin: Synchronized php-1.31.0-wmf.5/extensions/WikibaseQualityConstraints/tests/phpunit/DelegatingConstraintCheckerTest.php: Fix sorting of NullResults (T179038 ) (duration: 00m 50s)
18:52 ladsgroup@tin: Synchronized php-1.31.0-wmf.5/extensions/Wikidata/extensions/Constraints/tests/phpunit/DelegatingConstraintCheckerTest.php: Fix sorting of NullResults (T179038 ) (duration: 00m 49s)
18:51 ladsgroup@tin: Synchronized php-1.31.0-wmf.5/extensions/Wikidata/extensions/Constraints/includes/ConstraintCheck/DelegatingConstraintChecker.php: Fix sorting of NullResults (T179038 ) (duration: 01m 04s)
18:41 bblack: eqiad cp servers: rolling quick depool -> repool around ethtool parameter changes for -lro,-pause
18:30 urandom: Dropping and recreating keyspace enwiki_T_page__summary (restbase-ng cluster)
18:27 bblack: esams cp servers: rolling quick depool -> repool around ethtool parameter changes for -lro,-pause
18:26 urandom: T179105 : Altering "enwiki_T_page__summary".data to use LZ4Compressor
17:59 bblack: codfw cp servers: rolling quick depool -> repool around ethtool parameter changes for -lro,-pause
17:57 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4024.ulsfo.wmnet
17:49 bblack: ulsfo cp servers: rolling quick depool -> repool around ethtool parameter changes for -lro,-pause
17:47 awight@tin: Finished deploy [ores/deploy@f5deb7f]: Revscoring 2 on ores1002 (non-production) (duration: 01m 45s)
17:45 awight@tin: Started deploy [ores/deploy@f5deb7f]: Revscoring 2 on ores1002 (non-production)
17:36 arlolra: Updated Parsoid to 8e99708a (T176728 )
17:30 arlolra@tin: Finished deploy [parsoid/deploy@4882c59]: Updating Parsoid to 8e99708a (duration: 08m 40s)
17:21 arlolra@tin: Started deploy [parsoid/deploy@4882c59]: Updating Parsoid to 8e99708a
17:17 awight@tin: Finished deploy [ores/deploy@971be22]: Rolling back scb1002, T175180 T178441 (duration: 00m 32s)
17:16 awight@tin: Started deploy [ores/deploy@971be22]: Rolling back scb1002, T175180 T178441
17:14 awight@tin: Finished deploy [ores/deploy@971be22]: ORES w/ revscoring 2 and Celery 4, T175180 T178441 (duration: 03m 20s)
17:11 awight@tin: Started deploy [ores/deploy@971be22]: ORES w/ revscoring 2 and Celery 4, T175180 T178441
17:09 marostegui: Optimize ruwiki.recentchanges on dbstore1001 - T162789
17:08 dcausse: [logstash] reimporting today logs
17:08 dcausse: [logstash] deleting today logs to reapply mapping template
16:58 ottomata: now mirroring main Kafka cluster topics to jumbo Kafka cluster, with MirrorMaker instances running on analytics-eqiad broker nodes. https://phabricator.wikimedia.org/T177216
16:44 awight@tin: Finished deploy [ores/deploy@971be22]: Push ORES w/ Celery 4 support to new cluster (try to rebuild venv), T178441 (duration: 01m 02s)
16:43 awight@tin: Started deploy [ores/deploy@971be22]: Push ORES w/ Celery 4 support to new cluster (try to rebuild venv), T178441
16:33 awight@tin: Finished deploy [ores/deploy@971be22]: Push ORES w/ Celery 4 support to new cluster (take 3), T178441 (duration: 15m 33s)
16:21 urandom: restarted cassandra, restbase2001-b
16:18 awight@tin: Started deploy [ores/deploy@971be22]: Push ORES w/ Celery 4 support to new cluster (take 3), T178441
16:15 urandom: truncating cassandra hints in restbase-ng cluster
16:12 mobrovac@tin: Finished deploy [restbase/deploy@860cbfe]: Revert all summaries, back to all but WP (duration: 07m 57s)
16:04 mobrovac@tin: Started deploy [restbase/deploy@860cbfe]: Revert all summaries, back to all but WP
16:04 mobrovac@tin: Started deploy [restbase/deploy@860cbfe]: Revert all summaries, back to all but WP
15:50 mobrovac@tin: Finished deploy [restbase/deploy@522321a]: Double-process all summaries (duration: 09m 09s)
15:45 urandom: restarting cassandra, restbase2001-b
15:41 andrewbogott: upgrading dnsmasq on labnet1001 and labnet1002
15:41 mobrovac@tin: Started deploy [restbase/deploy@522321a]: Double-process all summaries
15:36 dcausse: reindexing logs in logstash
15:34 dcausse: deleting and reimporting logs in logstash (today data, might lose some logs in the process)
14:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1101 - T178383 (duration: 00m 49s)
14:04 zeljkof: EU SWAT finished
14:03 zfilipin@tin: Synchronized php-1.31.0-wmf.5/extensions/Wikibase: SWAT: Make search for titles be always uppercase (T179045) (duration: 01m 36s)
13:52 zfilipin@tin: Synchronized php-1.31.0-wmf.5/extensions/Wikidata: SWAT: Make search for titles be always uppercase (T179045) (duration: 02m 10s)
13:46 Amir1: ladsgroup@terbium:~$ mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildPropertyInfo.php --wiki=wikidatawiki --rebuild-all --force (T178180 )
13:41 godog: delete old CF cassandra metrics - T173436
13:32 godog: force delete old graphite cassandra metrics - T179057
13:28 moritzm: installing icu security update on trusty
13:28 zfilipin@tin: Synchronized wmf-config/Wikibase.php: SWAT: Add property for RDF mapping of external identifiers for Wikidata (T178180) (duration: 00m 50s)
13:21 marostegui: Compress InnoDB on db2040 - T178359
13:21 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable signature button in Projekt namespace on sewikimedia (T175363) (duration: 00m 50s)
13:13 zfilipin@tin: Synchronized wmf-config/InterwikiSortOrders.php: SWAT: Fix interwiki sort order for Northern Sami (T178965) (duration: 00m 50s)
12:57 marostegui@tin: Synchronized wmf-config/db-codfw.php: Reorganize future rc multi-instance hosts - T178359 (duration: 00m 50s)
12:30 bblack: powercycle cp4024 (depooled)
12:29 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4024.ulsfo.wmnet
10:36 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Clean up some comments in s3 (duration: 00m 50s)
09:01 moritzm: upgrade script runners to wikidiff2 1.5.1
08:58 ema: cache_text/upload varnish-be: set rutime parameter timeout_idle to 120s
08:34 moritzm: uploaded php-wikidiff2/hhvm-wikidiff2 1.5.1 for stretch-wikimedia
08:22 godog: add 100G to graphite1003's /var/lib/carbon
08:14 moritzm: upgrade deployment servers to wikidiff2 1.5.1
07:57 marostegui: Drop databases in s1 and s2 from db1047 and unconfigure replication - T177405
07:43 marostegui: Stop MySQL on db1047 to copy data over db1108 - T177405 T156844
07:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1077 - T164488 (duration: 00m 50s)
07:13 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1077 - T164488 (duration: 00m 50s)
07:12 marostegui: Stop replication in sync on db1103 and db1077 to fix data drifts - T164488
07:10 moritzm: upgrade mw1319-mw1328 to wikidiff2 1.5.1
06:57 marostegui: Stop MySQL on db2088 and db2084 to copy s2 and s4 to db2091 - T178359
06:57 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2038 (duration: 00m 50s)
06:55 marostegui: Optimize pagelinks and templatelinks on db1095 s1 - T174509
06:51 marostegui: Optimize recentchanges on s4 and s6 on labsdb1010 - T177772
06:15 moritzm: upgrade mw1299-mw1311 to wikidiff2 1.5.1
05:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1060 - T174509 (duration: 00m 50s)
05:14 marostegui: Optimize recentchanges on s4 and s6 on labsdb1009 - T177772
05:12 marostegui: Optimize pagelinks and templatelinks on db1060 - T174509
03:01 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Oct 26 03:01:19 UTC 2017 (duration 6m 48s)
02:54 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.5) (duration: 09m 30s)
02:31 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 10m 34s)
2017-10-25
23:48 legoktm@tin: Synchronized php-1.31.0-wmf.5/extensions/VisualEditor/modules/ve-mw/ui/styles/contextitems/ve.ui.MWInternalLinkContextItem.css: MWInternalLinkContextItem: increase specificity to override OOUI changes - T178933 (duration: 00m 50s)
23:39 legoktm@tin: Synchronized wmf-config/abusefilter.php: Change threshold for slow AbuseFilter logging to 800ms - T179039 (duration: 00m 50s)
22:42 MaxSem: running migratePreferences.php on group1 wikis
22:01 mobrovac@tin: Finished deploy [restbase/deploy@860cbfe]: Revert double-processing of all summaries to all except WPs, take #2 (duration: 09m 09s)
21:52 mobrovac@tin: Started deploy [restbase/deploy@860cbfe]: Revert double-processing of all summaries to all except WPs, take #2
21:48 mobrovac@tin: Started deploy [restbase/deploy@60ce036]: Revert double-processing of all summaries to all except WPs
21:33 bblack: experimental - disabled LRO on lvs4001+lvs4002
20:34 hasharAway: restarted zuul to flush a long queue in experimental pipeline (no other changes enqueued)
20:21 demon@tin: Synchronized php-1.31.0-wmf.5/extensions/ProofreadPage/: unbreak multiline text input (duration: 00m 53s)
19:28 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.5
18:45 bblack: s/1\.13\.16/1.13.6/
18:45 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable fine grained usage tracking on statement usage wikis T172914 (duration: 00m 50s)
18:44 bblack: reprepro: uploaded nginx-1.13.16-2+wmf1 packages to stretch and jessie
18:36 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Make disabled usage aspect use the new config T172914 (duration: 00m 50s)
18:33 niharika29@tin: Synchronized php-1.31.0-wmf.4/extensions/Wikidata: Allow specifying a replacement usage type in disabledUsageAspects T178153 (duration: 02m 10s)
18:25 niharika29@tin: Synchronized wmf-config/Wikibase.php: Make using CirrusSearch engine default for wbsearchentities T175741 (duration: 00m 48s)
18:23 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Add AbuseFilterSlow channel to monolog T178853 (duration: 00m 50s)
18:20 niharika29@tin: Synchronized php-1.31.0-wmf.5/extensions/ORES/: Update ApiHooks to conform with v3 responses T178962 (duration: 00m 50s)
18:19 niharika29@tin: Synchronized php-1.31.0-wmf.4/extensions/ORES/: Update ApiHooks to conform with v3 responses T178962 (duration: 00m 51s)
18:11 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable Special:EmailUser User Prohibit on All Wikis [mediawiki-config] - https://gerrit.wikimedia.org/r/386231 (duration: 00m 51s)
17:17 moritzm: upgrade mw2224-mw2242 to wikidiff2 1.5.1
16:59 moritzm: upgrade mw2190-mw2199 to wikidiff2 1.5.1
16:38 no_justification: gerrit: purging last remnants of debian package and running puppet. service may briefly restart
16:05 moritzm: upgrade mw2180-mw2189 to wikidiff2 1.5.1
16:03 moritzm: copied arcconf, hp-health, hpacucli, hpssa, hpssacli, hpssaducli, lsiutil, megacli, megaclisas-status to stretch-wikimedia/thirdparty/hwraid (thirdparty will be dropped at a later point)
15:39 marostegui: Optimize pagelinks and templatelinks on labsdb1011 - T174509
15:31 marostegui: Power off db1101 for HW maintenance - T178383
15:21 godog: bounce rsyslog on wezen
15:09 mattflaschen@tin: Synchronized wmf-config/InitialiseSettings.php: T177836 : Deploy Compact Language Links on the German Wikipedia (duration: 00m 49s)
14:52 gehel: applying new template to elasticsearch / logstash - T178530
14:51 ebernhardson: update logstash template on logstash elsaticsearch cluster for T178530
14:47 mattflaschen@tin: Synchronized wmf-config/InitialiseSettings.php: T166759 : Set $wgAutoloadAttemptLowercase to false (duration: 00m 50s)
14:37 mattflaschen@tin: Synchronized php-1.31.0-wmf.5/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: T177956 : [cirrus] Turn off recall A/B test on enwiki (duration: 00m 49s)
14:36 mattflaschen@tin: Synchronized php-1.31.0-wmf.4/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: T177956 : [cirrus] Turn off recall A/B test on enwiki (duration: 00m 49s)
14:25 moritzm: moved jmxtrans and prometheus-jmx-exporter from thirdparty to main as part of repo reorg for stretch-wikimedia
14:11 matt_flaschen: 'Deployed T178933 : MWInternalLinkContextItem: increase specificity to override OOUI changes'
14:10 mattflaschen@tin: Synchronized php-1.31.0-wmf.4/extensions/VisualEditor/modules/ve-mw/ui/styles/contextitems/ve.ui.MWInternalLinkContextItem.css: (no justification provided) (duration: 00m 50s)
13:36 mattflaschen@tin: Synchronized wmf-config/InitialiseSettings.php: T168886 : Updating wikis with consolidate editing feedback (duration: 00m 51s)
13:30 godog: deploy thumbor 1.7 - T178974
13:30 elukey: restart yarn nodemanager and hdfs datanode on analytics1030 to apply new JVM settings - T178876
13:15 chasemp: merge arturo's key as part of ops
13:04 matt_flaschen: About to run ULSCompactLinksDisablePref.php: Setting user preferences for Compact Language Links on dewiki, before removing Beta Feature and changing to regular preference. T177836
12:36 moritzm: upgrade mw2163-mw2179 to wikidiff2 1.5.1
12:17 mobrovac@tin: Synchronized wmf-config/InitialiseSettings-labs.php: Activate warning logging for JobExecutor (duration: 00m 50s)
12:16 mobrovac@tin: Synchronized wmf-config/InitialiseSettings.php: Activate warning logging for JobExecutor (duration: 00m 50s)
12:12 moritzm: upgrade mw1266-mw1275 to wikidiff2 1.5.1
11:30 moritzm: upgrade mw1280-mw1290, mw1312-mw1318 to wikidiff2 1.5.1
11:23 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2038 - T178359 (duration: 00m 49s)
10:49 mobrovac@tin: Finished scap: wmf-config/jobqueue-labs.php beta-only change for CP4JQ (duration: 04m 43s)
10:45 mobrovac@tin: Started scap: wmf-config/jobqueue-labs.php beta-only change for CP4JQ
10:41 mobrovac@tin: Finished deploy [restbase/deploy@2ae5b0c]: Remove /title/{title}/ and double-process all summaries - T158100 (duration: 09m 41s)
10:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1077 - T164488 (duration: 00m 50s)
10:32 moritzm: upgrade remaining video scalers in eqiad to wikidiff2 1.5.1
10:31 mobrovac@tin: Started deploy [restbase/deploy@2ae5b0c]: Remove /title/{title}/ and double-process all summaries - T158100
09:31 moritzm: upgrade mw1221-mw1235 to wikidiff2 (API servers)
09:07 gehel: rolling restart of all wdqs nodes for GC config change - T175919
08:12 moritzm: upgrade mw1238-mw1258 to wikidiff2
08:02 _joe_: decommissioning mw1168/69 as videoscalers, stopped jobrunner/jobchron, will stop hhvm/nginx/apache once current processing is done
07:42 gehel@tin: Finished deploy [wdqs/wdqs@0bb2b5c]: wdqs-updater upgrade for jolokia - T167842 (duration: 01m 44s)
07:40 gehel@tin: Started deploy [wdqs/wdqs@0bb2b5c]: wdqs-updater upgrade for jolokia - T167842
07:05 marostegui: Optimize pagelinks and templatelinks on db1021 - T174509
06:15 marostegui: Stop MySQL on db2038 to copy its data to db2084 - T178359
06:08 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2035 - T178359 (duration: 00m 50s)
06:02 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1065 - T174509 (duration: 00m 50s)
05:44 marostegui: Stop replication in sync on db1103 and db1077 to checksum data - T164488
05:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1077 - T164488 (duration: 01m 00s)
03:22 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Oct 25 03:22:51 UTC 2017 (duration 7m 20s)
03:15 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.5) (duration: 16m 46s)
02:35 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 08m 59s)
2017-10-24
22:54 ejegg: updated civicrm from from 0546446 to 25c7a33
21:24 ejegg: updated civicrm from 18edce1 to 0546446
21:21 mepps: updated payments-wiki from 1011434 to b88b805
20:47 arlolra@tin: Finished deploy [parsoid/deploy@UNKNOWN]: (no justification provided) (duration: 00m 13s)
20:47 arlolra@tin: Started deploy [parsoid/deploy@UNKNOWN]: (no justification provided)
20:29 mutante: restarting gerrit to make gerrit-ssh listen on all IPs again
19:49 arlolra@tin: Finished deploy [parsoid/deploy@UNKNOWN]: Enabling logging for skwiki posts (duration: 00m 35s)
19:48 arlolra@tin: Started deploy [parsoid/deploy@UNKNOWN]: Enabling logging for skwiki posts
19:46 mutante: restarting gerrit on cobalt to apply change 354074
19:13 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.5
18:57 awight@tin: Finished deploy [ores/deploy@fb55ab8]: Rollback ORES to Revscoring 1.0, T175180 (duration: 10m 13s)
18:47 awight@tin: Started deploy [ores/deploy@fb55ab8]: Rollback ORES to Revscoring 1.0, T175180
18:47 ejegg: updated civicrm from b9d2067 to 18edce1
18:26 awight@tin: Finished deploy [ores/deploy@fb55ab8]: Update ORES to revscoring 2.x, T175180 (take 2) (duration: 10m 03s)
18:16 awight@tin: Started deploy [ores/deploy@fb55ab8]: Update ORES to revscoring 2.x, T175180 (take 2)
18:05 arlolra: Updated Parsoid to a3be9cfc (T96923 , T177784 , T176568 , T25467 )
17:57 arlolra@tin: Finished deploy [parsoid/deploy@af55c4b]: Updating Parsoid to a3be9cfc (duration: 09m 16s)
17:48 arlolra@tin: Started deploy [parsoid/deploy@af55c4b]: Updating Parsoid to a3be9cfc
17:28 demon@tin: Finished scap: bootstrap wmf.5 (duration: 48m 34s)
17:18 urandom: T178845 : Rolling Cassandra restart, eqiad
17:07 awight@tin: Started deploy [ores/deploy@fb55ab8]: Update ORES to revscoring 2.x, T175180
16:50 herron: upgrading puppet packages on codfw puppetmaster T177254
16:39 demon@tin: Started scap: bootstrap wmf.5
16:10 demon@tin: Pruned MediaWiki: 1.31.0-wmf.3 [keeping static files] (duration: 01m 20s)
15:58 elukey: drop MediaViewer_10867062_15423246 and MobileWebUIClickTracking_10742159_15423246 from the log database on db1046 (archived on hadoop) - T168303
15:53 urandom: T178845 : Rolling Cassandra restart, codfw, row b
15:48 elukey: drop table Edit_13457736_15423246 from the log database (Eventlogging) on db104[6,7], dbstore1002
15:39 elukey: set net.netfilter.nf_conntrack_tcp_timeout_time_wait=65 to mw[1308-1311] - T136094
15:09 gehel: restarting blazegraph on all wdqs nodes for GC config change - T175919
14:48 _joe_: also stopping HHVM, nginx, apache2 , and disabling all the services
14:46 _joe_: stopping the jobrunner service on mw1161-67
14:39 marostegui: Optimize pagelinks templatelinks and recentchanges on db1030 - T174509 https://phabricator.wikimedia.org/T177772
14:36 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1065 - T174509 (duration: 00m 45s)
14:33 marostegui: Optimize ores_classification on enwiki db1065 - T159753
14:32 marostegui: Optimize pagelinks and templatelinks on db1065 - T174509
14:24 gehel: restarting blazegraph on wdqs2001 for GC config change - T175919
14:20 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1106 weight (duration: 00m 45s)
13:42 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2035 - T178359 (duration: 00m 45s)
13:41 marostegui: Stop MySQL on db2035 to copy its data to db2088 - T178359
13:39 zeljkof: EU SWAT finished
13:39 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add $wgNamespaceRobotPolicies Config for hewikisource (T178775) (duration: 00m 46s)
10:50 dcausse: cirrus/elasticsearch: reindexing of 167 small wikis done (T177871 )
09:22 marostegui: Stop s1 on db2092 to copy its data to db2088 - T178359
08:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1082 original weight - T178460 (duration: 00m 45s)
07:48 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1082 weight - T178460 (duration: 00m 45s)
07:46 marostegui: Stop replication in sync on db1103 and db2018 to fix data drifts - T164488
07:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1078 - T164488 (duration: 00m 46s)
07:28 marostegui: Stop replication in sync on db1078 and db1103 to fix data drifts -Â https://phabricator.wikimedia.org/T164488
06:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1078 - T164488 (duration: 00m 45s)
06:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1082 with low weight - T178460 (duration: 00m 47s)
02:32 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Oct 24 02:32:25 UTC 2017 (duration 6m 33s)
02:25 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 08m 13s)
2017-10-23
23:09 demon@tin: Synchronized wmf-config/throttle.php: Working Class Movement Library (Salford) throttle rule (duration: 00m 46s)
22:59 bblack: cp4024: varnish-backend-restart for lag
21:40 XioNoX: increasing MTU on Zayo transit interfaces
20:34 arlolra: Updated Parsoid to 1cefef12 (T178217 )
20:25 arlolra@tin: Finished deploy [parsoid/deploy@0759bd1]: Updating Parsoid to 1cefef12 (duration: 11m 14s)
20:14 arlolra@tin: Started deploy [parsoid/deploy@0759bd1]: Updating Parsoid to 1cefef12
19:37 bsitzmann@tin: Finished deploy [mobileapps/deploy@a654ce1]: Update mobileapps to 3628105 (T178828 ) (duration: 06m 59s)
19:30 bsitzmann@tin: Started deploy [mobileapps/deploy@a654ce1]: Update mobileapps to 3628105 (T178828 )
19:00 joal@tin: Finished deploy [analytics/aqs/deploy@ad22e0c]: Upgrade AQS node modules to restbase-latests - Redeploy after config fix (duration: 02m 34s)
18:58 joal@tin: Started deploy [analytics/aqs/deploy@ad22e0c]: Upgrade AQS node modules to restbase-latests - Redeploy after config fix
18:58 joal@tin: Finished deploy [analytics/aqs/deploy@ad22e0c]: Upgrade AQS node modules to restbase-latests - Redeploy after config fix (duration: 05m 07s)
18:52 joal@tin: Started deploy [analytics/aqs/deploy@ad22e0c]: Upgrade AQS node modules to restbase-latests - Redeploy after config fix
18:52 joal@tin: Finished deploy [analytics/aqs/deploy@ad22e0c]: Upgrade AQS node modules to restbase-latests (duration: 321m 51s)
17:56 gehel@tin: Finished deploy [wdqs/wdqs@b84592f]: WDQS GUI update (duration: 01m 58s)
17:54 gehel@tin: Started deploy [wdqs/wdqs@b84592f]: WDQS GUI update
17:48 XenoRyet: updated smashpig from d056a13 to 5262d53
17:32 demon@tin: Synchronized scap/plugins/prep.py: Fixes and improvements of various sorts (duration: 00m 45s)
17:26 gehel@tin: Finished deploy [wdqs/wdqs@4626acb]: deploying latest WDQS version, blazegraph + GUI updates (duration: 02m 07s)
17:24 gehel@tin: Started deploy [wdqs/wdqs@4626acb]: deploying latest WDQS version, blazegraph + GUI updates
16:53 demon@tin: Pruned MediaWiki: 1.30.0-wmf.19 (duration: 03m 09s)
15:59 elukey: forced BBU learn cycle on db1046 - T166141
15:00 herron: wiped resolver cache of records puppet.cofdw.wmnet and puppet.ulsfo.wmnet (T177254 )
14:46 herron: depooling codfw puppetmaster (via dns)
14:31 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1078 - T164488 (duration: 00m 46s)
14:05 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2037 - T178359 (duration: 00m 46s)
13:48 marostegui: Stop mysql and poweroff db1082 for HW maintenance - T178460
13:30 joal@tin: Started deploy [analytics/aqs/deploy@ad22e0c]: Upgrade AQS node modules to restbase-latests
13:19 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Add aleph500.biblacad.ro to $wgCopyUploadsDomains - T178753 (duration: 00m 47s)
13:19 gehel: rolling restart of wdqs for GC tuning - T175919
13:18 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Remove hooserv.net from wgCopyUploadsDomains (duration: 00m 47s)
13:08 zeljkof: EU SWAT finished, no deploys
13:00 bblack@neodymium: conftool action : set/pooled=no; selector: dc=ulsfo,cluster=cache_text,name=cp4018.ulsfo.wmnet
12:59 bblack@neodymium: conftool action : set/pooled=yes; selector: dc=ulsfo,cluster=cache_text,name=cp4032.ulsfo.wmnet
12:53 bblack@neodymium: conftool action : set/pooled=no; selector: dc=ulsfo,cluster=cache_text,name=cp4017.ulsfo.wmnet
12:47 bblack@neodymium: conftool action : set/pooled=yes; selector: dc=ulsfo,cluster=cache_text,name=cp4031.ulsfo.wmnet
12:46 moritzm: upgrading hhvm-wikdiff2 on remaining job runners in codfw
12:39 moritzm: upgrading hhvm-wikdiff2 on remaining video scalers in codfw
12:32 bblack@neodymium: conftool action : set/pooled=no; selector: dc=ulsfo,cluster=cache_text,name=cp4010.ulsfo.wmnet
12:29 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Remove hooserv.net from wgCopyUploadsDomains (duration: 00m 46s)
12:26 moritzm: upgrading hhvm-wikdiff2 on mw2153-mw2162 (job runners)
12:21 moritzm: upgrading hhvm-wikdiff2 on mw2254-mw2258 (app servers)
12:20 bblack@neodymium: conftool action : set/pooled=yes; selector: dc=ulsfo,cluster=cache_text,name=cp4030.ulsfo.wmnet
12:15 bblack@neodymium: conftool action : set/pooled=no; selector: dc=ulsfo,cluster=cache_text,name=cp4009.ulsfo.wmnet
12:12 bblack@neodymium: conftool action : set/pooled=yes; selector: dc=ulsfo,cluster=cache_text,name=cp4029.ulsfo.wmnet
12:03 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Enable description usage tracking on further wikis (T178515 ) (duration: 00m 47s)
11:11 moritzm: upgrading hhvm-wikdiff2 on mw2097-mw2114
11:10 marostegui: Compress innodb on db2038 and db2084 - T178359
10:56 moritzm: upgrading hhvm-wikdiff2 on remaining API servers in codfw
10:46 moritzm: upgrading hhvm-wikdiff2 on mw1161-mw1167 (job runners)
10:28 moritzm: upgrading hhvm-wikdiff2 on mw1180-mw1188 (app servers)
10:28 marostegui: Remove /srv from db1015 as it has been stopped for weeks now and will be decommissioned (and it is alerting low on disk space) - T173570
10:09 dcausse: elasticsearch/cirrus reindexing 167 wikis from terbium (T177871 )
10:08 gehel: restart tilerator / karthotherian for config change on all maps clusters - T160215
10:01 ema: restart varnish-be on cp4021 (mbox lag)
09:58 moritzm: upgrading hhvm-wikidiff2 on mw1293-mw1298 (scalers)
09:09 moritzm: upgrading hhvm-wikidiff2 on mw1189-mw1208 (API servers)
08:09 marostegui: Rename wb_entity_per_page table on s5 db1092 and s3 db1077 - T177601
08:07 moritzm: upgrading hhvm-wikdiff2 on mw1209-mw1220 (app servers)
07:23 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2037 and db2038 - T178359 (duration: 00m 46s)
06:56 moritzm: installing expat security updates on trusty
06:51 marostegui: Stop replication in sync on db1078 and db1103 to checksum data - T164488
06:50 moritzm: installing poppler security updates on trusty
06:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1078 - T164488 (duration: 00m 47s)
06:30 marostegui: Stop replication in sync on db1103 and db2018 to fix data drifts - T164488
06:09 marostegui: Stop replication on db2092 to re-import and compress linter and watchlist table
02:41 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Oct 23 02:41:04 UTC 2017 (duration 6m 55s)
02:34 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 08m 45s)
2017-10-21
08:16 hashar: Mass code-review+2 changes made by LibraryUpdater that already had Code-Review:+2 and NOT verified=-1 ( ping legoktm )
08:08 hashar: Stopping Zuul to flush its queue
2017-10-20
23:46 demon@tin: Finished deploy [gerrit/gerrit@21b7332]: pushing deleteproject plugin (duration: 00m 09s)
23:46 demon@tin: Started deploy [gerrit/gerrit@21b7332]: pushing deleteproject plugin
19:19 hasharAway: restarting Jenkins - T178608
18:52 bblack: vhtcpd upgrade + queue delay puppetization deploy ( https://gerrit.wikimedia.org/r/385415 ) done on cp* fleet - T133821
18:28 bblack: upgrading vhtcpd to 0.1.1-1 on the cp* fleet
17:38 mutante: removing Apache and HHVM Ganglia stats from all appservers, part of retiring Ganglia (T177225 ) - some transient puppet issues while purging package and remnants - use grafana dashboard at https://grafana.wikimedia.org/dashboard/db/prometheus-apache-hhvm-dc-stats
16:39 aaron@tin: Synchronized php-1.31.0-wmf.4/extensions/TwoColConflict: 5c3224980fe (duration: 00m 46s)
16:37 aaron@tin: Synchronized php-1.31.0-wmf.4/extensions/OATHAuth: c16a94e17b9981 (duration: 00m 47s)
16:05 krinkle@tin: Synchronized php-1.31.0-wmf.4/extensions/WikimediaMaintenance/getJobQueueLengths.php: I8737795e6 (duration: 00m 47s)
16:05 XioNoX: removing "policy-options policy-statement LVS_import term secondary" on cr routers
14:39 cmjohnson1: updating power firmware analytics1037
10:14 moritzm: upgrading hhvm-wikidiff2 to 1.5.1 on mw2150/mw2151/mw2244/mw2245
09:17 moritzm: upgrading hhvm-wikidiff2 to 1.5.1 on mw2200-mw2223
08:05 Amir1: mwscript createAndPromote.php --wiki=techconductwiki --sysop --bureaucrat 'Ladsgroup' --force
07:39 jynus: stopping db2033, too
07:36 jynus: stopping dbstore1002 and dbstore2002 x1 replication in sync
07:18 _joe_: uploading updated versions of docker base images (wikimedia-jessie, wikimedia-stretch) to docker-registry.wikimedia.org
06:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1078 - T164488 (duration: 00m 46s)
06:17 marostegui: Stop replication in sync on db1103 and db1078 - T164488
06:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1078 - T164488 (duration: 00m 45s)
06:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1072 - T164488 (duration: 00m 46s)
05:52 marostegui: Stop replication in sync on db1103 and db1072 - T164488
05:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1051 - T174509 (duration: 00m 47s)
02:19 bblack: upgrading vhtcpd to 0.1.1-1 on: cp1008, cp3030, cp3034, cp4029-32
02:17 bblack: uploaded vhtcpd-0.1.1-1 to reprepro
2017-10-19
22:21 addshore: stop running extra change dispatchers on terbium for wikidatawiki
21:58 addshore: running extra change dispatchers on terbium for wikidatawiki for a short while
21:24 andrewbogott: restarting nginx on nitrogen
21:22 andrewbogott: restarting puppetdb on nitrogen
20:00 mforns@tin: Finished deploy [analytics/refinery@0c9ba04]: deploying refinery to fix banner activity false alarms (duration: 00m 14s)
19:59 mforns@tin: Started deploy [analytics/refinery@0c9ba04]: deploying refinery to fix banner activity false alarms
19:53 bblack: reboot cp4029-32 (initial reboot post-puppetization)
19:12 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.4
18:36 thcipriani@tin: Synchronized portals: SWAT: Bumping portals to master T128546 (duration: 00m 51s)
18:35 thcipriani@tin: Synchronized portals/prod/wikipedia.org/assets: SWAT: Bumping portals to master T128546 (duration: 00m 50s)
18:28 thcipriani@tin: Synchronized wmf-config/abusefilter.php: SWAT: Enable $wgAbuseFilterProfile on ptwiki T177641 (duration: 00m 50s)
18:20 thcipriani@tin: Synchronized php-1.31.0-wmf.4/extensions/Echo/modules/styles/mw.echo.ui.PageNotificationsOptionWidget.less: SWAT: mw.echo.ui.PageNotificationsOptionWidget: Fix CSS after changes in OOjs UI T178439 (duration: 00m 51s)
16:50 hashar: contint1001: installed python3 (pending https://gerrit.wikimedia.org/r/385210 )
15:14 marostegui: Stop replication in sync on db1103 and db1072 -Â https://phabricator.wikimedia.org/T164488
15:02 mforns@tin: Finished deploy [analytics/refinery@0c9ba04]: (no justification provided) (duration: 00m 07s)
15:02 mforns@tin: Started deploy [analytics/refinery@0c9ba04]: (no justification provided)
14:36 mforns@tin: Finished deploy [analytics/refinery@0c9ba04]: (no justification provided) (duration: 03m 29s)
14:33 mforns@tin: Started deploy [analytics/refinery@0c9ba04]: (no justification provided)
13:20 moritzm: upgrading mw2120-mw2147 to wikidiff2 1.5.1
12:35 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2034 and db2036 - T178359 (duration: 00m 50s)
11:40 akosiaris: T177196 upload prometheus-postgres-exporter_0.2.0+ds-2 to apt.wikimedia.org/stretch-wikimedia/main and copied over to apt.wikimedia.org/jessie-wikimedia/main
11:10 moritzm: upgrading mw1276-mw1279 (canary servers) to wikidiff2 1.5.1
10:14 godog: upgrade prometheus-hhvm-exporter to 0.4-1
10:07 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: Electron hanging - T174916
09:54 ppchelko@tin: Finished deploy [restbase/deploy@2001a66]: Enable summary on Cass 3 for all but wikipedia (duration: 07m 32s)
09:50 ariel@tin: Finished deploy [dumps/dumps@2b9326b]: improved job batches for rev content 7z recompression (duration: 00m 02s)
09:50 ariel@tin: Started deploy [dumps/dumps@2b9326b]: improved job batches for rev content 7z recompression
09:48 moritzm: installing pyjwt security updates
09:47 ppchelko@tin: Started deploy [restbase/deploy@2001a66]: Enable summary on Cass 3 for all but wikipedia
08:13 marostegui: Stop replication in sync on db1103 and db1072 - T164488
08:13 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1072 - T164488 (duration: 00m 50s)
07:31 marostegui: Stop MySQL on db2034 and db2036 to clone db2092
06:36 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2034 and db2036 - T178359 (duration: 01m 08s)
06:02 marostegui: Compress revision table across db1044 databases (s3)
05:46 marostegui: Optimize recentchanges, pagelinks and templatelinks on db1102 for s6 - T174509 T177772
05:45 marostegui: Optimize recentchanges, pagelinks and templatelinks on db1064 - T174509 T177772
05:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1051 - T174509 (duration: 00m 50s)
05:41 marostegui: Optimize pagelinks and templatelinks on db1051 - T174509
05:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1055 - T174509 (duration: 00m 50s)
05:31 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1056 - T174509 (duration: 00m 50s)
05:29 awight@tin: Started deploy [ores/deploy@19b1851]: Push ORES w/ Celery 4 support (take 2), T178441
05:08 awight@tin: Finished deploy [ores/deploy@19b1851]: Push ORES w/ Celery 4 support, T178441 (duration: 00m 18s)
05:08 awight@tin: Started deploy [ores/deploy@19b1851]: Push ORES w/ Celery 4 support, T178441
03:38 bblack: cp3034 - upgraded vhtcpd to 0.1.0 for testing
03:38 bblack: cp3030 - upgraded vhtcpd to 0.1.0 for testing
03:01 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Oct 19 03:01:54 UTC 2017 (duration 7m 2s)
02:54 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 09m 28s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.3) (duration: 09m 40s)
00:01 maxsem@tin: Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/385106/2 (duration: 00m 50s)
2017-10-18
23:51 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/385094/2 (duration: 00m 50s)
23:49 maxsem@tin: Synchronized static/images/project-logos/: https://gerrit.wikimedia.org/r/#/c/385094/2 (duration: 00m 50s)
23:45 maxsem@tin: Synchronized php-1.31.0-wmf.4/extensions/Collection/: https://gerrit.wikimedia.org/r/#/c/384903/ (duration: 00m 50s)
23:44 maxsem@tin: Synchronized php-1.31.0-wmf.3/extensions/Collection/: https://gerrit.wikimedia.org/r/#/c/385005/ (duration: 00m 51s)
23:34 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/385007/ (duration: 00m 51s)
23:05 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.4
23:04 demon@tin: Synchronized php: symlink swap (duration: 00m 49s)
22:55 tgr@tin: Synchronized private/: Remove the llama (T150554 ) (duration: 00m 50s)
22:27 tgr@tin: Synchronized wmf-config/: T174651 / gerrit 384908 - deploy ReadingLists to beta - noop for production (duration: 00m 53s)
20:49 ejegg: updated CiviCRM from 7c26791 to b9d2067
20:16 XioNoX: Changing MTUs on interfaces to NTT
19:19 bblack: uploaded vhtcpd-0.1.0-1 to jessie-wikimedia, testing on cp1008 only for now
19:17 andrewbogott: restarting nodepool
19:15 andrewbogott: stopping nodepool temporarily to give rabbit a chance to catch up
19:01 MaxSem: ran LoginNotify/maintenance/migratePreferences.php on test and test2
18:57 andrewbogott: restarting nova-network on labnet1001
18:42 andrewbogott: restarting rabbitmq-server on labcontrol1001; too many timeouts
16:29 elukey: stop ircecho on einstenium for puppet shower
16:27 herron: restarted apache on puppetmasters
14:23 moritzm: uploaded hhvm 3.18.5+dfsg-1+wmf1+deb9u1 to apt.wikimedia.org/stretch-wikimedia
14:03 chasemp: add static ip to cloud mwstake project per T178012
14:03 chasemp: bump up quota on cyberpower cloud project per T178332
14:03 chasemp: create reading-lists cloud project
13:28 zeljkof: EU SWAT finished!
13:22 zfilipin@tin: Synchronized php-1.31.0-wmf.3/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: SWAT: [cirrus] Turn on recall A/B test on enwiki (T177502) (duration: 00m 50s)
13:21 gehel: repooling wdqs1003 now that it has catched up on updates
13:15 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: pagePreviews: Restart A/B test on enwiki and dewiki (T176469) (duration: 00m 51s)
13:05 moritzm: uploaded wikidiff2 1.5.1 to apt.wikimedia.org
12:11 mobrovac@tin: Finished deploy [restbase/deploy@3c7abf6]: Bug fix: Remove the space in the summary table name (duration: 07m 13s)
12:04 mobrovac@tin: Started deploy [restbase/deploy@3c7abf6]: Bug fix: Remove the space in the summary table name
11:54 mobrovac@tin: Finished deploy [restbase/deploy@99052c1]: Use Cass3 storage for ruwiki summaries (duration: 08m 59s)
11:47 jynus@tin: Synchronized wmf-config/db-eqiad.php: Add db1105 to help db1071 because db1082 has crashed (duration: 00m 50s)
11:45 mobrovac@tin: Started deploy [restbase/deploy@99052c1]: Use Cass3 storage for ruwiki summaries
11:31 moritzm: upgrading mw1262-mw1265 (canaries) to wikidiff2 1.5.1
11:26 marostegui: Optimize pagelinks and templatelinks on db1102 s7 - T174509
11:21 moritzm: upgrading mw1261 to wikidiff2 1.5.1
11:11 mobrovac@tin: Finished deploy [restbase/deploy@15619e0]: Introduce the page/summary proxy (no-op for now), part #2 (duration: 08m 53s)
11:03 mobrovac@tin: Started deploy [restbase/deploy@15619e0]: Introduce the page/summary proxy (no-op for now), part #2
11:02 mobrovac@tin: Finished deploy [restbase/deploy@15619e0]: Introduce the page/summary proxy (no-op for now) (duration: 05m 23s)
10:57 mobrovac@tin: Started deploy [restbase/deploy@15619e0]: Introduce the page/summary proxy (no-op for now)
10:50 moritzm: rebooting multatuli for kernel update
10:21 moritzm: imported linux 4.9.30-2+deb9u5~bpo8+1 for jessie-wikimedia
09:51 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1098, depool db1082 (crashed) (duration: 00m 50s)
09:39 moritzm: installing xserver/xvfb security updates
09:36 jynus: reloading dbproxy1010's haproxy configuration
09:30 elukey: drop MobileWikiAppToCInteraction_10375484_15423246 from the log database on dbstore1002,db1047,db1046 - T177960
08:41 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Re-enable Statement usage tracking on cawiki (T151717 ) (duration: 00m 50s)
08:25 marostegui: Stop MySQL on db2019 to copy its data to db2084 - T178359
08:23 mobrovac@tin: Finished deploy [changeprop/deploy@065a06e]: Bug fix: Apply ignore_errors on a per-request basis (duration: 01m 17s)
08:22 mobrovac@tin: Started deploy [changeprop/deploy@065a06e]: Bug fix: Apply ignore_errors on a per-request basis
08:09 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Enable description usage tracking on a few test wikis (T177155 ) (duration: 00m 50s)
07:49 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1056 - T174509 (duration: 00m 49s)
07:46 marostegui: Optimize pagelinks and templatelinks on db1056 - T174509
07:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1034 - T174509 (duration: 00m 59s)
07:14 gehel: depooling wdqs1003 until it catches up on updates
07:02 marostegui: Stop MySQL on db2079 to copy its data to db2084 - T178359
06:46 _joe_: powercycling wdqs1003
05:48 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1072 - T164488 (duration: 00m 49s)
05:43 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1055 - T174509 (duration: 00m 50s)
05:40 marostegui: Optimize pagelinks and templatelinks on db1055 - T174509
05:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1098 - T174509 (duration: 00m 49s)
05:36 marostegui: Optimize pagelinks and templatelinks on db1098 - T174509
05:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1067 - T174509 (duration: 00m 50s)
05:25 marostegui: Optimize pagelinks and templatelinks on db1102 for s2 - T174509
03:17 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Oct 18 03:17:01 UTC 2017 (duration 7m 3s)
03:09 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.4) (duration: 14m 58s)
02:33 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.3) (duration: 08m 21s)
2017-10-17
23:31 thcipriani@tin: Synchronized php-1.31.0-wmf.4/extensions/Citoid/modules/ve.ui.CiteFromIdInspector.css: SWAT: ve.ui.CiteFromIdInspector: Fix CSS for context menus after changes in OOjs UI T178324 (duration: 00m 50s)
23:29 thcipriani@tin: Synchronized php-1.31.0-wmf.4/extensions/VisualEditor/modules/ve-mw/ui/styles/widgets/ve.ui.MWMediaInfoFieldWidget.css: SWAT: ve.ui.MWMediaInfoFieldWidget: Fix positioning of icons T178415 (duration: 00m 50s)
23:22 thcipriani@tin: Synchronized php-1.31.0-wmf.3/includes/shell/Command.php: SWAT: Shell\Command: Better walltime fallback T178314 (duration: 00m 51s)
23:15 thcipriani@tin: Synchronized wmf-config/CommonSettings.php: SWAT: Remove setting no longer in MediaWiki (duration: 00m 50s)
23:06 mutante: disabling Icinga notifications for service recommendation_api on scb hosts - please remember to re-enable once ticket is resolved (T178445 ) (https://icinga.wikimedia.org/cgi-bin/icinga/status.cgi?search_string=recommendation_api%20endpoints )
22:28 ejegg: updated CiviCRM from fb513cc to 7c26791
21:55 foks: Removed 2FA from User:Amakuru
21:39 robh: cp4026 being returned to service post memory replacement. puppet runs fine and all icinga checks are green. repooled.
21:23 robh: cp4026 coming down for memory swap, no point in maint moding the system since other cp checks will alert anyhow
20:14 mutante: gerrit2001 - re-enabled puppet, attempting restart with --slave after gerrit:384758
19:54 mutante: cobalt - re-enabled puppet, ran puppet, systemd says gerrit is active(running)
19:51 mutante: cobalt - stopping gerrit service, confirming pid file gets removed, move /etc/init.d/gerrit to /root
19:50 mutante: short gerrit downtime for systemd change coming up
19:23 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.4
19:06 mutante: gerrit2001 - rebooting to verify systemd change and service comes back properly
18:53 mutante: gerrit2001: stopping the gerrit service
18:52 mutante: cobalt: temp disable puppet | gerrit2001: stop puppet
18:22 paravoid: rebooting sodium for kernel/distro upgrade
18:14 paravoid: dist-upgrading sodium to stretch
18:12 demon@tin: Finished scap: bootstrap wmf.4 (duration: 48m 41s)
17:23 demon@tin: Started scap: bootstrap wmf.4
16:49 demon@tin: Pruned MediaWiki: 1.31.0-wmf.2 [keeping static files] (duration: 01m 24s)
16:35 gehel: restart tilerator / tileratorui on all maps cluster after deployment of latest fix - T177389
16:34 gehel@tin: Finished deploy [tilerator/deploy@f3b26f3]: fix tileshell not exiting - T177389 (duration: 01m 42s)
16:32 gehel@tin: Started deploy [tilerator/deploy@f3b26f3]: fix tileshell not exiting - T177389
15:46 _joe_: killed a stuck gerrit process on cobalt
14:47 bd808: wikitech: Cleaned up 'importers' user_group entries (T171682 )
14:25 Amir1: end of cleaning up ores_classification table in plwiki, start of ruwiki (T159753 )
14:09 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1081 - T174509 T177772 (duration: 00m 45s)
14:05 Amir1: start of cleaning up ores_classification table in plwiki (T159753 )
14:00 zeljkof: EU SWAT finished
13:59 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: [cirrus] Fix typo in UserTesting config var (take 2) (T177502) (duration: 00m 45s)
13:47 zeljkof: EU SWAT continues
13:31 zeljkof: stopping EU SWAT
13:29 jynus: killing commonswiki script on tin
13:17 zfilipin@tin: Synchronized php-1.31.0-wmf.3/extensions/UniversalLanguageSelector/maintenance/ULSCompactLinksDisablePref.php: SWAT: Remove the 20 edits threshold from ULSCompactLinksDisablePref.php (duration: 00m 46s)
12:53 volans: upgrading cumin masters to cumin 1.2.2-1
12:40 moritzm: upgrading mwdebug* to hhvm-wikidiff 1.5.1
11:47 marostegui: Optimize recentchanges on db1102 for s4 - T174509
11:47 marostegui: Optimize templatelinks and pagelinks on db1102 for s4 - T174509
11:45 Amir1: end of cleaning up ores_classification table in hewiki, start of nlwiki (T159753 )
11:39 Amir1: end of cleaning up ores_classification table in fiwiki, start of hewiki (T159753 )
11:32 jynus: test-installing proxysql on wasat T175672
11:24 Amir1: end of cleaning up ores_classification table in fawiki, start of fiwiki (T159753 )
11:12 gehel: restarting wdqs-updater on wdqs1004 - T175919
10:40 Amir1: end of cleaning up ores_classification table in etwiki, start of fawiki (T159753 )
10:36 Amir1: end of cleaning up ores_classification table in cswiki, start of etwiki (T159753 )
10:26 Amir1: start of cleaning up ores_classification in cswiki (T159753 )
10:16 akosiaris: removing akosiaris's yubikey from network devices
09:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1085 - T174509 T177772 (duration: 00m 46s)
09:23 marostegui: Stop MySQL on db2084 for multi-instance testing
09:16 mobrovac@tin: Finished deploy [restbase/deploy@7f228d4]: Remove /page/revision and add history metrics end points, part #3 (duration: 07m 34s)
09:08 mobrovac@tin: Started deploy [restbase/deploy@7f228d4]: Remove /page/revision and add history metrics end points, part #3
09:08 mobrovac@tin: Finished deploy [restbase/deploy@3bcd1b7]: Remove /page/revision and add history metrics end points, part #2 (duration: 25m 57s)
08:45 gehel: restarting blazegraph on wdqs1004 for GC tuning (adding -XX:+G1PrintRegionLivenessInfo) - T175919
08:42 mobrovac@tin: Started deploy [restbase/deploy@3bcd1b7]: Remove /page/revision and add history metrics end points, part #2
08:40 mobrovac@tin: Finished deploy [restbase/deploy@3bcd1b7]: Remove /page/revision and add history metrics end points - T158100 T175805 (duration: 01m 05s)
08:39 mobrovac@tin: Started deploy [restbase/deploy@3bcd1b7]: Remove /page/revision and add history metrics end points - T158100 T175805
07:54 marostegui: Stop MySQL and reboot db2081 to see if it works fine - T178140
07:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2086 to the config - T170662 (duration: 00m 45s)
07:52 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2086 to the config - T170662 (duration: 00m 45s)
07:51 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2086 to the config - T170662 (duration: 00m 45s)
07:42 marostegui: Stop MySQL on db2079 to clone db2086 - T170662
07:36 moritzm: rebooting boron for kernel update
07:31 ema: upgrade misc_eqiad to varnish 5 T177233
07:13 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1067 - T174509 (duration: 00m 45s)
07:11 marostegui: Optimize templatelinks and pagelinks on db1067 - T174509
07:10 marostegui: Optimize enwiki.ores_classification on db1067 - T159753
06:55 moritzm: installing libffi security updates on trusty (Debian already fixed)
06:07 marostegui: Stop replication in sync on db1072 and db1103 for data drift fixing - T164488
05:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1034 - T174509 (duration: 00m 45s)
05:54 marostegui: Optimize pagelinks and templatelinks on db1034 - T174509
05:50 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1085 - T174509 T177772 (duration: 00m 45s)
05:46 marostegui: Optimize pagelinks, templatelinks and recentchanges on db1085 T177772 T174509
05:42 marostegui: Optimize pagelinks, templatelinks and ores_classification on db1095 - T174509 T159753
05:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1081 - T174509 (duration: 00m 45s)
05:38 marostegui: Optimize recentchanges on db1081 - T177772
05:37 marostegui: Optimize pagelinks and templatelinks on db1081 - T174509
05:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1090 - T174509 (duration: 00m 45s)
05:32 marostegui: Optimize pagelinks and templatelinks on db1090 - T174509
05:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1073 - T174509 (duration: 00m 46s)
02:32 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Oct 17 02:32:30 UTC 2017 (duration 6m 57s)
02:25 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.3) (duration: 08m 23s)
00:34 ejegg: updated CiviCRM from 824c2d8 to fb513cc
2017-10-16
23:49 thcipriani@tin: Synchronized php-1.31.0-wmf.3/extensions/CheckUser/specials/SpecialCheckUser.php: SWAT: Revert "Revert "Restore checkuser-userlinks when exist"" T170507 (duration: 00m 46s)
23:46 ebernhardson: increase elasticsearch log levels for transport to debug to help track down T167410
23:39 ejegg: updated civicrm from b953064 to 824c2d8
23:25 thcipriani@tin: Synchronized php-1.31.0-wmf.3/extensions/Echo/includes/ContainmentSet.php: SWAT: ContainmentSet: Use strict comparison for array_search() T177825 (duration: 01m 45s)
20:35 chasemp: disable puppet around cloud things for a refactor merge
20:20 demon@tin: Pruned MediaWiki: 1.30.0-wmf.18 (duration: 02m 59s)
19:59 demon@tin: Synchronized wmf-config/CommonSettings.php: Ief0ea01f (duration: 00m 46s)
19:58 jynus: sudo apt install wmf-mariadb101-client on terbium for debugging an issue
19:57 demon@tin: Synchronized wmf-config/InitialiseSettings.php: Ief0ea01f (duration: 00m 46s)
19:56 demon@tin: Synchronized tests/cirrusTest.php: Ief0ea01f (duration: 00m 47s)
18:18 thcipriani@tin: Synchronized wmf-config/Wikibase.php: SWAT: Add configuration for statement indexing for Wikidata T175199 (duration: 00m 47s)
17:35 ariel@tin: Finished deploy [dumps/dumps@2b9326b]: do the minimum for latestlinks job, no updating other files (duration: 00m 03s)
17:35 ariel@tin: Started deploy [dumps/dumps@2b9326b]: do the minimum for latestlinks job, no updating other files
17:08 gehel@tin: Finished deploy [wdqs/wdqs@b42eb0d]: WDQS GUI uipdates (duration: 02m 20s)
17:06 gehel@tin: Started deploy [wdqs/wdqs@b42eb0d]: WDQS GUI uipdates
16:56 ottomata: deploying LVS for druid-public-broker https://phabricator.wikimedia.org/T176223
16:39 joal@tin: Finished deploy [analytics/aqs/deploy@339674c]: Deploy new mediawiki-history endpoints (alpha version) after monitorin fix (duration: 04m 42s)
16:35 joal@tin: Started deploy [analytics/aqs/deploy@339674c]: Deploy new mediawiki-history endpoints (alpha version) after monitorin fix
16:23 joal@tin: Finished deploy [analytics/aqs/deploy@aa7ef0d]: Deploy new mediawiki-history endpoints (alpha version) after node-modules fix (duration: 06m 56s)
16:16 joal@tin: Started deploy [analytics/aqs/deploy@aa7ef0d]: Deploy new mediawiki-history endpoints (alpha version) after node-modules fix
16:09 ema: upgrade misc_codfw to varnish 5 T177233
16:05 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1084 - T174509 T177772 (duration: 00m 46s)
15:51 gehel: killing stuck tileshell process on maps-test2001 - T177389
15:51 gehel@tin: Finished deploy [tilerator/deploy@f3b26f3]: testing tileshell fix - T177389 (duration: 00m 20s)
15:50 gehel@tin: Started deploy [tilerator/deploy@f3b26f3]: testing tileshell fix - T177389
15:45 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1094 - T174509 (duration: 00m 46s)
15:38 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1088 - T174509 T177772 (duration: 00m 47s)
14:34 twentyafterfour: deploying hotfix for T178068 and T178107
14:30 moritzm: uploaded wikidiff 1.5.1 to apt.wikimedia.org for jessie/experimental
14:08 hashar: European swat is complete
14:08 hashar@tin: Synchronized wmf-config/abusefilter.php: (T177760 ) Enable AbuseFilter profiler | (T177761 ) Allow AbuseFilter to block & grant relevant permissions to permit administrators to manage the restricted option. (duration: 00m 47s)
13:50 moritzm: upgrading deployment-mediawiki0[56] to wikidiff2 1.5.1
13:45 hashar@tin: Synchronized wmf-config/: [cirrus] Prepare profiles for the recall A/B test on enwiki - T177502 (duration: 00m 48s)
13:44 moritzm: upgrading deployment-mediawiki04 to wikidiff2 1.5.1
13:42 hashar@tin: Synchronized wmf-config: [cirrus] Disable A/B test of MLR on testing on 18 wikis with > 1% of search traffic - T177490 (duration: 00m 48s)
13:38 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Add additional namespaces to search results for bnwikisource - T178041 (duration: 00m 49s)
12:59 gehel: restart cassandra on maps1001
12:38 ema: upgrading eqiad LVSs to pybal 1.14.2 (T178149 , T177815 )
11:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1074 - T174509 (duration: 00m 46s)
11:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1070 - T174509 (duration: 00m 46s)
11:10 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: Electron hanging - T174916
10:48 Amir1: cleaning up ores_classification table in wikidatawiki (T159753 )
10:03 ema: upgrading codfw LVSs to pybal 1.14.2 (T178149 , T177815 )
09:54 ema: upgrading esams LVSs to pybal 1.14.2 (T178149 , T177815 )
09:44 ema: upgrading ulsfo LVSs to pybal 1.14.2 (T178149 , T177815 )
09:43 ema: pybal 1.14.2 uploaded to apt.w.o
09:40 moritzm: restarting cassandra on restbase1007 to pick up NSS security update
09:39 moritzm: restarting cassandra on cerium/xenon/praseodymium to pick up NSS security update
09:37 gehel: restarting elasticsearch on elastic1017
09:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1088 - T174509 T177772 (duration: 00m 46s)
09:07 marostegui: Optimize recentchanges, pagelinks and templatelinks on db1088 -Â https://phabricator.wikimedia.org/T174509 Â https://phabricator.wikimedia.org/T177772
09:02 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1093 - T174509 T177772 (duration: 00m 46s)
08:53 marostegui: Stop MySQL on db1050 as it will be decommissioned - T178162
08:46 marostegui: Remove db1050 from tendril - T178162
08:31 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1050 from config - T178162 (duration: 00m 46s)
08:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1050 from config - T178162 (duration: 00m 46s)
07:56 marostegui: Stop db1103 and db1038 at the same position to fix data drifts - T164488
07:42 moritzm: installing NSS security updates
07:30 marostegui: Stop db1103 and db1072 at the same position to fix data drifts - T164488
07:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1072 - T164488 (duration: 00m 46s)
07:19 marostegui: Optimize table ores_classification on db1073 - T159753
07:14 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2084 to the config - T170662 (duration: 00m 46s)
07:13 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2084 to the config - T170662 (duration: 00m 47s)
06:25 marostegui: Stop MySQL on db2079 to clone db2084 from it - T170662
06:12 marostegui: Optimize pagelinks and templatelinks on db1094 - T174509
06:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1093 - T174509 T177772 (duration: 00m 46s)
06:04 marostegui: Optimize recentchanges, pagelinks and templatelinks on db1093 - T174509 T177772
06:00 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1070 - T174509 (duration: 00m 46s)
05:59 marostegui: Optimize recentchanges, pagelinks and templatelinks on db1070 - T174509
05:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1084 - T174509 T177772 (duration: 00m 46s)
05:53 marostegui: Optimize recentchanges, pagelinks and templatelinks on db1084 - T174509 T177772
05:47 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1074 - T174509 (duration: 00m 46s)
05:47 marostegui: Optimize pagelinks and templatelinks on db1074 - T174509
05:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1073 - T174509 (duration: 00m 47s)
05:37 marostegui: Optimize templatelinks and pagelinks on db1073 - T174509
05:36 marostegui: Optimize templatelinks and pagelinks on db1073 - 174509
02:35 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Oct 16 02:35:34 UTC 2017 (duration 6m 36s)
02:28 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.3) (duration: 07m 44s)
2017-10-15
11:15 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: Electron hanging - T174916
2017-10-14
03:52 paravoid: manually cleaning up pdns_metric_cron from all dnsrecursors (hydrogen/chromium/acamar/achernar/maerlant/nescio)
2017-10-13
23:04 mutante: scb1001 - systemctl restart pdfrender (Icinga said: pdfrender Connection refused)
20:36 mutante: gerrit2001 - removing hhvm package (which is now possible without also removing scap, thanks thcipriani), apt-get autoremove to remove more unused packages (T178039 )
18:52 joal@tin: Finished deploy [analytics/aqs/deploy@0375c34]: Deploy new mediawiki-history endpoints-alpha version (duration: 15m 02s)
18:40 joal@tin: (no justification provided)
18:37 joal@tin: Started deploy [analytics/aqs/deploy@0375c34]: Deploy new mediawiki-history endpoints-alpha version
16:54 otto@tin: Finished deploy [analytics/refinery@28db253]: (no justification provided) (duration: 05m 50s)
16:48 otto@tin: Started deploy [analytics/refinery@28db253]: (no justification provided)
15:31 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1056 (duration: 00m 46s)
14:59 mutante: added cparle to wmf LDAP group
14:38 jynus: stopping slave on db1056 and optimize recentchanges
14:25 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1100, depool db1056, fix db1098 weight (duration: 00m 46s)
14:06 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1053 (duration: 00m 46s)
13:07 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1053 (duration: 00m 47s)
12:57 godog: upload scap 3.7.1-1 - T127762
11:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2082 to the config - T170662 (duration: 00m 46s)
11:26 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2082 to the config - T170662 (duration: 00m 46s)
10:53 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1098 (duration: 00m 47s)
10:22 godog: roll-restart pybal to apply https://gerrit.wikimedia.org/r/#/c/383997/ - T178149
10:04 jynus: stopping slave on db1098 and optimize recentchanges
09:54 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1098 (duration: 00m 46s)
09:22 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2082 to the config - T170662 (duration: 00m 46s)
09:22 Amir1: a small clean up of ores_classification table in wikidatawiki (T159753 )
09:21 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2082 to the config - T170662 (duration: 00m 47s)
09:04 godog: restart pybal on lvs1003 to test logstash depool - T178078
08:57 marostegui: Stop db1103 and db1038 in sync for more checksumming - T164488
08:34 ema: upgrade pybal on lvs1003 to 1.14.1 T177815
08:18 ema: upgrade pybal on lvs1006 to 1.14.1 T177815
08:15 jynus: restarting commons wiki recentchanges purge of wb entries T177772
08:11 ema@neodymium: conftool action : set/pooled=yes; selector: name=logstash1001.eqiad.wmnet,service=logstash-gelf,dc=eqiad
08:11 ema@neodymium: conftool action : set/pooled=no; selector: name=logstash1001.eqiad.wmnet,service=logstash-gelf,dc=eqiad
08:10 ema: depool/repool logstash1001 (service=logstash-gelf) T178078
07:35 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2081 to the config - T170662 (duration: 00m 46s)
07:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2081 to the config - T170662 (duration: 00m 47s)
06:23 marostegui: Stop MySQL on db2080 to copy its data to db2081 - T170662
00:22 mutante: authdns servers: deleted /etc/ganglia/conf.d/gndsd.pyconf & /usr/lib/ganglia/python_modules_gdnsd.py - removing ganglia stats
2017-10-12
23:58 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Restore thanks limit on pl.wikipedia (T169268 ) (duration: 00m 47s)
23:50 cwd: re-enabled omnimail jobs
23:35 eileen1: update from 2d6668b to b953064
23:25 dereckson@tin: Synchronized wmf-config/CommonSettings.php: Add logging for email blocks (T175419 ) (duration: 00m 46s)
23:13 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Switch test wikis to HTML5 fragment mode in links (T152540 ) (duration: 00m 47s)
22:44 demon@tin: Synchronized php-1.31.0-wmf.3/includes/specialpage/ChangesListSpecialPage.php: silence notices (duration: 00m 47s)
21:36 cwd: disabled omnimail jobs
19:47 ejegg: updated CiviCRM from 47de976 to 2d6668b
19:46 chasemp: disable puppet for lab*services* for merge of 383896
19:18 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.3
18:28 mutante: added Pablo (pgrass) to LDAP group 'wmde' (T177599 )
18:15 demon@tin: Synchronized php-1.31.0-wmf.3/extensions/ORES/includes/Hooks.php: Make watchlist queries suck less (duration: 00m 50s)
18:01 demon@tin: Synchronized php-1.31.0-wmf.3/extensions/JsonConfig/includes/JCCache.php: Silence junk warnings (duration: 00m 50s)
15:17 mobrovac@tin: Started restart [electron-render/deploy@8dd5f13]: Electron hanging - T174916
15:11 mutante: gerrit2001 - rebooting for kernel upgrade
14:57 mutante: netmon1002 - arming keyholder for rancid deploy
14:52 mutante: netmon1002 (librenms, rancid) - rebooting for kernel upgrade
14:47 moritzm: rebooting kubernetes workers to new 4.9.51 kernel
14:41 moritzm: rebooting kubernetes staging servers to new 4.9.51 kernel
14:29 moritzm: rebooting kubernetes masters to new 4.9.51 kernel
13:54 moritzm: updating image scalers to librsvg 2.40.18
13:48 elukey: deployed the new Analytics Public Druid cluster - T176223
13:45 moritzm: updating remaining thumbor servers to librsvg 2.40.18
13:40 ladsgroup@tin: Synchronized wmf-config/Wikibase.php: SWAT: Remove deprecated config variable for Wikibase (T129475 ) (duration: 01m 02s)
13:33 moritzm: updating thumbor1001 to librsvg 2.40.18
13:13 moritzm: rolling restart of logstash to pick up thumbor logs (T150734 )
13:07 zeljkof: EU SWAT finished
12:56 moritzm: uploaded librsvg 2.40.18 for jessie-wikimedia to apt.wikimedia.org
11:46 ema: upgrading cp3010 to Varnish 5 T177233
11:27 Amir1: finished cleanup of ores_classification in wikidatawiki, still needs more work (T159753 )
10:03 ema: lvs1005: upgrade pybal to 1.14.1 T177815
10:02 ema: pybal 1.14.1 uploaded to apt.w.o
09:40 ema: upgrading cp3008 to Varnish 5 T177233
09:06 moritzm: installing nettle update from stretch 9.2 point release
09:01 moritzm: installing ncurses security updates
08:25 moritzm: installing gnutls28 update from stretch 9.2 point release
08:02 Amir1: starting cleanup of ores_classification in wikidatawiki (T159753 )
07:51 gehel: reinitialize cassandra on maps-test2004 to test vector tiles - T153282
07:40 moritzm: installing at-spi2-core update from stretch 9.2 point release
07:24 moritzm: installing gnupg2 update from stretch 9.2 point release
07:18 legoktm: purging html5-misnesting linter entries from database on terbium (T178040 )
03:25 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Oct 12 03:25:17 UTC 2017 (duration 7m 17s)
03:18 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.3) (duration: 15m 57s)
02:41 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.2) (duration: 15m 39s)
00:30 twentyafterfour: phabricator update finished (a while ago, I forgot to log it)
00:08 twentyafterfour: upgrading phabricator to #phab-2017-10-11
00:05 mutante: netmon2001 arm keyholder for rancid deploy
00:01 twentyafterfour: Preparing to take phabricator offline for a hopefully momentary upgrade.
2017-10-11
23:55 mutante: netmon2001 - smokeping back up - the reboot, as a side-effect, also resolved the systemd-icinga-alert that was pending in Icinga for a while with comments
23:52 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Vector print logo and print styles on all wikis T169732 PART II (duration: 00m 50s)
23:50 thcipriani@tin: Synchronized wmf-config/CommonSettings.php: SWAT: Enable Vector print logo and print styles on all wikis T169732 PART I (duration: 00m 50s)
23:47 mutante: netmon2001 (smokeping) - kernel upgrade, schedule downtime, reboot
23:40 thcipriani@tin: Synchronized php-1.31.0-wmf.2/skins/Vector/ResourceLoaderLessModule.php: SWAT: Print logo should use an absolute URI T177800 (duration: 00m 49s)
23:38 mutante: netmon1003 - servermon back up - apt-get autoremove to drop unrequired python packages
23:37 thcipriani@tin: Synchronized php-1.31.0-wmf.3/skins/Vector/ResourceLoaderLessModule.php: SWAT: Print logo should use an absolute URI T177800 (duration: 00m 50s)
23:35 mutante: netmon1003 - cant seem to reboot normally, attempting via gnt-instance command
23:31 mutante: netmon1003 (servermon) - upgrading kernel (jessie), schedule icinga downtime, rebooting, short downtime for servermon itself
23:31 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Parser: Switch from Tidy to Remex on nowiki T177989 (duration: 00m 50s)
23:27 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Parser: Switch from Tidy to Remex on fawiki T176150 (duration: 00m 50s)
23:23 thcipriani@tin: Synchronized php-1.31.0-wmf.3/extensions/VisualEditor/modules/ve-mw/init/styles/ve.init.MWVESwitchConfirmDialog.css: SWAT: Fix WikiEditor mode switcher widget (duration: 00m 50s)
23:19 thcipriani@tin: Synchronized php-1.31.0-wmf.2/includes/specials/SpecialUnblock.php: SWAT: Fix unblocking autoblocks T177952 (duration: 00m 50s)
23:18 mutante: releases1001 upgrading kernel, installed linux-image-amd64, took out of cache::misc, upgrade libnss3, scheduled downtime, rebooting ...
23:17 thcipriani@tin: Synchronized php-1.31.0-wmf.3/includes/specials/SpecialUnblock.php: SWAT: Fix unblocking autoblocks T177952 (duration: 00m 51s)
22:01 volans: uploaded cumin_1.2.2-1_amd64.deb to apt.wikimedia.org jessie-wikimedia
21:44 robh: cp4009 shutdown accidental, will boot back up immediately
21:13 mutante: releases2001 - scheduled downtime, rebooting
21:07 mutante: running puppet on cache misc hosts to temp remove codfw backend of releases.wm.org, then rebooting releases2001 for kernel upgrade
20:22 subbu: updated Parsoid to ddf7b293 (T138492 , T135667 , T177612 )
20:18 ssastry@tin: Finished deploy [parsoid/deploy@938e305]: Updating Parsoid to ddf7b293 (duration: 08m 42s)
20:17 hashar@tin: Finished deploy [jobrunner/jobrunner@a20d043]: Services now exit(0) when catching a signal - T168044 (duration: 02m 54s)
20:14 hashar@tin: Started deploy [jobrunner/jobrunner@a20d043]: Services now exit(0) when catching a signal - T168044
20:13 hashar@tin: Finished deploy [jobrunner/jobrunner@a20d043]: Services now exit(0) when catching a signal - T168044 (duration: 00m 02s)
20:13 hashar@tin: Started deploy [jobrunner/jobrunner@a20d043]: Services now exit(0) when catching a signal - T168044
20:11 hashar@tin: Started restart [jobrunner/jobrunner@a20d043]: Services now exit(0) when catching a signal - T168044
20:10 bsitzmann@tin: Finished deploy [mobileapps/deploy@15a52ba]: Update mobileapps to c045afc (T177301 ) (duration: 05m 25s)
20:09 ssastry@tin: Started deploy [parsoid/deploy@938e305]: Updating Parsoid to ddf7b293
20:07 hashar@tin: Started restart [jobrunner/jobrunner@a20d043]: Services now exit(0) when catching a signal - T168044
20:05 bsitzmann@tin: Started deploy [mobileapps/deploy@15a52ba]: Update mobileapps to c045afc (T177301 )
20:04 hashar@tin: Started restart [jobrunner/jobrunner@a20d043]: Services now exit(0) when catching a signal - T168044
20:01 chasemp: disable puppet around cloud things to roll refactor out slowly
19:29 mutante: releases1001 - same as 2001, upgraded jenkins to 2.73.2, kept existing config (T177962 )
19:27 mutante: releases2001 - upgraded jenkins to 2.73.2, kept existing config (vs overwriting with package config)
19:20 mutante: apt: reprepro copy stretch-wikimedia jessie-wikimedia jenkins
19:16 mutante: releases1001/releases2001 - apt-get autoremove to drop non-required python packages
19:03 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.3
19:02 demon@tin: Synchronized php: symlink swap (duration: 00m 49s)
18:47 gehel: killing stuck osm replication on maps-test2001
18:23 demon@tin: Synchronized wmf-config/Wikibase-labs.php: no-op, labs only (duration: 00m 59s)
18:19 demon@tin: Finished scap: wmf.3 bootstrap (duration: 44m 56s)
17:58 herron: ran /home/faidon/update-netboot-stretch.sh on puppetmaster1001
17:49 moritzm: uploaded jenkins 2.73.2 for jessie-wikimedia to apt.wikimedia.org
17:34 demon@tin: Started scap: wmf.3 bootstrap
17:32 demon@tin: Synchronized php: symlink swap to wmf.2, somehow got missed (duration: 00m 45s)
17:29 demon@tin: Synchronized scap/plugins/prep.py: No-op (duration: 01m 49s)
17:09 demon@tin: Pruned MediaWiki: 1.30.0-wmf.17 (duration: 02m 20s)
17:06 demon@tin: Pruned MediaWiki: 1.30.0-wmf.16 (duration: 02m 40s)
16:54 demon@tin: Pruned MediaWiki: 1.31.0-wmf.1 [keeping static files] (duration: 01m 44s)
16:24 hasharAway: Upgrade jenkins Maven integration plugin to 3.0 - T177962
16:15 godog: roll-restart thumbor to apply https://gerrit.wikimedia.org/r/380942 - T150734
16:12 hasharAway: Upgrading Jenkins CI T177962
16:09 gehel: decommission maps-test2004 from its cassandra cluster (free a node to test vector tiles) - T153282
16:09 XioNoX: resuming PDU upgrade
15:39 elukey: drop tables listed in https://phabricator.wikimedia.org/T171629#3674250 from db1046, db1047, dbstore1002
15:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1086 - T174509 (duration: 00m 47s)
15:21 _joe_: stopped redis on ocg* T177931
15:08 _joe_: ocg*: stop ocg; mv /srv/deployment /srv/stale, update-rc.d ocg disable, rm /etc/init/ocg.conf - T177931
14:55 _joe_: manually removing IPVS entries for ocg on eqiad LBs, T177931
14:47 _joe_: rolling restart of low-traffic pybals in eqiad for T177931
14:36 moritzm: installing dbus update from stretch 9.2 point release
13:40 hashar: European SWAT completed
13:39 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Remove compact language links dblist for simplicity (duration: 00m 46s)
13:38 hashar@tin: Synchronized docroot: Remove compact language links dblist for simplicity (duration: 00m 48s)
13:37 hashar@tin: Synchronized dblists: Remove compact language links dblist for simplicity (duration: 00m 47s)
13:36 hashar@tin: Synchronized wmf-config/CommonSettings.php: Remove compact language links dblist for simplicity (duration: 00m 47s)
13:29 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Configure wmgBabelMainCategory for the Dinka Wikipedia (duration: 00m 47s)
13:27 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable mapframe on eswiki - T177695 (duration: 00m 47s)
13:24 hashar@tin: Synchronized wmf-config/Wikibase-production.php: Enable constraint checks on qualifiers and references - T176863 (duration: 00m 47s)
13:21 moritzm: installing db5.3 security updates
13:03 godog: test graphite-web patch on labmon1001 - T177747
12:54 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Disable Statement usage tracking on cawiki (T151717 ) (duration: 00m 47s)
12:53 marostegui: Start recentchanges purge on s4 primary master T177772
12:48 marostegui: Kill recentchanges purge on s4 primary master - https://phabricator.wikimedia.org/T177772
12:44 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: (temp) Disable Statement usage tracking on cawiki (T151717 ) (duration: 00m 48s)
12:38 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Enable Statement usage tracking on cawiki and cewiki (T151717 ) (duration: 00m 47s)
12:32 hoo@tin: Synchronized wmf-config/: Move WB client "disabledUsageAspects" setting into $wmgWikibaseDisabledUsageAspects (duration: 00m 48s)
12:30 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Introduce $wmgWikibaseDisabledUsageAspects (duration: 00m 47s)
12:27 hoo@tin: scap aborted: wmf-config/InitialiseSettings.php Introduce $wmgWikibaseDisabledUsageAspects (duration: 02m 34s)
12:25 hoo@tin: Started scap: wmf-config/InitialiseSettings.php Introduce $wmgWikibaseDisabledUsageAspects
11:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1087 - T174509 (duration: 00m 50s)
11:49 ema: upgrading cp3007 to Varnish 5 T177233
11:34 moritzm: installing ruby1.9 security updates on trusty hosts
11:19 akosiaris: re-enable OTRS stats group T176221
11:11 jynus: starting purge of ruwiki.recentchanges T177772
10:55 akosiaris: OTRS upgrade done T176221
10:44 akosiaris: starting OTRS upgrade T176221
10:24 jynus: stopping dbstore2001:s3 for maintenance T177908
10:19 moritzm: installing curl security updates
09:32 moritzm: installing libxfont security updates
09:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1087 - T174509 (duration: 00m 46s)
09:27 marostegui: Optimize pagelinks and templatelinks on db1087 - T174509
09:18 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1082 - T174509 (duration: 00m 47s)
09:04 moritzm: installing curl security updates on app server canaries (along with HHVM restart)
08:50 moritzm: installing ffmpeg security updates
08:38 elukey: reboot kafka-jumbo hosts for kernel updates
08:37 marostegui: Stop replication in sync on db1103 and db1038 to checksum their data - T164488
08:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1103 - T164488 (duration: 00m 47s)
07:46 jynus: restart commonswiki recentchanges purging T177772
07:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1072 into the vslow and dump service for s3 - T172679 (duration: 00m 46s)
07:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1086 - T174509 (duration: 00m 47s)
07:02 marostegui: Optimize templatelinks and pagelinks on db1086 - T174509
06:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1082 - T174509 (duration: 00m 47s)
06:57 marostegui: Optimize templatelinks and pagelinks on db1082 - T174509
06:14 marostegui: Set db1068 s4 primary master to read-only OFF - T168661
06:14 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Set s4 to writable mode after maintenance - T168661 (duration: 00m 47s)
06:07 marostegui: Deploy alter table on s4 primary master - T168661
06:01 marostegui: Set read-only on db1068 s4 primary master - T168661
06:00 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Set s4 in read-only for maintenance - T168661 (duration: 00m 47s)
05:48 marostegui: Package update on db1068 s4 primary master - T168661
05:35 marostegui: Disable puppet on db1068 (s4 primary master) - T168661
05:35 marostegui: Disable puppet on db1068 (s4 primary master)
05:13 marostegui: Disable alerts on s4 for 2 hours - T168661
05:06 marostegui: Dump buffer pool on s4 primary master db1068 - T168661
05:03 kartik@tin: Finished deploy [cxserver/deploy@a79b38c]: Update cxserver to 273b515 (duration: 02m 57s)
05:00 kartik@tin: Started deploy [cxserver/deploy@a79b38c]: Update cxserver to 273b515
02:32 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Oct 11 02:32:46 UTC 2017 (duration 6m 39s)
02:26 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.2) (duration: 08m 52s)
2017-10-10
23:53 thcipriani@tin: Synchronized wmf-config: SWAT: Disable OCG services PART II T177795 (duration: 00m 48s)
23:52 thcipriani@tin: Synchronized wmf-config/CommonSettings.php: SWAT: Disable OCG services PART I T177795 (duration: 00m 47s)
23:41 thcipriani@tin: Synchronized php-1.31.0-wmf.2/extensions/Collection/RenderingAPI.php: SWAT: Do not request render if renderer not configured T177795 (duration: 00m 47s)
23:39 XioNoX: upgrading ps1-a2-eqiad - T175341
23:35 thcipriani@tin: Synchronized wmf-config/CommonSettings.php: SWAT: revert Enable Vector print logo on test wiki (duration: 00m 47s)
23:21 thcipriani@tin: scap failed: average error rate on 3/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/2cc7028226a539553178454fc2f14459 for details)
23:12 thcipriani@tin: Synchronized wmf-config/CommonSettings.php: SWAT: wikitech: Align "contentadmin" and "sysop" permissions T171208 (duration: 00m 48s)
23:08 eileen1: ran the group clean up script (reduced smart groups)
22:28 eileen1: changed civicrm smart group cache timeout to 5 civicrm/admin/setting/search?reset=1
21:15 bblack: esams repooling - T167840
19:30 krinkle@tin: Synchronized docroot/noc/: Clean up noc base.css (duration: 00m 48s)
19:07 krinkle@tin: Synchronized docroot/noc/: Clean up noc/index CSS (duration: 00m 47s)
19:00 XioNoX: starting the work for T167840
18:41 mutante: logstash*: delete /usr/lib/ganglia/python_modules/elasticsearch_monitoring.py and /etc/ganglia/conf.d/elasticsearch.pyconf via cumin (T177225 )
17:32 XioNoX: depooling esams from DNS for T167840
17:26 gehel: shutting down and restarting elasticsearch on relforge1001 for testing - T170378
17:25 krinkle@tin: Synchronized php-1.31.0-wmf.2/resources/lib/jquery.ui/: I717f2580e3aae (duration: 00m 47s)
16:19 milimetric@tin: Finished deploy [analytics/refinery@f4a0a33]: Deploying mostly for the new script in the bin folder (duration: 16m 01s)
16:03 milimetric@tin: Started deploy [analytics/refinery@f4a0a33]: Deploying mostly for the new script in the bin folder
15:48 jynus: starting purge of commonswiki.recentchanges T177772
15:39 thcipriani@tin: Synchronized README: noop deployment to test new logstash-checker.py host (duration: 00m 47s)
15:27 jynus: upgrading neodymium and sarin to mariadb-client 10.1.28
15:23 ema: cp1008 upgraded to varnish 5 T168529
15:00 godog: test stretch 9.2 upgrade on ms-fe2005 - T177739
14:59 cmjohnson1: lvs1007 swapping NIC card
14:53 godog: test stretch 9.2 upgrade on ms-be2013 - T177739
14:45 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1079 - T174509 (duration: 00m 47s)
14:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1080 - T174509 (duration: 00m 47s)
14:31 marostegui: Optimize table ores_classifications on db1080 - T159753
14:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1072 with low weight - T172679 (duration: 00m 47s)
14:24 aude@tin: Synchronized wmf-config/throttle.php: Add throttle rule T177835 (duration: 00m 47s)
14:22 elukey: add druid public cluster's IPs to analytics-in4 on cr1/cr2 - T177511
13:56 aude@tin: Synchronized wmf-config/throttle.php: Add throttle rule T177835 (duration: 02m 16s)
13:37 aude@tin: Synchronized wmf-config/Wikibase-production.php: Remove test wiki echo configs from Wikibase config (duration: 01m 31s)
13:20 aude@tin: Synchronized wmf-config/InitialiseSettings.php: Enable SandboxLink on gawiki (duration: 01m 50s)
13:11 gehel: restarting rsyslog on mw1180 - T177833
11:32 moritzm: downgrading HHVM on deployment-mediawiki04 to HHVM 3.18.2 temporarily (to further narrow down a problem with the new wikidiff package)
11:31 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1076 - T174509 (duration: 05m 02s)
11:31 dcausse: upgrading relforge to elastic 5.5.2
11:23 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1071 - T174509 (duration: 01m 21s)
10:33 akosiaris: T177511 switch druid100[456] to private1-x-eqiad VLANs
10:26 Amir1: start of cleaning up ores_classification in wikidatawiki (T159753 )
08:07 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2080 to the config - T170662 (duration: 00m 46s)
08:06 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2080 to the config - T170662 (duration: 00m 47s)
07:04 marostegui: Stop MySQL on db2079 to clone db2080 - https://phabricator.wikimedia.org/T170662
06:35 marostegui: Drop moodbar_feedback and moodbar_feedback_response from s3 - T153033
06:27 marostegui: Drop moodbar_feedback and moodbar_feedback_response from s1 - T153033
06:22 marostegui: Stop MySQL on db1038 to transfer its data to db1072 - T172679
06:19 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1038 - T172679 (duration: 00m 47s)
06:05 marostegui: Stop MySQL on db1072 to move it to s3 - T172679
05:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1079 - T174509 (duration: 00m 47s)
05:53 marostegui: Optimize templatelinks and pagelinks on db1079 - T174509
05:46 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1071 - T174509 (duration: 00m 48s)
05:45 marostegui: Optimize templatelinks and pagelinks on db1071 - T174509
05:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1076 - T174509 (duration: 00m 47s)
05:38 marostegui: Optimize templatelinks and pagelinks on db1076 - T174509
05:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1080 - T174509 (duration: 00m 48s)
05:32 marostegui: Optimize templatelinks and pagelinks on db1080 - T174509
02:35 eileen2: update civicrm from 00da4b0 to 47de976
02:32 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Oct 10 02:32:57 UTC 2017 (duration 6m 41s)
02:26 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.2) (duration: 08m 49s)
01:21 eileen2: update civicrm from 276cbfd to 00da4b0
2017-10-09
22:18 hoo: Removed empty left-over xmldatadumps/public/other/wikibase/wikidatawiki/20171007 (https://dumps.wikimedia.org/wikidatawiki/entities/20171007/ )
19:53 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable new print styles on Vector in test wiki - T169732 (duration: 00m 47s)
19:51 hashar: morning swat completed
19:50 hashar@tin: Synchronized php-1.31.0-wmf.2/extensions/Echo/maintenance/updatePerUserBlacklist.php: Sync a live hack in Echo updatePerUserBlacklist https://gerrit.wikimedia.org/r/#/c/383182/ - T173475 (duration: 00m 47s)
19:48 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable new print styles on Vector in test wiki - T169732 (duration: 00m 47s)
19:22 hashar@tin: Synchronized wmf-config/CommonSettings.php: REVERT: Enable new print styles on Vector - T169732 (duration: 00m 49s)
19:19 hashar@tin: scap failed: average error rate on 8/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/2cc7028226a539553178454fc2f14459 for details)
19:17 hashar@tin: Synchronized php-1.31.0-wmf.2/extensions/Echo: Reapply "Use User Ids instead of User Names for Echo Mute"" - T173475 (duration: 00m 52s)
18:57 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable NewUserMessage for SUL accounts too on hi.wikiversity - T177690 (duration: 00m 47s)
18:52 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable NewUserMessage for SUL accounts too on dty.wiki - T177688 (duration: 00m 47s)
18:45 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Add autopatrolled user group to sd.wikipedia - T177141 (duration: 00m 47s)
18:00 ottomata: stopping druid services on druid1004 (not 1006)
17:59 ottomata: stopping druid services on druid1006
17:28 ottomata: stopping druid services on druid1005
16:34 ottomata: stopping druid services on druid1006
16:34 ottomata: beginning decom and reinstall process for druid1004-1006 -- T176223
15:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1091 - T174509 (duration: 00m 47s)
13:53 zeljkof: EU SWAT finished
13:53 zfilipin@tin: Synchronized php-1.31.0-wmf.2/extensions/Translate/: SWAT: Revert "[tech-debt] Remove usage of FuzzyLikeThis in favor of simple fuzzy match" (T177727) (duration: 00m 57s)
13:27 zfilipin@tin: Synchronized wmf-config/CommonSettings.php: SWAT: Disallow most file types from upload to enwikivoyage (T176647) (duration: 00m 47s)
13:26 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Disallow most file types from upload to enwikivoyage (T176647) (duration: 00m 47s)
13:13 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Fix amwikimedia site name (T176042) (duration: 00m 47s)
13:10 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: New throttle rule (T177737) (duration: 00m 47s)
12:47 ema: restart pybal on eqiad load balancers to pick up bgp-med config change T165584
12:38 ema: restart pybal on codfw load balancers to pick up bgp-med config change T165584
12:31 ema: restart pybal on ulsfo load balancers to pick up bgp-med config change T165584
12:26 ema: restart pybal on esams load balancers to pick up bgp-med config change T165584
11:59 joal@tin: Finished deploy [analytics/refinery@c3812c2]: Analytics regular weekly deploy (duration: 10m 24s)
11:49 joal@tin: Started deploy [analytics/refinery@c3812c2]: Analytics regular weekly deploy
11:42 hashar: Upgrading Jenkins - T168644
11:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1083 - T174509 (duration: 00m 46s)
11:22 moritzm: installing apt updates from stretch 9.2 point release
11:22 marostegui: Optimize ores_classification table on db1083 - T159753
11:19 reedy@tin: Synchronized wmf-config/Wikibase-production.php: Actually disable injecting RC everywhere but commonswiki and ruwiki (duration: 00m 46s)
11:11 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1092 - T174509 (duration: 00m 47s)
11:08 moritzm: installing whois updates from stretch 9.2 point release
11:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1090 - T174509 (duration: 00m 46s)
10:54 moritzm: installing expect updates from stretch 9.2 point release
10:50 ladsgroup@tin: Synchronized wmf-config/Wikibase-production.php: Disable injecing RC records for commonswiki and ruwiki (T171027 ) (duration: 00m 47s)
10:38 ladsgroup@tin: Synchronized php-1.31.0-wmf.2/extensions/Wikidata/extensions/Wikibase/client: Re-instate $wgWBClientSettings[injectRecentChanges] (T171027 ) (duration: 00m 56s)
10:35 hoo: Started dumpwikidatajson.sh on snapshot1007 after T169680 disruptions
10:30 volans: rebooting snapshot1007 for NFS stuck - T169680
10:21 marostegui: Drop moodbar_feedback and moodbar_feedback_response from s7 - T153033
10:21 volans: rebooting snapshot1006 for NFS stuck - T169680
10:15 elukey: rebooting snapshot1001 for NFS stuck - T169680
10:00 volans: rebooting snapshot1005 for NFS stuck - T169680
09:51 volans: rebooting dataset1001 for NFS stuck - T169680
09:51 ema: test setting bgp med on lvs3001/3003 T165584
09:50 marostegui: Drop moodbar_feedback and moodbar_feedback_response from s5 - T153033
09:43 hashar: restarting Jenkins. Deadlock in SSHSlave plugin that causes memory to leak quite rapidly - T177749
09:34 moritzm: installing vim security updates
09:02 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2075 - T170662 (duration: 00m 47s)
08:56 volans: disabled puppet on 'stat100[5-6]*,snapshot100[1,5-7]*,dataset1001*' to manually recover from nfsd stuck - T169680
08:52 marostegui: Drop moodbar_feedback and moodbar_feedback_response from s4 - T153033
08:51 marostegui: Optimize pagelinks and templatelinks on db1092 - T174509
08:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1092 - T174509 (duration: 00m 47s)
08:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1096 - T174509 (duration: 00m 47s)
08:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1066 on s1 to and move db1072 to s3 instead - T172679 (duration: 00m 47s)
08:12 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Move db1066 from s1 to s3 - T172679 (duration: 01m 25s)
07:33 marostegui: Drop moodbar_feedback and moodbar_feedback_response from s2 - T153033
07:16 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2079 to the config - T170662 (duration: 00m 47s)
07:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2079 to the config - T170662 (duration: 00m 47s)
07:01 ema: cp4022 - backend restart, mailbox lag, cache_upload
06:56 marostegui: Stop MySQL on db2075 to use it to clone db2079 - T170662
06:56 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2075 - T170662 (duration: 00m 46s)
06:25 marostegui: Drop moodbar_feedback and moodbar_feedback_response from s6 - T153033
06:14 marostegui: Optimize pagelinks and templatelinks on db1039 - T174509
06:11 marostegui: Optimize pagelinks and templatelinks on db1050 - T174509
06:08 marostegui: Optimize pagelinks and templatelinks on db1096 - T174509
06:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1096 - T174509 (duration: 00m 47s)
05:59 marostegui: Optimize pagelinks and templatelinks on db1091 - T174509
05:59 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1091 - T174509 (duration: 00m 47s)
05:53 marostegui: Optimize pagelinks and templatelinks on db1090 - T174509
05:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1090 - T174509 (duration: 00m 47s)
05:44 marostegui: Optimize pagelinks and templatelinks on db1083 - T174509
05:38 marostegui: Stop MySQL on db1083 to upgrade it
05:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1083 - T174509 (duration: 00m 47s)
02:48 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Oct 9 02:48:21 UTC 2017 (duration 6m 54s)
02:41 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.2) (duration: 13m 49s)
2017-10-08
21:40 hoo: Killed "dumpwikidatardf.sh truthy nt" (wikidata truthy dump) on snapshot1007, got stuck after T169680 .
11:28 volans: ack-ed puppet not running on stat100[5-6],snapshot100[1,5-7] due to NFSD stuck on dataset1001 - T169680
08:19 elukey: restart varnish backend on cp4026 to stop 503s
2017-10-07
22:46 reedy@tin: Synchronized php-1.31.0-wmf.2/resources/: T177705 (duration: 00m 48s)
22:44 reedy@tin: Synchronized php-1.31.0-wmf.2/includes/specials/SpecialBlock.php: T177705 (duration: 00m 49s)
2017-10-06
18:58 ejegg: updated CiviCRM from 197b22d to 276cbfd
18:38 catrope@tin: Synchronized php-1.31.0-wmf.2/extensions/TwoColConflict/: Unbreak preference changes (T177524 ) (duration: 00m 47s)
18:23 arlolra@tin: Finished deploy [parsoid/deploy@e437c93]: Updating Parsoid to 772e11bf (duration: 12m 41s)
18:10 arlolra@tin: Started deploy [parsoid/deploy@e437c93]: Updating Parsoid to 772e11bf
15:30 moritzm: repooling mw2145 (API server). it was previously depooled, but there's no tickets or SAL entries indicating hardware maintenance, so it was probably simply forgotten to repool it
15:25 moritzm: repooling mw2203 (API server). it was previously depooled, but there's no tickets or SAL entries indicating hardware maintenance, so it was probably simply forgotten to repool it
12:54 moritzm: repooling mw2248-mw2250 (job runners). they were previosly depooled, but there's no tickets or SAL entries indicating hardware maintenance, so they were probably simply forgotten to repool
11:46 bblack: cp1* (eqiad caches) - upgrade to nginx-1.13.5-1+wmf1~jessie1 to match all the other sites
11:34 moritzm: repooling mw2244/mw2245 (image scalers). they were previosly depooled, but there's no tickets or SAL entries indicating hardware maintenance, so they were probably simply forgotten to repool
11:21 moritzm: installing gdk-pixbuf security updates
11:10 elukey: rolling restart of all the druid daemons on druid100[1-6] to pick up new logging changes
11:03 moritzm: installing git security updates on trusty (Debian already fixed)
10:51 moritzm: upgrading HHVM on script runners to HHVM 3.18.5
10:47 moritzm: upgrade remaining video scalers in codfw to HHVM 3.18.5
10:43 elukey: restart replication (mysql+ eventlogging_sync) on dbstore1002 after mysql restart
10:17 moritzm: upgrade remaining job runners in codfw to HHVM 3.18.5
09:53 elukey: stop replication (eventlogging_sync + mysql) on dbstore1002 as prep step for mysql restart
09:17 hashar: Restarted the CI Jenkins for plugin upgrades
09:01 moritzm: upgrade remaining app servers in codfw to HHVM 3.18.5
08:40 moritzm: upgrade remaining image scalers in codfw to HHVM 3.18.5
08:09 ema: upgrade pybal to 1.14.0 on eqiad primary LVSs
07:48 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1076 original weight - T174054 (duration: 00m 48s)
07:23 ema: upgrade pybal to 1.14.0 on eqiad secondary LVSs
07:18 moritzm: upgrade remaining API servers in codfw to HHVM 3.18.5
07:11 legoktm: live hacking on mwdebug1002
07:05 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1076 weight - T174054 (duration: 00m 47s)
06:37 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1076 weight - T174054 (duration: 00m 47s)
06:35 ema: set max_connections=0 on cp3032 varnish-be
06:23 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1106 to the config - T172679 (duration: 00m 47s)
06:21 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db1106 to the config - T172679 (duration: 00m 47s)
06:00 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1076 with low weight - T174054 (duration: 00m 46s)
05:36 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1053 - T174509 (duration: 00m 48s)
05:35 marostegui: Stop MySQL on db1076 for an upgrade
2017-10-05
23:40 ejegg: updated CiviCRM from cca1a9d to 197b22d
23:37 twentyafterfour@tin: Synchronized php-1.31.0-wmf.2/resources/lib/jquery/jquery.migrate.js: sync https://gerrit.wikimedia.org/r/#/c/382530 and https://gerrit.wikimedia.org/r/#/c/382529 (duration: 00m 47s)
23:34 XioNoX: upgrading ps1-d2-eqiad.mgmt.eqiad.wmnet (unused)
23:23 twentyafterfour@tin: Synchronized php-1.31.0-wmf.2/extensions/CirrusSearch/includes/BaseInterwikiResolver.php: sync https://gerrit.wikimedia.org/r/#/c/382625/ (duration: 00m 47s)
23:21 twentyafterfour@tin: Synchronized php-1.31.0-wmf.2/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: SWAT https://gerrit.wikimedia.org/r/#/c/382611/ (duration: 00m 47s)
22:47 twentyafterfour@tin: Synchronized php-1.31.0-wmf.2/extensions/CirrusSearch/includes/: Sync CirrusSearch extension again for good measure (duration: 00m 51s)
22:43 twentyafterfour@tin: Synchronized php-1.31.0-wmf.2/extensions/CirrusSearch: Sync CirrusSearch extension again for good measure (duration: 00m 57s)
22:39 twentyafterfour@tin: Synchronized wmf-config/CirrusSearch-common.php: sync https://gerrit.wikimedia.org/r/#/c/382474/ (duration: 00m 47s)
22:37 twentyafterfour@tin: Synchronized php-1.31.0-wmf.2/extensions/CirrusSearch/: deploy https://gerrit.wikimedia.org/r/#/c/382618/ (duration: 00m 58s)
22:13 awight@tin: Finished deploy [ores/deploy@42c5663]: Cause ORES service restart (duration: 00m 19s)
22:13 awight@tin: Started deploy [ores/deploy@42c5663]: Cause ORES service restart
22:04 bd808@tin: Finished deploy [striker/deploy@fbb9019]: Prevent tools from being named with invalid Kubernetes namespace labels (T176681 ) (duration: 00m 33s)
22:04 bd808@tin: Started deploy [striker/deploy@fbb9019]: Prevent tools from being named with invalid Kubernetes namespace labels (T176681 )
21:35 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.31.0-wmf.2 refs T174358
21:32 twentyafterfour@tin: Synchronized php-1.31.0-wmf.2/extensions/CirrusSearch: sync CirrusSearch to deploy https://gerrit.wikimedia.org/r/#/c/382605/ refs T177535 (duration: 01m 06s)
21:22 ejegg: updated CiviCRM from 59e3e00 to cca1a9d
21:13 twentyafterfour: Getting the train back on track: moving to wmf.2 as soon as https://gerrit.wikimedia.org/r/#/c/382605/1 merges
20:47 twentyafterfour: 1.31.0-wmf.2 is now blocked by T177535
20:28 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.31.0-wmf.1 refs T174358
20:26 twentyafterfour: rolling back to 1.31.0-wmf.1
20:11 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.31.0-wmf.2 refs T174358
20:10 twentyafterfour: Promote all wikis to 1.31.0-wmf.2 refs T174358 (currently there are no blockers and no significant logspam)
19:59 thcipriani@tin: Synchronized php-1.31.0-wmf.1/extensions/Echo: SWAT: revert "Use User Ids instead of User Names for Echo Mute" (duration: 00m 55s)
19:58 andrewbogott: upgrading wikitech-static to REL1_30
19:55 thcipriani@tin: Synchronized php-1.31.0-wmf.2/extensions/Echo: SWAT: revert "Use User Ids instead of User Names for Echo Mute" (duration: 00m 52s)
19:49 thcipriani@tin: Synchronized php-1.31.0-wmf.2/extensions/VisualEditor/extension.json: SWAT: Move 'parentheses' message to MWSaveDialog's modules T177446 (duration: 00m 50s)
19:45 thcipriani@tin: Synchronized php-1.31.0-wmf.2/extensions/TwoColConflict: SWAT: Dont register simulate page when not enabled as a BF (duration: 00m 52s)
19:05 thcipriani@tin: Synchronized php-1.31.0-wmf.1/extensions/Echo: SWAT: Use User Ids instead of User Names for Echo Mute T173475 (duration: 00m 55s)
18:58 thcipriani@tin: Synchronized php-1.31.0-wmf.2/extensions/Echo: SWAT: Use User Ids instead of User Names for Echo Mute T173475 (duration: 00m 56s)
18:53 bblack: cp3* upgrade nginx to 1.13.5-1+wmf1~jessie1
18:38 urandom: restbase-ng: lowering compactionthroughput to 4 (current/5 jbod devices)
18:26 thcipriani@tin: Synchronized wmf-config: SWAT: Enable structured change filters by default on all wikis except FlaggedRevs wikis using non-protection modes T177445 (duration: 00m 54s)
18:13 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable jQuery 3 on all wikis T124742 (duration: 00m 52s)
17:27 cmjohnson1: disable puppet db1024
16:53 moritzm: upgrade remaining app servers in eqiad to HHVM 3.18.5
16:16 XioNoX: failure test for pfw3-codfw
15:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1078 - T174054 (duration: 00m 50s)
15:25 marostegui: Pulling out one disk from db1078 - T174054
15:23 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1078 - T174054 (duration: 00m 51s)
15:16 marostegui: Pulling out one disk from db1076 - T174054
14:53 hashar: All Jenkins infra is now on Java 8 - T162828
14:52 marostegui: Sanitize db1095 for amwikimedia - T176043
14:48 hashar: Restarting Jenkins
14:27 moritzm: upgrade remaining eqiad video scalers to HHVM 3.18.5
13:57 ladsgroup@tin: Synchronized wmf-config/Wikibase-production.php: Enable Statement usage tracking on kowiki and trwiki (T151717 ) (duration: 00m 50s)
13:51 ladsgroup@tin: Synchronized wmf-config/Wikibase-production.php: (no justification provided) (duration: 00m 50s)
13:49 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Document task reference for amwikimedia logo (T176042 ) (duration: 00m 54s)
13:43 Amir1: ladsgroup@terbium:~$ mwscript createAndPromote.php --wiki=amwikimedia --createAndPromote=sysop,bureaucrat 'Ladsgroup' "password removed obviously" --force (T176042 )
13:35 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Fix amwikimedia logo (T176042 ) (duration: 00m 50s)
13:26 ladsgroup@tin: Synchronized php-1.31.0-wmf.2/extensions/WikimediaMessages/i18n/wikimediaprojectnames: (no justification provided) (duration: 00m 49s)
13:15 godog: test upgrade node-exporter in thumbor/swift/services/puppet3-diffs/monitoring projects - T166561
13:11 ladsgroup@tin: Synchronized wmf-config/interwiki.php: Create amwikimedia (T176042 ) (duration: 00m 51s)
13:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1089 - T174509 (duration: 00m 50s)
12:58 ladsgroup@tin: Synchronized multiversion/MWMultiVersion.php: Create amwikimedia (T176042 ) (duration: 00m 50s)
12:53 ladsgroup@tin: Synchronized static/images/project-logos/: Create amwikimedia (T176042 ) (duration: 00m 50s)
12:52 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Create amwikimedia (T176042 ) (duration: 00m 50s)
12:50 ladsgroup@tin: rebuilt wikiversions.php and synchronized wikiversions files: Create amwikimedia (T176042 )
12:49 ladsgroup@tin: Synchronized dblists: Create amwikimedia (T176042 ) (duration: 00m 50s)
12:36 ladsgroup@tin: Synchronized dblists: (no justification provided) (duration: 00m 51s)
12:22 moritzm: installing poppler security updates on trusty
11:57 moritzm: upgrade remaining eqiad video scalers to HHVM 3.18.5
11:29 jynus: rebuilding globalimagelinks table on dbstore1001:s4
10:35 hashar: contint2001: switched java to version 8 as well as all the jre/jdk utilities managed by alternatives - T162828
10:26 moritzm: upgrade remaining eqiad API servers to HHVM 3.18.5
10:23 addshore@tin: Synchronized php-1.31.0-wmf.2/extensions/TwoColConflict/includes/TwoColConflictHooks.php: REVERT: Dont register simulate page when not enabled as a BF PT1/3 (duration: 00m 50s)
10:22 addshore@tin: Synchronized php-1.31.0-wmf.2/extensions/TwoColConflict/extension.json: REVERT: Dont register simulate page when not enabled as a BF PT2/3 (duration: 00m 50s)
10:17 addshore@tin: Synchronized php-1.31.0-wmf.2/extensions/TwoColConflict/extension.json: Dont register simulate page when not enabled as a BF PT2/3 (duration: 00m 50s)
10:16 addshore@tin: Synchronized php-1.31.0-wmf.2/extensions/TwoColConflict/includes/TwoColConflictHooks.php: Dont register simulate page when not enabled as a BF PT1/3 (duration: 00m 51s)
10:14 addshore@tin: Synchronized php-1.31.0-wmf.2/extensions/TwoColConflict/includes/SpecialConflictTestPage/SpecialConflictTestPage.php: Fix SimulateTwoColEditConflict when using a page with ns (duration: 00m 50s)
10:09 addshore@tin: Synchronized php-1.31.0-wmf.2/extensions/WikimediaEvents/WikimediaEventsHooks.php: 31-wmf.2 WikimediaEvents: WMDE: update $campaignStartTimestamp (duration: 00m 50s)
10:07 addshore@tin: Synchronized php-1.31.0-wmf.1/extensions/WikimediaEvents/WikimediaEventsHooks.php: 31-wmf.1 WikimediaEvents: WMDE: update $campaignStartTimestamp (duration: 00m 52s)
10:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1076 - T174054 (duration: 00m 50s)
09:52 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1099 - T174509 (duration: 00m 50s)
08:54 marostegui: Stop MySQL on db1104 to clone db1106 - T172679
08:53 gehel: rolling restart of wdqs to pick up new GC options - T175919
08:41 moritzm: upgrade remaining eqiad job runners to HHVM 3.18.5
08:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1105 to the config - T172679 (duration: 00m 50s)
08:39 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db1105 to the config - T172679 (duration: 00m 52s)
08:29 marostegui: Optimize pagelinks and templatelinks on s5 - db1105 - T174509
08:21 moritzm: upgrade remaining eqiad image scalers to HHVM 3.18.5
08:12 marostegui: Optimize templatelinks and pagelinks on s1 s2 s4 s5 s6 and s7 on labsdb1009 - T174509
08:03 jynus: putting labsdb1009 under maintenance
07:54 moritzm: upgrade mw1221-mw1235 (API servers) to HHVM 3.18.5
07:26 legoktm: running batched query of "DELETE FROM linter WHERE linter_cat=12" on all wikis to clear out mostly bogus html5-misnesting category
07:11 ema: update pybal to 1.14.0 on esams primary LVSs
07:06 ema: update pybal to 1.14.0 on esams secondary LVSs
07:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1099 - T174509 (duration: 00m 52s)
07:01 marostegui: Optimize templatelinks and pagelinks on db1099 - T174509
06:56 moritzm: upgrade mw1238-mw1258 (app servers) to HHVM 3.18.5
06:40 marostegui: Stop MySQL on db1104 to clone db1105 from it - T172679
06:32 marostegui: Optimize templatelinks and pagelinks on db1069 - T174509
06:31 marostegui: Optimize templatelinks and pagelinks on db1053 - T174509
06:30 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1053 - T174509 (duration: 00m 50s)
06:25 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1089 - T174509 (duration: 00m 49s)
06:24 marostegui: Optimize templatelinks and pagelinks on db1089 - T174509
06:17 marostegui: Optimize s2.templatelinks and enwiki.pagelinks on codfw master db2017 (without replication) - T174509
06:13 marostegui: Optimize enwiki.templatelinks and enwiki.pagelinks on codfw master db2048 (without replication) - T174509
06:01 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2035 and db2056 - T174509 (duration: 00m 52s)
05:57 elukey: restart varnish backend on cp3040
05:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1097 - T174509 (duration: 01m 09s)
05:42 elukey: restart varnish backend on cp3030
02:59 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Oct 5 02:59:18 UTC 2017 (duration 7m 23s)
02:51 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.2) (duration: 09m 01s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.1) (duration: 09m 51s)
00:06 catrope@tin: Synchronized php-1.31.0-wmf.2/resources/src/mediawiki.rcfilters/mw.rcfilters.init.js: T176652 (duration: 00m 49s)
00:04 catrope@tin: Synchronized php-1.31.0-wmf.2/extensions/WikimediaEvents/modules/ext.wikimediaEvents.recentChangesClicks.js: T176652 (duration: 00m 50s)
2017-10-04
23:55 eileen: update process_control to 10161f8
23:53 catrope@tin: Synchronized php-1.31.0-wmf.2/extensions/TimedMediaHandler/MwEmbedModules/EmbedPlayer/resources/jquery.embedPlayer.js: Fix jqmigrate warning (T169385 ) (duration: 00m 50s)
23:51 catrope@tin: Synchronized php-1.31.0-wmf.2/extensions/ORES/includes/Hooks.php: ORES highlights for RCFilters (T172757 ) (duration: 00m 50s)
23:49 catrope@tin: Synchronized php-1.31.0-wmf.2/resources/src/: ORES highlights for RCFilters (T172757 ) (duration: 00m 50s)
23:36 catrope@tin: Synchronized php-1.31.0-wmf.2/includes/changes/ChangesListFilter.php: ORES highlights for RCFilters (T172757 ) (duration: 00m 50s)
23:32 catrope@tin: Synchronized php-1.31.0-wmf.2/includes/changes/ChangesListFilterGroup.php: Fix PHP fatal when transcluding Special:Recentchanges (T176236 ) (duration: 00m 50s)
23:32 Krinkle: Upgrading xhgui on tungsten (operations/software/xhgui.git) from 0.5.2 to 0.8.1 (matching version used in operations/mediawiki-config vendor) - new feature: flamegraphs
23:29 catrope@tin: Synchronized wmf-config/throttle.php: Remove expired rules (duration: 00m 50s)
23:27 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Enable AbuseFilter runtime profile on ptwiki (T177336 ) (duration: 00m 51s)
23:19 catrope@tin: Synchronized static/images/mobile/copyright/wikipedia-wordmark-en.svg: Remove unnecessary id attributes (T175670 ) (duration: 00m 50s)
23:18 catrope@tin: Synchronized php-1.31.0-wmf.2/skins/MinervaNeue/: Correct feature phone threshold detection (T176286 ) (duration: 00m 53s)
22:35 ejegg: changed DonationInterface cURL timeout for all processors to 12 sec
21:09 arlolra: Updated Parsoid to 477942c3 (T177115 )
21:07 hoo: Updated the Wikidata property suggester with data from Monday's JSON dump and applied the T132839 workarounds
21:03 arlolra@tin: Finished deploy [parsoid/deploy@617cdb3]: Updating Parsoid to 477942c3 (duration: 10m 01s)
20:53 arlolra@tin: Started deploy [parsoid/deploy@617cdb3]: Updating Parsoid to 477942c3
20:33 arlolra@tin: Finished deploy [parsoid/deploy@5b80254]: Updating Parsoid to 477942c3 (duration: 00m 58s)
20:32 arlolra@tin: Started deploy [parsoid/deploy@5b80254]: Updating Parsoid to 477942c3
20:18 pnorman: Start regenerating map tiles on codfw for z13-z14 - T176252
19:55 twentyafterfour@tin: Synchronized php: group1 wikis to 1.31.0-wmf.2 refs T174358 (duration: 00m 48s)
19:55 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to 1.31.0-wmf.2 refs T174358
19:38 pnorman: Start regenerating map tiles on eqiad for z13-z14 - T176252
19:25 twentyafterfour: Deploying MediaWiki 1.31.0-wmf.2 to Group 1 wikis refs T174358
19:06 bblack: cp2* (all codfw caches): upgrade nginx to 1.13.5-1+wmf1~jessie1
18:39 ottomata: reverting roll out of LVS service druid-analytics.svc.eqiad.wmnet - this wont' work with hosts inside of Analytics VLAN
18:23 niharika29@tin: Synchronized php-1.31.0-wmf.2/skins/Vector/: Do not special-case ULS and Not logged in in RTL in personal bar T48947 , T177312 (duration: 00m 50s)
18:22 niharika29@tin: Synchronized php-1.31.0-wmf.1/skins/Vector/: Do not special-case ULS and Not logged in in RTL in personal bar T48947 , T177312 (duration: 00m 51s)
18:09 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable jQuery 3 on most group1 wikis (T124742 ) (duration: 00m 51s)
17:37 elukey: enabled basic ACLs on the Kafka Jumbo cluster - T173493
17:32 bblack: text@ulsfo - upgrade nginx to 1.13.5-1+wmf1~jessie1
17:32 ottomata: deploying new LVS service for druid-analytics-broker
17:16 moritzm: uploaded icu52 to stretch-wikimedia (co-installable forward port of jessie's ICU for stretch to be used for HHVM)
17:06 bblack: cp4022-26 (now all of upload@ulsfo) - upgrade nginx to 1.13.5-1+wmf1~jessie1
16:58 krinkle@tin: Synchronized php-1.31.0-wmf.2/extensions/TimedMediaHandler/resources/mw.MediaWikiPlayer.loader.js: T175506 (duration: 00m 52s)
16:29 bblack: lvs1001-3: base package upgrades, autoremove, +python-requests
16:24 bblack: recdns: upgrade base packages
16:23 volans: deleted 'R:File = /usr/local/bin/wmf-reimage' on matching hosts T176955
16:20 bblack: maelant: upgrade base packages
16:11 bblack: lvs1004-6: base package upgrades, autoremove, +python-requests
16:06 jynus: performing schema change on labsdb1009/10/11
16:01 ejegg: disabled CiviCRM dedupe jobs
15:40 bblack: acamar (recdns@codfw) - base package upgrades
15:39 bblack: lvs1007-12: base package upgrades, autoremove, +python-requests
15:37 bblack: hydrogen (recdns@eqiad) - base package upgrades
15:35 marostegui: Power off db2010 to decommission it - T175685
15:35 bblack: recdns: apt autoremove -> install python-requests
15:32 bblack: caches: apt autoremove -> install python-requests
15:28 marostegui: Disable puppet on db2010 - it will be decommissioned - T175685
15:24 andrewbogott: rebuilding labvirt1016
14:57 ema: cp4025: restart varnish backend
14:47 XioNoX: replacing faulty VC cable on fasw-c-eqiad
14:45 bblack: base package upgrades (+autoremove) on lvs3001-2
14:43 bblack: base package upgrades (+autoremove) on lvs3004
14:41 bblack: base package upgrades (+autoremove) on lvs3003
14:24 bblack: upgrading nginx to 1.13.5-1+wmf1~jessie1 on cp4021 (first live cache, upload@ulsfo)
14:18 bblack: base package upgrades (+autoremove) on lvs2*
14:10 bblack: base package upgrades (+autoremove) on lvs4*
13:47 zeljkof: EU SWAT finished
13:46 zfilipin@tin: Synchronized php-1.31.0-wmf.2/extensions/Translate/tag/PageTranslationHooks.php: SWAT: Revert "Use Language type hint in hooks" (T177352) (duration: 00m 50s)
13:45 zfilipin@tin: Synchronized php-1.31.0-wmf.2/extensions/Translate/TranslateHooks.php: SWAT: Revert "Use Language type hint in hooks" (T177352) (duration: 00m 50s)
13:41 zfilipin@tin: Synchronized php-1.31.0-wmf.2/extensions/Translate/TranslateHooks.php: php-1.31.0-wmf.2/extensions/Translate/tag/PageTranslationHooks.php SWAT: Revert "Use Language type hint in hooks" (T177352) (duration: 00m 52s)
13:26 zeljkof: one more commit for EU SWAT
13:26 marostegui: Optimize table ores_classification on enwiki codfw master db2048 with replication - might generate lag - T159753
13:22 marostegui: Optimize enwiki.ores_classification on db2069
13:06 zeljkof: EU SWAT finished
13:05 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: New throttle rule (T177370) (duration: 00m 53s)
12:59 jynus: upgrade and reboot labsdb1009
12:56 marostegui: Optimize templatelinks and pagelinks tables on s2 and s7 - labsdb1010 - T174509
12:38 moritzm: upgrading app servers in codfw to HHVM 3.18.5
12:21 moritzm: upgrading image scalers mw1293-mw1295 to HHVM 3.18.5
12:12 moritzm: upgrading HHVM on deployment servers to 3.18.5
12:10 elukey: added two new mediawiki videoscalers - mw1307/1318
12:09 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1318.eqiad.wmnet
12:09 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1307.eqiad.wmnet
12:00 gehel: mediawiki now uses the LVS endpoint for logstash - T175242
11:49 moritzm: upgrading job runners mw1161-mw1167 to HHVM 3.18.5
11:34 moritzm: installing ghostscript security updates
11:13 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1087 original weight (duration: 00m 47s)
10:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1087 weight (duration: 00m 51s)
10:29 jynus: upgrade and reboot labsdb1011
10:22 jynus: draining labsdb1011 connections
10:18 akosiaris: T177374 , fully depool wtp1001-wtp1024
10:18 akosiaris@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=wtp10([01][0-9]|2[0-4]).eqiad.wmnet
10:03 moritzm: install libidn security updates
10:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1087 with low weight (duration: 00m 50s)
09:54 _joe_: restarting hhvm on canaries (both appservers and canaries) T107128
09:47 moritzm: upgrading API app servers mw1189-mw1208 to HHVM 3.18.5
09:45 elukey: rolling restart of aqs nodes to pick up the new logstash lvs config
09:39 hoo: Killed remaining dumpRdf runners on snapshot1007 after two crashed due to the db1087 depool. Auto-restart will do the rest.
09:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1104 to the config - T172679 (duration: 00m 50s)
09:32 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db1104 to the config - T172679 (duration: 00m 51s)
09:25 moritzm: installing ocaml security updates on trusty
09:16 marostegui: Reboot db1087 to pick up new kernel
09:09 _joe_: restarting hhvm on mwdebug*, T107128
09:07 marostegui: Optimize templatelinks and pagelinks tables on db1097 (s4) - T174509
09:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1097 - T174509 (duration: 00m 50s)
08:41 akosiaris@puppetmaster1001: conftool action : set/pooled=no; selector: name=wtp10([01][0-9]|2[0-4]).eqiad.wmnet
08:40 mobrovac@tin: Finished deploy [restbase/deploy@8eb758a]: Switch the mobile feeds to the new storage schema and Cassandra 3 (duration: 10m 05s)
08:36 moritzm: upgrading app servers mw1180-mw1188, mw1209-mw1220 to HHVM 3.18.5
08:30 mobrovac@tin: Started deploy [restbase/deploy@8eb758a]: Switch the mobile feeds to the new storage schema and Cassandra 3
08:28 marostegui: Stop MySQL on db2010 as it will be decommissioned - T175685
07:30 jynus: stopping s5 replication on dbstore1002 and converting wikidatawiki.wb_terms into TokuDB
07:11 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db2010 from config as it will be decommissioned - T175685 (duration: 00m 48s)
07:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db2010 from config as it will be decommissioned - T175685 (duration: 00m 48s)
06:53 marostegui: Stop MySQL on db1087 to clone db1104 from it - T172679
06:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1087 - T172679 (duration: 00m 50s)
06:30 marostegui: Optimize templatelinks and pagelinks tables on db1066 - T174509
06:27 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2035 and db2056 - T174509 (duration: 00m 50s)
06:21 elukey: restart varnish backend on cp3043
06:19 marostegui: Optimize pagelinks and templatelinks on dbstore2002 s1 - T174509
06:16 elukey: restart varnish backend on cp3041
06:13 marostegui: Optimize pagelinks and templatelinks tables on s7 codfw master (db2029) this will generate lag - T174509
05:58 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2034, db2055 db2062 - T174509 (duration: 00m 51s)
03:09 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Oct 4 03:09:34 UTC 2017 (duration 7m 15s)
03:02 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.2) (duration: 15m 20s)
02:25 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.1) (duration: 07m 44s)
00:33 ejegg: re-enabled fundraising queue consumers
00:29 ejegg: re-enabled CiviCRM de-dupe jobs
00:24 ejegg: re-enabled civicrm cron and ingenico jobs
00:20 ejegg: re-enabled fundraising stats and export jobs
00:16 ejegg: re-enabled omnimail import jobs
00:10 ejegg: re-enabled fundraising audit jobs
00:07 ejegg: re-enabled pending queue consumer
2017-10-03
23:52 ejegg: stopped all jobs for process-control upgrade
23:46 pnorman: Start regenerating map tiles on eqiad for z0-z12 - T176252
23:20 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: T175971 T176088 Enable RemexHTML on wikitech and eswikiversity (duration: 00m 51s)
23:11 ebernhardson@tin: Synchronized wmf-config/CirrusSearch-common.php: Pre-configure cirrussearch for 5.5.x upgrade (duration: 00m 51s)
22:26 bsitzmann@tin: Finished deploy [mobileapps/deploy@82aa7d6]: Update mobileapps to 5dc0c02 (T175762 T177001 T176525 T176517 T176519 ) (duration: 06m 14s)
22:20 bsitzmann@tin: Started deploy [mobileapps/deploy@82aa7d6]: Update mobileapps to 5dc0c02 (T175762 T177001 T176525 T176517 T176519 )
20:29 pnorman: Start regenerating map tiles on codfw for z0-z12 - T176252
20:17 demon@tin: Synchronized php-1.31.0-wmf.2/extensions/GlobalUserPage/includes/Hooks.php: fix broken thing (duration: 00m 51s)
19:58 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.2
19:33 demon@tin: Synchronized scap/plugins/prep.py: no-op (duration: 00m 51s)
19:17 demon@tin: Finished scap: bootstrap wmf.2 (duration: 33m 50s)
19:01 twentyafterfour: Starting MediaWiki train for 1.30.1-wmf.2
18:43 demon@tin: Started scap: bootstrap wmf.2
18:25 chasemp: change to private vlan for labweb1001 on asw-b-eqiad
18:24 chasemp: change to private vlan for labweb1002 on asw2-d-eqiad
18:24 chasemp: auth-dnsupdate on bahan for 382027
16:51 ejegg: updated payments-wiki from c2a3d43 to b9859f6
15:57 bblack: upgrading basic packages on role::cache::text
15:50 bblack: upgrading basic packages on role::cache::upload
15:42 bblack: upgrading basic packages on role::cache::misc
15:29 mobrovac@tin: Finished deploy [restbase/deploy@02acc45]: (no justification provided) (duration: 06m 27s)
15:24 _joe_: uploading blubber to jessie-wikimedia T175296
15:23 mobrovac@tin: Started deploy [restbase/deploy@02acc45]: (no justification provided)
15:04 mobrovac@tin: Finished deploy [restbase/deploy@02acc45]: (no justification provided) (duration: 05m 14s)
14:59 mobrovac@tin: Started deploy [restbase/deploy@02acc45]: (no justification provided)
14:56 mobrovac@tin: Finished deploy [restbase/deploy@02acc45]: (no justification provided) (duration: 04m 25s)
14:56 bblack: upgrading basic packages on cp1065
14:52 mobrovac@tin: Started deploy [restbase/deploy@02acc45]: (no justification provided)
13:56 zeljkof: EU SWAT finished
13:55 zfilipin@tin: Synchronized php-1.31.0-wmf.1/extensions/Translate/webservices/CxserverWebService.php: SWAT: Make CxserverWebService forward compatible (duration: 00m 45s)
13:46 godog: upgrade grafana to 4.5.2 on krypton - T175980
13:41 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Adjust wgNamespacesToBeSearchedDefault for enwikibooks, fawikibooks and hewikisource (T176906 T176908 T176907) (duration: 00m 46s)
13:34 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Revert "Limit thanks for new users at pl.wikipedia to 3 per day" (T176174 T169268) (duration: 00m 46s)
13:26 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable ShortUrl Extension on hiwikiversity (T177187) (duration: 00m 46s)
13:25 marostegui: Optimize templatelinks and pagelinks tables on s1, s4 and s6 on labsdb1010 - T174509
13:13 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable NewUserMessage Extension on hiwikiversity (T177188) (duration: 00m 46s)
13:04 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2044 - T174764 (duration: 00m 47s)
10:50 mobrovac@tin: Finished deploy [restbase/deploy@1cc530b]: (no justification provided) (duration: 08m 38s)
10:41 mobrovac@tin: Started deploy [restbase/deploy@1cc530b]: (no justification provided)
08:10 Amir1: cleaning up ores_classification tables (T159753 )
08:05 marostegui: Deplo alter table on s3 codfw master (db2018) to add PK to the main tables - T163912
08:02 mobrovac@tin: Finished deploy [restbase/deploy@65f519b]: Create new buckets for feed end points (duration: 09m 57s)
07:52 mobrovac@tin: Started deploy [restbase/deploy@65f519b]: Create new buckets for feed end points
07:20 gehel: killing stuck tileshell and cleaning up /srv/osm_expire on maps-test2001 - T175123
07:05 elukey: restart varnish backend on cp3031 (503s)
07:03 marostegui: Optimize pagelinks and templatelinks tables on labsdb1010 for s5 - T174509
07:01 marostegui: Optimize templatelinks and pagelinks tables on s6 codfw master db2028 (this might generate lag) - T174509
06:46 marostegui: Drop now redundant indexes from pagelinks and templatelinks on s7 - T174509
06:28 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1066 - T172679 (duration: 00m 47s)
05:54 marostegui: Optimize templatelinks and pagelinks tables on s5 master codfw (db2023): this will generate lag - T174509
05:51 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2049 and db2041 - T174509 (duration: 00m 47s)
03:23 demon@tin: Pruned MediaWiki: 1.30.0-wmf.19 [keeping static files] (duration: 01m 30s)
02:30 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Oct 3 02:30:30 UTC 2017 (duration 6m 39s)
02:23 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.1) (duration: 07m 22s)
2017-10-02
23:38 catrope@tin: Synchronized php-1.31.0-wmf.1/extensions/VisualEditor/: T177250 (duration: 00m 47s)
23:36 catrope@tin: Synchronized php-1.31.0-wmf.1/resources/src/mediawiki.rcfilters/dm/mw.rcfilters.dm.FiltersViewModel.js: T177107 (duration: 00m 47s)
23:33 mutante: releases2001 - kill and restart jenkins with systemctl
23:19 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Update WMF mailing address (T173684 ) and enable jQuery 3 on Wiktionaries (T124742 ) (duration: 00m 48s)
23:00 mutante: releases1001 - remove and reinstall jenkins
22:56 Reedy: created newsletter tables on officewiki T176199
19:36 ejegg: turned on CiviMail record creation
18:48 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Change the Turkish Wiktionary logo (T176008 ) (duration: 00m 46s)
18:46 niharika29@tin: Synchronized static/images/project-logos/: Change the Turkish Wiktionary logo (T176008 ) (duration: 00m 48s)
17:46 gehel@tin: Finished deploy [wdqs/wdqs@f20b726]: latest wdqs GUI + minor fixes (duration: 01m 48s)
17:44 gehel@tin: Started deploy [wdqs/wdqs@f20b726]: latest wdqs GUI + minor fixes
14:08 zeljkof: EU SWAT finished
14:07 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Sandbox menu to Javanese Wikipedia (T176308) (duration: 00m 46s)
14:01 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Extension:Newsletter on officewiki (T176199) (duration: 00m 46s)
13:59 zeljkof: extending EU SWAT for 5-10 minutes
13:51 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Change the zh-classicalwiki logo (T177165) (duration: 00m 45s)
13:50 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Change the zh-classicalwiki logo (T177165) (duration: 00m 46s)
13:44 zfilipin@tin: Synchronized static/images/project-logos/zh_classicalwiki-1.5x.png: static/images/project-logos/zh_classicalwiki-2x.png static/images/project-logos/zh_classicalwiki.png wmf-config/InitialiseSettings.php SWAT: Change the zh-classicalwiki logo (T177165) (duration: 00m 46s)
13:42 godog: roll restart thumbor to apply https://gerrit.wikimedia.org/r/#/c/381045/ - T166699
13:38 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable the Extension:SandboxLink on the nlwikinews (T177170) (duration: 00m 46s)
13:32 zfilipin@tin: Synchronized wmf-config/Wikibase-production.php: SWAT: Wikidata: Dont persist description usages (yet) (T177153) (duration: 00m 45s)
13:27 zfilipin@tin: Synchronized wmf-config/Wikibase-production.php: SWAT: Wikidata: Dont persist description usages (yet) (T177153) (duration: 00m 46s)
13:20 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add eliminator as a priviliged account (T176554) (duration: 00m 46s)
13:17 zfilipin@tin: Synchronized wmf-config/Wikibase-labs.php: wmf-config/Wikibase.php wmf-config/Wikibase-production.php SWAT: labs: Use redis lock manager for dispatching changes of Wikibase (T175109) (duration: 00m 46s)
13:09 zfilipin@tin: Synchronized wmf-config/Wikibase-labs.php: wmf-config/Wikibase.php wmf-config/Wikibase-production.php SWAT: labs: Use redis lock manager for dispatching changes of Wikibase (T175109) (duration: 00m 47s)
12:56 elukey: added two new mediawiki jobrunners - mw131[01]
12:50 marostegui: Optimize tables pagelinks and templatelinks on s5: dbstore2001
11:51 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2039 - T174509 (duration: 00m 46s)
10:06 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1311.eqiad.wmnet
10:06 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1310.eqiad.wmnet
10:04 mobrovac: restbase restarting on restbase1007 to start logging error bodies for T176728
09:55 jynus: stopping and upgrading labsdb1010
09:27 Amir1: finished the cleanup of ores_classification table in enwiki (T159753 )
09:12 godog: roll-restart thumbor to upgrade to 1.6 - T172556
09:06 elukey: forced remount of /mnt/hdfs on notebook1002
08:47 gehel: deploying latest tilerator on maps-test only, fix for T175123
08:46 gehel@tin: Finished deploy [tilerator/deploy@b02bd9a]: (no justification provided) (duration: 00m 22s)
08:46 gehel@tin: Started deploy [tilerator/deploy@b02bd9a]: (no justification provided)
08:46 marostegui: Poweroff db2044 for HW maintenance - T174764
08:40 marostegui: Stop MySQL on db2044 to get it ready to replace its mainboard - T174764
08:29 Amir1: finished the cleanup of ores_classification table in wikidatawiki and starting the enwiki one (T159753 )
08:24 jynus: stopping dbstore1001 for maintenance
08:12 marostegui: Optimize tables pagelinks and templatelinks on s5: db1100 (already depooled) - T174509
07:59 marostegui: Drop now redundant indexes from pagelinks and templatelinks from s5 - T174509
07:57 Amir1: starting a round of cleanup in ores_classification table in wikidatawiki
07:41 marostegui: Stop MySQL on db1036 as it is going to be decommissioned - https://phabricator.wikimedia.org/T176311
07:32 marostegui: Optimize table pagelinks and templatelinks on s6: dbstore2001 - T174509
07:21 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2039 - T174509 (duration: 00m 46s)
07:20 marostegui: Optimize table pagelinks and templatelinks on s6: db2039 - T174509
07:10 marostegui: Optimize table pagelinks and templatelinks on s1: db2034, db2055 db2062 - T174509
07:06 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2034, db2055 db2062 - T174509 (duration: 00m 46s)
06:55 _joe_: restarting a few API appservers with high cpu load
06:19 elukey: restart varnish backend on cp3040
06:10 elukey: restart varnish backend on cp3033
06:08 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1036 from config files as it will be decommissioned - T176311 (duration: 00m 46s)
06:07 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1036 from config files as it will be decommissioned - T176311 (duration: 00m 48s)
02:43 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Oct 2 02:43:33 UTC 2017 (duration 6m 39s)
02:36 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.1) (duration: 08m 42s)
2017-10-01
21:30 madhuvishy: powercycling labvirt1015
18:58 hashar: restarted Jenkins. Was stuck somehow
13:50 elukey: restart hhvm on mw1167 (jobrunner) - hhvm stuck, dump-debug in /tmp/hhvm.9624.bt.
2017-09-30
20:54 catrope@tin: Synchronized wmf-config/CommonSettings.php: Bump $wgResourceLoaderStorageVersion (T176884 ) (duration: 00m 47s)
00:37 ejegg: increased PayPal Express Checkout cURL timeout to 12 seconds
2017-09-29
22:12 ejegg: updated CiviCRM from 784756b to 59e3e00
21:19 mutante: labstore2003/2004: closed idle screen sessions
20:27 mutante: mwlog1001, oxygen - closing unused screen session
20:20 mutante: install1002/install2002 - closing unused screen sessions (except a tail -f)
20:11 mutante: restbase2001 - closing screen sessions that weren't running anything anymore
19:23 ebernhardson: completed running cirrussearch category moves on commonswiki
18:19 catrope@tin: Synchronized php-1.31.0-wmf.1/includes/libs/CSSMin.php: T176884 (duration: 00m 47s)
18:08 chasemp: remove 'ToolLabs webservice - lighttpd on precise' catchpoint check for Catchpoint
18:07 chasemp: remove labsdb1002 checks on catchpoint (Toolforge)
18:06 chasemp: remove Labs puppetmaster eqiad catchpoint check (for Toolforge)
18:06 chasemp: remove grid-start-precise catchpoint check
18:04 chasemp: update catchpoing test for stream.wikimedia.org to watch https://stream.wikimedia.org/?doc
17:00 ebernhardson: kicking off extra job runners to process jobs moving commonswiki Category pages from general to content serach index
16:37 andrewbogott: re-imaging labvirt1017, 1018 in order to get it on the standard 4.4.0-81-generic kernel
15:52 andrewbogott: re-imaging labvirt1015 in order to get it on the standard 4.4.0-81-generic kernel
13:21 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2070, db2069 and db2042 - T174509 (duration: 00m 47s)
13:18 Amir1: starting a round of cleanup in ores_classification table in enwiki (T159753 )
13:09 elukey: depool mw1265 (hhvm 3.18.5) - disk filled up
12:43 bblack: removed numa=isolate sys entries manually (runtime + future boots) from cp4021
11:53 bblack: cp4021: reboot for numa mode switch
11:44 elukey: re-enable job runner daemons on mw130[8,9]
11:15 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2053 db2046 - T174509 (duration: 00m 48s)
11:05 elukey: precautionary stop of jobrunner/jobchron on the new mw130[8,9] - job queue size rapid increase investigation
09:55 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1309.eqiad.wmnet
09:33 ema: upgrade pybal to 1.14.0 on codfw primaries
08:20 ema: upgrade pybal to 1.14.0 on codfw secondaries
05:53 ema: cp4024 repooled T174891
05:21 ema: powerup cp4024 T174891
00:26 ejegg: updated fundraising tools from 69a9627 to 6d4b6f3
00:16 ejegg: updated CiviCRM 2138869 to 784756b
00:07 mutante: releases1001 - starting Jenkins setup wizard with generated admin pass
00:05 mutante: releases1001 - stopped puppet, manually fixing --prefix=/ci setting for jenkins process, killing it, removing init.d file, starting with systemd, jenkins now up T164030
2017-09-28
23:46 awight@tin: Finished deploy [ores/deploy@42c5663]: Cause ORES service restart (duration: 00m 20s)
23:45 awight@tin: Started deploy [ores/deploy@42c5663]: Cause ORES service restart
22:18 MaxSem: Deployed fix for T176247
21:30 ejegg: updated CiviCRM from 3f48cb0 to 2138869
21:23 XenoRyet: Updated SmashPig from dc62203 to d056a13
19:36 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.1
19:34 legoktm: mysql:wikiadmin@db1052 [enwiki]> delete from user_groups where ug_user =17629530 limit 1; (T176798 )
19:31 bblack: cp4026 - depool->reboot for bios updates, isolcpus fix
19:28 legoktm: mysql:wikiadmin@db1062 [centralauth]> UPDATE localuser SET lu_attached_method="new" WHERE lu_attached_method is NULL AND lu_name="Rua";
19:14 ejegg: updated fundraising tools 5708cf1 Add 'first_donation_date' column to Silverpop export
19:14 robh: cp4024 initial tests done (takes about 2 hours or so) then prompted for more in depth testing (hit yes, no failures so far) T174891
19:00 bblack: cp4025 - depool->reboot for bios updates, isolcpus fix
18:54 thcipriani@tin: Synchronized php-1.31.0-wmf.1/skins/MinervaNeue/includes/skins/SkinMinerva.php: SWAT: Revision::newFromTitle may return null T176882 (duration: 00m 50s)
18:42 bblack: cp4023 - depool->reboot for bios updates, isolcpus fix
18:37 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Create rollbacker group at viwiktionary T176979 (duration: 00m 50s)
18:26 thcipriani@tin: Synchronized docroot/noc: SWAT: copy squid.php->reverse-proxy.php, squid-labs->reverse-proxy-staging PART II T104148 (duration: 00m 49s)
18:25 thcipriani@tin: Synchronized wmf-config: SWAT: copy squid.php->reverse-proxy.php, squid-labs->reverse-proxy-staging T104148 (duration: 00m 51s)
17:58 arlolra: Updated Parsoid to 2f4b9a8c (T176363 , T176151 , T170832 )
17:52 mutante: librenms - renewed LE TLS cert ..via puppet, after Icinga alerted and accidentally still had "do_acme: false" in Hiera
17:46 arlolra@tin: Finished deploy [parsoid/deploy@84342fe]: Updating Parsoid to 2f4b9a8c (duration: 15m 25s)
17:43 bblack: cp4028 - depool->reboot for bios updates, isolcpus fix
17:40 moritzm: installing apache updates on cobalt
17:32 bblack: cp4022 - depool->reboot for bios updates, isolcpus fix
17:30 arlolra@tin: Started deploy [parsoid/deploy@84342fe]: Updating Parsoid to 2f4b9a8c
17:25 marostegui: Run fixStuckGlobalRename.php --wiki=enwiktionary --logwiki=metawiki "CodeCat" "Rua" on terbium - T176985
17:20 bblack: cp4027 - depool->reboot for bios updates, isolcpus fix
17:08 bblack: cp4021 - depool->reboot for bios update
17:05 robh: bios firmware uploaded and awaiting reboot for application on cp402[1235678 ]
16:48 bd808: Created centralauth.{analytics,web}.db.svc.eqiad.wmflabs Designate CNAME (T176978 )
16:26 robh: cp4024 has had ilom and bios firmware updaed per epsa error checking, now re-running epsa utility T174891
16:09 robh: cp4024 bios update in progress (has been for last 5 minutes), letting it alone for a solid 30 to let it finish
16:01 marostegui: Global rename of CodeCat â Rua - T176985
15:22 hoo: Ran scap pull on mwdebug1001 after experiments with T176844
14:54 dcausse: restarting elastic on relforge servers to reinstall prod ltr plugin
14:51 marostegui: Stop MySQL on db1035 as server is going to be decommissioned - T176931
14:45 moritzm: installing apache updates on puppet mastes (will trigger a few failing puppet runs, but ircecho disabled temporarily)
14:42 marostegui: Remove db1035 from tendril - T176931
14:31 moritzm: upgrading mw1276 (API canary) to HHVM 3.18.5
14:28 moritzm: upgrading mwdebug* to HHVM 3.18.5
13:56 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1035 from config as it will be decommissioned - T176931 (duration: 00m 49s)
13:55 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1035 from config as it will be decommissioned - T176931 (duration: 00m 50s)
13:40 zeljkof: EU SWAT finished
13:38 zfilipin@tin: Synchronized php-1.31.0-wmf.1/extensions/VisualEditor/modules/ve-mw/init/targets/ve.init.mw.DesktopArticleTarget.init.js: SWAT: ve.init.mw.DesktopArticleTarget: Remove hack for reversed tabs in RTL in Vector (T50017) (duration: 00m 50s)
13:37 moritzm: upgrading mw1262 to mw1265 (canary app servers) to HHVM 3.18.5
13:35 moritzm: uploaded HHVM 3.18.5 for jessie-wikimedia to apt.wikimedia.org
13:26 marostegui: Optimize table on s4 codfw master (db2051), with replication (lag might be generated) - T174509
13:13 ema: bounce pybal on ulsfo primaries to pick up bgp-local-ips config change T103882
13:12 marostegui: Drop  redundant indexes from pagelinks and templatelinks on s4 - T174509
13:10 moritzm: upgrading mw1261 to HHVM 3.18.5
13:08 ema: bounce pybal on ulsfo secondaries to pick up bgp-local-ips config change T103882
13:02 marostegui: Optimize template links and pagelinks on db2053 and db2046 - T174509
13:00 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2053 db2046 - T174509 (duration: 00m 49s)
12:48 dcausse: restarting elastic on relforge servers to test a new version of the ltr plugin
12:27 ema: reboot cp402[2-6] to disable numa isolation
10:30 gehel: restart elasticsearch on relforge to validate new logging config - T175242
10:09 elukey: incrementally add mw1312-17 api-appservers to serve live traffic (weights will be raised incrementally)
10:00 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1308.eqiad.wmnet
09:59 elukey: added new mediawiki jobrunner - mw1308
09:52 ema: upgrade pybal to 1.14.0 on lvs1009-10
09:49 aaron@tin: Synchronized wmf-config/CommonSettings.php: Enable logging of post-send DB updates (duration: 00m 55s)
09:42 ema: upgrade pybal to 1.14.0 on lvs400[13]
08:00 elukey@puppetmaster1001: conftool action : set/pooled=yes; selector: name=mw1288.eqiad.wmnet
07:27 moritzm: upgrading app servers in deployment-prep to HHVM 3.18.5
07:14 moritzm: installing apache updates on silver/wikitech.wikimedia.org
06:59 moritzm: installing apache updates on graphite* hosts
06:28 elukey: restart varnish backend on cp3032
05:54 marostegui: Optimize tables pagelinks and templatelinks on dbstore2001 (s2) - T174509
05:53 marostegui: Optimize tables pagelinks and templatelinks on dbstore2002 (s2) - T174509
05:43 marostegui: Run maintain-views on labsdb1001, labsdb1003, labsdb1009, labsdb1010, labsdb1011 for hiwikivoyage - T173027
05:38 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2049 and db2041 to optimize templatelinks and pagelinks tables - T174509 (duration: 00m 47s)
05:29 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2064 db2063 - T174509 (duration: 00m 48s)
05:21 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2076 db2067 db2060 - T174509 (duration: 00m 48s)
05:19 tstarling@tin: Finished scap: updating Vector skin for gerrit 381153 (duration: 20m 39s)
04:59 tstarling@tin: Started scap: updating Vector skin for gerrit 381153
03:57 bblack: ulsfo dns-depooled at ~03:51
03:26 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Sep 28 03:26:18 UTC 2017 (duration 6m 10s)
03:20 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.1) (duration: 15m 44s)
02:42 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.19) (duration: 15m 24s)
00:36 twentyafterfour: upgrading phabricator, expect minimal downtime
2017-09-27
23:39 robh: cp4024 is running hardware testing, leave it alone T174891
23:17 catrope@tin: Synchronized wmf-config/Wikibase.php: Make CirrusSearch default for wbsearchentities on testwikidatawiki T175741 (duration: 00m 48s)
22:34 maxsem@tin: Synchronized php-1.31.0-wmf.1/includes/EditPage.php: https://gerrit.wikimedia.org/r/#/c/381139/1 (duration: 00m 49s)
19:14 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 -> wmf.1
19:13 demon@tin: Synchronized php: symlink bump (duration: 00m 47s)
19:10 demon@tin: Synchronized php-1.31.0-wmf.1/maintenance/backup.inc: unbreak dumps (duration: 00m 49s)
18:46 catrope@tin: Finished scap: SWAT: T176762 , T175358 , T172023 , T174719 , and add html5-misnesting to Linter (duration: 22m 55s)
18:39 bblack: re-pooling upload@ulsfo (text@ulsfo remains pooled)
18:23 catrope@tin: Started scap: SWAT: T176762 , T175358 , T172023 , T174719 , and add html5-misnesting to Linter
18:21 ejegg: updated payments-wiki from 6104e3d to c2a3d43
17:40 ejegg: updated SmashPig Adyen settings
17:29 bblack: uploaded nginx-1.13.5-1+wmf1 to stretch-wikimedia, and matching nginx-1.13.5-1+wmf1~jessie1 to jessie-wikimedia
16:27 XioNoX: setting local-as to selected transit BGP sessions - T167840
16:00 ema@neodymium: conftool action : set/pooled=yes; selector: name=cp4026.ulsfo.wmnet
16:00 ema@neodymium: conftool action : set/pooled=yes; selector: name=cp4025.ulsfo.wmnet
15:59 ema@neodymium: conftool action : set/pooled=yes; selector: name=cp4023.ulsfo.wmnet
15:58 ema@neodymium: conftool action : set/pooled=yes; selector: name=cp4022.ulsfo.wmnet
15:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1103 with low weight - T172679 (duration: 00m 47s)
15:51 ema: lvs4002: upgrade pybal to 1.14.0
15:49 ema: lvs4004: upgrade pybal to 1.14.0
15:28 _joe_: uploaded prometheus-statsd-exporter to stretch-wikimedia T175539
15:24 bblack: rebooting cp402[2356] (bnx2x oops)
15:15 moritzm: revoking salt certs for WMCS
15:12 mlitn@tin: Synchronized wmf-config/CommonSettings-labs.php: Remove 3D beta config (duration: 00m 47s)
15:11 mlitn@tin: Synchronized wmf-config/InitialiseSettings-labs.php: Remove 3D beta config (duration: 00m 48s)
15:10 mlitn@tin: Synchronized wmf-config/CommonSettings.php: Load 3D extension (duration: 00m 49s)
15:09 mlitn@tin: Synchronized wmf-config/InitialiseSettings.php: Config 3D to be loaded on test, test2 (duration: 00m 48s)
15:00 mlitn@tin: Finished scap: Enable 3D extension (duration: 37m 09s)
14:58 ema: lvs1007: upgrade pybal to 1.14.0
14:45 elukey: rolling restart of all the Yarn nodemanager daemons on analytics1028-1068
14:43 bblack: cp1008/pinkunicorn upgraded to test build of nginx-1.13.5-1+wmf1
14:28 volans: uploaded cumin_1.2.1-1_amd64.deb to apt.wikimedia.org jessie-wikimedia
14:24 akosiaris: change wtp1001 to wtp1024 weights to 5 from 15 in preparation for deprecation
14:24 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1024.eqiad.wmnet
14:24 mlitn@tin: Started scap: Enable 3D extension
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1023.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1022.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1021.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1020.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1019.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1018.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1017.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1016.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1015.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1014.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1013.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1012.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1011.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1010.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1009.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1008.eqiad.wmnet
14:23 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1007.eqiad.wmnet
14:22 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1006.eqiad.wmnet
14:22 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1005.eqiad.wmnet
14:22 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1004.eqiad.wmnet
14:22 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1003.eqiad.wmnet
14:22 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1002.eqiad.wmnet
14:22 akosiaris@puppetmaster1001: conftool action : set/weight=5; selector: name=wtp1001.eqiad.wmnet
14:17 akosiaris: T175242 restart eventstreams across the fleet to pick up the change in a rolling restart manner with a batch size of 2
14:16 akosiaris: T175242 restart parsoid across the fleet to pick up the change in a rolling restart manner with a batch size of 5
14:03 akosiaris: T175242 restart tilerator, tileratorui, restbase across the fleet to pick up the change in a rolling restart manner with a batch size of 2
13:43 akosiaris: T175242 re-enable puppet across aqs kafka maps maps-test ores restbase restbase-dev sca scb wtp clusters for merging https://gerrit.wikimedia.org/r/#/c/376500/ . Run puppet as well in a batched execution
13:42 anomie@tin: Synchronized php-1.31.0-wmf.1/includes/specials/SpecialWatchlist.php: SWAT: {{gerrit|380968}} Fix watchlist "in the last X hours" display (duration: 00m 48s)
13:42 elukey: raised Hadoop HDFS namenode master daemon max heap size to 6G (prev 4G) on analytics100[12]
13:31 akosiaris: T175242 eventstreams requires manual restart
13:25 akosiaris: T175242 parsoid requires manual restart
13:15 akosiaris: T175242 restbase requires manual restart
13:11 addshore: SWAT done
13:11 addshore@tin: Synchronized php-1.31.0-wmf.1/extensions/TwoColConflict/modules/ext.TwoColConflict.BaseVersionSelector.css: SWAT: Address changes in the label style in OOUI (duration: 00m 46s)
13:09 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add autopatrolled user group to dty.wikipedia (T176709) (duration: 00m 47s)
13:01 akosiaris: T175242 tilerator and tileratorui need manually restart
12:54 akosiaris: T175242 enabled puppet in aqs kafka maps maps-test selected hosts and ran puppet manually.
12:47 akosiaris: T175242 disable puppet across aqs kafka maps maps-test ores restbase restbase-dev sca scb wtp clusters for merging https://gerrit.wikimedia.org/r/#/c/376500/
12:20 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1103 to the array of hosts (duration: 00m 48s)
12:19 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db1103 to the array of hosts (duration: 00m 49s)
12:14 moritzm: installing apache updates on einsteinium/tegmen
12:06 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Enable the ArticlePlaceholder on bnwiki (T176771 ) (duration: 00m 49s)
12:05 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Enable the ArticlePlaceholder on bnwiki (T176771 ) (duration: 01m 00s)
09:45 marostegui: Stop mysql on db1035 to copy its data to db1103 - T172679
09:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1035 to transfer its data to db1103 - T172679 (duration: 00m 48s)
08:55 mobrovac@tin: Finished deploy [changeprop/deploy@7a3dc66]: Use the /page/title/{title}/{revision} end point for revision-visibility-change - T158100 (duration: 01m 26s)
08:53 mobrovac@tin: Started deploy [changeprop/deploy@7a3dc66]: Use the /page/title/{title}/{revision} end point for revision-visibility-change - T158100
08:43 mobrovac@tin: Finished deploy [restbase/deploy@1cc530b]: Introduce the /page/title/{title}/{revision} end point - T158100 (duration: 10m 01s)
08:34 elukey: raise traffic weights to 30 for mw13[19-28] incrementally - T165519
08:33 mobrovac@tin: Started deploy [restbase/deploy@1cc530b]: Introduce the /page/title/{title}/{revision} end point - T158100
08:16 marostegui: Optimize tables pagelinks templatelinks on db2064 and db2063 - T174509
08:16 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2064 db2063 to optimize templatelinks and pagelinks tables - T174509 (duration: 00m 48s)
07:55 marostegui: Optimize pagelinks and template links tables on  db2076 db2067 db2060 - T174509
07:52 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2076 db2067 db2060 to optimize templatelinks and pagelinks tables - T174509 (duration: 00m 48s)
07:37 marostegui: Optimize pagelinks and template links tables on db2070 db2069 db2042 db2016 - T174509
07:27 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2070, db2069 and db2042 to optimize templatelinks and pagelinks tables - T174509 (duration: 00m 48s)
07:17 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2071 and db2072 - T174509 (duration: 00m 47s)
06:57 marostegui: Deploy alter table on s6 codfw - T174509
06:55 marostegui: Deploy alter table on db1052 (enwiki master) - T174509
06:37 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add task number to db2044 line (duration: 00m 48s)
06:37 ema: restart varnish-be on cp3040
06:36 moritzm: installing git security updates (our config is not affected, but still fixing the underlying issue)
06:33 ema: restart varnish-be on cp3041
06:29 marosteg1i: Reboot db2044 after storage failure - T174764
06:15 elukey: restart varnish backend on cp3043
05:41 madhuvishy: service nova-fullstack restart on labnet1001
03:17 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Sep 27 03:17:22 UTC 2017 (duration 6m 22s)
03:11 l10nupdate@tin: scap sync-l10n completed (1.31.0-wmf.1) (duration: 15m 55s)
02:32 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.19) (duration: 08m 42s)
2017-09-26
23:25 thcipriani@tin: Synchronized php-1.31.0-wmf.1/extensions/VisualEditor/modules/ve-mw/ui/tools/ve.ui.MWSignatureTool.js: SWAT: Revert MWSignatureTool case (duration: 00m 48s)
23:20 thcipriani@tin: Synchronized php-1.30.0-wmf.19/resources/src/mediawiki.rcfilters/mw.rcfilters.init.js: SWAT: RCFilters: Log performance data T176652 (duration: 00m 48s)
23:19 thcipriani@tin: Synchronized php-1.31.0-wmf.1/resources/src/mediawiki.rcfilters/mw.rcfilters.init.js: SWAT: RCFilters: Log performance data T176652 (duration: 00m 51s)
22:03 no_justification: gerrit master restart, back momentarily
20:32 mutante: phab1001 - restarted apache after upgrade
20:30 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2044 - T174764 (duration: 00m 50s)
20:20 mutante: phab1001 (prod phab) - upgrade Apache package
20:15 mutante: phab2001 - install Apache and MariaDB package upgrades
19:16 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to 1.31.0-wmf.1
18:25 mepps: Updated crm from 89b8007 to 3f48cb0
18:00 demon@tin: Finished scap: 1.31.0-wmf.1 bootstrap + testwiki (duration: 47m 56s)
17:12 demon@tin: Started scap: 1.31.0-wmf.1 bootstrap + testwiki
15:50 volans: powecycled cp2012, no ping or ssh console stuck
15:18 XioNoX: starting eqiad frack switch to new infra - T174218
15:06 gehel: restarting blazegraph on all wdqs nodes for heap resize - T175919
14:47 mobrovac@tin: Finished deploy [restbase/deploy@f989dd9]: Remove the old mobileapps module completely - T169940 (duration: 20m 35s)
14:36 marostegui: Optimize tables pagelinks and templatelinks on db2071 and db2072 from s1 - T174509
14:27 mobrovac@tin: Started deploy [restbase/deploy@f989dd9]: Remove the old mobileapps module completely - T169940
14:26 hoo: mwscript refreshLinks.php --wiki elwiki --namespace 0 on terbium has finished (T151717 )
14:12 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2071 and db2072 to optimize templatelinks and pagelinks tables - T174509 (duration: 00m 44s)
13:39 marostegui: Deploy alter table on s6 master (with replication enable, so lag will be generated) db2028 - T163979
13:25 hashar@tin: Synchronized wmf-config/filebackend.php: Expose Thumbor swift username - T144479 (duration: 00m 44s)
13:24 hashar@tin: Synchronized wmf-config/CommonSettings.php: Enable asia-specific Navigation Timing metric - T169522 (duration: 00m 47s)
13:22 godog: upgrade grafana to 4.5.2 on labmon1001 - T175980
13:15 hashar@tin: Synchronized wmf-config/CommonSettings.php: Disable RelatedArticles instrumentation on all wikis - T174944 (duration: 00m 45s)
13:02 marostegui: Drop redundant indexes from s6 - T174509
13:00 elukey: add mw132[7,8] to live traffic (new appservers) - weights will be increased incrementally from 5 to 20
12:44 gehel: restarting blazegraph on wdqs1004 / wdqs2001 for heap resive - T175919
12:27 marostegui: Drop redundant indexes from s2 - T174509
12:12 marostegui: Drop redundant indexes from enwiki on eqiad - T174509
12:03 kartik@tin: Finished deploy [cxserver/deploy@aa232ee]: Update cxserver, registry reconstruction (duration: 03m 03s)
12:00 kartik@tin: Started deploy [cxserver/deploy@aa232ee]: Update cxserver, registry reconstruction
11:58 moritzm: uploaded prometheus-apache-exporter 0.3-1+deb9u1 to apt.wikimedia.org/stretch-wikimedia
11:57 marostegui: Drop redundant indexes from enwiki on codfw - T174509
11:49 moritzm: uploaded prometheus-hhvm-exporter 0.3.1+deb9u1 to apt.wikimedia.org/stretch-wikimedia
11:30 kartik@tin: (no justification provided)
11:29 kartik@tin: Finished deploy [cxserver/deploy@f1d4851]: Update cxserver, registry reconstruction (duration: 00m 52s)
11:28 kartik@tin: Started deploy [cxserver/deploy@f1d4851]: Update cxserver, registry reconstruction
10:52 volans: uploaded cumin_1.1.1-1_amd64.deb to apt.wikimedia.org jessie-wikimedia
10:34 joal@tin: Finished deploy [analytics/refinery@5d806c6]: Regular analytics deploy (duration: 08m 00s)
10:26 joal@tin: Started deploy [analytics/refinery@5d806c6]: Regular analytics deploy
10:20 _joe_: rebooting copper (again)
09:12 elukey: add mw132[4,5,6] to live traffic (new appservers) - weights will be increased incrementally from 5 to 20
09:10 _joe_: rebooting copper to verify docker setup is correct
08:46 moritzm: installing apache updates on mw* app servers
08:44 godog: run puppet on ms-fe* to reduce conntrack generated by swift internal clients towards backends - T173731
08:24 godog: roll-restart thumbor to upgrade to 1.5 - T173804
08:09 godog: roll-restart swift-proxy to apply https://gerrit.wikimedia.org/r/#/c/380483/ - T161535
08:08 marostegui: Drop old temporary tar.gz files from dbstore1001 to get back some disk space
08:07 marostegui: Drop table trackbacks from s3 - T175051
07:48 mattflaschen@tin: Synchronized wmf-config/CommonSettings-labs.php: RCFilters: Beta Cluster only (duration: 00m 46s)
07:46 marostegui: Drop trackbacks table from s7 - T175051
07:28 marostegui: Drop trackbacks table from s6 - T175051
07:16 moritzm: installing perl security updates
06:45 _joe_: restarting varnish backend on cp3033, throwing 503s
06:29 mobrovac@tin: Finished deploy [cpjobqueue/deploy@17c3833]: (no justification provided) (duration: 00m 28s)
06:28 mobrovac@tin: Started deploy [cpjobqueue/deploy@17c3833]: (no justification provided)
06:18 marostegui: Drop table trackbacks from s4 - T175051
06:06 marostegui: Drop table trackbacks from s5 (only exists on dewiki) - T175051
05:48 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2047 - T176573 (duration: 00m 44s)
05:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove old db1055 comments (duration: 00m 46s)
02:29 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Sep 26 02:29:31 UTC 2017 (duration 5m 52s)
02:23 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.19) (duration: 07m 26s)
2017-09-25
23:58 ejegg: updated payments-wiki from d2ce697 to 6104e3d
23:15 mutante: gerrit2001 reinstalled with stretch, revoked old puppet cert, accepted new puppet cert, initial run that will do base and all the gerrit things at once.. (T168562 )
22:53 mutante: gerrit2001 - PXE booting
22:34 mutante: gerrit2001 - schedule downtime for reinstall with strech, reboot imminent
20:57 hoo: Manually pruned the sites and site_identifiers table on hiwikivoyage, than ran populateSitesTable.php. (T173030 )
20:37 arlolra: Updated Parsoid to 4230c27b (T176425 , T173029 )
20:30 arlolra@tin: Finished deploy [parsoid/deploy@376faad]: Updating Parsoid to 4230c27b (duration: 16m 52s)
20:13 arlolra@tin: Started deploy [parsoid/deploy@376faad]: Updating Parsoid to 4230c27b
19:58 XioNoX: renumbering ams BGP communities - T167840
19:25 robh@puppetmaster1001: conftool action : set/pooled=yes; selector: name=cp4021.ulsfo.wmnet
19:02 robh: cp4021 coming down for memory swap
19:02 robh@puppetmaster1001: conftool action : set/pooled=no; selector: name=cp4021.ulsfo.wmnet
18:54 MaxSem: started populating ip_changes on group2 wikis
18:38 demon@tin: Synchronized scap/plugins/clean.py: no-op/consistency (duration: 00m 45s)
18:22 ejegg: updated CiviCRM from 45c56cf to 89b8007
17:58 demon@tin: Pruned MediaWiki: 1.30.0-wmf.15 (duration: 02m 46s)
17:07 gehel@tin: Finished deploy [wdqs/wdqs@3eec185]: Deploy new Blazegraph binary & GUI update (duration: 01m 51s)
17:05 gehel@tin: Started deploy [wdqs/wdqs@3eec185]: Deploy new Blazegraph binary & GUI update
17:02 mobrovac@tin: Synchronized wmf-config/InitialiseSettings.php: Disable the updateBetaFeaturesUserCounts logging channel - T175637 (duration: 00m 46s)
16:37 demon@tin: Synchronized php-1.30.0-wmf.19/maintenance/getConfiguration.php: getConfiguration: Don't bail when a valid variable is set null (duration: 00m 47s)
15:57 urandom: T169940 : Converting restbase-ng data tables to size-tiered compaction
14:39 gehel: dropping badly named apifeature index from cirrus elasticsearch codfw/eqiad - T176430
14:31 gehel: rolling restart of logstash for config change - T176430
14:13 gehel: rolling restart of logstash for config change - T176430
13:46 zeljkof: EU SWAT finished
13:45 zfilipin@tin: Synchronized static/images/project-logos/hiwiki.png: SWAT: Make hiwiki logo a little bit bigger (T175721) (duration: 00m 46s)
13:37 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Import sources on hr.wikibooks (T176320) (duration: 00m 46s)
13:30 addshore@tin: Synchronized php-1.30.0-wmf.19/extensions/WikimediaEvents/WikimediaEventsHooks.php: SWAT: gerrit:379749 onBeforeInitializeWMDECampaign, fix early bail condition T174948 (duration: 00m 46s)
13:18 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add several rights to eliminators in fawiki (T176553) (duration: 00m 46s)
12:52 moritzm: removed salt master from neodymium
12:38 moritzm: revoked all salt keys on neodymium
11:58 godog: powercycle ms-be2020 - blk_update_request: I/O error, dev sdb, sector in consol
11:54 moritzm: removing salt-minion/salt-common from production
11:47 Dereckson: Repopulate sites tables for Wikidata clients (T173013 )
11:47 Dereckson: Create new database and tables for hi.wikivoyage (T173013 )
11:45 dereckson@tin: Synchronized wmf-config/interwiki.php: Interwiki map update (+hiwikivoyage and other changes previous month, T176572 ) (duration: 00m 46s)
11:39 marostegui: Run redact_sanitarium for hiwikivoyage on db1095 - T173027
11:39 moritzm: upgrading tor to latest stable release (0.3.1.7)
11:37 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Initial configuration for hi.wikivoyage.org (T173013 ) (duration: 00m 46s)
11:37 dereckson@tin: Synchronized static/images/project-logos/: Logos for hi.wikivoyage.org (T173013 ) (duration: 00m 45s)
11:34 dereckson@tin: rebuilt wikiversions.php and synchronized wikiversions files: +hiwikivoyage (T173013 )
11:32 dereckson@tin: Synchronized dblists: Add hiwikivoyage to the set (T173013 ) (duration: 00m 46s)
10:42 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1055 weight (duration: 00m 46s)
10:14 elukey: add mw1323 to live traffic (mw appserver) - traffic weights will go from 5 to 20 incrementally
09:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1055 weight and remove main traffic from special slaves (duration: 00m 46s)
09:07 marostegui: Add 50GB to db1036 /srv partition - T176311
09:05 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1036 and full repool db1101 - T176311 (duration: 00m 46s)
08:49 moritzm: uploaded php-luasandbox 2.0.14, hhvm-tidy 0.1.3 and php-wikidiff2 1.4.1 for stretch-wikimedia to apt.wikimedia.org
08:36 mobrovac@tin: Finished deploy [restbase/deploy@ab24f70]: Switch mobile-sections to next-gen storage and stop storing the current-day feed in Cassandra - T169940 T176233 (duration: 10m 03s)
08:26 mobrovac@tin: Started deploy [restbase/deploy@ab24f70]: Switch mobile-sections to next-gen storage and stop storing the current-day feed in Cassandra - T169940 T176233
08:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1055 weight (duration: 00m 45s)
07:56 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic for db1101 - T176311 (duration: 00m 46s)
07:42 moritzm: uploaded hhvm 3.18.2+dfsg-1+wmf5+deb9u1 for stretch-wikimedia to apt.wikimedia.org
07:39 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1055 weight (duration: 00m 47s)
07:37 elukey: add mw132[0,2] to live traffic (mw appservers) - traffic weights will go from 5 to 20 incrementally
06:38 moritzm: installing apache security updates on logstash*
02:36 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Sep 25 02:36:48 UTC 2017 (duration 6m 42s)
02:30 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.19) (duration: 07m 17s)
2017-09-24
14:23 jynus: restarting db2047 for kernel upgrade
14:16 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2047 (duration: 00m 48s)
off: restarted ircecho on einsteinium
2017-09-23
2017-09-22
23:36 mutante: gerrit2001 - rebooting .. wave a dead chicken http://www.catb.org/jargon/html/W/wave-a-dead-chicken.html
22:56 mutante: gerrit2001 - trying to manually start gerrit again, as opposed to puppet doing it.. debugging gerrit-ssh issue there, cobalt still untouched
22:34 mutante: gerrit2001 - stopping gerrit with gerrit.sh stop, letting puppet start it again
22:29 mutante: gerrit2001 - systemd says gerrit.service is failed. gerrit.sh start says "Already Running!!" :p - cobalt is fine
18:26 demon@tin: Pruned MediaWiki: 1.30.0-wmf.18 [keeping static files] (duration: 01m 29s)
15:21 hashar: Restarted Jenkins. Out of memory)
14:49 jynus@tin: Synchronized wmf-config/db-eqiad.php: repool db1101 with low weight (duration: 00m 47s)
14:43 moritzm: uploaded php-luasandbox build for src:php5.5 (required for CI tests on jessie)
14:42 moritzm: updated tor packages to 0.3.1.7 (new stable series)
13:49 elukey: mw1321 (new appserver) serving traffic (going to increase its weight up to 20)
12:02 hashar: apt-get upgrade on contint1001 / contint2001
09:42 elukey: mw1319 (new appserver) serving traffic (going to increase its weight up to 20)
09:14 jynus: stop mariadb at db1055 for upgrade and maintenance
06:32 moritzm: installing emacs security updates on trusty (Debian already fixed)
2017-09-21
23:56 ejegg: updated payments-wiki from 7ed7243 to d2ce697
23:23 ebernhardson@tin: Synchronized php-1.30.0-wmf.19/extensions/WikimediaEvents/extension.json: T175047 : Stop delivering search relevance survey javascript (duration: 00m 46s)
23:18 ebernhardson@tin: Synchronized wmf-config: T175047 : Turn off human search relevance survey (duration: 00m 48s)
23:16 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: T175047 : Turn off human search relevance survey (duration: 00m 46s)
22:59 jynus@tin: Synchronized wmf-config/db-eqiad.php: depool db1055 (duration: 00m 46s)
22:12 volans: uploaded cumin_1.1.0-1_amd64.deb to apt.wikimedia.org jessie-wikimedia
22:10 ejegg: updated payments-wiki from 9da8482 to 7ed7243
21:07 demon@tin: Synchronized wmf-config/CommonSettings.php: extdist settings (duration: 00m 46s)
21:04 ejegg: updated CiviCRM from fc56901 to 45c56cf
19:32 demon@tin: Synchronized wmf-config/InitialiseSettings.php: adjust logging on csp reports (duration: 00m 46s)
19:20 demon@tin: Synchronized php-1.30.0-wmf.19/extensions/VisualEditor/ApiVisualEditor.php: fix user/wgUser mixup (duration: 00m 46s)
19:16 gehel: upgrade to nodejs 6.11 on maps servers (including restart of tilerator / kartotherian) - T171707
19:16 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4026.*
19:04 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.19
18:56 urandom: T171772 : Applying locally hacked Prometheus exporter config to RESTBase dev Cassandra instances
18:44 thcipriani@tin: Synchronized php-1.30.0-wmf.19/resources/src/mediawiki.rcfilters: SWAT: WLFilters: Do not hide .watchlistDetails while loading T176300 RCFilters: Make the interface not jump around while loading (duration: 00m 49s)
18:41 urandom: T171772 : Restarting Cassandra restbase-dev1004-a to apply locally hacked Prometheus exporter config
18:34 thcipriani@tin: Synchronized php-1.30.0-wmf.19/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: SWAT: Turning off Explore Similar AB test T175649 (duration: 00m 49s)
18:31 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Print instrumentation T176341 (duration: 00m 52s)
14:56 moritzm: uploaded php-defaults and php-redis built against src:php5.5 to component/ci for jessie-wikimedia
13:45 gehel: upgrade to nodejs 6.11 on the full maps-test cluster - T171707
13:12 zeljkof: EU SWAT finished
13:09 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Revert "Switch elasticsearch active cluster to codfw" (duration: 00m 48s)
13:09 zfilipin@tin: Synchronized wmf-config/CommonSettings.php: SWAT: Revert "Switch elasticsearch active cluster to codfw" (duration: 00m 48s)
13:07 zfilipin@tin: Synchronized tests/cirrusTest.php: SWAT: Revert "Switch elasticsearch active cluster to codfw" (duration: 00m 49s)
12:55 _joe_: stopped long-running runjobs for refreshlinks on commons.
12:50 mobrovac@tin: Finished deploy [restbase/deploy@bc02191]: produce a diff for the mobile proxy (duration: 18m 34s)
12:32 mobrovac@tin: Started deploy [restbase/deploy@bc02191]: produce a diff for the mobile proxy
11:59 moritzm: upgrading apache on deployment servers and script runners
11:40 moritzm: upgrading apache on canary app servers
11:17 moritzm: installing apache security updates (our configurations are not affected, the upload of 2.4.10-10+deb8u11+wmf1 is mostly for the benefit of whatever people are doing in Cloud VPS, but we should still fix the underlying bug in production as well)
11:14 dcausse: reindexing all hebrew wikis to pickup the new hebmorph analyzer
11:09 dcausse: reindexing ilwikimedia in codfw to pickup and test new hebmorph analyzer
10:26 akosiaris: upload docker-ce_17.06.2~ce-0~debian_amd64.deb to apt.wikimedia.org jessie-wikimedia/thirdparty/ci T175293
09:59 dcausse: reindexing group1 wikis T176397
09:54 dcausse: reindexing group0 wikis T176397
08:35 moritzm: unbreak deployment-puppetmaster02 in deployment-prep (broken by unattended-upgrades update of apache T159254 )
08:35 akosiaris: pool all of wtp1025 to wtp1048 T165520
08:35 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1044.eqiad.wmnet
08:35 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1043.eqiad.wmnet
08:34 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1042.eqiad.wmnet
08:34 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1041.eqiad.wmnet
08:34 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1040.eqiad.wmnet
08:34 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1039.eqiad.wmnet
08:34 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1038.eqiad.wmnet
08:34 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1037.eqiad.wmnet
08:33 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1036.eqiad.wmnet
08:33 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1035.eqiad.wmnet
08:33 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1034.eqiad.wmnet
08:33 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1033.eqiad.wmnet
08:33 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1032.eqiad.wmnet
08:33 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1031.eqiad.wmnet
08:32 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1030.eqiad.wmnet
08:32 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1029.eqiad.wmnet
08:32 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1028.eqiad.wmnet
08:32 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1027.eqiad.wmnet
08:32 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1026.eqiad.wmnet
08:32 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
08:31 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
08:31 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
08:31 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
08:31 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
08:31 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
08:30 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
08:30 akosiaris@puppetmaster1001: conftool action : set/weight=10; selector: name=wtp1025.eqiad.wmnet
08:25 mobrovac@tin: Finished deploy [restbase/deploy@9fd380d]: Log only mismatches during updates, not live requests (duration: 03m 18s)
08:22 mobrovac@tin: Started deploy [restbase/deploy@9fd380d]: Log only mismatches during updates, not live requests
08:21 mobrovac@tin: Finished deploy [restbase/deploy@9fd380d]: Log only mismatches during updates, not live requests (duration: 00m 58s)
08:20 mobrovac@tin: Started deploy [restbase/deploy@9fd380d]: Log only mismatches during updates, not live requests
08:19 mobrovac@tin: Finished deploy [restbase/deploy@9fd380d]: Log only mismatches during updates, not live requests (duration: 05m 22s)
08:14 mobrovac@tin: Started deploy [restbase/deploy@9fd380d]: Log only mismatches during updates, not live requests
08:08 mobrovac@tin: Finished deploy [restbase/deploy@3e9bd9f]: New storage schema for mobile-sections - T169940 (duration: 10m 28s)
07:57 mobrovac@tin: Started deploy [restbase/deploy@3e9bd9f]: New storage schema for mobile-sections - T169940
07:31 _joe_: restarted aphlict on phab1001
07:27 mobrovac@tin: Finished deploy [restbase/deploy@3e9bd9f]: New storage schema for mobile-sections, canary deploy for schema creation, take 2 - T169940 (duration: 00m 30s)
07:27 mobrovac@tin: Started deploy [restbase/deploy@3e9bd9f]: New storage schema for mobile-sections, canary deploy for schema creation, take 2 - T169940
07:09 ema: bounce pybal on lvs1003 to clear stale alert T176388
06:59 ema: bounce pybal on lvs1006 to clear stale alert
06:55 ema: bounce pybal on lvs1009 to clear stale alert
03:24 bblack: depooled cp4026
03:12 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Sep 21 03:12:41 UTC 2017 (duration 6m 36s)
03:06 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.19) (duration: 14m 44s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.18) (duration: 08m 50s)
00:41 eileen: update civicrm from da833ac to fc56901
00:23 ebernhardson@tin: Synchronized php-1.30.0-wmf.18/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: Fix schema name for search satisfaction error logging (duration: 00m 53s)
00:18 ebernhardson@tin: Synchronized php-1.30.0-wmf.19/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: Fix schema name for search satisfaction error logging (duration: 00m 49s)
00:09 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Enable jQuery 3 on commons (T124742 ) (duration: 00m 49s)
00:02 catrope@tin: Synchronized php-1.30.0-wmf.19/resources/src/mediawiki.rcfilters/: Lazy-load the RCFilters menu (T176250 ) (duration: 00m 48s)
2017-09-20
23:49 catrope@tin: Synchronized php-1.30.0-wmf.18/extensions/WikimediaEvents/: Log JS errors during search satisfaction tests (duration: 00m 48s)
23:47 catrope@tin: Synchronized php-1.30.0-wmf.19/extensions/WikimediaEvents/: Log JS errors during search satisfaction tests (duration: 00m 49s)
23:40 mutante: phabricator: aphlict (notification service) now running on prod server
23:36 mutante: phab1001 - re-enable puppet, run after follow-up fix, adding notification server config
23:31 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Add throttle rule for John Michael Kohler Art Center (T176287 ) (duration: 00m 48s)
23:29 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Enable user email blacklist on meta (T174694 ) (duration: 00m 48s)
23:21 catrope@tin: Synchronized dblists/categories-rdf.dblist: Add mlwiki to categories-rdf.dblist (duration: 00m 48s)
23:14 mutante: phab2001 - deluser aphlict, delgroup aphlict, try to let puppet create it again, still fails, just won't create the group
23:13 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Add patroller group on metawiki (T176079 ) (duration: 00m 48s)
23:11 mutante: phab2001 - groupadd aphlict,temp adduser aphlict to debug user creation issue
22:22 ebernhardson: elasticsearch eqiad completely recovery, all green. Returning recovery max_bytes_per_sec to 20mb
21:54 mutante: phab: disable puppet on phab1001 for a minute, apply notification server change on phab2001
21:45 eileen: update civicrm from 0fb4676 to da833ac
21:09 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4015.ulsfo.wmnet
20:58 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4007.ulsfo.wmnet
20:40 bsitzmann@tin: Finished deploy [mobileapps/deploy@e23bf66]: Update mobileapps to d46860e (T175759 T176263 ) (duration: 06m 05s)
20:36 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4014.ulsfo.wmnet
lunch: increase max recovery bytes/s to 80mb in eqiad elasitcsearch cluster
20:34 bsitzmann@tin: Started deploy [mobileapps/deploy@e23bf66]: Update mobileapps to d46860e (T175759 T176263 )
20:33 demon@tin: Synchronized php-1.30.0-wmf.19/includes/diff/DifferenceEngine.php: fix some warnings (duration: 00m 51s)
19:55 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4016.ulsfo.wmnet
19:54 eevans@tin: Finished deploy [cassandra/metrics-collector@df909a1] (dev-cluster): Deploy 4.1.0 artifact to dev cluster (duration: 00m 10s)
19:54 eevans@tin: Started deploy [cassandra/metrics-collector@df909a1] (dev-cluster): Deploy 4.1.0 artifact to dev cluster
19:53 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4006.ulsfo.wmnet
19:48 eevans@tin: Finished deploy [cassandra/metrics-collector@df909a1]: Pushing out 4.1.0 release (duration: 00m 47s)
19:47 eevans@tin: Started deploy [cassandra/metrics-collector@df909a1]: Pushing out 4.1.0 release
19:39 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4008.ulsfo.wmnet
19:26 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.19
19:24 demon@tin: Synchronized php: symlink swap for wmf.19 (duration: 00m 49s)
19:02 catrope@tin: Finished scap: Patches for T176302 , T175962 , T173533 (duration: 21m 29s)
18:40 catrope@tin: Started scap: Patches for T176302 , T175962 , T173533
17:29 hoo: Started mwscript refreshLinks.php --wiki elwiki --namespace 0 in a screen on terbium. Can be KILLed at any time, if needed.
17:29 hoo: Started mwscript refreshLinks.php --wiki elwiki --namespace 0 in a screen on elwiki. Can be KILLed at any time, if needed.
17:13 hoo: Ran mwscript refreshLinks.php --wiki elwiki --namespace 0 -e 30 for testing purposes
17:06 ejegg: removed limit from donation queue consumer
17:05 otto@tin: Finished deploy [eventstreams/deploy@e62ab64]: (no justification provided) (duration: 04m 44s)
17:01 otto@tin: Started deploy [eventstreams/deploy@e62ab64]: (no justification provided)
16:53 madhuvishy: Upgraded and restarted apache2 on labs-puppetmaster.wikimedia.org
16:45 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: dc=eqiad,cluster=elasticsearch
16:42 gehel: rolling restart of logstash ingesters after cirrus eqiad recovery
16:34 gehel: elasticsearch eqiad is finally recovering, after the restart of elastic1051
16:34 gehel: cleanup jar hell and restart elastic1051
16:19 gehel: starting elasticsearch on all nodes on eqiad
16:10 gehel: restarting elasticsearch masters on eqiad (elastic1030, 36 and 40)
16:04 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: dc=eqiad,cluster=elasticsearch
16:03 gehel: depooling all nodes in elasticsearch eqiad to investigate failed cluster restart (traffic is already directed to codfw)
15:46 ejegg: limited donations queue consumer to 100 per run
15:36 ejegg: updated payments-wiki from 896f16a to 9da8482
15:36 gehel: upgrading elasticsearch plugins on elasticsearch eqiad, including cold restart of the cluster - T173231
15:33 ejegg: limited donations queue consumer to 200 per run
14:03 akosiaris@tin: Synchronized wmf-config/InitialiseSettings.php: Whitelist wtp10[25-48] for Linter (duration: 00m 49s)
13:42 otto@tin: Synchronized /srv/mediawiki-staging/wmf-config/CommonSettings-labs.php: (no justification provided) (duration: 00m 49s)
13:24 ema: begin rolling codfw varnish-be restarts, 20m spacing
13:20 dereckson@tin: Synchronized wmf-config/CommonSettings.php: Switch elasticsearch active cluster to codfw (Gerrit:378850 , 3/3) (duration: 00m 48s)
13:18 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Switch elasticsearch active cluster to codfw (Gerrit:378850 , 2/3) (duration: 00m 49s)
13:17 dereckson@tin: Synchronized tests/cirrusTest.php: Switch elasticsearch active cluster to codfw (Gerrit:378850 , 1/3, no-op in prod) (duration: 00m 49s)
12:51 ema: restarted varnish-be on cp4028
12:28 gehel: upgrading nodejs to 6.11 on maps-test2003 for testing - T171707
12:19 ema: restarted varnish-be on cp4027
12:18 moritzm: uploaded apache 2.4.10-10+deb8u11+wmf1 to jessie-wikimedia (rebuild of latest security update with our local patches on top)
12:08 hoo@tin: Synchronized wmf-config/Wikibase-production.php: Enable statement usage tracking on elwiki (T151717 ) (duration: 00m 49s)
11:58 mobrovac@tin: Finished deploy [restbase/deploy@dea2b41]: New storage schema for mobile-sections, canary deploy for schema creation - T169940 (duration: 07m 20s)
11:56 ema: restarted varnish-be on cp4018
11:51 mobrovac@tin: Started deploy [restbase/deploy@dea2b41]: New storage schema for mobile-sections, canary deploy for schema creation - T169940
11:49 jynus: stopping replication on db1101 for faster repartition work T176311
11:45 jynus@tin: Synchronized wmf-config/db-eqiad.php: depool db1101 (duration: 00m 58s)
11:34 ema: restarted varnish-be on cp4010
11:25 mobrovac@tin: Finished deploy [restbase/deploy@dea2b41]: (no justification provided) (duration: 00m 02s)
11:25 mobrovac@tin: Started deploy [restbase/deploy@dea2b41]: (no justification provided)
11:03 moritzm: rolling out new debdeploy release across the fleet
11:02 mobrovac@tin: Finished deploy [cassandra/metrics-collector@d0169ee]: (no justification provided) (duration: 00m 19s)
11:02 mobrovac@tin: Started deploy [cassandra/metrics-collector@d0169ee]: (no justification provided)
11:02 ema: restarted varnish-be on cp4008
11:02 mobrovac@tin: Finished deploy [cassandra/metrics-collector@d0169ee]: (no justification provided) (duration: 00m 04s)
11:01 mobrovac@tin: Started deploy [cassandra/metrics-collector@d0169ee]: (no justification provided)
11:01 mobrovac@tin: Finished deploy [cassandra/metrics-collector@d0169ee]: (no justification provided) (duration: 00m 03s)
11:01 mobrovac@tin: Started deploy [cassandra/metrics-collector@d0169ee]: (no justification provided)
11:00 mobrovac@tin: Finished deploy [cassandra/metrics-collector@d0169ee]: (no justification provided) (duration: 00m 04s)
11:00 mobrovac@tin: Started deploy [cassandra/metrics-collector@d0169ee]: (no justification provided)
09:55 bblack: restarted varnish-be on cp3042
09:29 bblack: restarted varnish-be on cp1052
09:23 ema: restarted varnish-be on cp3043
09:21 ema: restarted varnish-be on cp3041
09:19 ema: restarted varnish-be on cp3032
09:14 ema: restarted varnish-be on cp3040
08:42 bblack: restarted varnish-be on cp1053
08:41 bblack: restarted varnish-be on cp1068
08:40 bblack: restarted varnish-be on cp1054
08:40 bblack: restarted varnish-be on cp1065
07:11 ema: cp3030 varnish-be restart due to 503s
03:16 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Sep 20 03:16:25 UTC 2017 (duration 7m 31s)
03:08 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.19) (duration: 15m 57s)
02:31 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.18) (duration: 08m 06s)
00:55 bblack: cp1066 - backend restart, mailbox lag, text
00:54 bblack: cp4016 - backend restart, mailbox lag, text
2017-09-19
23:57 thcipriani@tin: Synchronized php-1.30.0-wmf.18/extensions/VisualEditor/ApiVisualEditor.php: SWAT: ApiVisualEditor: Fix checkbox label message handling with Message objects T176249 (duration: 00m 49s)
23:56 thcipriani@tin: Synchronized php-1.30.0-wmf.19/extensions/VisualEditor/ApiVisualEditor.php: SWAT: ApiVisualEditor: Fix checkbox label message handling with Message objects T176249 (duration: 00m 48s)
23:46 thcipriani@tin: Synchronized php-1.30.0-wmf.18/extensions/WikimediaEvents/modules/ext.wikimediaEvents.humanSearchRelevance.js: SWAT: Javascript timestamps are in ms, not s T174106 (duration: 00m 48s)
23:45 thcipriani@tin: Synchronized php-1.30.0-wmf.19/extensions/WikimediaEvents/modules/ext.wikimediaEvents.humanSearchRelevance.js: SWAT: Javascript timestamps are in ms, not s T174106 (duration: 00m 50s)
23:39 thcipriani@tin: Synchronized php-1.30.0-wmf.18/includes/specials/SpecialRecentchangeslinked.php: SWAT: SpecialRecentchangeslinked: Unconditionally join on the page table T176228 (duration: 00m 49s)
23:36 thcipriani@tin: Synchronized php-1.30.0-wmf.19/includes/specials/SpecialRecentchangeslinked.php: SWAT: SpecialRecentchangeslinked: Unconditionally join on the page table T176228 (duration: 00m 49s)
23:29 thcipriani@tin: Synchronized php-1.30.0-wmf.18/includes/page/PageArchive.php: SWAT: Do not attempt to copy to ip_changes in PageArchive class (duration: 00m 49s)
23:27 thcipriani@tin: Synchronized php-1.30.0-wmf.19/includes/page/PageArchive.php: SWAT: Do not attempt to copy to ip_changes in PageArchive class (duration: 00m 50s)
23:17 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable jQuery 3 on meta.wikimedia.org T124742 (duration: 00m 50s)
22:58 eileen: manually running Omnirecipient.load from 2017-09-18 with accidentally low throttle setting
22:22 mutante: gerrit2001 - stopped gerrit with init.d script. moved init.d script to /root, started with systemctl
22:21 eileen: update civicrm from 9677cf7 to 0fb4676
22:15 mutante: gerrit2001 /bin/sh /var/lib/gerrit2/review_site/bin/gerrit.sh start
22:02 mutante: gerrit2001 - systemctl start gerrit
21:51 volans: disabled puppet on labpuppetmaster1001, testing cumin T175712
21:27 MaxSem: on terbium: mwscript populateIpChanges.php --wiki=enwikivoyage --force
21:23 bblack@neodymium: conftool action : set/pooled=yes; selector: cluster=cache_text,name=cp4008.ulsfo.wmnet
21:23 bblack@neodymium: conftool action : set/pooled=yes; selector: cluster=cache_text,name=cp4016.ulsfo.wmnet
21:22 bblack@neodymium: conftool action : set/pooled=yes; selector: cluster=cache_text,dc=eqiad,name=cp1066.*
21:18 MaxSem: re-ran populateIpChanges.php on group0 wikis
21:15 maxsem@tin: Synchronized php-1.30.0-wmf.19/maintenance/populateIpChanges.php: https://gerrit.wikimedia.org/r/#/c/379057/1 (duration: 00m 49s)
21:06 ema: cp4009: varnish-be restart, 503 spike
20:04 bblack@neodymium: conftool action : set/pooled=no; selector: cluster=cache_upload,name=cp4013.ulsfo.wmnet
20:03 bblack@neodymium: conftool action : set/pooled=no; selector: cluster=cache_text,name=cp4016.ulsfo.wmnet
19:50 bblack@neodymium: conftool action : set/pooled=no; selector: cluster=cache_upload,name=cp4005.ulsfo.wmnet
19:49 bblack@neodymium: conftool action : set/pooled=no; selector: cluster=cache_text,name=cp4008.ulsfo.wmnet
19:17 ejegg: updated payments-wiki from 26b16ea to 896f16a
19:08 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.19
18:52 gehel: upgrading nodejs to 6.11 on maps-test2004 for testing - T171707
18:35 urandom: T160570 , T169940 : Upgrade restbase-ng environment to Cassandra 3.11.0-wmf5
18:26 arlolra: Updated Parsoid to 05a0965 (T122965 , T175101 , T173384 , T173061 , T172896 , T174977 , T159894 )
18:17 ladsgroup@tin: Synchronized php-1.30.0-wmf.18/extensions/Wikidata/extensions/Wikibase/repo/tests/phpunit/includes/Content/EntityContentTest.php: Fix undoing merge operations that turned Items into a redirects, part II (T175984 ) (duration: 00m 48s)
18:15 ladsgroup@tin: Synchronized php-1.30.0-wmf.18/extensions/Wikidata/extensions/Wikibase/repo/includes/Content/EntityContent.php: Fix undoing merge operations that turned Items into a redirects, part I (T175984 ) (duration: 00m 49s)
18:08 ladsgroup@tin: Synchronized php-1.30.0-wmf.19/extensions/Wikidata/extensions/Wikibase/repo/tests/phpunit/includes/Content/EntityContentTest.php: Fix undoing merge operations that turned Items into a redirects, part II (T175984 ) (duration: 00m 50s)
18:07 ladsgroup@tin: Synchronized php-1.30.0-wmf.19/extensions/Wikidata/extensions/Wikibase/repo/includes/Content/EntityContent.php: Fix undoing merge operations that turned Items into a redirects, part I (T175984 ) (duration: 00m 50s)
18:00 arlolra@tin: Finished deploy [parsoid/deploy@a01064d]: Updating Parsoid to 05a0965 (duration: 12m 26s)
17:54 ebernhardson: T173464 start cirrussearch reindex of all wikis starting with zh
17:48 arlolra@tin: Started deploy [parsoid/deploy@a01064d]: Updating Parsoid to 05a0965
17:43 jynus: stopping db2010 and cloning to db2078
17:16 demon@tin: Finished scap: bootstrap wmf.19 (duration: 31m 04s)
16:45 demon@tin: Started scap: bootstrap wmf.19
16:44 demon@tin: scap aborted: bootstrap wmf.19 (duration: 01m 52s)
16:42 demon@tin: Started scap: bootstrap wmf.19
16:38 jynus@tin: Synchronized wmf-config/db-codfw.php: decom db1018 (duration: 00m 51s)
16:37 jynus@tin: Synchronized wmf-config/db-eqiad.php: decom db1018 (duration: 01m 01s)
16:34 demon@tin: Pruned MediaWiki: 1.30.0-wmf.17 [keeping static files] (duration: 01m 18s)
16:31 demon@tin: Pruned MediaWiki: 1.30.0-wmf.14 (duration: 03m 15s)
16:29 jynus: disable puppet on db1009
16:27 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Trying that again, got an error the first time (duration: 00m 52s)
16:26 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Enable RCFilters by default on cawiki, frwiki and hewiki (T157642 ) (duration: 00m 59s)
16:19 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Enable structured filters on watchlist on all wikis (duration: 00m 45s)
16:11 ema: pybal 1.14.0 uploaded to apt.w.o
15:54 catrope@tin: Finished scap: core, GuidedTour and WikimediaMessages patches for T176191 , T167262 and T175765 (duration: 20m 32s)
15:33 catrope@tin: Started scap: core, GuidedTour and WikimediaMessages patches for T176191 , T167262 and T175765
15:22 jynus: enabling firewall on db1020 (m2-master)
15:01 ema: restart varnish-be on cp1055, 503 spike
14:55 jynus: deploying firewall to m5 hosts
14:39 jynus: disabling puppet on all misc active mysql hosts
13:35 zeljkof: EU SWAT finished
13:32 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Rename Wikisaurus namespace on Wiktionary to "Thesaurus" (T174264) (duration: 00m 45s)
13:26 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Rename Wikisaurus namespace on Wiktionary to "Thesaurus" (T174264) (duration: 02m 55s)
13:19 gehel: upgrading elasticsearch plugins on relforge, including cold restart of the cluster - T173231
13:17 gehel: upgrading elasticsearch plugins on elasticsearch codfw, including cold restart of the cluster - T173231
13:15 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Rename Wikisaurus namespace on Wiktionary to "Thesaurus" (T174264) (duration: 02m 56s)
13:04 gehel: upgrading elasticsearch plugins on elastic2001 - T173231
12:55 moritzm: rebooting kubestage1001
11:25 moritzm: uploaded debdeploy 0.99.1 to apt.wikimedia.org (for trusty, jessie and stretch)
11:23 kartik@tin: Finished deploy [cxserver/deploy@03c1eb1]: Update cxserver to 569b7b7 (duration: 02m 42s)
11:20 kartik@tin: Started deploy [cxserver/deploy@03c1eb1]: Update cxserver to 569b7b7
10:53 volans: restarted nagios-nrpe-server on stat1005
10:14 jynus: shutting down mysql on db1018
09:42 moritzm: installing libxml2 security updates on trusty (Debian already fixed)
09:40 elukey: powercycle analytics1062 - no ssh, console com2 frozen
09:34 mobrovac@tin: Synchronized wmf-config/InitialiseSettings.php: Enable the updateBetaFeaturesUserCounts logging channel - T175637 (duration: 00m 50s)
09:30 ema: cp1062 - varnish-backend-restart, mbox lag
08:40 mobrovac@tin: Finished deploy [cxserver/deploy@03c1eb1]: (no justification provided) (duration: 00m 52s)
08:39 mobrovac@tin: Started deploy [cxserver/deploy@03c1eb1]: (no justification provided)
08:09 mobrovac@tin: Finished deploy [cxserver/deploy@03c1eb1]: (no justification provided) (duration: 00m 13s)
08:09 mobrovac@tin: Started deploy [cxserver/deploy@03c1eb1]: (no justification provided)
07:58 kartik@tin: Finished deploy [cxserver/deploy@03c1eb1]: Update cxserver to 569b7b7 (duration: 00m 37s)
07:58 kartik@tin: Started deploy [cxserver/deploy@03c1eb1]: Update cxserver to 569b7b7
07:50 moritzm: installing bind9 regression update on trusty servers
07:35 kartik@tin: (no justification provided)
07:35 kartik@tin: Finished deploy [cxserver/deploy@03c1eb1]: Update cxserver to 569b7b7 (duration: 01m 04s)
07:34 kartik@tin: Started deploy [cxserver/deploy@03c1eb1]: Update cxserver to 569b7b7
02:28 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Sep 19 02:28:56 UTC 2017 (duration 7m 6s)
02:21 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.18) (duration: 07m 04s)
01:52 demon@tin: Synchronized scap/plugins/updatewikiversions.py: Minor newline nitpick. I promise that's it (duration: 00m 46s)
01:41 demon@tin: Synchronized scap/plugins/updatewikiversions.py: Minor spacing nitpick, that's it for today (duration: 00m 45s)
01:17 demon@tin: Synchronized scap/plugins/updatewikiversions.py: add new hotness updatewikiversions.py (duration: 00m 45s)
01:15 demon@tin: Synchronized multiversion/: rm old and busted updateWikiversions (duration: 01m 07s)
00:12 foks: Remove 2FA from users Alan and Magicknight94 (retroactive log)
00:10 eileen: update civicrm from 187238f to b6f7dc3
2017-09-18
23:28 tstarling@tin: Synchronized php-1.30.0-wmf.18/includes/filerepo/file/LocalFile.php: (no justification provided) (duration: 00m 46s)
23:13 thcipriani@tin: Synchronized php-1.30.0-wmf.18/extensions/ArticleCreationWorkflow: SWAT: Only intercept users who can potentially create articles otherwise T176100 (duration: 00m 46s)
22:32 demon@tin: Synchronized wmf-config/CommonSettings.php: Use PrivateSettings.php directly (duration: 00m 46s)
21:57 bawolff: deployed patch T175900
21:18 krinkle@tin: Synchronized php-1.30.0-wmf.18/extensions/NavigationTiming/modules/ext.navigationTiming.js: Unbreak - T176105 (duration: 00m 48s)
21:16 mutante: repeating import of gerrit_2.13.8+git1-wmf.7 on install1002, followed by reprepro copy stretch-wikimedia jessie-wikimedia gerrit to make it available on stretch
21:06 mutante: uploaded gerrit 2.13.8+git1-wmf.7 for jessie-wikimedia on APT.wm.org
20:42 krinkle@tin: Finished deploy [jobrunner/jobrunner@57f5f47]: No-op sync - first time scap3 - T129148 (duration: 03m 23s)
20:38 krinkle@tin: Started deploy [jobrunner/jobrunner@57f5f47]: No-op sync - first time scap3 - T129148
20:17 arlolra@tin: (no justification provided)
20:15 arlolra@tin: Finished deploy [parsoid/deploy@a01064d]: Updating Parsoid to 05a0965 (duration: 01m 53s)
20:13 arlolra@tin: Started deploy [parsoid/deploy@a01064d]: Updating Parsoid to 05a0965
19:29 XioNoX: restarting pfw3-codfw for software upgrade
19:10 urandom: T160570 : Upgrading restbase-dev100[5-6].eqiad.wmnet to Cassandra 3.11.0-wmf5
19:07 smalyshev@tin: Finished deploy [wdqs/wdqs@2523836]: Deploy correct prefixes conf (duration: 01m 20s)
19:05 smalyshev@tin: Started deploy [wdqs/wdqs@2523836]: Deploy correct prefixes conf
19:03 smalyshev@tin: Finished deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS, take 4 (duration: 02m 36s)
19:02 no_justification: gerrit: restarting service, back in a moment
19:00 smalyshev@tin: Started deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS, take 4
18:56 urandom: T160570 : Upgrading restbase-dev1004.eqiad.wmnet to Cassandra 3.11.0-wmf5 (canary)
18:46 smalyshev@tin: Finished deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS, take 3 (duration: 02m 25s)
18:44 smalyshev@tin: Started deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS, take 3
18:37 smalyshev@tin: (no justification provided)
18:36 smalyshev@tin: Finished deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS (duration: 02m 26s)
18:34 smalyshev@tin: Started deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS
18:20 maxsem@tin: Synchronized wmf-config/: https://gerrit.wikimedia.org/r/#/c/378357/ https://gerrit.wikimedia.org/r/#/c/376791/ (duration: 00m 48s)
17:42 smalyshev@tin: Finished deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS (duration: 00m 21s)
17:42 smalyshev@tin: Started deploy [wdqs/wdqs@ecdbd0d]: Deploying Blazegraph fixes and category scripts for WDQS
17:26 demon@tin: Synchronized wmf-config/ProductionServices.php: no-op/comment fix (duration: 00m 45s)
16:46 jynus: shuting down db1100 T175973
16:40 reedy@tin: Synchronized php-1.30.0-wmf.18/maintenance/populateIpChanges.php: perf improvements T175962 (duration: 00m 46s)
16:05 bblack: cp1063 - backend restart, mailbox lag
15:38 bblack: cp1072 - varnish backend restart, mailbox lag
15:28 bblack: cp1074 - backend restart for mailbox lag
15:25 bblack: cp1050 - backend restart for mailbox lahg
14:27 reedy@tin: Synchronized php-1.30.0-wmf.18/includes/filerepo/file/LocalFile.php: T175444 (duration: 00m 47s)
13:29 ppchelko@tin: Finished deploy [cpjobqueue/deploy@2c422f5]: Annotate the job event with pipeline property (duration: 00m 29s)
13:28 ppchelko@tin: Started deploy [cpjobqueue/deploy@2c422f5]: Annotate the job event with pipeline property
13:25 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Meta(Talk)Namespace configuration for be.wiktionary - T175950 (duration: 00m 46s)
13:18 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: pagePreviews: Stop A/B test on enwiki and dewiki - T176068 (duration: 00m 45s)
13:10 hashar@tin: Synchronized php-1.30.0-wmf.18/extensions/BetaFeatures: Temporary log all the executions of the update job. - T175637 (duration: 00m 46s)
13:09 hashar@tin: Synchronized wmf-config: New 'abusefilter-helper' configuration for en.wikipedia - T175684 (duration: 00m 49s)
12:59 moritzm: installing libxen updates
12:36 moritzm: installing tomcat8 security updates
10:40 ema: cp1099 - backend restart, mailbox lag
06:42 moritzm: installing freexl security updates
06:18 TimStarling: testing EtcdConfig in beta cluster
05:40 tstarling@tin: Synchronized wmf-config: just for consistency, should have no effect (gerrit 375108) (duration: 00m 49s)
02:36 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Sep 18 02:36:50 UTC 2017 (duration 7m 6s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.18) (duration: 08m 04s)
2017-09-17
12:49 _joe_: taking a full debug dump of mw1288 after depooling it
12:40 _joe_: restarting hhvm on some api appservers, to ease the cpu overlock
09:46 elukey: restart varnish backend on cp1052 - recurrent mailbox lag
08:11 elukey: restart varnish backend on cp1053 - recurrent mailbox lag
2017-09-16
22:05 godog: compress older otrs directories to reclaim inodes - T171490
16:53 elukey: restart varnish-backend on cp1073 (cache upload) for mailbox lag
15:40 andrewbogott: rebooting labvirt1017
15:34 andrewbogott: rebooting labvirt1015 for T176044
2017-09-15
21:50 mutante: contint2001 - systemctl start zuul
21:47 mutante: contint1001 - tmp disable puppet - contint2001 - test zuul unit file (gerrit 359016)
21:43 demon@tin: Finished deploy [gerrit/gerrit@48ad332]: 2.13.9 -- not being used yet though (duration: 00m 08s)
21:43 demon@tin: Started deploy [gerrit/gerrit@48ad332]: 2.13.9 -- not being used yet though
15:46 bblack: cp1049 - backend restart, mailbox lag
15:40 bblack: cp1099 - backend restart, mailbox lag
15:38 bblack: cp1072 - backend restart, mailbox lag
15:21 _joe_: uploaded td-agent-bit (fluentd-bit) to wikimedia-stretch T175527
13:13 moritzm: pruned hhvm bytecode cache on mw2256, led to log spam for an undefined variable after being out of service for hardware maintenance
13:10 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2256.codfw.wmnet
13:10 addshore@tin: Synchronized wmf-config/CommonSettings-labs.php: beta: $wgWikiDiff2MovedParagraphDetectionCutoff = 25 BETA ONLY 2/2 (duration: 00m 45s)
13:08 addshore@tin: Synchronized wmf-config/InitialiseSettings-labs.php: beta: $wgWikiDiff2MovedParagraphDetectionCutoff = 25 BETA ONLY 1/2 (duration: 00m 47s)
11:22 jmm@puppetmaster1001: conftool action : set/pooled=no; selector: mw2256.codfw.wmnet
09:10 moritzm: installing tcpdump security updates on trusty hosts (Debian systems already fixed)
08:54 moritzm: installing libbluetooth updates
08:03 jynus: dropping mariadb analytics users on db1031, db1029
07:49 Amir1: running maintenance/updateCollation.php --force on Persian (fa) wikis (T173601 )
07:39 gehel: shutting down and masking elasticsearch on elastic1020 - T175951
07:35 gehel: depooling elastic1020 - T175951
07:28 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw2256.codfw.wmnet
00:46 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1100 (duration: 00m 46s)
2017-09-14
23:51 mutante: phabricator - tested gerrit 354247 before merging using apache-fast-test from tin -restarted apache
23:37 catrope@tin: Synchronized wmf-config/: VRS config for Electron (T175868 ) (duration: 00m 47s)
23:34 catrope@tin: Synchronized php-1.30.0-wmf.18/extensions/VisualEditor/lib/ve/src/ui/styles/ve.ui.TableLineContext.css: T169389 (duration: 00m 45s)
23:33 catrope@tin: Synchronized php-1.30.0-wmf.18/extensions/ContentTranslation/modules/widgets/translator/ext.cx.translator.js: Fix JS error in CX (duration: 00m 46s)
23:29 catrope@tin: Synchronized wmf-config/CommonSettings.php: Give sysops the flow-create-board right on Flow wikis (T175934 ) (duration: 00m 45s)
23:27 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Give sysops the flow-create-board right on Flow wikis (T175934 ) (duration: 00m 45s)
23:22 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Use RemexHtml on mediawikiwiki and testwiki (T175095 ) (duration: 00m 46s)
23:19 dzahn@neodymium: conftool action : set/pooled=yes; selector: name=acamar.wikimedia.org
23:18 mutante: acamar is back up and running - no failed units
23:13 catrope@tin: Synchronized wmf-config/InitialiseSettings.php: Allow bureaucrats to remove accountcreator on mediawikiwiki (T175903 ) (duration: 00m 46s)
23:12 mutante: acamar - staged BIOS upgrade in progress
23:09 mutante: acamar - done with upgrade - rebooting - it's depooled and removed from resolv.conf - T162850
22:56 mutante: depooled acamar (codfw dns recursor) for BIOS upgrade - installing firmware
22:56 dzahn@neodymium: conftool action : set/pooled=no; selector: name=acamar.*
22:55 maxsem@tin: Synchronized wmf-config/: Labs only: https://gerrit.wikimedia.org/r/378184 https://gerrit.wikimedia.org/r/378187 (duration: 00m 47s)
22:15 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: ACTRIAL going live: https://gerrit.wikimedia.org/r/#/c/378104/ (duration: 00m 46s)
21:22 maxsem@tin: Synchronized php-1.30.0-wmf.18/extensions/ArticleCreationWorkflow/: https://gerrit.wikimedia.org/r/#/c/378125/ (duration: 00m 47s)
20:34 demon@tin: Synchronized php-1.30.0-wmf.18/skins/MinervaNeue/resources/skins.minerva.userpage.icons/userpage.svg: I95d76339 (duration: 00m 45s)
20:18 demon@tin: Synchronized php-1.30.0-wmf.18/includes/api/ApiFeedWatchlist.php: Ibea5bd88 (duration: 00m 46s)
20:04 demon@tin: Synchronized php-1.30.0-wmf.18/extensions/OAuth/backend/MWOAuthDAO.php: I785230df (duration: 00m 46s)
20:02 ppchelko@tin: Finished deploy [cpjobqueue/deploy@af9b590]: Emit broker statistics more frequentl (duration: 00m 30s)
20:02 ppchelko@tin: Started deploy [cpjobqueue/deploy@af9b590]: Emit broker statistics more frequentl
19:58 dcausse: banning elastic1020 to see if T175951 is caused by mixed versions of the ltr plugin
19:36 demon@tin: Synchronized php-1.30.0-wmf.18/skins/MinervaNeue/resources/skins.minerva.icons.images.scripts/userNormal.svg: fix svg syntax (duration: 00m 46s)
19:31 demon@tin: Synchronized php-1.30.0-wmf.17/skins/MinervaNeue/resources/skins.minerva.icons.images.scripts/userNormal.svg: fix svg syntax (duration: 00m 45s)
19:07 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.18
18:58 tgr@tin: Synchronized wmf-config/PrivateSettings.php: set Electron secret for T175868 (duration: 00m 48s)
18:57 tgr@tin: Synchronized private/PrivateSettings.php: set Electron secret for T175868 (duration: 00m 49s)
18:29 twentyafterfour: Phabricator: Deploying hotfix for T175942
18:10 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Configure enwiki to use CirrusSearch MLR by default T175772 (duration: 00m 50s)
18:03 Jamesofur: disabled oauth validation on metawiki/SUL for Teles after verification
17:57 bblack: cp2001 backend restart, mailbox lag
17:56 bblack: cp1074 backend restart, mailbox lag
17:18 eevans@tin: Started restart [electron-render/deploy@8dd5f13]: (no justification provided)
17:16 gehel@tin: Finished deploy [wikimedia/discovery/analytics@ab5d5c1]: moving discovery/analytics to scap3 (duration: 00m 04s)
17:16 gehel@tin: Started deploy [wikimedia/discovery/analytics@ab5d5c1]: moving discovery/analytics to scap3
16:59 krinkle@tin: Synchronized php-1.30.0-wmf.17/extensions/NavigationTiming: T104902 (duration: 01m 00s)
15:51 bblack: repool cp1063 (after wiping varnish storage + restarting all 3x main service daemons)
15:51 bblack: cp1064 backend restart, mailbox lag
15:46 bblack: depool cp1063 (upload eqiad) - seems to be having some iowait issues?
14:07 dereckson@tin: Synchronized php-1.30.0-wmf.18/extensions/CheckUser/specials/SpecialCheckUser.php: Fix Special cu for ip ranges (temp) (T175898 ) (duration: 00m 49s)
13:34 dcausse@tin: Synchronized wmf-config/CirrusSearch-production.php: Setup Cirrus MLR models for top 20 language AB test (duration: 00m 49s)
13:33 dcausse@tin: Synchronized wmf-config/CirrusSearch-common.php: Setup Cirrus MLR models for top 20 language AB test (duration: 00m 48s)
13:31 dcausse@tin: Synchronized wmf-config/InitialiseSettings.php: Setup Cirrus MLR models for top 20 language AB test (duration: 00m 50s)
11:14 jynus@tin: Synchronized wmf-config/db-eqiad.php: Remove all references to db1049, pool db1100 (duration: 00m 49s)
11:12 akosiaris: upload apertium-crh-tur_0.2.0~r81866-1+wmf1 to apt.wikimedia.org/jessie-wikimedia/main
11:12 jynus@tin: Synchronized wmf-config/db-codfw.php: Remove all references to db1049 (duration: 00m 50s)
09:51 akosiaris@puppetmaster1001: conftool action : set/pooled=no; selector: name=wtp1025.eqiad.wmnet
09:43 akosiaris@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet
09:43 akosiaris@puppetmaster1001: conftool action : set/weight=1; selector: name=wtp1025.eqiad.wmnet
09:41 gehel: re-enabling puppet on all cassandra nodes - T171704
09:41 akosiaris@tin: Finished deploy [parsoid/deploy@cec7d17]: test deploy using dsh groups. T165520 (duration: 09m 30s)
09:32 akosiaris@tin: Started deploy [parsoid/deploy@cec7d17]: test deploy using dsh groups. T165520
09:02 akosiaris@tin: Finished deploy [parsoid/deploy@cec7d17]: test deploy using dsh groups. T165520 (duration: 02m 25s)
09:00 akosiaris@tin: Started deploy [parsoid/deploy@cec7d17]: test deploy using dsh groups. T165520
08:13 gehel: merging cassandra refactoring for puppet - T171704
02:55 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Sep 14 02:55:10 UTC 2017 (duration 7m 28s)
02:47 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.18) (duration: 07m 58s)
02:27 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.17) (duration: 09m 27s)
00:46 MaxSem: restarted populateIpChanges to use updated code
00:26 twentyafterfour: everything appears to be up and running. Phabricator upgrade complete.
00:23 twentyafterfour: running puppet on phab1001 to restore files which get lost by scap deploy
00:21 twentyafterfour@tin: Finished deploy [phabricator/deployment@7e3a0a8]: (no justification provided) (duration: 00m 01s)
00:21 twentyafterfour@tin: Started deploy [phabricator/deployment@7e3a0a8]: (no justification provided)
00:21 twentyafterfour@tin: Finished deploy [phabricator/deployment@7e3a0a8]: (no justification provided) (duration: 00m 25s)
00:20 twentyafterfour@tin: Started deploy [phabricator/deployment@7e3a0a8]: (no justification provided)
00:18 twentyafterfour: Begin phabricator upgrade #phab-2017-09-13
2017-09-13
23:48 dereckson@tin: Synchronized php-1.30.0-wmf.18/maintenance/populateIpChanges.php: Improve ip_changes update script to insert rows in batches (Gerrit:377918 ) (duration: 00m 49s)
23:29 dereckson@tin: Synchronized php-1.30.0-wmf.18/extensions/Wikidata/extensions/Wikibase/client: Split page set before constructing InjectRCRecordsJob (T175316 ) (duration: 00m 57s)
23:28 dereckson@tin: Synchronized php-1.30.0-wmf.18/extensions/Echo/modules/api/mw.echo.api.APIHandler.js: (no justification provided) (duration: 00m 48s)
23:26 dereckson@tin: Synchronized wmf-config/CommonSettings.php: Set $wgScoreSafeMode to false (T174413 ) (duration: 00m 50s)
22:06 maxsem@tin: Synchronized php-1.30.0-wmf.18/extensions/EventBus/: https://gerrit.wikimedia.org/r/#/c/377900/1 (duration: 00m 49s)
22:03 maxsem@tin: Synchronized php-1.30.0-wmf.18/extensions/ArticleCreationWorkflow/: https://gerrit.wikimedia.org/r/#/c/377900/1 (duration: 00m 49s)
21:55 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/377874/ (duration: 00m 47s)
21:23 ejegg: updated payments-wiki from ed2e481 to 26b16ea
grrrr: populating ip_changes on group1 wikis
20:43 bblack: re-routed ulsfo->eqiad around possible codfw damage as well - https://gerrit.wikimedia.org/r/#/c/377871/
20:34 bblack: deployed https://gerrit.wikimedia.org/r/#/c/377845/ (no gerrit bot, jenkins lint failing, etc) due to possible codfw routing issues
19:39 andrewbogott: rebooting labtestvirt2001
19:35 ppchelko@tin: Finished deploy [changeprop/deploy@86e8103]: Separate kafka metrics by rule (duration: 01m 18s)
19:34 ppchelko@tin: Started deploy [changeprop/deploy@86e8103]: Separate kafka metrics by rule
19:21 urandom: T172384 : Upgrading restbase-dev100[5-6] to Cassandra 3.11.0-wmf4
19:19 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.18
19:13 demon@tin: Synchronized php: symlink thingie (duration: 00m 49s)
19:11 urandom: T172384 : Upgrading restbase-dev1004 to Cassandra 3.11.0-wmf4 (canary)
18:34 niharika29@tin: Synchronized static/images/project-logos/: Change logo for huwiktionary T175483 (duration: 00m 49s)
18:27 mutante: gerrit2002 - re-enabled puppet - had forgotten to enable it again after recently tryning to test the gerrit->logstash change
18:26 niharika29@tin: Synchronized php-1.30.0-wmf.18/extensions/VisualEditor/: Make ve.dm.Change part of core module T175828 (duration: 00m 50s)
18:25 niharika29@tin: Synchronized docroot/mediawiki/ontology/: Add setup for https://www.mediawiki.org/ontology T171807 (duration: 00m 49s)
18:12 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable Timeless on frwiki T154371 (duration: 00m 50s)
17:55 gwicke: rolling restart of pdfrender service in equiad after hang T174916 T172815
16:42 gehel@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=wdqs1001.eqiad.wmnet
16:42 gehel@puppetmaster1001: conftool action : set/pooled=inactive; selector: name=wdqs1002.eqiad.wmnet
16:28 gehel: starting decommissioning of wdqs100[12] - T175595
16:25 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs1002.eqiad.wmnet
16:25 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=wdqs1001.eqiad.wmnet
16:22 andrewbogott: rebooting labtestvirt2002
16:08 ottomata: restarting druid-brokers with increase in query cache size
15:23 demon@tin: Synchronized php-1.30.0-wmf.18/includes/filerepo/file/LocalFile.php: I2ecb6030 (duration: 00m 49s)
15:22 demon@tin: Synchronized php-1.30.0-wmf.18/includes/diff/DifferenceEngine.php: I2ecb6030 (duration: 00m 49s)
15:21 demon@tin: Synchronized php-1.30.0-wmf.17/includes/filerepo/file/LocalFile.php: I2ecb6030 (duration: 00m 49s)
15:20 demon@tin: Synchronized php-1.30.0-wmf.17/includes/diff/DifferenceEngine.php: I2ecb6030 (duration: 00m 49s)
15:09 demon@tin: Synchronized php-1.30.0-wmf.17/extensions/AbuseFilter/: backporting I13051038 (duration: 00m 52s)
14:31 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: dc=eqiad,cluster=logstash
14:31 gehel: pooling new logstash servers (logstash100[7-9]) - T175045
14:23 mobrovac@tin: Finished deploy [cpjobqueue/deploy@60d0a78]: Start using the EventBus infrastructure for the updateBetaFeaturesUserCounts job - T175210 (duration: 00m 33s)
14:22 mobrovac@tin: Started deploy [cpjobqueue/deploy@60d0a78]: Start using the EventBus infrastructure for the updateBetaFeaturesUserCounts job - T175210
13:45 akosiaris: upload apertium-crh_0.1.0~r81872-1+wmf1 to apt.wikimedia.org/jessie-wikimedia/main
13:30 urandom: T169936 : Converting RESTBase dev environment to size-tiered compaction
13:29 elukey: update puppet compiler's facts via ./modules/puppet_compiler/files/compiler-update-facts
13:23 gehel: initial installation of logstash100[7-9] - T175045
13:21 zeljkof: EU SWAT finished
13:13 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: Temporary lift account creation limits for WM United Kingdom workshop (T175700) (duration: 00m 49s)
12:14 moritzm: rebooting mc1015 (spare) for some tests
12:07 jynus: stopping wikidata rebuild maintenance script on terbium
12:07 bblack: cp1049 - backend restart, mailbox lag
11:57 jynus@tin: Synchronized wmf-config/db-eqiad.php: Set db1097 as the main api server for s4, Remove regular load from db1070 (duration: 00m 56s)
11:39 jynus: disabling semi-sync replication on the largest s5 database servers
11:03 ariel@tin: Finished deploy [dumps/dumps@3f119ab]: monitor will cd to dumps repo before each run (duration: 00m 02s)
11:03 ariel@tin: Started deploy [dumps/dumps@3f119ab]: monitor will cd to dumps repo before each run
11:02 joal@tin: Finished deploy [analytics/refinery@b2e8852]: Regular analytics weekly deploy (new jars, oozie patches) (duration: 03m 12s)
10:58 joal@tin: Started deploy [analytics/refinery@b2e8852]: Regular analytics weekly deploy (new jars, oozie patches)
10:52 volans: reboot lvs3001
10:06 akosiaris: upload apertium-cat-srd_0.9.0~r82238-1+wmf1 to apt.wikimedia.org/jessie-wikimedia/main
10:06 akosiaris: upload apertium-srd-ita_0.9.5~r82237-1+wmf1 to apt.wikimedia.org/jessie-wikimedia/main
09:36 moritzm: installing bind updates from Debian stable/oldstable SUA update (client tools only)
09:25 akosiaris: upload apertium-srd_0.10.0~r82237-1+wmf1_amd64.changes to apt.wikimedia.org/jessie-wikimedia/main T174988
09:24 akosiaris: upload apertium-tur_0.1.0~r81882-1+wmf1 to apt.wikimedia.org/jessie-wikimedia/main
09:23 akosiaris: upload apertium-ita_0.10.0~r82237-1+wmf1 to apt.wikimedia.org/jessie-wikimedia/main
09:23 akosiaris: upload apertium-cat_2.3.0~r82237-1+wmf1 to apt.wikimedia.org/jessie-wikimedia/main
09:05 akosiaris: upload apertium_3.4.2~r68466-2+wmf2 to apt.wikimedia.org/jessie-wikimedia
08:55 elukey: restart varnish backend on cp1072 (upload) - mailbox expiry lag
08:24 moritzm: installing emacs security updates
07:56 akosiaris: bounce varnish on cp1062, mailbox lag
07:47 moritzm: rolling out hhvm-luasandbox 2.0.14 to the remaining hosts in eqiad (along with HHVM restarts) (T173705 )
07:08 moritzm: installing tcpdump security updates (fixing 90 CVE IDs...)
05:18 kartik@tin: Finished deploy [cxserver/deploy@ee0081d]: Update cxserver to 3baaa36 (duration: 02m 56s)
05:15 kartik@tin: Started deploy [cxserver/deploy@ee0081d]: Update cxserver to 3baaa36
03:12 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Sep 13 03:12:46 UTC 2017 (duration 7m 30s)
03:05 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.18) (duration: 16m 28s)
03:03 jynus: dropping old users from phabricator database
03:00 eileen: update civicrm from ee7dda3 to 187238f
02:41 demon@tin: Synchronized wmf-config/mobile.php: rm unused config var (duration: 00m 46s)
02:27 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.17) (duration: 08m 31s)
01:19 jynus: stopping mariadb @ db1048 to clone it to db1059
00:11 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Special:PageLanguage on mul.wikisource Fix correct wiki name for wgPageLanguageUseDB on www.wikisource.org T175622 (duration: 00m 49s)
2017-09-12
23:53 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Revert "Do not deploy Timeless on fr.wiktionary for now" (duration: 00m 49s)
23:35 thcipriani@tin: Synchronized wmf-config: SWAT: Enable Flow on Newsletter_talk on mediawiki.org T175502 (duration: 00m 52s)
22:17 chasemp: labvirt1018:~# /sbin/reboot
22:00 bsitzmann@tin: Finished deploy [mobileapps/deploy@b11b75c]: Update mobileapps to 297b048 (T174707 T175305 ) (duration: 04m 28s)
21:55 bsitzmann@tin: Started deploy [mobileapps/deploy@b11b75c]: Update mobileapps to 297b048 (T174707 T175305 )
21:49 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/377527/2 T175725 (duration: 00m 49s)
21:37 MaxSem: Populating ip_changes in group0 wikis
21:33 ejegg: changed contact de-duplication duty cycle to 25 minutes
21:05 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.18
20:33 demon@tin: Finished scap: bootstrap wmf.18 (for real this time) (duration: 26m 23s)
20:06 demon@tin: Started scap: bootstrap wmf.18 (for real this time)
19:53 demon@tin: Finished scap: bootstrap wmf.18 (duration: 08m 14s)
19:45 demon@tin: Started scap: bootstrap wmf.18
19:05 papaul: maintenance complete on mw2256
17:51 bblack: cp3034 - reset fetch_maxchunksize to 0.25G (default) for fe+be
17:40 cwd: turned on dedupe
16:55 niharika29@tin: Finished deploy [iegreview/iegreview@69c4c3f]: Deploying iegreview with scap3 T129154 (duration: 00m 01s)
16:55 niharika29@tin: Started deploy [iegreview/iegreview@69c4c3f]: Deploying iegreview with scap3 T129154
16:40 ejegg: updated CiviCRM from 63288ae to ee7dda3
16:35 niharika29@tin: Finished deploy [scholarships/scholarships@004635d]: Deploying scholarships with scap3 T129134 (duration: 00m 03s)
16:35 niharika29@tin: Started deploy [scholarships/scholarships@004635d]: Deploying scholarships with scap3 T129134
16:26 volans@sarin: conftool action : set/pooled=yes; selector: name=mw2100.codfw.wmnet
16:18 ejegg: updated SmashPig from dce4f0a to dc62203910
15:43 _joe_: restarted ircecho on einsteinium
15:35 gehel: restart elasticsearch on relforge (last check of new plugin deployment) - T158560
15:22 bblack: manually setting fetch_maxchunksize=16MB via varnishadm on cp3034 fe+be instances
15:06 chasemp: disable puppet for lab* things for designate refactor rolleout (delayed a few minutes for a meeting)
15:04 moritzm: uploaded php5.5 5.5.38 to component/ci of jessie-wikimedia (for use in CI)
14:53 papaul: shutting down mw2256 for maintenance
14:43 elukey: add kafka-jumbo IPs to the kafka term of the analytics-in4 filter on cr1/cr2 eqiad
14:42 ema: cp1074 - backend restart, mailbox lag
14:39 bblack: manually setting fetch_maxchunksize=64MB via varnishadm on cp3034 fe+be instances
14:19 jynus@tin: Synchronized wmf-config/db-eqiad.php: Remove all references to db1059 (duration: 00m 44s)
14:18 jynus@tin: Synchronized wmf-config/db-codfw.php: Remove all references to db1059 (duration: 00m 45s)
14:10 gehel: restarting elasticsearch on elastic1020 to validate new plugin deployment - T158560
14:07 gehel: switching elasticsearch plugin deployment to .deb instead of scap on cirrus/eqiad - T158560
14:06 volans: testing reimage of mw2100 - T166300
14:06 gehel: restarting elasticsearch on elastic2001 to validate new plugin deployment - T158560
14:04 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1059, pool db1097 with low load (duration: 00m 45s)
14:02 gehel: switching elasticsearch plugin deployment to .deb instead of scap on cirrus/codfw - T158560
13:37 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs1005.eqiad.wmnet
13:37 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs1004.eqiad.wmnet
13:37 gehel: pooling wdqs100[45] - T171210
13:31 hashar@tin: Synchronized php-1.30.0-wmf.17/extensions/CentralAuth/includes/api/ApiSetGlobalAccountStatus.php: API: Unbreak setglobalaccountstatus locked=lock - T175462 (duration: 00m 44s)
13:26 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable WikidataPageBanner for Russian Wikimedia chapter wiki - T175356 (duration: 00m 45s)
13:25 hashar@tin: Synchronized php-1.30.0-wmf.17/includes/changes/ChangesListBooleanFilter.php: WLFilters: Respect default values - T174725 (duration: 00m 45s)
13:21 hashar@tin: Synchronized wmf-config/CommonSettings.php: Add Extension:Newsletter permissions to CommonSettings (duration: 00m 45s)
13:18 hashar@tin: Synchronized wmf-config/throttle.php: Lift account creation restrictions for WM Taiwan 10th anniversary - T175534 (duration: 00m 45s)
13:15 hashar@tin: Synchronized php-1.30.0-wmf.17/extensions/Wikidata/extensions/Wikibase/view/resources/jquery/wikibase/themes/default/jquery.wikibase.entityselector.css: Remove red highlighting on all entity selectors - T175525 (duration: 00m 45s)
13:12 hashar@tin: Synchronized wmf-config/Wikibase-production.php: Reduce wikiPageUpdaterDbBatchSize to 20 - T173710 (duration: 00m 45s)
09:33 moritzm: upgrading deployment servers to HHVM-Luasandbox 2.0.14
09:32 moritzm: upgrading mw2* to HHVM-Luasandbox 2.0.14
09:25 moritzm: upgrading mw1293-mw1295 (image scalers) to HHVM-Luasandbox 2.0.14 (along with HHVM restarts)
09:24 moritzm: upgrading mw1293-mw1205 (image scalers) to HHVM-Luasandbox 2.0.14 (along with HHVM restarts)
08:38 gehel: deploying elasticsearch plugins on relforge - T158560 (wrong ticket number before)
08:34 gehel: deploying elasticsearch plugins on relforge - T175159
08:28 moritzm: upgrading mw1189-mw1208 (API servers) to HHVM-Luasandbox 2.0.14 (along with HHVM restarts)
07:35 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool es1019 with full weight (duration: 00m 45s)
07:15 moritzm: upgrading mw1180-mw1188, mw1209-1220 (app servers) to HHVM-Luasandbox 2.0.14 (along with HHVM restarts)
02:29 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Sep 12 02:29:20 UTC 2017 (duration 6m 45s)
02:22 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.17) (duration: 07m 31s)
02:10 bblack: cp1050 - backend restart, mailbox lag
02:08 bblack: cp1099 - backend restart, mailbox lagh
00:48 awight@tin: Finished deploy [ores/deploy@42c5663]: Cause ORES service restart (duration: 00m 18s)
00:48 awight@tin: Started deploy [ores/deploy@42c5663]: Cause ORES service restart
2017-09-11
23:06 mutante: jenkins-bot says: java.lang.OutofMemoryError: PermGen space
22:30 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4026.*
22:30 dereckson@tin: Synchronized wmf-config/CommonSettings-labs.php: Fix MediaWiki\Auth\AbstractPreAuthenticationProvider case (Gerrit:377281 , no-op in prod) (duration: 00m 45s)
21:45 awight@tin: Finished deploy [ores/deploy@42c5663]: Try ORES filehandle fix on new cluster (duration: 13m 47s)
21:41 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4025.*
21:31 awight@tin: Started deploy [ores/deploy@42c5663]: Try ORES filehandle fix on new cluster
21:03 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4023.*
20:13 demon@tin: Pruned MediaWiki: 1.30.0-wmf.13 (duration: 02m 49s)
19:11 chasemp: adjust nodepool rate to 10
19:08 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool es1019 with low load (duration: 00m 46s)
19:01 mutante: restbase-dev1004:/srv# mv /home/eevans/cassandra-1502141526-pid13410.hprof /srv/eevans/ to free disk space
18:52 mutante: restbase - java[12418]: Error: Could not find or load main class org.wikimedia.cassandra.metrics.service.Service
18:40 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable responsive reference columns on wiktionaries, wikivoyages and enwiki https://gerrit.wikimedia.org/r/#/c/376573/ , https://gerrit.wikimedia.org/r/#/c/371630/ (duration: 00m 46s)
18:33 robh: firmware update on scs-a1-codfw and scs-c1-codfw has been complete and tested good T174475
18:29 chasemp: bump nodepool rate to 15 and max-servers to 18 (no restart as it picks up config periodically)
18:29 robh: firmware update on scs-a1-codfw and scs-c1-codfw has been started, awaiting completion T174475
18:24 robh: updating firmware on scs-a1-codfw and scs-c1-codfw
18:10 robh: scs-ulsfo has successfully updated firmware T174475
18:03 robh: upgrading firmware on scs-ulsfo
17:09 gehel@tin: Finished deploy [wdqs/wdqs@177d20a]: (no justification provided) (duration: 02m 39s)
17:07 gehel: wdqs weekly deployment (logging and throttling fixes)
17:06 gehel@tin: Started deploy [wdqs/wdqs@177d20a]: (no justification provided)
15:20 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4022.*
15:20 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4028.*
14:40 godog: roll-restart thumbor to apply https://gerrit.wikimedia.org/r/#/c/377264/ and upgrade to 1.4 - T173580 T174997
14:24 hashar: tin.eqiad.wmnet: reverted patch "WLFilters: Respect default values" that got merged but left undeployed
14:23 hashar: tin.eqiad.wmnet: reverted all four mediawiki-config that got merged but left undeployed and rebased the workspace
14:05 hashar: European SWAT cancelled due to deploy infrastructure issue
13:55 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: (no justification provided) (duration: 00m 27s)
13:54 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: (no justification provided) (duration: 00m 27s)
13:50 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: (no justification provided) (duration: 00m 28s)
13:43 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: (no justification provided) (duration: 00m 28s)
13:40 hashar@tin: Synchronized static/images/project-logos/huwiktionary.png: Change logo for huwiktonary - T175483 (duration: 00m 28s)
11:43 moritzm: installing file/libmagic security updates (stretch-specific, does not affect older distros)
10:30 jynus: restarting mysql @ labsdb1001
09:40 godog: roll-restart thumbor to apply changes for gif max animated area - T173580
09:39 mobrovac@tin: Finished deploy [cpjobqueue/deploy@e73a10f]: Initial deploy of cpjobqueue - T175281 (duration: 00m 20s)
09:39 mobrovac@tin: Started deploy [cpjobqueue/deploy@e73a10f]: Initial deploy of cpjobqueue - T175281
08:44 mobrovac@tin: Finished deploy [zotero/translators@a0c41c3]: Update translators to upstream e03695273 - T174992 (duration: 00m 07s)
08:44 mobrovac@tin: Started deploy [zotero/translators@a0c41c3]: Update translators to upstream e03695273 - T174992
08:16 volans: reimage script tests continue on mc100[1-2] T166300
07:59 moritzm: installing systemd updates from stretch point update
07:41 moritzm: installing remaining libonig security updates
02:41 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Sep 11 02:41:43 UTC 2017 (duration 7m 0s)
02:34 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.17) (duration: 08m 10s)
2017-09-10
21:00 bawolff: disabled Jforrester's gerrit account and flush sessions
13:41 elukey: restart Varnish backend on cp1073 (cache::upload) for mailbox expiry lag
13:13 elukey: restart cp1053's varnish backend for mailbox expiry lag and 503s - T175473
00:37 chasemp: labnodepool1001:~# service nodepool restart
00:36 bd808: wmcs: purged rabbitmq ceilometer_notifications.* queues
2017-09-09
23:42 chasemp: labnodepool1001:~# service nodepool start
23:06 chasemp: freeswap && labcontrol1001:/home/rush# service rabbitmq-server restart
22:59 chasemp: restart nova-api and nova-network
22:49 chasemp: labnodepool1001:~# service nodepool stop
22:44 madhuvishy: Running service rabbitmq-server restart on labcontrol1001
20:41 legoktm@tin: Synchronized php-1.30.0-wmf.17/includes/filerepo/file/LocalFile.php: Fix issues related to comments and files in UI and API - T175443 T175444 (duration: 00m 46s)
01:12 ebernhardson@tin: Synchronized php-1.30.0-wmf.17/extensions/WikimediaEvents/modules/ext.wikimediaEvents.humanSearchRelevance.js: T171740 : Reduce annoyance of survey by enforcing minimum 2 days between showing survey to same browser (duration: 00m 46s)
2017-09-08
23:46 ebernhardson@tin: Synchronized wmf-config/CirrusSearch-rel-survey.php: T171740 : Fix inverted sampling rates for human relevance survey (duration: 00m 47s)
23:38 mutante: netmon1002 - terminated unused screen session
23:09 mutante: phab1001, phab2001 - terminating 3 unused screen processes that were from iridium migration / data sync
21:54 mutante: gerrit2001 - restarting gerrit to test logstash config change (while not touching "live" gerrit)
20:24 demon@tin: Synchronized php-1.30.0-wmf.17/extensions/Flow/includes/: bugs and stuff (duration: 00m 54s)
20:14 mutante: heze - rebooting for firmware upgrade
20:10 mutante: heze (bacula storage) - installing BIOS upgrade (T162850 )
20:10 demon@tin: Synchronized scap/scap.cfg: drop git_repo for now, T175041 (duration: 00m 46s)
19:43 mutante: zosma.codfw.wmnet - delete salt key, puppet node clean, puppet node deactivate, remove from Icinga,... (T138650 )
19:37 mutante: removing ganeti instance 'zosma' on ganeti2001 (T138650 )
18:17 demon@tin: Synchronized wmf-config/FeaturedFeedsWMF.php: no-op (duration: 00m 46s)
16:04 demon@tin: Synchronized php-1.30.0-wmf.17/extensions/AbuseFilter/includes/Views/AbuseFilterViewExamine.php: T175338 , followup (duration: 00m 46s)
16:03 demon@tin: Synchronized php-1.30.0-wmf.17/includes/revisiondelete/RevDelLogList.php: typofix (duration: 00m 46s)
15:53 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4027.*
15:47 ejegg: increased cURL timeout for PayPal IPN confirmation to 14 sec
15:07 akosiaris@tin: Finished deploy [librenms/librenms@5554213]: Testing new scap keyholder_key config (duration: 00m 04s)
15:06 akosiaris@tin: Started deploy [librenms/librenms@5554213]: Testing new scap keyholder_key config
15:02 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4021.*
14:58 volans: testing wmf-auto-reimage also on mc1002 T166300 T164341
12:36 _joe_: restarting a check of the md2 devices on conf200* T162013 after reducing sync speed
12:36 _joe_: reduced raid resync max speed of raid devices on conf200* to 20000
10:35 hashar@tin: Synchronized php-1.30.0-wmf.17/extensions/ProofreadPage: Restore page status buttons - T175304 (duration: 00m 50s)
10:12 _joe_: stopped all md devices syncs on conf200* machines
10:09 _joe_: etcd cluster lost consensus T162013
10:07 volans: testing wmf-auto-reimage on mc1001 T166300 T164341
09:58 _joe_: running the same command on conf2002 too
09:48 _joe_: running `/usr/share/mdadm/checkarray --cron --all --idle --quiet` on conf2001 and conf2003 trying to reproduce consensus issues in T162013
09:25 moritzm: restarting apache on tegmen/einsteinium to pick up security update
09:09 moritzm: installing libonig security updates
09:08 mobrovac@tin: Finished deploy [restbase/deploy@0d39acf] (dev-cluster): Use double writing for mobileapps in the dev cluster, take #2 - T169940 (duration: 01m 01s)
09:07 mobrovac@tin: Started deploy [restbase/deploy@0d39acf] (dev-cluster): Use double writing for mobileapps in the dev cluster, take #2 - T169940
09:01 moritzm: installing remaining gnutls updates from jessie point release
08:13 moritzm: installing libgd2 security updates
08:11 mobrovac@tin: Started deploy [restbase/deploy@0d39acf] (dev-cluster): Use double writing for mobileapps in the dev cluster - T169940
08:11 mobrovac: restbase enabled back puppet in the dev cluster - T169940
07:53 moritzm: installing ruby2.3 security updates
07:32 mobrovac@tin: Finished deploy [restbase/deploy@0d39acf] (dev-cluster): (no justification provided) (duration: 07m 02s)
07:25 mobrovac@tin: Started deploy [restbase/deploy@0d39acf] (dev-cluster): (no justification provided)
07:21 mobrovac@tin: Finished deploy [restbase/deploy@e6aeeeb] (dev-cluster): (no justification provided) (duration: 00m 11s)
07:21 mobrovac@tin: Started deploy [restbase/deploy@e6aeeeb] (dev-cluster): (no justification provided)
00:42 demon@tin: Synchronized php-1.30.0-wmf.17/extensions/AbuseFilter/includes/Views/AbuseFilterViewExamine.php: fix comment stuff (duration: 00m 46s)
2017-09-07
23:42 bblack: cp1066 - depooled all traffic services, seems to have some bug causing 503 spikes
23:19 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group2 to wmf.17
23:15 XenoRyet: Updated Smashpig from 8eb98c1 to dce4f0a
22:41 niharika29@tin: Synchronized php-1.30.0-wmf.17/extensions/ArticleCreationWorkflow/: Bump extension version (duration: 00m 49s)
22:03 niharika29@tin: Finished scap: Deploying ArticleCreation Workflow on test2 wiki TT175302 (duration: 23m 00s)
21:40 niharika29@tin: Started scap: Deploying ArticleCreation Workflow on test2 wiki TT175302
21:23 demon@tin: Synchronized php-1.30.0-wmf.17/includes/api/ApiQueryRecentChanges.php: T175307 (duration: 00m 49s)
21:01 urandom: T169940 : Disabling puppet and restbase service in RESTBase dev
21:00 urandom: T169940 : Disabling changeprop in RESTBase dev environment
19:05 demon@tin: Synchronized php-1.30.0-wmf.17/extensions/MobileFrontend/includes/specials/SpecialMobileHistory.php: T175161 (duration: 00m 49s)
18:48 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.17
18:47 demon@tin: Synchronized php: symlink update -> wmf.17 (duration: 00m 48s)
18:39 legoktm: restarted script to reparse all pages in parsoid for Linter (python3 parsoid_reparse.py http://parsoid.discovery.wmnet:8000 --sitematrix --linter-only --skip-closed https://aa.wikipedia.org/w/api.php ) - T161556
17:55 ejegg: disabled queue consumers for data cleaning job
17:52 ejegg: updated CiviCRM from c1ece1e to 63288ae
17:35 otto@tin: Finished deploy [analytics/refinery@6b60f2c]: (no justification provided) (duration: 03m 23s)
17:32 otto@tin: Started deploy [analytics/refinery@6b60f2c]: (no justification provided)
17:31 otto@tin: Finished deploy [analytics/refinery@bdf6754]: (no justification provided) (duration: 00m 23s)
17:31 otto@tin: Started deploy [analytics/refinery@bdf6754]: (no justification provided)
17:23 otto@tin: Finished deploy [analytics/refinery@bdf6754]: (no justification provided) (duration: 03m 25s)
17:20 otto@tin: Started deploy [analytics/refinery@bdf6754]: (no justification provided)
16:13 demon@tin: Synchronized wmf-config/InitialiseSettings.php: Remove Extension:RelatedSites from zhwikivoyage (duration: 00m 50s)
15:32 mobrovac@tin: Finished deploy [restbase/deploy@77961d0]: Revert using MCS for the summary end point - T168848 (duration: 06m 18s)
15:25 mobrovac@tin: Started deploy [restbase/deploy@77961d0]: Revert using MCS for the summary end point - T168848
14:45 _joe_: manually running /usr/share/mdadm/checkarray --cron --all --idle --quiet on conf2001, trying to reproduce consensus issues in T162013
14:40 jynus: rebooting and upgrading es1019
13:54 godog: powercycle ms-be2023 - load through the roof and no login possible
13:54 oblivian@puppetmaster1001: conftool action : set/pooled=true; selector: dnsdisc=jobrunner,name=eqiad
13:14 zeljkof: EU SWAT finished
13:13 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add English Wiktionary as a client of Wikidata (T159316) (duration: 00m 49s)
13:12 zfilipin@tin: Synchronized dblists/wikidataclient.dblist: SWAT: Add English Wiktionary as a client of Wikidata (T159316) (duration: 00m 49s)
11:24 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1100 with 0 weight - T172679 (duration: 00m 49s)
09:15 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1100 depooled to s5 array - T172679 (duration: 00m 49s)
08:59 ariel@tin: Finished deploy [dumps/dumps@da05e9f]: add general info to siteinfo dumps (duration: 00m 02s)
08:58 ariel@tin: Started deploy [dumps/dumps@da05e9f]: add general info to siteinfo dumps
08:44 elukey: restart varnish backend on cp1063 - mailbox expiry lag
07:57 elukey: force re-mount of /mnt/hdfs on stat1005
07:54 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1100 - T172679 (duration: 00m 48s)
07:53 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db1100 - T172679 (duration: 00m 49s)
07:45 mobrovac@tin: Finished deploy [restbase/deploy@b51c58c]: Use MCS for the summary endpoint - T168848 (duration: 06m 36s)
07:39 mobrovac@tin: Started deploy [restbase/deploy@b51c58c]: Use MCS for the summary endpoint - T168848
07:35 mobrovac@tin: Finished deploy [restbase/deploy@b51c58c] (staging): Use MCS for the summary endpoint (duration: 01m 43s)
07:33 mobrovac@tin: Started deploy [restbase/deploy@b51c58c] (staging): Use MCS for the summary endpoint
07:20 oblivian@tin: Synchronized rpc/RunSingleJob.php: Adding the RunSingleJob.php rpc endpoint for jobrunners (duration: 00m 56s)
06:07 eileen: civicrm updated from 73a24b4 to c1ece1e
03:13 eileen: seconds behind=78
03:08 eileen: seconds behind 55
02:52 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Sep 7 02:52:05 UTC 2017 (duration 7m 15s)
02:44 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.17) (duration: 06m 15s)
02:43 eileen: restarting groupmember.load after hacking out cache delete calls
02:40 eileen: update civicrm from 764bfe1 to 73a24b4
02:24 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.16) (duration: 07m 23s)
02:05 eileen: replag 217 behind
01:56 eileen: replag 197 seconds
01:47 eileen: replag 23 seconds
01:29 eileen: replag 0
01:22 eileen: replag 550 sec behind master
01:13 eileen: replag delay 422
01:12 eileen: job killed
00:49 eileen: 8389 contacts now
00:49 eileen: 7971 contacts so far
00:46 eileen: started Omnigroupmember.load
00:24 twentyafterfour: phabricator update deployed
00:12 twentyafterfour@tin: Finished deploy [phabricator/deployment@c265c1a]: (no justification provided) (duration: 00m 24s)
00:11 twentyafterfour@tin: Started deploy [phabricator/deployment@c265c1a]: (no justification provided)
00:07 twentyafterfour: deploying phabricator update 2017-09-06 https://phabricator.wikimedia.org/project/view/2980/
2017-09-06
23:41 twentyafterfour@tin: Synchronized php-1.30.0-wmf.16/extensions/WikimediaEvents/: Sync Change-Id: I7ae522 to all webservers refs T174387 (duration: 00m 49s)
23:28 twentyafterfour@tin: Synchronized php-1.30.0-wmf.17/extensions/WikimediaEvents/: Sync Change-Id: I7ae522 to wmf.17 refs T174387 (duration: 00m 49s)
23:23 twentyafterfour@tin: Synchronized wmf-config/CirrusSearch-common.php: CirrusSearch-common.php goes last (duration: 00m 48s)
23:21 twentyafterfour@tin: Synchronized wmf-config/InitialiseSettings.php: Now sync initializesettings (duration: 00m 49s)
23:19 twentyafterfour@tin: Synchronized wmf-config/CirrusSearch-rel-survey.php: sync the added file first (duration: 00m 49s)
23:17 twentyafterfour@tin: scap failed: average error rate on 9/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/2cc7028226a539553178454fc2f14459 for details)
23:12 twentyafterfour@tin: Synchronized wmf-config/: roll-back due to huge error rate spike (duration: 00m 51s)
23:10 twentyafterfour@tin: Synchronized wmf-config/: deploy config for CirrusSearch human relevancy survey. Change-Id: I272c69 Bug: T174106 (duration: 00m 52s)
22:18 no_justification: prior deploy was no-op
22:18 demon@tin: Finished deploy [gerrit/gerrit@d4f9a77]: (no justification provided) (duration: 00m 07s)
22:18 demon@tin: Started deploy [gerrit/gerrit@d4f9a77]: (no justification provided)
22:11 bsitzmann@tin: Finished deploy [mobileapps/deploy@507a479]: Update mobileapps to 2cb6281 (T168848 T169277 T169274 T162179 T164033 T167921 T174698 T168848 T174808 ) (duration: 04m 53s)
22:06 bsitzmann@tin: Started deploy [mobileapps/deploy@507a479]: Update mobileapps to 2cb6281 (T168848 T169277 T169274 T162179 T164033 T167921 T174698 T168848 T174808 )
21:39 niharika29@tin: Synchronized wmf-config/CommonSettings-labs.php: Config for ArticleCreationWorkflow T175054 (duration: 00m 50s)
21:28 ejegg: updated payments-wiki from bf97cd2 to ed2e481
21:24 ejegg: updated CiviCRM from f9c373e to 764bfe1
21:17 ottomata: rebooting kafka-jumbo1004
20:34 arlolra: Updated Parsoid to f9d367ea (T169342 )
20:27 arlolra@tin: Finished deploy [parsoid/deploy@f07ac8c]: Updating Parsoid to f9d367ea (duration: 08m 27s)
20:18 arlolra@tin: Started deploy [parsoid/deploy@f07ac8c]: Updating Parsoid to f9d367ea
19:53 ottomata: reimaging kafka-jumbo* with stretch
19:48 chasemp: reboot labvirt1018
18:50 maxsem@tin: Synchronized wmf-config/: just to make sure... (duration: 00m 50s)
18:48 maxsem@tin: Synchronized wmf-config/CommonSettings.php: revert (duration: 00m 49s)
18:33 maxsem@tin: Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/370292/3 (duration: 00m 49s)
18:25 maxsem@tin: Synchronized wmf-config/CommonSettings-labs.php: https://gerrit.wikimedia.org/r/#/c/370291/2 (duration: 00m 49s)
18:19 maxsem@tin: Synchronized wmf-config/Wikibase.php: https://gerrit.wikimedia.org/r/#/c/376317/2 (duration: 00m 48s)
18:18 maxsem@tin: Synchronized wmf-config/Wikibase-production.php: https://gerrit.wikimedia.org/r/#/c/376317/2 (duration: 00m 48s)
18:09 maxsem@tin: Synchronized portals: (no justification provided) (duration: 00m 49s)
18:09 ejegg: rolled payments-wiki back to bf97cd2
18:08 maxsem@tin: Synchronized portals/prod/wikipedia.org/assets: (no justification provided) (duration: 00m 50s)
18:05 ejegg: updated payments-wiki from bf97cd2 to 6911158
17:53 bblack: cp1048 - varnish backend restart for mailbox lag...
17:23 demon@tin: Synchronized wmf-config/FeaturedFeedsWMF.php: code cleanup (duration: 00m 49s)
17:20 mutante: added LDAP user schoenbaechler to WMF group (T175156 )
17:16 mutante: modify-ldap-group on terbium is broken
17:02 ejegg: disabled creating CiviMail records for Thank You emails
16:51 demon@tin: Synchronized php-1.30.0-wmf.17/extensions/Flow/includes/: I284b5aa (duration: 01m 01s)
16:41 chasemp: disable puppet for cloud things to test changes
16:40 bblack: cp1072 - varnish backend restart (mailbox lag)
16:40 bblack: cp1062 - varnish backend restart (mailbox lag)
15:55 ejegg: re-enabled IPN notification at PayPal console (48 minutes ago)
15:53 akosiaris: powercycling lvs3001
15:28 papaul: firmware upgrade on mw2256
15:11 bblack: cp1049 - restart varnish backend (mailbox lag)
15:07 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool es1019 (duration: 00m 49s)
15:06 jynus@tin: Synchronized wmf-config/db-codfw.php: Repool db2040 (duration: 00m 49s)
14:17 gehel: wdqs1005 is coming up after a few reimaging issues, expect some icinga noise...
13:19 jynus: restarting and upgrading db2040
13:13 reedy@tin: Synchronized wmf-config/Wikibase-production.php: Move some wikidata config T174962 (duration: 00m 48s)
13:12 reedy@tin: Synchronized wmf-config/Wikibase-labs.php: Move some wikidata config T174962 (duration: 00m 49s)
13:07 reedy@tin: Synchronized wmf-config/throttle.php: Throttle exception T175113 (duration: 00m 49s)
12:08 moritzm: installing libapache2-mod-perl update from jessie 8.9 update
11:41 moritzm: installing gtk+2.0 update from jessie 8.9 update
11:24 elukey: temporarily raise kafka log4j authorizer verbosity to DEBUG on kafka1012 - T173493
11:21 moritzm: installing gnutls update from jessie 8.9 and stretch 9.1 point updates
11:20 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2040 (duration: 00m 49s)
11:20 godog: reimage restbase1008 with cassandra 3 - T169939
11:15 marostegui: Disable puppet on db1100 for mydumper/myloader
10:26 moritzm: installing perl update from stretch 9.1 point release
10:25 moritzm: installing perl update from jessie 8.9 point release
10:12 godog: reimage restbase1010 with cassandra 3 - T169939
09:39 moritzm: installing libgd security updates
09:28 moritzm: installing libonig security uodates#
09:05 jynus: disabling puppet on most db hosts to merge firewall changes safely
08:53 godog: reimage restbase1009 with cassandra 3 - T169939
08:49 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T174948 Add WMDE log channel (duration: 00m 49s)
08:43 addshore@tin: Synchronized wmf-config/InitialiseSettings.php: T110170 Enable Newsletter on mediawikiwiki (duration: 00m 51s)
07:15 marostegui: Stop MySQL on db1049 to copy its content to db1100 - https://phabricator.wikimedia.org/T172679
06:46 moritzm: installing php-luasandbox 2.0.14 on API canaries along with HHVM restart (T173705 )
06:37 marostegui: Truncate l10n_cache table across production - T150306
04:08 demon@tin: Synchronized wmf-config/throttle.php: no-op (duration: 00m 49s)
03:14 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Sep 6 03:14:16 UTC 2017 (duration 7m 6s)
03:07 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.17) (duration: 14m 49s)
02:31 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.16) (duration: 08m 18s)
away: disabled IPN messages at the PayPal console
00:14 Reedy: that was T174747 Adjust language icon color
00:14 reedy@tin: Synchronized php-1.30.0-wmf.16/skins/MinervaNeue: (no justification provided) (duration: 00m 49s)
00:10 reedy@tin: Synchronized php-1.30.0-wmf.16/extensions/Scribunto/tests/phpunit/: Fix broken test (duration: 00m 49s)
00:09 reedy@tin: Synchronized php-1.30.0-wmf.17/extensions/Scribunto/tests/phpunit/: Fix broken test (duration: 00m 50s)
2017-09-05
23:58 reedy@tin: Synchronized wmf-config/: Test ArticleCreationWorkflow extension on the Beta Cluster (duration: 00m 51s)
23:39 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: Enable AbuseFilter runtime profile T161059 (duration: 00m 49s)
23:33 foks: 2FA disabled for Omshivaprakash (T175075 )
22:17 robh: rebooting mw1163 as its serial console is locked up
22:07 demon@tin: Synchronized php-1.30.0-wmf.16/extensions/FlaggedRevs/frontend/specialpages/reports/: kill some warnings and stuff (duration: 02m 55s)
22:06 demon@tin: Synchronized php-1.30.0-wmf.16/extensions/FlaggedRevs/frontend/specialpages/reports/: (no justification provided) (duration: 03m 13s)
20:48 ejegg: re-enabled donation queue consumer and thank you mailer
20:40 ejegg: updated civicrm from 820bd98 to f9c373e
20:37 ejegg: disabled Donation queue consumer and thank-you mail sender
19:38 gilles@tin: Synchronized private/PrivateSettings.php: Add Thumbor username to Swift configuration (duration: 00m 48s)
19:33 jynus: rebuilding querycache table on tewiktionary.dbstore1002, it had crashed
19:28 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: wmf.17 on group0
19:05 Reedy: deleted querycache rows where qc_type = '' on all wikis T174513
18:11 ejegg: updated fundraising tools from 9554229 to 8d39806
18:11 demon@tin: Finished scap: wmf.17 bootstrap (duration: 47m 00s)
17:38 bblack: cp1074 - restart varnish backend, mailbox lag
17:24 demon@tin: Started scap: wmf.17 bootstrap
17:14 demon@tin: Pruned MediaWiki: 1.30.0-wmf.12 (duration: 02m 32s)
16:59 andrewbogott: reducing the number of concurrent nova-conductor workers. May cause hiccups as services restart
16:58 elukey: ran authdns-update from ns1.w.o after https://gerrit.wikimedia.org/r/374385 to create electcom.wikimedia.org
16:51 jynus: reloading dbproxy1005 to repoing to db1009 again- things seem stable right now
16:45 elukey: add new virtualhost electcom.wikimedia.org to the appservers apache config - https://gerrit.wikimedia.org/r/374389 (implies apache config reload)
16:33 mattflaschen@tin: Synchronized php-1.30.0-wmf.16/: Prepare to enable RCFilters (WLFilters) on Watchlist, but without i18n changes (reverted) (duration: 12m 41s)
16:14 addshore: addshore@terbium:~$ mwscript extensions/WikimediaMaintenance/createExtensionTables.php --wiki mediawikiwiki --extension newsletter
16:08 akosiaris: reboot all kubernetes boxes T170119
15:43 matt_flaschen: 'Scap sync failed on i18n, so I'll deploy just the non-i18n ones'
15:36 mattflaschen@tin: scap aborted: Prepare to enable RCFilters (WLFilters) on Watchlist (duration: 08m 42s)
15:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1083 - T174509 (duration: 00m 47s)
15:27 mattflaschen@tin: Started scap: Prepare to enable RCFilters (WLFilters) on Watchlist
14:30 reedy@tin: Synchronized wmf-config/: phpcs (duration: 00m 47s)
14:28 reedy@tin: Synchronized tests/: phpcs (duration: 00m 46s)
14:26 reedy@tin: Synchronized errorpages/404.php: phpcs (duration: 00m 45s)
14:25 reedy@tin: Synchronized docroot/noc/conf/highlight.php: Fix links for dblists in highlight.php (duration: 00m 44s)
14:15 zeljkof: EU SWAT finished
14:13 zfilipin@tin: Synchronized php-1.30.0-wmf.16/includes/EditPage.php: SWAT: Re add wpScrolltop id in EditPage (T174723) (duration: 00m 45s)
13:47 zfilipin@tin: Synchronized php-1.30.0-wmf.16/extensions/ContentTranslation/: SWAT: encodeURIComponent title to escape / properly (T174792) (duration: 00m 48s)
13:30 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add abusefilter-view-private to rollbackers in zhwiki (T174978) (duration: 00m 46s)
13:28 godog: reimage restbase2005 - T169939
13:17 marostegui: Drop pr_index tables from where ProofreadPage isn't enabled - T174782
13:14 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Update logo for sr.wikibooks.org (T172284) (duration: 00m 46s)
12:24 hashar: ci: switched mediawiki/core mediawiki/vendor php5.5 jobs from trusty to jessie - T161882
12:15 ladsgroup@tin: Synchronized wmf-config/Wikibase-production.php: Enable sendEchoNotification for enwiki, dewiki, frwiki (T142102 ) (duration: 00m 46s)
11:54 marostegui: Drop reader_feedback, reader_feedback_history, reader_feedback_pages tables - T174586
11:07 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: (Actually) Enable the ArticlePlaceholder on sqwiki (T174335 ) (duration: 00m 45s)
11:05 hoo@tin: Synchronized wmf-config/InitialiseSettings.php: Enable the ArticlePlaceholder on sqwiki (T174335 ) (duration: 00m 46s)
10:38 hashar: Nodepool / CI are fully backup and processing queued jobs
10:25 hashar: Restarting Nodepool. I have managed to spawn instances manually for contintcloud tenant
10:18 _joe_: stopping puppet, nginx on mw2249 for experimentation
10:11 ema: reboot cp4024 T174891
10:08 godog: copy python-logstash from jessie-wikimedia to stretch-wikimedia
09:59 hashar: Stopped Nodepool on labnodepool1001.eqiad.wmnet
09:52 akosiaris: reimage kubernetes[12]00[1-4] T170119
08:49 moritzm: upgrading canary app servers to hhvm-luasandbox 2.0.14 (T173705 )
08:47 moritzm: uploaded php-luasandbox/hhvm-luasandbox 2.0.14 to apt.wikimedia.org (T173705 )
07:14 moritzm: installing libgd security updates on canary app servers (along with hhvm restart)
07:03 _joe_: launching manually 3 workers for refreshLinks jobs on commons, T173710
06:32 marostegui: Deploy alter table on db1083 - T174509
06:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1083 - T174509 (duration: 00m 46s)
06:25 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1089 - T174509 (duration: 00m 46s)
06:09 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1089 - T174509 (duration: 00m 55s)
02:30 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Sep 5 02:30:03 UTC 2017 (duration 6m 37s)
02:23 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.16) (duration: 07m 28s)
2017-09-04
23:32 Dereckson: Set e-mail to Hexabot bot account to allow account recovery (T174973 )
22:07 legoktm: starting script to reparse all pages in parsoid for Linter (python2 parsoid-reparse.py http://parsoid.discovery.wmnet:8000 --sitematrix --linter-only --skip-closed https://aa.wikipedia.org/w/api.php ) - T161556
17:50 moritzm: uploaded debdeploy 0.0.99-2 to apt.wikimedia.org
17:19 gehel@tin: Finished deploy [wdqs/wdqs@1caaa30]: weekly WDQS deployment (duration: 02m 46s)
17:17 gehel@tin: Started deploy [wdqs/wdqs@1caaa30]: weekly WDQS deployment
16:29 mobrovac@tin: Finished deploy [restbase/deploy@8c9c436]: (no justification provided) (duration: 06m 23s)
16:23 mobrovac@tin: Started deploy [restbase/deploy@8c9c436]: (no justification provided)
16:10 mobrovac: restbase depooled restbase10(0[89]|10) and restbase2005 for T169939
16:07 _joe_: rolling restart of pybal in codfw/eqiad low-traffic pools for picking up the new jobrunner LVS endpoint
15:42 oblivian@puppetmaster1001: conftool action : set/pooled=yes; selector: cluster=jobrunner,service=nginx
14:51 godog: reimage restbase2003 for use with cassandra 3 / jbod - T169939
14:27 moritzm: enabled LDAP authentication for ganglia.wikimedia.org
13:37 gehel: rolling restart of elasticsearch/logstash to cleanup unused plugins - T174933
13:34 gehel: delete leftover grafana-dashboards index on elasticsearch logstash cluster
13:22 gehel: restart elasticsearch on logstash1006 to test plugin removal - T174933
12:26 moritzm: uploaded debdeploy 0.0.99-1+trusty/trusty-wikimedia to apt.wikimedia.org
12:18 marostegui: Stop MySQL on db1037 as it is going to be decommissioned - T174902
12:10 elukey: rolling restart of zookeeper on conf100[123] for jvm security updates
11:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1037 as it will be decommissioned - T174902 (duration: 00m 46s)
11:52 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1037 as it will be decommissioned - T174902 (duration: 00m 46s)
11:45 godog: restart varnish on cp1099 - mailbox lag
10:22 ariel@tin: Finished deploy [dumps/dumps@0b32eef]: fix abstract dumps variants regression (duration: 00m 02s)
10:22 ariel@tin: Started deploy [dumps/dumps@0b32eef]: fix abstract dumps variants regression
09:51 moritzm: uploaded debdeploy 0.0.99-1+jessie/jessie-wikimedia to apt.wikimedia.org
09:41 mobrovac@tin: Started restart [mathoid/deploy@44ea6d8]: Restart for libxml update
09:36 moritzm: uploaded debdeploy 0.0.99-1/stretch-wikimedia to apt.wikimedia.org
08:52 elukey: restart zookeeper on conf2003 for jvm security updates
08:48 marostegui: Stop MySQL on db1045 as it will be decommissioned - T174806
08:47 moritzm: installing mercurial security updates
08:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1045 as it will be decommissioned - T174806 (duration: 00m 46s)
08:39 moritzm: installing libgcrypt20 security updates
08:39 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1045 as it will be decommissioned - T174806 (duration: 00m 48s)
08:33 godog: upload scap 3.7.0-1 to apt.w.o - T127762
07:39 moritzm: installing gnupg security updates
07:03 marostegui: Deploy alter table on s3 codfw master with replication on kpbwiki - T168661
07:02 _joe_: starting additional runJobs instance for htmlcacheupdate on commons T173710
04:52 marostegui: Deploy alter table on db1052 - T168661
02:44 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Sep 4 02:44:15 UTC 2017 (duration 6m 52s)
02:37 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.16) (duration: 08m 17s)
2017-09-03
17:52 elukey: depooled cp4024 (ulsfo upload) due to kernel errors in dmesg
15:57 ema: restart varnish backend on cp1073 (mailbox lag)
09:41 ema@neodymium: conftool action : set/pooled=yes; selector: name=cp4024.ulsfo.wmnet,service=varnish-fe
09:41 ema@neodymium: conftool action : set/pooled=yes; selector: name=cp4024.ulsfo.wmnet,service=nginx
09:25 ema: cp4024 is down, powercycled
09:13 _joe_: manually fixed value of /conftool/v1/pools/eqiad/api_appserver/apache2/mw1208.eqiad.wmnet in codfw to allow replication to restart
00:59 legoktm: legoktm@terbium:~$ mwscript extensions/WikimediaMaintenance/filebackend/setZoneAccess.php --wiki=hiwikiversity --backend=local-multiwrite # T174859
2017-09-01
16:18 bblack: restart varnish backend on cp1074 (mailbox lag)
{{safesubst:SAL entry|1=15:15 urandom: Restarting Cassandra: restbase-dev100[5-6]-{a,b}}}
{{safesubst:SAL entry|1=15:06 urandom: Restarting Cassandra: restbase-dev1004-{a,b}}}
15:00 reedy@tin: Synchronized php-1.30.0-wmf.16/extensions/Popups/: T174724 (duration: 00m 46s)
14:28 marostegui: Add 150G to /srv partition on labsdb1001
13:42 marostegui: Rename table pr_index on enwiki on db1089 - T174782
13:10 marostegui: Stop MySQL on db1026 as it will be decommissioned - T174763
12:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1081 original weight - T168661 (duration: 00m 43s)
12:19 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1026 as it will be decommissioned - T174763 (duration: 00m 43s)
12:17 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1026 as it will be decommissioned - T174763 (duration: 00m 43s)
11:50 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1081 weight - T168661 (duration: 00m 43s)
11:23 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1081 with low weight - T168661 (duration: 00m 44s)
10:35 elukey: stop puppet on thorium and disable root rsyncs - T174756
10:31 marostegui: Upgrade MariaDB to 10.0.32 on db1081 - T168661
10:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1081 - T168661 (duration: 00m 43s)
09:10 godog: bounce graphite-web on graphite1001, problematic query? using all CPU
09:04 ema: lvs1007 upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
08:59 godog: depool restbase200[135] before reimage - T169939
07:22 elukey: restart apache2 and hue on thorium, Analytics sites down, investigating
07:06 marostegui: Power reset db2044 as it is unresponsive - T174764
03:54 krinkle@tin: Synchronized wmf-config/InitialiseSettings.php: I4dfc33f66c3 - Enable jQuery 3 on nlwiki sister projects (duration: 00m 43s)
00:57 tstarling@tin: Synchronized php-1.30.0-wmf.16/includes/parser/Parser.php: (no justification provided) (duration: 00m 44s)
00:13 RainbowSprinkles: gerrit: flushed all non-login caches, things might be sluggish for the next ~15mins or so
2017-08-31
20:03 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: (no justification provided)
19:27 mattflaschen@tin: Finished scap: Watchlist filters: Convert edit watchlist button to new UX and fix server-side tag filtering. T172030 (duration: 21m 23s)
19:17 cmjohnson1: powercycled mw1294
19:05 mattflaschen@tin: Started scap: Watchlist filters: Convert edit watchlist button to new UX and fix server-side tag filtering. T172030
18:53 mattflaschen@tin: Synchronized docroot/noc/conf/interwiki-labs.php.txt: Interwiki links on Beta Cluster: T69931 (duration: 00m 47s)
18:51 mattflaschen@tin: Synchronized wmf-config: Interwiki links on Beta Cluster: T69931 (duration: 00m 49s)
17:50 demon@tin: Synchronized php-1.30.0-wmf.16/skins/MinervaNeue/resources/skins.minerva.icons.images.scripts/watch.svg: sorta impromptu swat thing (duration: 00m 47s)
16:35 ejegg: re-enabled donation queue consumer and thank you mail sender
15:25 ejegg: updated civicrm from f00611d to 820bd98
15:20 ejegg: updated SmashPig from 8f4388d to 8eb98c1
15:19 ejegg: turned off donation queue consumer and thank you job
15:18 elukey: restart zookeeper on conf200[2,3] for jvm security updates
13:36 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1084 original weight - T168661 (duration: 00m 47s)
13:10 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1084 weight - T168661 (duration: 00m 47s)
13:07 elukey: restart zookeeper on conf2001 for security updates (canary node)
13:00 moritzm: installing augeas security updates
12:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1084 with low weight - T168661 (duration: 00m 47s)
12:14 marostegui: Upgrade MariaDB on db1084 - T168661
12:11 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1084 - T168661 (duration: 00m 47s)
12:05 akosiaris: reimage chlorine and kubernetes1004 T170119
12:04 akosiaris: reimage chlorine and kubernetes1004 T10119
11:58 moritzm: restarting nginx on dataset1001/ms1001 to pick up libxml security update
11:21 elukey: restart apache2 on bohrium for libxml + gnutls security updates
11:18 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1056 original weight - T168661 (duration: 00m 47s)
11:17 moritzm: restarting apache on dbmonitor* to pick up libxml security update
10:27 moritzm: restarting nginx on prometheus* to pick up libxml security update
10:22 moritzm: restarting apache on einsteinium/tegmen (Icinga hosts) to pick up libxml security update
10:14 moritzm: restarting nginx on meitnerium/archiva.wikimedia.org to pick up libxml security update
10:13 moritzm: restarting nginx on debug proxies to pick up libxml security update
10:10 moritzm: restarting apache on hafnium to pick up libxml security update
09:57 addshore: extensions/Cognate/maintenance/recalculateCognateNormalizedHashes.php run done, 6326 hashes recalculated, T172987
09:50 addshore: addshore@terbium:~$ mwscript extensions/Cognate/maintenance/recalculateCognateNormalizedHashes.php --wiki enwiktionary --batch-size 1000000 # Should upsert 6326 rows T172987
09:44 addshore: addshore@terbium:~$ mwscript extensions/Cognate/maintenance/recalculateCognateNormalizedHashes.php --wiki enwiktionary --batch-size 1000000 --dry-run # T172987
09:43 moritzm: restarting apache on auth* to pick up libxml security update
09:40 moritzm: installing libxml2 security updates
09:10 moritzm: installing experimental hhvm-luasandbox package on mw1261 (canary app server) with backported patch for T171392 / T173705
09:04 hashar: contint1001 upgrading git, restarting git-daemon and zuul-merger - T161086
09:01 godog: test reimage of restbase2001 - T169939
08:57 hashar: contint2001 upgrading git, restarting git-daemon and zuul-merger - T161086
08:45 godog: remove /srv/deployment/grafana on naos and tin, unused - T170881
08:41 gehel: restart postgresql / kartotherian / tilerator / tileratorui on maps@eqiad
08:40 addshore@tin: Synchronized php-1.30.0-wmf.16/extensions/Cognate/maintenance/recalculateCognateNormalizedHashes.php: T172987 Add waitForReplication to RecalculateCognateNormalizedHashes script (duration: 00m 47s)
08:20 gehel: restart postgresql / kartotherian / tilerator / tileratorui on maps@codfw
08:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic on db1056 - T168661 (duration: 00m 47s)
08:01 marostegui: Rename reader_feedback, reader_feedback_history, reader_feedback_pages tables on dewiki on db1092 - T174586
07:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase traffic on db1056 - T168661 (duration: 00m 47s)
06:45 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1056 with low weight - T168661 (duration: 00m 46s)
06:19 marostegui: Upgrade MariaDB to 10.0.32 on db1056 - T168661
06:18 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1056 - T168661 (duration: 00m 47s)
06:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1056 - T168661 (duration: 00m 47s)
02:55 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Aug 31 02:55:12 UTC 2017 (duration 7m 9s)
02:48 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.16) (duration: 06m 40s)
02:30 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.15) (duration: 10m 02s)
2017-08-30
23:10 demon@tin: Synchronized php-1.30.0-wmf.16/extensions/Newsletter/includes/specials/SpecialNewsletter.php: T174604 (duration: 00m 47s)
23:09 demon@tin: Synchronized php-1.30.0-wmf.15/extensions/Newsletter/includes/specials/SpecialNewsletter.php: T174604 (duration: 00m 48s)
23:05 demon@tin: Synchronized wmf-config/unitConversionConfig.json: swat (duration: 00m 47s)
21:05 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 back to 1.30.0-wmf.16
21:03 thcipriani@tin: Synchronized php-1.30.0-wmf.16/extensions/ProofreadPage/ProofreadPage.body.php: ProofreadPage: Avoids a stack overflow T173520 (duration: 00m 47s)
20:45 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 back to 1.30.0-wmf.15
20:22 ppchelko@tin: Finished deploy [changeprop/deploy@8d5fa29]: Start rejecting deduplicated messages (duration: 01m 05s)
20:21 ppchelko@tin: Started deploy [changeprop/deploy@8d5fa29]: Start rejecting deduplicated messages
20:07 urandom: T169939 : Decommission Cassandra: restbase1009-c.eqiad.wmnet
19:09 ppchelko@tin: Finished deploy [changeprop/deploy@ff93ea8]: Don't deduplicate retry messages (duration: 01m 17s)
19:07 ppchelko@tin: Started deploy [changeprop/deploy@ff93ea8]: Don't deduplicate retry messages
19:06 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 to wmf.16
19:04 demon@tin: Synchronized php: Symlink swap for group1/wmf.16 (duration: 00m 46s)
18:11 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/374399/4 (duration: 00m 47s)
17:29 urandom: T169939 : Decommission Cassandra: restbase1009-b.eqiad.wmnet
17:27 hoo: Updated the Wikidata property suggester with data from Monday's JSON dump and applied the T132839 workarounds
17:12 ejegg: disabled creating CiviMail records for Thank You emails
16:49 XioNoX: rolling back vrrp priority change for the cr2<->asw-d-codfw link - T174366
16:34 XioNoX: start work on the cr2<->asw-d-codfw link - T174366
16:13 XioNoX: lowering vrrp priority for the cr2<->asw-d-codfw link - T174366
15:56 elukey: re-added analytics1055 among the hdfs/yarn worker after maintenance
15:49 bblack: cp1049 - restart varnish backend, mailbox lag
15:43 urandom: T169939 : Decommission Cassandra: restbase1009-a.eqiad.wmnet
15:02 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1294.eqiad.wmnet
14:31 moritzm: restarting nginx on sodium to pick up libxml security update
14:07 elukey: restart java daemons on druid100[456] for jvm security updates
14:04 moritzm: restarting squid on install* servers to pick up libxml security update
14:04 moritzm: restarting squid on installurl downloaders to pick up libxml security update
14:02 moritzm: restarting apache on krypton to pick up libxml security update
13:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1053 original weight - T168661 (duration: 00m 46s)
13:55 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: pagePreviews: Scale A/B test bucket sizes by 10 - T172291 (duration: 00m 46s)
13:50 moritzm: restarting squid on url downloaders to pick up libxml security update
13:34 moritzm: installing libxml2 security updates
13:23 moritzm: installing ffmpeg security upadtes
13:21 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1053 weight - T168661 (duration: 00m 46s)
13:19 hashar: European SWAT completed
13:16 gehel: restarting wdqs-blazegraph and wdqs-updater on all wdqs nodes for logback config change - T172710
13:11 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable SandboxLink on cywiki - T173054 (duration: 00m 48s)
12:55 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1053 weight - T168661 (duration: 00m 45s)
12:51 gehel: restart wdqs-updater on wdqs2001 for config change
12:40 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1053 weight - T168661 (duration: 00m 47s)
12:26 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1053 with low weight - T168661 (duration: 00m 46s)
12:05 marostegui: Upgrade MariaDB to 10.0.32 on db1053 - T168661
12:05 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1053 - T168661 (duration: 00m 47s)
11:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1055 original weight - T174265 (duration: 00m 46s)
10:24 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give db1055 more traffic - T174265 (duration: 00m 47s)
09:49 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give db1055 more traffic - T174265 (duration: 00m 47s)
09:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1059 - T168661 (duration: 00m 47s)
09:07 elukey: restart all jvm daemons on druid100[123] for security updates
08:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give db1055 more traffic - T174265 (duration: 00m 48s)
08:06 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give db1055 more traffic - T174265 (duration: 00m 47s)
07:47 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1228.eqiad.wmnet
07:47 marostegui: Upgrade MariaDB on db1059 to 10.0.32 - T168661
07:44 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1059 for a MariaDB upgrade - T168661 (duration: 00m 53s)
07:14 moritzm: powering down mw1294 for hardware diagnostics (T1674069
07:12 jmm@puppetmaster1001: conftool action : set/pooled=inactive; selector: mw1294.eqiad.wmnet
07:12 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Slowly repool db1055 - T174265 (duration: 00m 52s)
03:16 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Aug 30 03:16:09 UTC 2017 (duration 7m 19s)
03:08 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.16) (duration: 15m 42s)
02:32 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.15) (duration: 08m 28s)
02:17 demon@tin: Finished deploy [gerrit/gerrit@f33c63f]: Testing, no-op (duration: 00m 07s)
02:17 demon@tin: Started deploy [gerrit/gerrit@f33c63f]: Testing, no-op
01:27 demon@tin: Synchronized wmf-config/InitialiseSettings.php: remove education program from legalteamwiki (duration: 00m 47s)
00:53 demon@tin: Synchronized wmf-config/flaggedrevs.php: killing FR on testwiki (duration: 00m 46s)
00:52 demon@tin: Synchronized dblists/flaggedrevs.dblist: killing FR on testwiki (duration: 00m 47s)
00:05 cwd: turned dedupe off
2017-08-29
23:50 cmjohnson1: reinstalling mw1228
23:34 demon@tin: Synchronized php-1.30.0-wmf.15/extensions/Popups/: swat (duration: 00m 47s)
23:33 demon@tin: Synchronized php-1.30.0-wmf.16/extensions/Popups/: swat (duration: 00m 50s)
21:33 urandom: T169939 : Decommission Cassandra: restbase1008-c.eqiad.wmnet
21:16 andrewbogott: restarting designate-sink on labservices1001
21:01 ejegg: updated DjangoBannerStats from 5963e7c to ce95eab
20:56 ejegg: updated payments-wiki from 58d33b8 to bf97cd2
20:49 godog: bounce varnish on cp1074 / cp1099 / cp1072 - mailbox lag
20:38 ppchelko@tin: Finished deploy [changeprop/deploy@a57c79d]: Release a redis-based deduplicator in test mode. Attempt 2 (duration: 01m 17s)
20:37 ppchelko@tin: Started deploy [changeprop/deploy@a57c79d]: Release a redis-based deduplicator in test mode. Attempt 2
20:36 ppchelko@tin: Finished deploy [changeprop/deploy@ed0fadc]: Release a redis-based deduplicator in test mode (duration: 04m 28s)
20:31 ppchelko@tin: Started deploy [changeprop/deploy@ed0fadc]: Release a redis-based deduplicator in test mode
19:47 urandom: T169939 : Decommission Cassandra: restbase1008-b.eqiad.wmnet
19:34 ottomata: restarting analytics kafka brokers and all mirror maker processes to apply https://gerrit.wikimedia.org/r/#/c/374610/
19:12 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.16
19:08 ppchelko@tin: Finished deploy [restbase/deploy@7f2e55f]: Update CXServer endpoints config (duration: 06m 48s)
19:02 ottomata: restarting main-eqiad -> analytics kafka mirror maker processes on analytics kafka brokers, something is not working...
19:01 XenoRyet: updated Smashpig from 98c5516 to 8f4388d
19:01 ppchelko@tin: Started deploy [restbase/deploy@7f2e55f]: Update CXServer endpoints config
18:19 robh: attempting firmware upgrade on scs-a8-eqiad
18:17 Pchelolo: depooling restbase1008,1009,1010,2003,2005 while cluster reshaping is going on
18:15 demon@tin: Finished scap: bootstrap wmf.16 (duration: 45m 27s)
18:15 urandom: T169939 : Decommission Cassandra: restbase1008-a.eqiad.wmnet
17:30 demon@tin: Started scap: bootstrap wmf.16
17:23 ejegg: restarted donation queue consumer and thank you mailer
17:20 Reedy: Disabled oathauth for KartikMistry on wikitech
16:33 ejegg: updated civicrm from 7dfea0d to f00611d
16:32 ejegg: turned off donation queue consumer and thank you mailer
16:04 ejegg: restarted ingenico recurring charge job
15:54 urandom: T169939 : Decommission Cassandra: restbase2005-c.codfw.wmnet
14:51 elukey: drop log.MobileWebUIClickTracking_10742159_15423246 from db1047 (archived on HDFS) - T172322
14:33 marostegui: Shutdown db1055 to replace its BBU - T174265
13:12 zeljkof: EU SWAT finished
13:10 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add several HD logos (T150618) (duration: 00m 43s)
13:09 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Add several HD logos (T150618) (duration: 00m 43s)
12:37 elukey: restart kafka daemons on kafka1014 for jvm security updates
12:31 godog: bounce pybal on lvs1003 to pick up config changes
11:49 elukey: restart kafka daemons on kafka1013 for jvm security updates
11:43 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1168.eqiad.wmnet
11:30 elukey: restart java daemons on analytics100[1,2] (Hadoop Master nodes) for jvm updates
11:08 marostegui: Stop MariaDB on db1097 to migrate it to file per table - T161088
10:57 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1097 - T161088 (duration: 00m 43s)
10:48 moritzm: installing libxml2 security updates
10:31 moritzm: reimaging mw1168 (video scaler) to jessie
10:16 jynus@tin: Synchronized wmf-config/db-eqiad.php: decom db1028 (duration: 00m 42s)
10:15 jynus@tin: Synchronized wmf-config/db-codfw.php: decom db1028 (duration: 02m 48s)
09:59 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1064 - T168661 (duration: 00m 43s)
09:39 marostegui: Update MariaDB on db1064 to 10.0.32 - T168661
09:38 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1064 for a mariadb upgrade - T168661 (duration: 00m 43s)
09:33 jmm@puppetmaster1001: conftool action : set/pooled=yes; selector: mw1169.eqiad.wmnet
09:29 elukey: re-installed pmacct/librdkafka1/kafkacat on rhenium with stretch versions - T173489
09:19 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Restore db1091 original weight - T168661 (duration: 00m 43s)
08:53 ema: upgrading cache_text to varnish 4.1.8-1wm1
08:45 akosiaris: reprepro copy calico, calico-cni from jessie-wikimedia to stretch-wikimedia (apt.wikimedia.org) T170119
08:43 hashar: Restarting Jenkins for openjdk update
08:42 elukey: restart yarn/hdfs daemons for openjdk security updates
08:42 akosiaris: upload kubernetes_1.7.4-1 to apt.wikimedia.org/stretch-wikimedia/main T170119
08:42 moritzm: restarting archiva to pick up openjdk security update
08:41 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Increase db1091 weight - T168661 (duration: 00m 43s)
08:35 gehel: restart wdqs-updater on wdqs2001
08:32 hashar: apt-get upgrade on contint1001 and contint2001
08:20 moritzm: reimaging mw1169 (video scaler) to jessie
08:09 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1091 with low weight - T168661 (duration: 00m 43s)
08:06 elukey: drop log.MobileWebUIClickTracking_10742159_15423246 from dbstore1002 to free space (table archived on HDFS) - T172322 T168303
07:41 marostegui: Upgrade MariaDB on db1091 to 10.0.32 - T168661
07:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1097 - T168661 (duration: 00m 42s)
06:56 moritzm: installing ghostscript security updates on trusty (Debian already fixed)
06:09 demon@tin: Synchronized scap/plugins/clean.py: no-op, for consistency (duration: 00m 43s)
06:05 marostegui: Restart MariaDB on db1102 and db1095 to pick up new replication filters - T174385
05:49 demon@tin: Pruned MediaWiki: 1.30.0-wmf.11 (duration: 02m 50s)
02:34 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Aug 29 02:34:09 UTC 2017 (duration 6m 44s)
02:27 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.15) (duration: 08m 00s)
00:25 urandom: T169939 : Decommissioning restbase1010-a.eqiad.wmnet
2017-08-28
23:37 reedy@tin: Synchronized php-1.30.0-wmf.15/extensions/Score/: consistency (duration: 00m 43s)
23:31 reedy@tin: Synchronized php-1.30.0-wmf.15/extensions/Score/Score.body.php: output (duration: 00m 43s)
23:27 reedy@tin: Synchronized php-1.30.0-wmf.15/extensions/Score/Score.body.php: output (duration: 00m 42s)
23:23 reedy@tin: Synchronized wmf-config/CommonSettings.php: abc2ly in firejail (duration: 00m 43s)
23:22 reedy@tin: Synchronized php-1.30.0-wmf.15/extensions/Score/Score.body.php: Fix escaping (duration: 00m 43s)
23:13 cwd: re-enabled dedupe jobs
23:13 ebernhardson@tin: Synchronized php-1.30.0-wmf.15/extensions/WikimediaEvents/: Turn off two cirrus AB tests T171742 T171214 (duration: 00m 44s)
23:08 ebernhardson@tin: Synchronized docroot/noc/conf/categories-rdf.dblist: T173892 : Add list for wikis that would have categories dumped into RDF (duration: 00m 43s)
23:07 ebernhardson@tin: Synchronized dblists/categories-rdf.dblist: T173892 : Add list for wikis that would have categories dumped into RDF (duration: 00m 43s)
22:56 ejegg: turned fundraising orphan rectifier back on
22:46 urandom: T169939 : Decommissioning restbase1010-b.eqiad.wmnet
21:48 krinkle@tin: Synchronized wmf-config/InitialiseSettings.php: I1c9c9c0e - Enable jQuery 3 on nlwiki, svwiki, plwiki (duration: 00m 44s)
21:37 reedy@tin: Synchronized wmf-config/CommonSettings.php: Disable wgScoreAbc2Ly firejail T172582 (duration: 00m 44s)
21:06 ejegg: switched nightly Ingenico audit from wr1 to wx file parsing
21:06 reedy@tin: Synchronized wmf-config/CommonSettings.php: Run Score binaries in firejail T172582 (duration: 00m 43s)
21:01 reedy@tin: Synchronized wmf-config/CommonSettings.php: Move email blacklist to meta:Email_blacklist (duration: 00m 45s)
20:53 XioNoX: pushing "aggregate defaults discard" to all the cr* routers - T174364
20:52 urandom: T169939 : Decommissioning restbase1010-c.eqiad.wmnet
20:16 ayounsi@tin: Finished deploy [librenms/librenms@5fea59c]: (no justification provided) (duration: 00m 05s)
20:16 ayounsi@tin: Started deploy [librenms/librenms@5fea59c]: (no justification provided)
20:16 XioNoX: upgrading librenms to 1.31.01
20:08 bd808@tin: Finished deploy [striker/deploy@47b5e8a]: Deploying 47b5e8a (various bug fixes) (duration: 00m 26s)
20:07 bd808@tin: Started deploy [striker/deploy@47b5e8a]: Deploying 47b5e8a (various bug fixes)
19:24 ejegg: updated CiviCRM from 200f34b to 7dfea0d
19:24 ottomata: restarting eventbus in eqiad to apply https://gerrit.wikimedia.org/r/#/c/372179/
19:23 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Grant arbcomers abusefilter-view-private and abusefilter-log-private at cswiki T174357 ; Enable popups on en and de wiki for A/B test T172291 (duration: 00m 43s)
19:15 herron: scb1001: restarted pdfrender service
19:14 ottomata: restarting eventbus in codfw to verify https://gerrit.wikimedia.org/r/#/c/372179/
19:09 ottomata: rolling restart and rebalances of main-* kafka clusters for https://gerrit.wikimedia.org/r/#/c/372179/
19:03 niharika29@tin: Synchronized php-1.30.0-wmf.15/extensions/CodeMirror/: Fix exception on some combination of quotes T174060 (duration: 00m 44s)
18:48 gwicke: restarted pdfrender instances in eqiad (T159922 )
18:48 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: pagePreviews: remove invalidated popup sampling rate variables https://gerrit.wikimedia.org/r/#/c/373171/7 (duration: 00m 43s)
18:47 ayounsi@tin: Finished deploy [librenms/librenms@8c9da11]: (no justification provided) (duration: 00m 02s)
18:47 ayounsi@tin: Started deploy [librenms/librenms@8c9da11]: (no justification provided)
18:43 aaron@tin: Synchronized php-1.30.0-wmf.15/includes/jobqueue/jobs/HTMLCacheUpdateJob.php: 2d83569397 - Fix old regression in HTMLCacheUpdate de-duplication (duration: 00m 44s)
18:37 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Fix incorrect Special:UserLogin in Popups blacklist https://gerrit.wikimedia.org/r/#/c/373696/ (duration: 00m 43s)
18:33 XioNoX: upgrading librenms to 1.31
18:32 ayounsi@tin: Finished deploy [librenms/librenms@8c9da11]: (no justification provided) (duration: 00m 05s)
18:32 ayounsi@tin: Started deploy [librenms/librenms@8c9da11]: (no justification provided)
18:30 ejegg: updated CiviCRM from c30ec13 to 200f34b
18:21 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable wgEchoPerUserBlacklist at all wikis T173838 (duration: 00m 43s)
18:15 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Upload wikipedia wordmark zh https://gerrit.wikimedia.org/r/#/c/373948 (duration: 00m 45s)
18:14 niharika29@tin: Synchronized static/images/mobile/copyright/: Upload wikipedia wordmark zh https://gerrit.wikimedia.org/r/#/c/373948 (duration: 00m 44s)
17:59 XioNoX: pushing "aggregate defaults discard" to *ams - T174364
17:56 urandom: T169939 : Decommission Cassandra: restbase2005-b.codfw.wmnet
17:31 gehel: restarting wdqs-blazegraph and wdqs-updater for config change
17:19 gehel: restarting wdqs-blazegraph and wdqs-updater for config change
17:11 XioNoX: pushing "aggregate defaults discard" to cr2-knams - T174364
17:09 gehel@tin: Finished deploy [wdqs/wdqs@90f4e2d]: (no justification provided) (duration: 02m 27s)
17:07 gehel@tin: Started deploy [wdqs/wdqs@90f4e2d]: (no justification provided)
17:07 gehel@tin: Started deploy [wdqs/wdqs@90f4e2d]: (no justification provided)
16:46 demon@tin: Pruned MediaWiki: 1.30.0-wmf.10 (duration: 02m 41s)
16:39 demon@tin: Pruned MediaWiki: 1.30.0-wmf.14 [keeping static files] (duration: 02m 00s)
14:39 elukey: restart eventlogging_sync on dbstore1002 - issue after drop of old table
14:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db1099 with normal weight on s5 - T172679 (duration: 00m 44s)
14:31 elukey: drop PageContentSaveComplete_5588433_15423246 from the log database on db1046 (m4-master) - T170720
14:28 zeljkof: EU SWAT finished
14:28 zfilipin@tin: Synchronized php-1.30.0-wmf.15/includes/jobqueue/jobs/HTMLCacheUpdateJob.php: SWAT: Disable rebound CDN purges for backlinks in HTMLCacheUpdateJob (T173710) (duration: 00m 45s)
14:01 zeljkof: extending EU SWAT for 10 or so minutes to deploy two more patches
13:55 elukey: restart kafka* daemons on kafka1012 for openjdk security updates (canary)
13:52 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Log WikibaseQualityConstraints (T171281) (duration: 00m 44s)
13:42 zfilipin@tin: Synchronized wmf-config/throttle-analyze.php: SWAT: Automatically include commons and wikidata in $wmgThrottlingExceptions (T163872) (duration: 00m 44s)
13:36 zfilipin@tin: Synchronized wmf-config/CommonSettings.php: SWAT: throttle.php: Separate the throttling definitions from the exception values itself (T167040) (duration: 00m 44s)
13:34 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: throttle.php: Separate the throttling definitions from the exception values itself (T167040) (duration: 00m 44s)
13:33 zfilipin@tin: Synchronized wmf-config/throttle-analyze.php: SWAT: throttle.php: Separate the throttling definitions from the exception values itself (T167040) (duration: 00m 44s)
13:27 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Allow sysops to grant/remove transwiki user group in dtywiki (T174226) (duration: 00m 44s)
13:21 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add 4 URLs to $wgCopyUploadsDomain whitelist (T174152) (duration: 00m 45s)
13:09 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Remove non-transparent background from dty.wiki logos (T174098) (duration: 00m 45s)
12:53 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add more weight to db1099 on s5 - T172679 (duration: 00m 44s)
12:43 jmm@puppetmaster1001: conftool action : set/pooled=inactive; selector: mw1259.eqiad.wmnet
12:27 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add more weight to db1099 on s5 - T172679 (duration: 00m 44s)
12:07 gehel: restarting nginx for upgrade on wdqs*
12:05 gehel: restarting nginx for upgrade on elastic* / relforge*
12:03 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1099 pooled with low weight to s5 - T172679 (duration: 00m 45s)
11:05 moritzm: installing libxml2 security updates
08:41 moritzm: restarting squid3 on install1002
08:16 marostegui: Ugprade MariaDB on s4 codfw master - db2051 to 10.0.32 - T168661
08:10 marostegui: Stop MySQL on db1045 - T172679
08:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1045 to clone db1099 from it - T172679 (duration: 00m 46s)
07:51 moritzm: reimaging mw1259 to jessie (T145742 )
07:22 marostegui: Force re-learn cycle on db1055 - https://phabricator.wikimedia.org/T174265
07:12 moritzm: installing openjdk security updates on notebook* hosts
07:05 moritzm: installing openjdk security updates on meitnerium
02:36 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Aug 28 02:36:23 UTC 2017 (duration 6m 54s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.15) (duration: 09m 36s)
2017-08-27
20:35 ariel@tin: Finished deploy [dumps/dumps@39f9b52]: write output files to temp location and move into place when complete (duration: 00m 02s)
20:34 ariel@tin: Started deploy [dumps/dumps@39f9b52]: write output files to temp location and move into place when complete
15:46 akosiaris: upload kubernetes_1.4.6-7 to apt.wikimedia.org/jessie-wikimedia/main T170346
05:30 marostegui: Force BBU relearn on db1055 - T174265
00:45 jynus: correction last log s/db1051/db1055/
00:42 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1051, hw issues, may get lag (duration: 00m 44s)
2017-08-26
15:46 godog: bounce varnish on cp1073 - mailbox lag
2017-08-25
19:09 gehel: actually not killing the "stuck discovery report-updater process on stat1005", it is already gone - T174110
19:07 gehel: kill stuck discovery report-updater process on stat1005 - T174110
16:03 moritzm: created cn=ciadmin group in LDAP to be used for privileged Jenkins access, initial members in ticket (T169557 )
14:29 godog: bounce varnish on cp1049 / cp1072 - mailbox lag
14:16 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db1097 - T168661 (duration: 00m 44s)
13:53 marostegui: Upgrade MariaDB on db1097 to 10.0.32 - T168661
13:53 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db1097 - T168661 (duration: 00m 44s)
13:40 marostegui: Upgrade MariaDB to 10.0.32 on db2019 - T168661
13:37 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2037 - T168661 (duration: 00m 45s)
13:24 marostegui: Upgrade MariaDB on db2037 to 10.0.32 - T168661
13:02 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2037 - T168661 (duration: 00m 44s)
12:51 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2044 - T168661 (duration: 00m 44s)
12:29 marostegui: Upgrade MariaDB on db2044 - T168661
12:29 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2044 - T168661 (duration: 00m 43s)
12:19 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2058 - T168661 (duration: 00m 48s)
11:48 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1028 (duration: 00m 47s)
09:19 moritzm: restart thumbor to pick up libxml2 security updates
09:12 godog: reimage restbase2001 to test new partman recipe - T169939
08:41 moritzm: restarting apache/hhvm on canary app servers to pick up libxml security update
08:00 volans: upgrading cumin to v1.0.0 on sarin/neodymium
07:55 gehel: reimaging logstash1006 after change of failed disk - T173679
07:49 jynus@tin: Synchronized wmf-config/db-eqiad.php: pool db1069 with more weight (duration: 00m 44s)
07:00 jynus@tin: Synchronized wmf-config/db-eqiad.php: decom db1033, pool db1069 with low weight (duration: 00m 44s)
06:55 jynus@tin: Synchronized wmf-config/db-codfw.php: decom db1033 (duration: 00m 43s)
06:30 jynus@tin: Synchronized wmf-config/db-eqiad.php: formatting fixes (duration: 00m 44s)
06:11 jynus@tin: Synchronized wmf-config/db-codfw.php: repool es2013, formatting fixes (duration: 00m 44s)
06:01 marostegui: Upgrade MariaDB to 10.0.32 on db2058 - T168661
06:00 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2058 - T168661 (duration: 00m 43s)
05:30 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2065 - T168661 (duration: 00m 44s)
05:18 moritzm: re-enable mw1260 (video scaler) for active job processing
05:16 marostegui: Reboot db2065 to pick up new kernel
05:08 marostegui: Upgrade MariaDB on db2065 - T168661
05:03 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2065 for a MariaDB upgrade - T168661 (duration: 00m 44s)
01:12 mattflaschen@tin: Synchronized php-1.30.0-wmf.15/extensions/Flow/modules/engine/components/board/features/flow-board-loadmore.js: T173807 : Fix Flow infinite scroll (duration: 00m 45s)
2017-08-24
23:17 thcipriani@tin: Synchronized php-1.30.0-wmf.15/extensions/TimedMediaHandler/TimedMediaHandler.php: SWAT: Disable Ogg Theora video transcodes in default config T172445 (duration: 00m 45s)
22:45 demon@tin: Pruned MediaWiki: 1.30.0-wmf.13 [keeping static files] (duration: 02m 01s)
22:24 ejegg: updated civicrm from fa882f2 to c30ec13 (pt-br and ja thank you letters)
21:59 gwicke: restarted pdfrender service instances in eqiad / T159922
21:54 maxsem@tin: Synchronized php-1.30.0-wmf.15/extensions/LoginNotify/: https://gerrit.wikimedia.org/r/#/c/373701/1 (duration: 00m 45s)
21:13 maxsem@tin: Synchronized php-1.30.0-wmf.15/extensions/LoginNotify/: Logging: https://gerrit.wikimedia.org/r/#/c/373691/ (duration: 00m 44s)
20:07 andrewbogott: stopping, starting, restarting etc. nodepool
19:58 urandom: T169939 : Decommission Cassandra: restbase2003-c.codfw.wmnet
19:24 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.30.0-wmf.15
19:14 andrewbogott: restarting rabbitmq-server on labcontrol1001
18:38 ejegg: updated payments-wiki from bd9f730 to 58d33b8
18:38 bblack: varnish backend restart - cp1074 - mailbox lag
18:24 urandom: T169939 : Decommission Cassandra: restbase2003-b.codfw.wmnet
17:33 arlolra@tin: Finished deploy [parsoid/deploy@bd12f8a]: Updating Parsoid to 538dad7f (duration: 12m 24s)
17:21 arlolra@tin: Started deploy [parsoid/deploy@bd12f8a]: Updating Parsoid to 538dad7f
17:01 jynus: stopping and cloning db1033 to db1069
16:52 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give normal weight to db1096 - T172679 (duration: 00m 47s)
16:43 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give some more weight to db1096 - T172679 (duration: 00m 47s)
16:35 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give some more weight to db1096 - T172679 (duration: 00m 47s)
16:34 urandom: T169939 : Decommission Cassandra: restbase2003-a.codfw.wmnet
16:24 andrewbogott: apt-get upgrade and updated to MW 1.29.1 on wikitech-static. Rebooting to pick up kernel updates.
15:58 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Give some weight to db1096 - T172679 (duration: 00m 47s)
15:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db1096 depooled to s5 - T172679 (duration: 00m 47s)
14:44 ladsgroup@tin: Synchronized php-1.30.0-wmf.15/extensions/Wikidata/extensions/Wikibase/client/includes/Changes/WikiPageUpdater.php: Reduce batch size in WikiPageUpdater (T173710 ) (duration: 00m 48s)
14:14 phuedx: refreshLinks on bawiki (T173994 )
14:13 volans: updated cumin package in our APT to version cumin_1.0.0-1
13:19 phuedx@tin: Synchronized wmf-config/InitialiseSettings-labs.php: T171853 : Re-enable Page Previews for enwiki and dewiki on the Beta Cluster (duration: 00m 47s)
13:13 phuedx@tin: Synchronized portals: (no justification provided) (duration: 00m 49s)
13:12 phuedx@tin: Synchronized portals/prod/wikipedia.org/assets: (no justification provided) (duration: 00m 49s)
12:45 gehel: killing discovery report updater on stats1005 (stuck since Aug 15) - T173333
12:30 marostegui: Stop MySQL on db1026 to clone db1096 from it - T172679
12:29 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1026 to clone db1096 from it T172679 (duration: 00m 47s)
11:36 jynus: disable puppet on db1069 in preparation for reimage
11:08 marostegui: Global rename Papa1234 â Karl-Heinz Jansen - T173859
09:56 jynus@tin: Synchronized wmf-config/db-eqiad.php: repool db1078 with full weight (duration: 00m 46s)
09:33 volans: executing https://wikitech.wikimedia.org/wiki/Cumin#Run_Puppet_only_if_last_run_failed to get rid of failed ones
08:54 jynus: 16 minute test run of rebuildTermSqlIndex.php on terbium
08:52 marostegui: Drop tables article_assessment, article_assessment_pages, article_assessment_ratings tables from testwiki - T173590
07:24 Amir1: starting the run for rebuildTermIndex (T171460 )
02:50 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Aug 24 02:50:40 UTC 2017 (duration 7m 4s)
02:43 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.15) (duration: 05m 56s)
02:28 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.14) (duration: 09m 50s)
01:46 aaron@tin: Synchronized php-1.30.0-wmf.15/includes/jobqueue: Deploy 752580637 (T173710 ) (duration: 00m 49s)
2017-08-23
23:49 maxsem@tin: Synchronized php-1.30.0-wmf.14/extensions/LoginNotify/: SWAT fixes (duration: 00m 47s)
23:47 maxsem@tin: Synchronized php-1.30.0-wmf.15/extensions/LoginNotify/: SWAT fixes (duration: 00m 47s)
23:42 maxsem@tin: Synchronized php-1.30.0-wmf.14/extensions/CodeMirror/: https://gerrit.wikimedia.org/r/#/c/373324/ (duration: 00m 47s)
23:41 maxsem@tin: Synchronized php-1.30.0-wmf.15/extensions/CodeMirror/: https://gerrit.wikimedia.org/r/#/c/373324/ (duration: 00m 49s)
23:28 maxsem@tin: Synchronized php-1.30.0-wmf.15/resources/: https://gerrit.wikimedia.org/r/#/c/373391/ (duration: 00m 48s)
23:27 maxsem@tin: Synchronized wmf-config/MetaContactPages.php: https://gerrit.wikimedia.org/r/#/c/373369/ (duration: 00m 48s)
20:59 urandom: T169939 : Truncating MCS tables
20:26 thcipriani@tin: Synchronized php: group1 wikis to 1.30.0-wmf.15 (duration: 00m 46s)
20:25 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to 1.30.0-wmf.15
20:13 jynus@tin: Synchronized wmf-config/db-eqiad.php: repool db1078 with low weight (duration: 00m 47s)
20:12 thcipriani@tin: Finished scap: Revert ProofreadPage to db7507246665e69384c1d92af2aedc62263a5116 for wmf.15 T173520 (duration: 04m 11s)
20:08 thcipriani@tin: Started scap: Revert ProofreadPage to db7507246665e69384c1d92af2aedc62263a5116 for wmf.15 T173520
19:49 thcipriani@tin: Synchronized wmf-config/InitialiseSettings-labs.php: Beta-only change: pagePreviews: Enable A/B test (BC-only (duration: 00m 47s)
19:45 joal@tin: Finished deploy [analytics/refinery@f467ce1]: Bug fixing deploy (duration: 03m 16s)
19:44 thcipriani@tin: Synchronized php-1.30.0-wmf.14/extensions/MobileFrontend: Verify the existence of `url` key when parsing lang objects T172316 (duration: 00m 56s)
19:42 joal@tin: Started deploy [analytics/refinery@f467ce1]: Bug fixing deploy
19:36 thcipriani@tin: Synchronized php-1.30.0-wmf.15/extensions/TimedMediaHandler: Enable WebM playback via ogv.js T172444 (duration: 00m 50s)
19:22 kaldari: foreachwiki sql.php /srv/mediawiki/php/maintenance/archives/patch-ip_changes.sql
19:19 kaldari: mwscript sql.php --wiki=testwiki /srv/mediawiki/php/maintenance/archives/patch-ip_changes.sql
19:15 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Remove CitThisPage from blacklist for page previews T173865 (duration: 00m 47s)
19:13 niharika29@tin: Synchronized php-1.30.0-wmf.15/extensions/TimedMediaHandler/: Enable WebM playback via ogv.js https://gerrit.wikimedia.org/r/#/c/373126/ (duration: 00m 49s)
19:12 niharika29@tin: Synchronized php-1.30.0-wmf.14/extensions/TimedMediaHandler/: Enable WebM playback via ogv.js https://gerrit.wikimedia.org/r/#/c/373126/ (duration: 00m 50s)
19:00 niharika29@tin: Synchronized php-1.30.0-wmf.14/extensions/MobileFrontend/: Verify the existence of key when parsing lang objects https://gerrit.wikimedia.org/r/#/c/373292/ (duration: 00m 49s)
18:57 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Log channel for LoginNotify T173888 (duration: 00m 47s)
18:55 ppchelko@tin: Finished deploy [changeprop/deploy@1998c10]: [Config] Disable mobile rerenders for non-wikipedia domains (duration: 01m 33s)
18:54 ppchelko@tin: Started deploy [changeprop/deploy@1998c10]: [Config] Disable mobile rerenders for non-wikipedia domains
18:52 niharika29@tin: Synchronized php-1.30.0-wmf.15/extensions/LoginNotify/: Log everything T173888 (duration: 00m 48s)
18:43 gehel: manually running report updater for discovery golden data on stat1005 - T173333
18:40 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Reinforce LoginNotify settings https://gerrit.wikimedia.org/r/#/c/372555/ (duration: 00m 47s)
18:40 gehel: resetting permissions on stat1005:/srv/published-datasets/discovery - T173333
18:39 niharika29@tin: Synchronized wmf-config/CommonSettings.php: Reinforce LoginNotify settings https://gerrit.wikimedia.org/r/#/c/372555/ (duration: 00m 47s)
18:32 thcipriani: jenkins restarted, zuul should pick-up queue
18:26 thcipriani: jenkins appears to be having some issues, trying to dump threads now and then restart to see if it helps
18:13 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Allow crats to add people to accountcreator group on mw.or https://gerrit.wikimedia.org/r/#/c/373317/ (duration: 00m 49s)
17:40 bblack: reboot cp4021
16:38 cwd: disabled all dedupe
16:26 demon@tin: Pruned MediaWiki: 1.30.0-wmf.11 [keeping static files] (duration: 01m 32s)
16:24 bd808@tin: Finished deploy [striker/deploy@2f2dd7c]: Deploying 2f2dd7c "Tool account creation and more" (T128400 , T149458 , T159044 , T164847 , T167931 , T168480 , T173845 ) (duration: 00m 39s)
16:23 bd808@tin: Started deploy [striker/deploy@2f2dd7c]: Deploying 2f2dd7c "Tool account creation and more" (T128400 , T149458 , T159044 , T164847 , T167931 , T168480 , T173845 )
16:22 demon@tin: Pruned MediaWiki: 1.30.0-wmf.9 (duration: 03m 44s)
15:35 legoktm: added addshore to "integration" gerrit group (T173233 )
15:10 gehel: shutdown logstash1006 for disk replacement - T173679
14:49 moritzm: installing texlive security updates on trusty (Debian already fixed)
14:36 bblack@neodymium: conftool action : set/pooled=no; selector: name=cp4027.ulsfo.wmnet
14:34 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp4027.ulsfo.wmnet
13:31 marostegui: Stop MySQL on db1041 to get it ready for decommission - T173915
13:13 zeljkof: EU SWAT finished
13:10 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Flow as a Beta feature on wawiki and wawikionary (T172947) (duration: 00m 48s)
12:16 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1041 to decommission it - T173915 (duration: 00m 48s)
12:15 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1041 to decommission it - T173915 (duration: 00m 48s)
10:39 jynus: stopping and deleting s1 and s4 from dbstore2001
10:06 joal@tin: Finished deploy [analytics/refinery@84d6ee4]: Regular deploy (1 month since last) (duration: 03m 30s)
10:03 moritzm: upgrading remaining image scalers to luasandbox 2.0.13
10:02 joal@tin: Started deploy [analytics/refinery@84d6ee4]: Regular deploy (1 month since last)
09:36 godog: kill older and running tasks from gerrit queue, it was stuck
09:10 moritzm: upgrading remaining API servers in to luasandbox 2.0.13
09:08 godog: upload prometheus-blackbox-exporter 0.7.0+ds1-1~wmf1 to jessie-wikimedia, backported - T169860
08:35 marostegui: Stop Upgrade MySQL on db2073 to 10.0.32 - T168661
08:22 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2073 - T168661 (duration: 00m 59s)
08:18 moritzm: upgrading remaining job runners to luasandbox 2.0.13
07:12 marostegui: Global rename: Opdire657 â Sakiv - T173834
06:55 moritzm: upgrading remaining app servers in to luasandbox 2.0.13
03:15 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Aug 23 03:15:36 UTC 2017 (duration 7m 6s)
03:08 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.15) (duration: 15m 11s)
02:32 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.14) (duration: 08m 03s)
2017-08-22
23:39 thcipriani@tin: Synchronized php-1.30.0-wmf.15/extensions/Linter/includes/LintErrorsPager.php: SWAT: Fix up 11f4a97ba6bcd0c1de (duration: 00m 49s)
23:35 thcipriani@tin: Finished scap: SWAT: Fix typo in the notification message (duration: 21m 48s)
23:13 thcipriani@tin: Started scap: SWAT: Fix typo in the notification message
23:10 thcipriani@tin: Synchronized php-1.30.0-wmf.14/extensions/Popups/includes/PopupsHooks.php: SWAT: Remove aborting of BeforePageDisplay hook T173411 (duration: 00m 49s)
21:18 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Retry CodeMirror deployment T170966 (duration: 00m 49s)
20:41 ejegg: updated CiviCRM from 4aa177b to fa882f2
20:25 mutante: restbase-dev* - puppet runs fail due to E: Version '3.11.0' for 'cassandra' was not found
20:00 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to 1.30.0-wmf.15
19:56 urandom: starting cassandra restbase2004-a and restbase2006-c, OOMs
19:46 thcipriani@tin: Finished scap: testwiki to php-1.30.0-wmf.15 and rebuild l10n cache (duration: 45m 55s)
19:00 thcipriani@tin: Started scap: testwiki to php-1.30.0-wmf.15 and rebuild l10n cache
18:50 robh: labservies1002 update completed (not 1003, typo)
18:50 robh: labservies1003 update completed
18:40 robh: firmware update of labservices1002 in progress
17:54 andrewbogott: removing obsolete apache2 and puppetmaster packages from labcontrol boxes for https://phabricator.wikimedia.org/T171786
17:07 thcipriani: starting branch cut for 1.30.0-wmf.15
13:58 godog: bounce varnish on cp1074 / cp1049 / cp1073 - mailbox problems
13:48 zeljkof: EU SWAT finished
13:45 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: relatedArticles: Tidy up config (T165991) (duration: 00m 44s)
13:44 zfilipin@tin: Synchronized wmf-config/CommonSettings.php: SWAT: relatedArticles: Tidy up config (T165991) (duration: 00m 44s)
13:29 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Set $wmgUseWikimediaShopLink to true for ptwiki (T173768) (duration: 00m 45s)
10:41 moritzm: upgrading hhvm-luasandbox on mw1293-mw1295 (image scalers)
10:21 Amir1: another run of rebuildTermSqlIndex (T171460 )
09:54 moritzm: upgrading hhvm-luasandbox on mw1189-mw1208 (API servers)
09:42 marostegui: Renaming user Darwinius â DarwIn - T173159
09:35 moritzm: upgrading hhvm-luasandbox on mw1180-1188 and mw1209-mw1220 (app servers)
09:34 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1064 - T172996 (duration: 00m 44s)
08:59 akosiaris: upload apertium-bel-rus_0.2.0~r81186-1+wmf to apt.wikimedia.org/jessie-wikimedia/main
08:55 moritzm: upgrading hhvm-luasandbox on mw1161-mw1167 (job runners)
08:43 akosiaris: upload apertium-rus_0.1.0~r81184-1+wmf to apt.wikimedia.org/jessie-wikimedia/main
08:43 akosiaris: upload apertium-bel_0.1.0~r81357-1+wmf to apt.wikimedia.org/jessie-wikimedia/main
08:35 godog: bounce varnish on cp1072 - mailbox problems
08:33 godog: bounce varnish on cp4015 - mailbox problems
08:24 marostegui: Drop s4 from db1095 with NO replication - T172996
08:23 moritzm: upgrading hhvm-luasandbox on deployment servers / script runners
07:34 marostegui: Stop replication on db1064 sanitarium2 and sanitarium3 master to move labsdb1009,10 and 11 s4 from db1095 to db1102 - https://phabricator.wikimedia.org/T172996
07:32 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1064 - T172996 (duration: 00m 52s)
07:28 moritzm: installing ruby security updates on trusty (Debian already fixed)
07:11 moritzm: installing c-ares security updates on trusty (Debian already fixed)
06:54 moritzm: installing graphite2 (the image library, not the metrics tool) security updates on trusty (Debian already fixed)
02:30 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Aug 22 02:30:47 UTC 2017 (duration 6m 55s)
02:23 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.14) (duration: 07m 11s)
2017-08-21
22:50 robh: rebooting tin for firmware update since its idle
22:20 mutante: praseodymium - installing BIOS upgrade, reboot
22:11 mutante: xenon - installing BIOS upgrade
21:58 mutante: cerium (cassandra test) - rebooting for firmware upgrade
21:54 mutante: cerium - installing Dell BIOS upgrade (T162850 )
21:26 mutante: bast2001 - running Dell BIOS firmware upgrade (T162850 )
21:20 arlolra@tin: Finished deploy [parsoid/deploy@2210a38]: Updating Parsoid to 28a9a22b (duration: 10m 59s)
21:09 arlolra@tin: Started deploy [parsoid/deploy@2210a38]: Updating Parsoid to 28a9a22b
18:40 niharika29@tin: Synchronized dblists/pp_stage1.dblist: Roll page previews out to all wikis except en and de wiki T162672 (duration: 00m 44s)
18:34 niharika29@tin: Synchronized wmf-config/CommonSettings.php: Enable OOjs UI EditPage on all wikis https://gerrit.wikimedia.org/r/#/c/366868/ (duration: 00m 44s)
18:33 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable OOjs UI EditPage on all wikis https://gerrit.wikimedia.org/r/#/c/366868/ (duration: 00m 44s)
18:30 niharika29@tin: Synchronized wmf-config/Wikibase.php: Wikibase on deployment-prep: Exclude non-existent wikis from clientDbList T173571 (duration: 00m 44s)
18:25 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Increase AbuseFilter autodisable thresholds for Meta-Wiki T173633 (duration: 00m 44s)
18:22 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Add CookBook and Cookbook Talk NS on hiwikibooks T173398 (duration: 00m 45s)
18:16 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Administrators to add/remove 'transwiki' at nowiktionary T172365 (duration: 00m 45s)
18:12 niharika29@tin: Synchronized static/images/: Set project logo for wikimania2018wiki T173042 (duration: 00m 44s)
18:11 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Set project logo for wikimania2018wiki T173042 (duration: 00m 44s)
17:22 mobrovac@tin: Finished deploy [cxserver/deploy@f43ef96]: Bring back cxserver on scb2001 to a stable state - T173038 (duration: 00m 14s)
17:22 mobrovac@tin: Started deploy [cxserver/deploy@f43ef96]: Bring back cxserver on scb2001 to a stable state - T173038
17:12 gehel@tin: Finished deploy [wdqs/wdqs@a1c4f1f]: (no justification provided) (duration: 02m 27s)
17:10 gehel@tin: Started deploy [wdqs/wdqs@a1c4f1f]: (no justification provided)
15:40 bblack: cp1099 - varnish backend restart for mailbox lag
14:55 mobrovac: cxserver depool scb2001 to debug failed checks - T173038
14:54 mobrovac@tin: Finished deploy [cxserver/deploy@1065ffe]: Deploy 1065ffe2 to canary scb2001 for debugging - T173038 (duration: 00m 18s)
14:53 mobrovac@tin: Started deploy [cxserver/deploy@1065ffe]: Deploy 1065ffe2 to canary scb2001 for debugging - T173038
14:52 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1079 - T163190 (duration: 00m 44s)
14:04 zeljkof: EU SWAT finished
14:04 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add HD logos for srwikisource, update them too (T172268) (duration: 00m 44s)
14:02 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Add HD logos for srwikisource, update them too (T172268) (duration: 00m 44s)
13:53 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Update logos for srwikinews, add HD version for them (T172255) (duration: 00m 49s)
13:52 jynus: stop dbstore2001 mariadb@s4, start mariadb@s7
13:52 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Update logos for srwikinews, add HD version for them (T172255) (duration: 00m 44s)
13:46 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Set X-Frame-Options: SAMEORIGIN if UploadWizard enabled (T173631) (duration: 00m 44s)
13:46 ottomata: adding index on (database, rev_timestamp) on mediawiki_page_create_2 table on db1047: T170990
13:37 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Update logos for srwiktionary, add HD logos for srwiktionary (T172245) (duration: 00m 44s)
13:36 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Update logos for srwiktionary, add HD logos for srwiktionary (T172245) (duration: 00m 45s)
13:27 jynus: stop dbstore2001 mariadb@s7
13:27 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Add new logo for the Baskhir Wikibooks (T173471) (duration: 00m 44s)
13:26 zfilipin@tin: Synchronized static/images/project-logos/: SWAT: Add new logo for the Baskhir Wikibooks (T173471) (duration: 00m 44s)
13:26 ottomata: adding index on (database, rev_timestamp) on mediawiki_page_create_2 table on dbstore1002: T170990
13:11 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Reopen bawikibooks (T173471) (duration: 00m 44s)
13:10 zfilipin@tin: Synchronized dblists/closed.dblist: SWAT: Reopen bawikibooks (T173471) (duration: 00m 44s)
12:52 marostegui: Stop replication on db1079 and db1041 to compare their data - T163190
12:52 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db1079 - T163190 (duration: 00m 45s)
12:35 marostegui: Stop MySQL on db1015 to decommission it - T173570
09:50 jynus: restart dbstore2001 mariadb@s1
09:40 jynus: restart dbstore2001 mariadb@x1
09:29 moritzm: upgrading hhvm-luasandbox on mw1262-1265
09:17 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Remove db1015 - T173570 (duration: 00m 44s)
09:15 marostegui@tin: Synchronized wmf-config/db-codfw.php: Remove db1015 - T173570 (duration: 00m 44s)
09:01 moritzm: upgrading hhvm-luasandbox on mw1261
08:08 marostegui: Rename tables article_assessment, article_assessment_pages, article_assessment_ratings tables from testwiki on db1078 - T173590
07:42 marostegui: Drop cx_drafts table from x1 - T172364
07:13 marostegui: Stop s4 replication thread on db1095 - T172996
07:09 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2047 (duration: 00m 45s)
02:36 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Aug 21 02:36:58 UTC 2017 (duration 7m 3s)
02:29 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.14) (duration: 08m 57s)
2017-08-19
13:17 Amir1: another run: (Hotwc3 â HotWC3) (Lamia Bahy â Albedo11) (MonĂłxido de carbono â Roquetero) (PaulMichaels â PaulBenario) (Rodrigo.dst â RodrigoTavares) (Sadia Tasnim (Moyna) â মŕ§ŕŚšŕŚžŕŚŽŕ§ŕŚŽŕŚŚ সŕ§ŕŚŽŕŚ¨ মাহমŕ§ŕŚŚ) (Syou 18331322 â Ms3102) (TzvetelinaOOD1 â Tzveti1) (World Para Taekwondo â TKD at World Para Taekwondo) (Yaellerner â Ya1levy777) (ĺšłäş äżĺ
â Toshimit) (T173419 )
13:00 Amir1: running the script for ("Clopper228" "CGminded") and ("Gregory.lussier" "StevenSmith83473") (T173419 )
12:47 Amir1: ladsgroup@terbium:~$ mwscript extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=metawiki --logwiki=metawiki "BolsĂŠe" "Kathmandu2017" (T173419 )
03:21 krinkle@tin: Synchronized wmf-config/InitialiseSettings.php: Enable jQuery 3 on test.wikidata and mediawiki.org - Id4d42d8c53 (duration: 00m 45s)
2017-08-18
21:05 mutante: gerrit2001 - restarted gerrit again after reverting gerrit:372426, using systemctl commands, not 'service' or init.d
20:55 mutante: restarting gerrit on gerrit2001
20:48 demon@tin: Synchronized scap/plugins/clean.py: Completeness (duration: 00m 44s)
20:43 RainbowSprinkles: prior thing was a no-op, testing
20:42 demon@tin: Pruned MediaWiki: 1.30.0-wmf.9 [keeping static files] (duration: 01m 43s)
19:03 bblack: cp1074 - varnish backend restart (mailbox lag)
19:01 bblack: reboot cp4021-8
18:21 bblack: puppeting cp4021-8 - expect possibility of ipsec alerts, etc...
13:32 Amir1: another run of /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki wikidatawiki --entity-type=item --from-id $(tail -100 /tmp/rebuildTermSqlIndex.log | grep -E "Processed up to page (\d+?)" | sed -E "s/Processed up to page //; s/ \(Q.+?//" | tail -1) >>/tmp/rebuildTermSqlIndex.log 2>&1
13:26 Amir1: ladsgroup@terbium:~$ mwscript namespaceDupes.php --wiki=hiwikiversity (T172977 )
12:44 Amir1: start of ladsgroup@terbium:~$ time /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki wikidatawiki --entity-type=item
11:36 jynus: change topology of dbstore2001:x1 and dbstore2002:x1
11:00 elukey: reboot dbstore1002 for kernel updates
10:34 elukey: restart mysql on dbstore1002 - attempt to reclaim space after big table drop (stop slaves and el_sync, check running queries, stop mysql, check process, start mysql)
09:55 Amir1: one small pass of ladsgroup@terbium:~$ time /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki wikidatawiki --entity-type=entity (T171460 )
09:46 Amir1: ladsgroup@terbium:~$ time /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki wikidatawiki --entity-type=property (T171460 )
09:42 moritzm: installing libmspack security updates
09:16 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2077 - T168409 (duration: 00m 44s)
07:28 marostegui: Stop MySQL on db2077 to copy it to dbstore2001 - T168409
07:26 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2077 - T168409 (duration: 00m 44s)
07:20 moritzm: installing openjdk-7 security updates on trusty
06:58 jynus: and nginx
06:55 jynus: systemctl restart squid3.service on install2002, apt seem stuck on some servers
06:24 jynus: stopping and upgrading db2033
05:43 jynus: upgrading and restarting all mariadb instances on dbstore2002
afk: updated fundraising dash from 696a3ff to 7dfc969
00:48 ebernhardson@tin: Synchronized php-1.30.0-wmf.14/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: T171213 : Increase sampling rate of cirrus satisfaction schema (again) to 1k per bucket per day (duration: 00m 44s)
00:18 ebernhardson@tin: Synchronized php-1.30.0-wmf.14/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: T171213 : Increase sampling rate of cirrus satisfaction schema (duration: 00m 44s)
2017-08-17
23:31 thcipriani@tin: Synchronized php-1.30.0-wmf.14/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: SWAT: Revert "Disable cirrus MLR ab test" (duration: 00m 44s)
23:21 thcipriani@tin: Synchronized php-1.30.0-wmf.14/resources/src/mediawiki.rcfilters/dm/mw.rcfilters.dm.FilterGroup.js: SWAT: RCFilters: Fix validation for single_option groups T173303 (duration: 00m 44s)
23:14 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Reapply "Remove temporary wgStructuredChangeFiltersEnableExperimentalViews setting" (duration: 00m 45s)
22:10 ppchelko@tin: Finished deploy [changeprop/deploy@2c553a6]: Lower the concurrecy for transcludes to decrease cassandra load during cluster reshaping (duration: 01m 14s)
22:09 thcipriani@tin: Synchronized php-1.30.0-wmf.14/includes/specials/SpecialNewpages.php: Restore the newFromId() approach in SpecialNewpages::feedItemDesc T173541 (duration: 00m 46s)
22:09 ppchelko@tin: Started deploy [changeprop/deploy@2c553a6]: Lower the concurrecy for transcludes to decrease cassandra load during cluster reshaping
22:07 gwicke: restarting pdfrender on scb100* nodes
21:38 ejegg: updated dash from bec0077 to 696a3ff
21:28 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable LoginNotify on sites without Echo too. :|https://gerrit.wikimedia.org/r/#/c/372456/ (duration: 00m 44s)
21:19 niharika29@tin: Synchronized wmf-config/InitialiseSettings.php: Enable LoginNotify finally! Yippeeeeee https://gerrit.wikimedia.org/r/#/c/372450/ (duration: 00m 45s)
21:17 bblack: cp1063: varnish backend restart (mailbox lag)
20:07 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.30.0-wmf.14
19:28 chasemp: disable puppet for cloud things for some careful refactor merging
19:14 thcipriani@tin: Synchronized php: group1 wikis to wmf.14 (duration: 00m 46s)
19:12 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to wmf.14
19:07 thcipriani@tin: Finished scap: ProofReadPage Revert to db7507246665e69384c1d92af2aedc62263a5116 T173520 (duration: 06m 13s)
19:01 thcipriani@tin: Started scap: ProofReadPage Revert to db7507246665e69384c1d92af2aedc62263a5116 T173520
18:28 ebernhardson: restart logstash on logstash1003 to see why its not reading EventError messages from kafka
18:24 niharika29@tin: Synchronized php-1.30.0-wmf.14/extensions/RevisionSlider/: Revert Reintroduce hover and bar clicking - https://gerrit.wikimedia.org/r/#/c/372384/ (duration: 00m 48s)
18:16 jynus: upgrading and restarting all mariadb instances on dbstore2001
18:12 niharika29@tin: Synchronized php-1.30.0-wmf.14/extensions/LoginNotify/: Log usage statistics https://gerrit.wikimedia.org/r/#/c/372214/ (duration: 00m 51s)
18:03 thcipriani@tin: Synchronized php-1.30.0-wmf.14/extensions/UploadWizard/UploadWizard.config.php: SWAT: Preserve array keys (language keys) when sorting the language dropdown T173522 (duration: 00m 51s)
17:53 jynus: restarting mariadb (dbstore2001:x1) to test new buffer pool configuration
17:20 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis back to wmf.13 now T173520
17:10 RainbowSprinkles: gerrit: cpu spikes, lots of large gc logs. Looking into it. (nb: things might be a little slow, but it is /up/)
16:54 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis back to wmf.14 now for T164173
16:51 thcipriani@tin: Synchronized php-1.30.0-wmf.14/includes/jobqueue/jobs/RefreshLinksJob.php: Avoid lock acquisition errors for multi-title refreshlinks jobs T173462 (duration: 00m 51s)
16:45 ejegg: updated SmashPig from c501f53 to 98c5516
16:25 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Pool db2077 - T170662 (duration: 00m 49s)
16:24 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2077 - T170662 (duration: 00m 51s)
16:16 urandom: T169939 : RESTBase: converting (33) new keyspaces to time-windowed compaction
16:11 urandom: T169939 : Lower eqiad compaction throughput from 20MB/s to 15MB/s
15:46 godog: reimage restbase2001 - T169939
15:42 filippo@puppetmaster1001: conftool action : set/pooled=no; selector: name=restbase2001.codfw.wmnet
14:32 bblack@neodymium: conftool action : set/pooled=yes; selector: name=cp3036.*
13:30 zeljkof: EU SWAT finished
13:24 zfilipin@tin: Synchronized static/favicon/wikiversity.ico: SWAT: Update wikiversity favicon (T160491) (duration: 00m 50s)
13:15 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Change $wgArticleCountMethod to any for srwikiquote (T172974) (duration: 00m 51s)
13:08 zfilipin@tin: Synchronized wmf-config/throttle.php: SWAT: Add one throttling exception (T173444) (duration: 00m 51s)
12:38 moritzm: installing libgd/libsoup security updates
12:08 urandom: T169939 : Decommissioning Cassandra/restbase2001-c.codfw.wmnet
12:07 urandom: T169939 : Decommissioning Cassandra/restbase2001-b.codfw.wmnet
10:29 gilles: Thumbor stress test finished
10:20 gilles: Load testing Thumbor
10:13 godog: roll-restart nginx on thumbor to apply https://gerrit.wikimedia.org/r/372199
08:36 godog: roll-restart thumbor to apply webp support change
08:30 godog: restart varnish on cp1062 to fix mailbox lag
08:05 godog: reboot ms-be2021, unreachable on network
06:24 marostegui: Stop slave on db2047 to fix duplicate keys - T151029
05:55 marostegui: Stop replication on db2077 to fix duplicate entries - T151029
05:48 marostegui: Stop replication in sync on db1078 and db1015 - T164488
03:11 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Aug 17 03:11:13 UTC 2017 (duration 7m 18s)
03:03 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.14) (duration: 03m 58s)
02:50 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.13) (duration: 08m 33s)
02:27 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 08m 52s)
00:45 urandom: T169939 : Decommissioning Cassandra/restbase2001-b.codfw.wmnet
2017-08-16
22:30 urandom: T169939 : Decommissioning Cassandra/restbase2001-a.codfw.wmnet
22:14 urandom: T169939 : Cleaning up wikipedia parsoid snapshots
22:13 urandom: T169939 : Rolling restart of Cassandra complete
21:38 bawolff: deleting private info from securepoll_votes that the script missed due to ref-integ issues for old elections (T173393 )
21:37 thcipriani: train is on hold pending resolution of T173462
21:33 bawolff: deleting private info in enwiki arbcom1_vote table (T173393 )
21:30 urandom: T169939 : Rolling restart of Cassandra instances, eqiad, rack d
21:22 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: revert group1 wikis to 1.30.0-wmf.14 for T173462
21:21 thcipriani@tin: Synchronized php: revert group1 wikis to 1.30.0-wmf.14 for T173462 (duration: 00m 47s)
20:46 urandom: T169939 : Rolling restart of Cassandra instances, eqiad, rack b
20:45 arlolra@tin: Finished deploy [parsoid/deploy@a9dc803]: Updating Parsoid to 1832a78e (duration: 08m 52s)
20:37 arlolra@tin: Started deploy [parsoid/deploy@a9dc803]: Updating Parsoid to 1832a78e
19:57 urandom: T169939 : Rolling restart of Cassandra instances, eqiad, rack a
19:52 MaxSem: Manually cleaning up PI on enwiki (T173393 )
19:37 thcipriani@tin: Synchronized php: group1 wikis to 1.30.0-wmf.14 (duration: 00m 46s)
19:35 thcipriani@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to 1.30.0-wmf.14
19:06 urandom: T169939 : Rolling restart of Cassandra instances, codfw, rack d
18:57 thcipriani@tin: Synchronized wmf-config/CirrusSearch-common.php: SWAT: cirrus Tune ordering of crossproject search results on enwiki T171803 PART II (duration: 00m 50s)
18:56 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: cirrus Tune ordering of crossproject search results on enwiki T171803 PART I (duration: 00m 51s)
18:41 thcipriani@tin: Synchronized wmf-config: SWAT: Apply token count limits to phrase queries on all wikis T172653 (duration: 00m 53s)
18:30 thcipriani@tin: Synchronized php-1.30.0-wmf.13/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: SWAT: Disable cirrus MLR ab test T171214 (duration: 00m 50s)
18:29 thcipriani@tin: Synchronized php-1.30.0-wmf.14/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: SWAT: Disable cirrus MLR ab test T171214 (duration: 00m 51s)
18:20 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: JobQueueEventBus: Enable group1 T163380 (duration: 00m 54s)
18:18 urandom: T169939 : Rolling restart of Cassandra instances, codfw, rack c
17:31 urandom: T169939 : Rolling restart of Cassandra instances, codfw, rack b
17:07 demon@tin: Synchronized php-1.30.0-wmf.14/extensions/AntiSpoof/SpoofUser.php: T173394 (duration: 00m 51s)
15:54 marostegui: Stop MySQL and shutdown db1078 for HW checks - T173365
15:43 elukey: drop PageContentSaveComplete_5588433_15423246 from db1047 and dbstore1002 (analytics-slaves)
15:41 godog: delete outdated CFs cassandra metrics from graphite2002 and graphite1003
15:35 marostegui: Rename cx_drafts table on db1029 - T172364
15:09 godog: restart varnish on cp1072 to clear mailbox lag
14:56 godog: restart varnish on cp1099 to clear mailbox lag
14:48 godog: restart varnish on cp1074 to clear mailbox lag
14:44 godog: restart varnish on cp1049 to clear mailbox lag
14:38 gilles: Thumbor stress test finished
14:29 gilles: Stress testing Thumbor from single IP
13:57 zeljkof: EU SWAT finished
13:35 zfilipin@tin: Synchronized dblists/pp_stage1.dblist: SWAT: pagePreviews: Deploy to next 100 stage 1 wikis (T162672) (duration: 00m 50s)
13:09 zfilipin@tin: Synchronized portals: (no justification provided) (duration: 00m 52s)
13:08 zfilipin@tin: Synchronized portals/prod/wikipedia.org/assets: (no justification provided) (duration: 00m 52s)
12:25 Amir1: ladsgroup@terbium:~$ time /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki testwikidatawiki --entity-type=property (T172776 , T171460 )
12:25 moritzm: powercycling cp3036
12:24 marostegui: Compressing InnoDB on db2077 - T168409
12:22 jmm@puppetmaster1001: conftool action : set/pooled=no; selector: cp3036.esams.wmnet
10:46 marostegui: Stop replication in sync on db1015 and db1078 - T164488
10:42 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2076 - T170662 (duration: 00m 51s)
10:03 godog: copy ubuntu-font-family-sources to stretch-wikimedia - T170817
08:37 marostegui: Drop wikigrok tables from s1, s3 and s5 - T172020
08:12 marostegui: Stop MySQL on db2047 to copy its content to db2077 - T170662
08:10 moritzm: bounced pdfrender on scb1004 (T159922 )
08:07 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2047 - T170662 (duration: 01m 06s)
07:25 marostegui: Stop MySQL on db2076 to copy its content to dbstore2001 - T168409
07:08 elukey: executed sudo find -type f -mtime +30 -exec rm {} \; in /var/log/carbon to free some space
06:40 marostegui: Run pt-table-checksum on s3 for revision table - T164488
06:01 marostegui: Stop replication on db2076 to fix duplicate entries - T151029
03:35 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Aug 16 03:35:43 UTC 2017 (duration 7m 21s)
03:28 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.14) (duration: 16m 32s)
02:51 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.13) (duration: 08m 08s)
02:28 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 08m 51s)
00:43 dereckson@tin: Synchronized php-1.30.0-wmf.12/extensions/Wikidata/Wikidata.php: Explicitly load badges extension (Gerrit:372088 ) (duration: 00m 51s)
00:42 dereckson@tin: Synchronized php-1.30.0-wmf.14/extensions/Wikidata/Wikidata.php: Explicitly load badges extension (Gerrit:372051 ) (duration: 00m 51s)
00:38 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Enable Timeless on three French wikis (T154371 ) + Fixes for Wikidata: Remove wbq_evaluation logging, Update Wikidata property blacklist (Gerrit:367913 and Gerrit:370846 ) (duration: 00m 53s)
00:02 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Enable WikidataPageBanner on test wikis (T173388 ) (duration: 00m 51s)
2017-08-15
23:58 dereckson@tin: Synchronized php-1.30.0-wmf.14/skins/Timeless/resources/screen-common.less: Fix messed up recent changes/watchlist legends (T173151 ) (duration: 00m 50s)
23:55 dereckson@tin: Synchronized php-1.30.0-wmf.13/skins/Timeless/resources/screen-common.less: Fix messed up recent changes/watchlist legends (T173151 ) (duration: 00m 54s)
23:14 Dereckson: Ran updateArticleCount.php on sr.wikiquote (T172974 )
23:12 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Create a few of namespace aliases for hiwikiversity (T172977 ) (duration: 00m 53s)
22:50 ppchelko@tin: Started restart [changeprop/deploy@444223d]: (no justification provided)
22:45 Pchelolo: scb - codfw: enabling puppet and starting changeprop back T169939
21:40 mobrovac: scb - codfw: disabling puppet and stopping changeprop to release load on Cassandra while dropping old tables - T169939
21:26 urandom: Starting Cassandra restbase2005-b
19:24 demon@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.14
19:19 mobrovac@tin: Finished deploy [restbase/deploy@1139d00]: Use only new parsoid tables (before the removal of old ones), part #2 - T169939 (duration: 02m 50s)
19:16 mobrovac@tin: Started deploy [restbase/deploy@1139d00]: Use only new parsoid tables (before the removal of old ones), part #2 - T169939
19:14 demon@tin: Finished scap: bootstrap wmf.14 v2.0 (duration: 30m 56s)
19:04 mobrovac@tin: Started deploy [restbase/deploy@1139d00]: Use only new parsoid tables (before the removal of old ones) - T169939
19:03 mobrovac@tin: Finished deploy [restbase/deploy@1139d00] (staging): Use only new parsoid tables (before the removal of old ones) - T169939 (duration: 04m 39s)
18:58 mobrovac@tin: Started deploy [restbase/deploy@1139d00] (staging): Use only new parsoid tables (before the removal of old ones) - T169939
18:43 demon@tin: Started scap: bootstrap wmf.14 v2.0
18:39 demon@tin: scap failed: CalledProcessError Command '/usr/local/bin/mwscript rebuildLocalisationCache.php --wiki="testwiki" --outdir="/tmp/scap_l10n_3982832257" --threads=10 --lang en --quiet' returned non-zero exit status 255 (duration: 03m 05s)
18:35 demon@tin: Started scap: bootstrap wmf.14
18:18 demon@tin: Pruned MediaWiki: 1.30.0-wmf.12 [keeping static files] (duration: 02m 47s)
17:59 bsitzmann@tin: Finished deploy [mobileapps/deploy@34a1304]: Update mobileapps to 33b80dd (T172829 T152441 T172021 T103362 ) (duration: 04m 00s)
17:55 bsitzmann@tin: Started deploy [mobileapps/deploy@34a1304]: Update mobileapps to 33b80dd (T172829 T152441 T172021 T103362 )
15:21 mobrovac: restarting pdfrender on scb1001, added some debug messages to help us diagnose T159922
13:42 gehel: cleanup of old leftover logs on deployment-logstash2:/var/log/logstash
13:21 moritzm: bounced pdfrender on scb1001 (T159922 )
11:32 moritzm: installing Linux updates on stretch systems (no reboots yet)
10:45 moritzm: installing PHP security updates on trusty
10:09 jynus: rebooting db1078
09:39 jynus@tin: Synchronized wmf-config/db-eqiad.php: depool db1078 (duration: 00m 49s)
07:56 moritzm: installing cvs security updates
07:18 elukey: restart pdfrender on scb1003
02:56 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Aug 15 02:56:14 UTC 2017 (duration 6m 42s)
02:49 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.13) (duration: 07m 53s)
02:28 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 08m 58s)
2017-08-14
22:16 Dereckson: Previous log entry is related to Gerrit:371960 and Gerrit:371961 .
22:15 dereckson@tin: Synchronized php-1.30.0-wmf.11/extensions/EventBus/JobQueueEventBus.php: JobQueueEventBus: Populate the database field + not set properties are accessed (duration: 00m 48s)
20:44 ppchelko@tin: Finished deploy [restbase/deploy@4d6c706]: Temporary fallback to the new storage buckets before truncation (duration: 09m 35s)
20:35 ppchelko@tin: Started deploy [restbase/deploy@4d6c706]: Temporary fallback to the new storage buckets before truncation
17:09 gehel: resetting mode on stat1005:/srv/published-datasets/discovery recursively
16:29 elukey: execute sudo find -type f -mtime +60 -exec rm {} \; in /var/log/carbon on graphite2001 to free some space in /
13:32 elukey: Execute systemctl mask nfacctd on rhenium.wikimedia.org for T172681
13:08 zeljkof: nothing for EU SWAT
12:21 jynus: stopping replication on all instances of dbstore2001 T169516
11:01 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2076 - T170662 T151029 (duration: 00m 47s)
11:00 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2076 - T170662 T151029 (duration: 01m 02s)
10:34 marostegui: Restart MySQL on db1069 to pick up new replcation filters
10:27 marostegui: Restart MySQL on db1095 to pick up new replication filters
09:16 oblivian@puppetmaster1001: conftool action : set/pooled=yes; selector: service=thumbor,name=thumbor2001.codfw.wmnet
09:07 _joe_: stopping thumbor on thumbor2001 after depooling it for testing
09:05 oblivian@puppetmaster1001: conftool action : set/pooled=no; selector: service=thumbor,name=thumbor2001.codfw.wmnet
03:15 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Aug 14 03:15:31 UTC 2017 (duration 7m 6s)
03:08 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.13) (duration: 06m 47s)
02:39 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 08m 29s)
2017-08-13
22:45 ebernhardson: restart elsaticsearch on elastic1017 after setting md2 readahead to 256 to match md2 on 1032-152
19:36 godog: upload python-thumbor-wikimedia 1.2 - T161719
19:01 godog: bounce pdfrender on scb1001 and scb1003 - T159922
18:46 godog: bounce carbon and uwsgi on graphite1003
17:53 reedy@tin: Synchronized wmf-config/db-eqiad.php: fix comment (duration: 00m 47s)
17:52 reedy@tin: Synchronized wmf-config/db-codfw.php: fix comment (duration: 00m 47s)
17:51 reedy@tin: Synchronized refresh-dblist: phpcs (duration: 00m 48s)
2017-08-12
20:00 krinkle@tin: Synchronized php-1.30.0-wmf.13/includes/jobqueue/JobQueueGroup.php: T171371 - Log job pushes to bogus wikis (duration: 00m 53s)
16:09 Reedy: Deleted some bogus user languages from commonswiki.user_properties
15:25 elukey: powercycle mw2256 (able to use com2 but not to login as root, regular ssh hanging) - T163346
2017-08-11
21:47 legoktm@tin: Finished scap: (no justification provided) (duration: 07m 09s)
21:41 jmm@puppetmaster1001: conftool action : set/pooled=inactive; selector: mw2256.codfw.wmnet
21:40 legoktm@tin: Started scap: (no justification provided)
21:40 legoktm@tin: scap failed: LockFailedError Failed to acquire lock "/var/lock/scap.unknown-but-probably-mediawiki.lock"; owner is "legoktm"; reason is "Deploying Timeless (try 2) - T154371 " (duration: 00m 00s)
21:02 ebernhardson: unban elastic1017 from elasticsearch cluster
20:52 bblack: varnish backend restart on cp1099
20:46 legoktm@tin: Started scap: Deploying Timeless (try 2) - T154371
20:42 legoktm@tin: scap aborted: Deploying Timeless - T154371 (duration: 05m 10s)
20:36 legoktm@tin: Started scap: Deploying Timeless - T154371
20:19 bblack: varnish backend restart on cp1049 + cp1074 (mailbox lag)
19:59 ebernhardson: ban elastic1017 from eqiad search cluster
18:28 moritzm: installing subversion security updates
17:19 jynus@tin: Synchronized wmf-config/db-codfw.php: Repool db2075 (duration: 00m 47s)
17:05 twentyafterfour: Deploying phabricator security update
16:21 jynus_: stop db1069:s6 replication and dropping frwiki, jawiki, ruwiki
15:41 jynus_: stopping db2075 to clone it to dbstore2001
15:39 moritzm: installing git security updates on trusty (jessie/stretch already fixed)
15:38 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool db2075 (duration: 00m 48s)
13:51 elukey: moved the eventbus scap deployment dirs on kafka[12]00[123] to deploy-service:deploy-service to allow scap to depool/pool - T171506
13:49 mobrovac@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 27s)
13:49 mobrovac@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
13:42 mobrovac@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 13s)
13:42 mobrovac@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
13:25 mobrovac@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 34s)
13:24 mobrovac@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
13:16 mobrovac@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 12s)
13:15 mobrovac@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
13:00 mobrovac@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 17s)
13:00 mobrovac@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
12:51 mobrovac@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 04s)
12:51 mobrovac@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
12:44 mobrovac@tin: Finished deploy [eventlogging/eventbus@41e3418]: (no justification provided) (duration: 00m 05s)
12:44 mobrovac@tin: Started deploy [eventlogging/eventbus@41e3418]: (no justification provided)
12:05 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2046 - T151029 (duration: 00m 48s)
11:25 jynus: stopping and upgrading labsdb1010
11:19 marostegui: Stop replication on db2046 to fix duplicate entries - T151029
09:28 jynus: stopping and restarting es2013 for upgrade
09:07 jynus: stopping and restarting db2046 for upgrade
07:35 elukey: restart pdfrender on scb1004
2017-08-10
23:43 maxsem@tin: Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/371196/2 (duration: 00m 47s)
23:26 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/370310/ (duration: 00m 47s)
23:25 maxsem@tin: Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/370310/ (duration: 00m 46s)
22:31 twentyafterfour@tin: Synchronized php-1.30.0-wmf.13/includes/specials/SpecialDoubleRedirects.php: Hopefully fix T173045 (duration: 00m 48s)
22:17 moritzm: installing git security-updates
20:51 mobrovac@tin: Finished deploy [cassandra/metrics-collector@d0169ee] (staging): First Scap3 deployment, take #2.5 - T137371 (duration: 00m 21s)
20:51 mobrovac@tin: Started deploy [cassandra/metrics-collector@d0169ee] (staging): First Scap3 deployment, take #2.5 - T137371
20:50 mobrovac@tin: Finished deploy [cassandra/metrics-collector@d0169ee] (staging): First Scap3 deployment, take #2 - T137371 (duration: 00m 09s)
20:50 mobrovac@tin: Started deploy [cassandra/metrics-collector@d0169ee] (staging): First Scap3 deployment, take #2 - T137371
20:35 mobrovac@tin: Finished deploy [cassandra/metrics-collector@d0169ee] (staging): First Scap3 deployment - T137371 (duration: 00m 05s)
20:34 mobrovac@tin: Started deploy [cassandra/metrics-collector@d0169ee] (staging): First Scap3 deployment - T137371
20:00 mobrovac@tin: Finished deploy [cassandra/metrics-collector@5db1a43] (staging): First Scap3 deployment - T137371 (duration: 18m 17s)
19:42 mobrovac@tin: Started deploy [cassandra/metrics-collector@5db1a43] (staging): First Scap3 deployment - T137371
19:35 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: touch (duration: 00m 47s)
19:33 reedy@tin: Synchronized wmf-config/: Newsletter on testwiki (duration: 00m 49s)
19:31 mobrovac@tin: Finished deploy [cassandra/logstash-logback-encoder@d085ffa] (aqs): first Scap3 deployment - T116340 (duration: 01m 08s)
19:30 mobrovac@tin: Started deploy [cassandra/logstash-logback-encoder@d085ffa] (aqs): first Scap3 deployment - T116340
19:30 mobrovac@tin: Finished deploy [cassandra/logstash-logback-encoder@d085ffa]: first Scap3 deployment - T116340 (duration: 00m 34s)
19:29 mobrovac@tin: Started deploy [cassandra/logstash-logback-encoder@d085ffa]: first Scap3 deployment - T116340
19:28 mobrovac@tin: Finished deploy [cassandra/logstash-logback-encoder@d085ffa] (staging): first Scap3 deployment (rest of the nodes) - T116340 (duration: 00m 12s)
19:28 mobrovac@tin: Started deploy [cassandra/logstash-logback-encoder@d085ffa] (staging): first Scap3 deployment (rest of the nodes) - T116340
19:27 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: All wikis (except wikidata) to 1.30.0-wmf.13 refs T170631
19:16 godog: restart cassandra instances on xenon to test logstash-logback-encoder deploy
19:13 mobrovac@tin: Finished deploy [cassandra/logstash-logback-encoder@d085ffa] (staging): first Scap3 deployment - T116340 (duration: 00m 03s)
19:13 mobrovac@tin: Started deploy [cassandra/logstash-logback-encoder@d085ffa] (staging): first Scap3 deployment - T116340
18:56 Reedy: created newsletter tables on testwiki
18:55 reedy@tin: Synchronized php-1.30.0-wmf.13/extensions/Newsletter/: sql file updates (duration: 00m 52s)
18:53 reedy@tin: Synchronized php-1.30.0-wmf.13/extensions/WikimediaMaintenance/createExtensionTables.php: newsletter (duration: 00m 52s)
18:21 demon@tin: Synchronized wmf-config/InitialiseSettings.php: Fix commons/commonswiki snafu (duration: 00m 52s)
17:53 kartik@tin: Finished deploy [cxserver/deploy@f43ef96]: (no justification provided) (duration: 00m 44s)
17:52 kartik@tin: Started deploy [cxserver/deploy@f43ef96]: (no justification provided)
17:42 kartik@tin: Finished deploy [cxserver/deploy@1065ffe]: Update cxserver to 686f4f3 (duration: 00m 40s)
17:42 kartik@tin: Started deploy [cxserver/deploy@1065ffe]: Update cxserver to 686f4f3
16:23 ema: cp1072: restart varnish backend
15:48 XioNoX: troubleshoting interface errors between pfw3-codfw and fasw-codfw
15:09 marostegui: Stop replication on db2046 to fix duplicate entries - T151029
15:05 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2046 - T151029 (duration: 00m 50s)
15:03 marostegui: Poweroff es2013 for maintenance - T172265
14:45 marostegui@tin: Synchronized wmf-config/db-codfw.php: Pool db2075 - T170662 (duration: 00m 51s)
14:32 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2045 - T151029 (duration: 00m 51s)
14:11 elukey: restart kafka1012 temporary with some logs to TRACE to debug T172681
away: updated civicrm from 200abc2 to 4aa177b
13:43 marostegui: Compress cebwiki on db1095 - T153058
13:32 zeljkof: EU SWAT finished
13:26 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Set $wgArticleCountMethod to any on srwikisource (T172974) (duration: 00m 53s)
13:18 zeljkof: EU SWAT, part two
13:17 marostegui: Drop m3 databases from dbstore1002 - T156758
13:16 zeljkof: EU SWAT finished
13:15 zfilipin@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable NewUserMessage on knwiki (T172894) (duration: 00m 52s)
12:56 marostegui: Stop MySQL on db2045 to upgrade socket location - T148507
12:07 elukey: restored varnishakafka on cp3032
11:17 elukey: disabled puppet on cp3032 and restarted varnishkafka with debug logging
09:58 jynus: continuing cloning of dbstore2002 to dbstore2001
09:30 gehel: repooling wdqs2001, long after data reload completed
09:30 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=wdqs2001.codfw.wmnet
09:27 gehel@tin: Finished deploy [wdqs/wdqs@c186e3e]: (no justification provided) (duration: 01m 31s)
09:25 gehel@tin: Started deploy [wdqs/wdqs@c186e3e]: (no justification provided)
09:17 jynus: disabling lag notification on all s4 replicas
08:59 elukey: update librdkafka1 to 0.9.4.1 on eventlog1001
08:15 elukey: add 50G to carbon lv on graphite1003 and 100G on graphite2002
07:32 jynus: enabling semisymc master replication on db1068
07:29 jynus: disabling semisync slave replication on all SSD hosts on s4
06:45 elukey: powercycle mw2256 - T163346
06:38 elukey: restart pdfrender on scb1004
06:04 Dereckson: Removed 2FA for GoldRingChip account (T172878 )
03:23 bblack: cp1008, restbase-dev100[456] have puppet disabled, manual "rm /etc/ssh/userkeys/dzahn"
03:20 bblack: batched cumin puppet agent run on all hosts (not forced)
02:28 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 09m 12s)
2017-08-09
22:06 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: Get baaack you broken CodeMirror! (duration: 00m 51s)
21:20 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: Deploy CodeMirror https://gerrit.wikimedia.org/r/#/c/370943/ (duration: 00m 50s)
21:11 maxsem@tin: Synchronized php-1.30.0-wmf.12/extensions/CodeMirror: https://gerrit.wikimedia.org/r/#/c/370904/ (duration: 00m 51s)
21:05 ppchelko@tin: Finished deploy [restbase/deploy@f97beeb]: Rollback on canary (duration: 07m 30s)
21:01 godog: add 100G to carbon lv on graphite1003 and graphite2002
20:58 ppchelko@tin: Started deploy [restbase/deploy@f97beeb]: Rollback on canary
20:54 ppchelko@tin: Finished deploy [restbase/deploy@f97beeb]: Temporary fallback to the new storage buckets before truncation. All tables created, finish deploy (duration: 07m 12s)
20:51 marostegui: Remove m3 replication from dbstore1002 - T156758
20:47 ppchelko@tin: Started deploy [restbase/deploy@f97beeb]: Temporary fallback to the new storage buckets before truncation. All tables created, finish deploy
20:43 ppchelko@tin: Finished deploy [restbase/deploy@f97beeb]: Temporary fallback to the new storage buckets before truncation. Attempt 3 (duration: 08m 05s)
20:37 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2046 - T170662 (duration: 00m 49s)
20:35 ppchelko@tin: Started deploy [restbase/deploy@f97beeb]: Temporary fallback to the new storage buckets before truncation. Attempt 3
20:22 twentyafterfour: twentyafterfour@tin rebuilt wikiversions.php and synchronized wikiversions files: Actually sync all group1 wikis (except wikidata) to 1.30.0-wmf.13 refs T170631
18:57 ppchelko@tin: Started deploy [restbase/deploy@f97beeb]: Temporary fallback to the new storage buckets before truncation. Attempt 2
18:39 marostegui: Stop replication on db2045 to fix duplicate keys - T151029
18:36 urandom: Restarting Cassandra, restbase2001-b.codfw.wmnet (schema jankiness)
18:27 urandom: Restarting Cassandra, restbase2001-a.codfw.wmnet (schema jankiness)
18:10 ebernhardson: restart es on elastic1018 to test interleaved numa
17:50 marostegui: Add db2076 to tendril - T170662
17:41 marostegui: Compress innodb on s6 on db2076 - T170662
17:31 legoktm: changed mediawiki/vendor to fast forward only
17:24 ppchelko@tin: Finished deploy [restbase/deploy@f97beeb]: Rollback on canary (duration: 01m 27s)
17:23 ppchelko@tin: Started deploy [restbase/deploy@f97beeb]: Rollback on canary
16:35 ppchelko@tin: Finished deploy [restbase/deploy@f97beeb]: Temporary fallback to the new storage buckets before truncation (duration: 04m 46s)
16:30 ppchelko@tin: Started deploy [restbase/deploy@f97beeb]: Temporary fallback to the new storage buckets before truncation
16:29 urandom: T172384 : Upgrading Cassandra in RESTBase dev to 3.11.0-wmf2 (patched to disable use of FastThreadLocal)
16:15 marostegui: Stop mysql on db2046 to copy its content to db2076 - T170662
16:14 elukey: rolling restart of eventstream on scb hosts to deploy https://gerrit.wikimedia.org/r/370793
16:11 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2046 - T170662 (duration: 00m 50s)
16:08 gehel: config update on logstash, a few logs might be lost during restart - T172713
15:19 XioNoX: removing old pfw related config from cr1-codfw - T171970
14:08 mutante: purging req.urls with "^/resources" from varnish cluster, to fix redirect with cached 404
14:04 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2075 to s5 - T170662 (duration: 00m 50s)
14:03 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2075 to s5 - T170662 (duration: 00m 51s)
13:55 marostegui: Reboot db2076 for maintenance - T170662
13:21 marostegui: Stop replication on db2075 for maintenance - T170662
13:18 ladsgroup@tin: Synchronized wmf-config/InitialiseSettings.php: Allow bureaucrats remove confirmed user group (T101983 ) (duration: 00m 51s)
13:10 gehel@tin: Finished deploy [wdqs/wdqs@6620e0f]: (no justification provided) (duration: 01m 34s)
13:09 gehel@tin: Started deploy [wdqs/wdqs@6620e0f]: (no justification provided)
12:41 gehel: restarting updater on wdqs1001 (real fix coming up soon
10:20 jynus: stopping dbstore2002's mysqls and cloning them to dbstore2001
08:39 jynus: disable puppet on dbstore2001, about to be reimaged
07:47 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1065, db1064, db1070 (duration: 01m 04s)
07:12 jynus: stopping replication in sync between db1069 and db1065, db1044, db1064, db1070
07:07 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1065, db1064, db1070 (duration: 00m 47s)
06:26 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1079 (duration: 00m 51s)
06:11 jynus: stopping db1069 (s7) and db1079 replication in sync
06:03 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1085, depool db1079 (duration: 00m 51s)
05:42 jynus: stopping db1069 (s6) and db1085 replication in sync
05:32 jynus@tin: Synchronized wmf-config/db-eqiad.php: Repool db1060, depool db1085 (duration: 00m 50s)
04:56 jynus: stopping db1069 (s2) and db1060 replication in sync
04:54 jynus@tin: Synchronized wmf-config/db-eqiad.php: Depool db1060 (duration: 00m 58s)
04:48 jynus: restarting pdfrender on scb1001, unresponsive
04:09 jynus: powercycling mw2256
03:23 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.13) (duration: 07m 33s)
02:56 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.12) (duration: 07m 45s)
02:38 ebernhardson: restart elasticsearch1017 with niofs store
02:28 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 08m 38s)
00:27 ebernhardson: test vm.zone_reclaim_mode=0 on elastic1017
2017-08-08
23:23 dereckson@tin: Synchronized wmf-config/InitialiseSettings.php: Allow bureaucrats on WMF wikis to grant and remove 'confirmed' (T101983 ) (duration: 00m 51s)
21:25 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 wikis to 1.30.0-wmf.13 refs T170631
21:08 ppchelko@tin: Finished deploy [restbase/deploy@c16fb6b]: Update summary and licensing information for pageviews API (duration: 08m 31s)
21:05 twentyafterfour@tin: Finished scap: again: deploy 1.30.0-wmf.13 to testwikis and rebuild l10n refs T170631 (duration: 42m 55s)
20:59 ppchelko@tin: Started deploy [restbase/deploy@c16fb6b]: Update summary and licensing information for pageviews API
20:32 twentyafterfour: updated mediawiki.org changelog for 1.30.0-wmf.13
20:22 twentyafterfour@tin: Started scap: again: deploy 1.30.0-wmf.13 to testwikis and rebuild l10n refs T170631
19:46 ema: restart varnish backend on cp1074
19:45 godog: bounce thumbor-instances on thumbor1003 to make sure all memory limits are applied
19:44 twentyafterfour@tin: scap failed: CalledProcessError Command '/usr/local/bin/mwscript rebuildLocalisationCache.php --wiki="test2wiki" --outdir="/tmp/scap_l10n_224168097" --threads=10 --lang en --quiet' returned non-zero exit status 255 (duration: 07m 33s)
19:36 twentyafterfour@tin: Started scap: deploy 1.30.0-wmf.13 to testwikis and rebuild l10n refs T170631
18:28 XioNoX: frack-codfw moved to new infrastructure
18:27 elukey: re-enabled irc-echo after the puppet shower
18:11 elukey: stop ircecho to avoid puppet shower
16:59 twentyafterfour: Branching mediawiki/master to mediawiki/wmf/1.30.0-wmf.13 refs T170631
16:04 elukey: rolling restart of varnishkafka-webrequest to apply https://gerrit.wikimedia.org/r/#/c/370659/ (puppet automatically restarts)
15:03 XioNoX: starting pfw-codfw migration - T171970
14:19 elukey: restart of all the varnishkafka statsv/eventlogging instances on caching hosts to pick up https://gerrit.wikimedia.org/r/370644 (puppet automatic restarts)
14:16 elukey: set mw2256 pooled=inactive + downtime to allow BIOS upgrade - T163346
14:07 ladsgroup@tin: Synchronized wmf-config/Wikibase-production.php: SWAT: Add copyright info for Wikidata API (T112606 ) (duration: 00m 47s)
13:56 gehel: dump of task API for elasticsearch eqiad - T169498
13:54 ladsgroup@tin: Synchronized static/images/project-logos: SWAT: Fix srwiki logos (T150618 ) (duration: 00m 48s)
13:00 elukey: restart varnishkafka-webrequest with kafka.broker.version.fallback=0.9.0.1 + kafka.api.version.request=false on cp3032 (local test, to rollback remove the lines from /etc/varnishkafka/webrequest.conf)
12:49 Amir1: start of ladsgroup@terbium:~$ /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki wikidatawiki --entity-type=property --rebuild-all-terms (T172776 )
12:32 elukey: restart pdfrender on scb1002
12:20 Amir1: start of ladsgroup@terbium:~$ /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki wikidatawiki --entity-type=property (T172776 )
12:15 gehel: restarting wdqs-updater on wdqs1001
12:14 elukey: stop eventlogging on eventlog1001 to test kafka consumer failures
11:22 Amir1: start of ladsgroup@terbium:~$ timeout 3500s /usr/local/bin/mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki wikidatawiki --entity-type=item >>/tmp/rebuildTermSqlIndex.log 2>&1 (T171460 )
10:12 elukey: update librdkafka1* on notebook100[12] and stat1003
09:09 jynus@tin: Synchronized wmf-config/db-eqiad.php: Pool db1098 as the main recentchanges/watchlist s6 role (duration: 00m 47s)
08:46 jynus@tin: Synchronized wmf-config/db-eqiad.php: Pool db1098 at 50% load (duration: 00m 46s)
08:28 jynus@tin: Synchronized wmf-config/db-eqiad.php: Pool db1098 with limited load (duration: 00m 46s)
07:53 jynus@tin: Synchronized wmf-config/db-codfw.php: Add db1098 (duration: 00m 47s)
07:41 elukey: stop puppet on cp3032 (cache::text) to set varnishkafka-webrequest logging to debug
07:34 Amir1: stopped the script and re-running without --deduplicate-terms (T171460 )
07:21 Amir1: start of ladsgroup@terbium:~$ time mwscript extensions/Wikidata/extensions/Wikibase/repo/maintenance/rebuildTermSqlIndex.php --wiki=wikidatawiki --entity-type=property --deduplicate-terms (T171460 )
06:12 elukey: alert users with big home directories for stat1005 disk alarms (will erase data later on only if they don't answer)
06:12 elukey: restart pdfrender on scb1003
03:55 mutante: phab1001 /usr/local/bin/community_metrics.sh | /usr/local/bin/project_changes.sh creating stats mails to admins (which failed before) (T163938 )
03:25 mutante: phab1001 /srv/phab/tools/public_task_dump.py to create dump, was failed cron due to missing /srv/dumps/
02:53 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Aug 8 02:53:13 UTC 2017 (duration 6m 53s)
02:46 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.12) (duration: 06m 59s)
02:36 ejegg: enabled donation queue consumer
02:26 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 08m 25s)
00:46 ejegg: disabled donation queue consumer
2017-08-07
21:28 urandom: T172384 : Upgrading Cassandra to 3.11.0-wmf1 in dev environment (build patched to disable in-built heap dumping)
19:51 ejegg: updated CiviCRM from f24ba78 to 200abc2
19:05 ebernhardson@tin: Synchronized php-1.30.0-wmf.12/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: T171212 - Turn on CirrusSearch MLR AB test (duration: 00m 46s)
19:00 ebernhardson@tin: Synchronized wmf-config/CirrusSearch-common.php: T169498 - Enable max token count for phrase rescore on zh lang wikis (step 2) (duration: 00m 46s)
18:59 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: T169498 - Enable max token count for phrase rescore on zh lang wikis (step 1) (duration: 00m 46s)
18:53 ebernhardson@tin: Synchronized wmf-config/CirrusSearch-production.php: T171212 - Update CirrusSearch AB test rescore profiles (duration: 00m 46s)
18:44 ebernhardson@tin: Synchronized wmf-config/Wikibase-labs.php: T112606 - beta only - Add copyright info for Wikidata API (duration: 00m 46s)
18:43 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: T172630 - Enable wgMinervaEnableSiteNotice for kowiki (duration: 00m 46s)
18:38 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: T170687 - Exclude files from Special:ShortPages on commons (duration: 00m 46s)
18:23 ebernhardson@tin: Synchronized php-1.30.0-wmf.12/extensions/CirrusSearch/: T169498 limit phrase token count, T172464 constant boost ltr queries (duration: 00m 58s)
18:10 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: T172594 - Translate sitename for nl.wikinews (duration: 00m 47s)
18:05 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: Grant autopatrol to editor in en.wikibooks - T172561 (duration: 00m 47s)
17:57 mutante: phab2001 - re-enabling puppet, but closing firewall for 80/443
17:56 jynus: stopping slave and reparitioning db1098
17:10 marostegui: Restart s7 instance on db1102 to pick up new replication filters - T172693
17:05 gehel@tin: Finished deploy [wdqs/wdqs@da33919]: (no justification provided) (duration: 02m 28s)
17:02 gehel@tin: Started deploy [wdqs/wdqs@da33919]: (no justification provided)
16:44 marostegui: Restart s7 instance on db1069 to pick up new replication filters - T172693
16:37 XioNoX: manually restarted varnish on cp1099
15:10 thcipriani: restarting jenkins for plugin upgrade
14:51 gehel: reducing elasticsearch eqiad concurrent rebalance to 4 (from 8)
14:38 elukey: updated librdkafka1 and ++1 to 0.9.4.1 on hafnium
14:32 mutante: phab2001 - stopping Apache,schedule downtime for http and puppet
14:22 herron: mx[1,2]001, fermium: Installed libmail-dkim-perl and restarted spamassassin service - T172689
13:15 jynus: reboot db1098
12:39 _joe_: restarting pdfrender on scb1001, T159922
12:39 elukey: restart kafka on kafka1018 to force it out of the kafka topic leaders - T172681
12:26 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2074 - T171321 (duration: 00m 45s)
12:08 gehel: deploying https://gerrit.wikimedia.org/r/#/c/299825/ - some logs will be lost during logstash restart
10:02 marostegui: Add dbstore2002:3313 to tendril - T171321
09:47 jynus: stopping db1050's mysql and cloning it to db1089
09:06 elukey: set net.netfilter.nf_conntrack_tcp_timeout_time_wait=65 (was 120) on all the analytics kafka brokers - T136094
09:03 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2065 after fixing: linter, page and watchlist tables (duration: 00m 47s)
08:12 marostegui: Force BBU re-learn on db1016 - T166344
07:02 marostegui: Stop replication on db2065 to reimport: page, linter and watchlist tables
07:02 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2065 to reimport: page, linter and watchlist tables (duration: 00m 47s)
06:38 marostegui: Stop MySQL on db2074 - T171321
06:37 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2074 - T171321 (duration: 00m 46s)
06:33 marostegui: Stop replication on db2075 - T170662
06:27 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2073 - T171321 (duration: 00m 47s)
06:20 marostegui: Force BBU re-learn on db1016 - T166344
02:57 l10nupdate@tin: ResourceLoader cache refresh completed at Mon Aug 7 02:57:42 UTC 2017 (duration 6m 42s)
02:51 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.12) (duration: 07m 56s)
02:30 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 10m 16s)
2017-08-06
13:17 elukey: powercycle mw2256 - com2 frozen - T163346
13:13 elukey: restart pdfrender on scb1002
06:18 ebernhardson@tin: Synchronized wmf-config/PoolCounterSettings.php: T169498 : Reduce cirrus search pool counter to 200 parallel requests cluster wide (duration: 02m 54s)
01:28 chasemp: conf2002:~# service etcdmirror-conftool-eqiad-wmnet restart (not sure what else to do the service failed)
2017-08-05
14:40 Reedy: created oauth tables on foundationwiki T172591
14:13 reedy@tin: Synchronized php-1.30.0-wmf.12/extensions/WikimediaMaintenance/createExtensionTables.php: add oauth (duration: 00m 48s)
2017-08-04
23:51 mutante: phab2001 - removed outdated /etc/hosts entries, that fixed rsync, syncing /srv/repos/ from phab1001
23:35 mutante: phab2001 rebooting
23:35 mutante: phab2001 - installing various package upgrades, apt-get autoremove old kernel images
23:12 mutante: "reserved" UID 498 for phd on https://wikitech.wikimedia.org/wiki/UID | phab2001: find -exec chown to fix all the files , restart cron
23:04 mutante: phab2001 - changing UID/GID for phd user from 997:997 to 498:498 to make it match phab1001, to fix rsync breaking permissions. (rsync forces --numeric-ids when fetching from and rsyncd configured with chroot=yes). chown -R phw:www-data /srv/repos/
22:37 ejegg: restarted donations and refund queue consumers
21:44 ejegg: stopped donations and refund queue consumers
21:24 urandom: T172384 : Disabling Puppet in dev environment to prevent unattended Cassandra restarts
20:19 mutante: renewing SSL cert for status.wm.org (just like wikitech-static, but that one didnt have monitoring?)
20:02 mutante: wikitech-static-ord - apt-get install certbot
19:41 ejegg: updated CiviCRM from f1fd7f0 to f24ba78
19:38 mutante: renaming graphite varnish director/fixing config, running puppet on cache misc, tested on cp1045
18:17 andrewbogott: switched most cloud instance to new puppetmasters, as per https://phabricator.wikimedia.org/T171786
11:46 marostegui: Deploy schema change directly on s3 master for maiwikimedia - T172485
11:30 marostegui: Deploy schema change directly on s3 master for kbpwiki - T172485
11:14 marostegui: Deploy schema change directly on s3 master for dinwiki - T172485
10:14 marostegui: Deploy schema change directly on s3 master for atjwiki - T172485
10:05 marostegui: Stop replication on db2073 for maintenance
09:22 marostegui: Add dbstore2002 to tendril - T171321
09:19 marostegui: Deploy schema change directly on s3 master for techconductwiki - T172485
08:35 marostegui: Deploy schema change directly on s3 master for hiwikiversity - T172485
08:19 marostegui: Deploy schema change directly on s3 master for wikimania2018wiki - T172485
08:04 marostegui: Sanitize wikimania2018wiki on sanitarium and sanitarium2 - T155041
07:47 marostegui: Stop MySQL on db2073 to copy its data to dbstore2002 - https://phabricator.wikimedia.org/T171321
07:47 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2073 - T171321 (duration: 00m 47s)
07:07 moritzm: installing imagemagick regression security updates on trusty
06:47 marostegui: Sanitize hiwikiversity on sanitarium and sanitarium2 - T171829
05:23 mutante: phab1001 sudo ip addr del 10.64.32.186/32 dev eth0 (T172478 )
02:28 dzahn@neodymium: conftool action : set/pooled=yes; selector: name=phab1001-vcs.eqiad.wmnet
02:28 dzahn@neodymium: conftool action : set/pooled=no; selector: name=phab1001-vcs.eqiad.wmnet
02:15 mutante: phab1001 can't talk to mx servers via IPv6, but works via IPv4. iridium and other mailservers can also talk IPv6 to it. why? it did not change even when stopping ferm on client and on server it allows from anywhere. workaround for now was to hardcode IPv4 IP in phab config. (T163938 )
02:13 twentyafterfour: outgoing phab mail is working again
01:16 twentyafterfour: twentyafterfour@phab1001:/srv/repos$ sudo chown -R phd:www-data /srv/repos
01:16 twentyafterfour: twentyafterfour@phab1001:/srv/repos$ sudo chmod -x /usr/local/sbin/sync-srv-repos
01:12 mutante: phab1001 - stopped and started exim, which is now running with same options as iridium
00:53 twentyafterfour: starting phd to shut up icinga
00:40 ebernhardson@tin: Synchronized php-1.30.0-wmf.12/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: Turn CirrusSearch MLR test back off (duration: 00m 47s)
00:35 twentyafterfour: phabricator web service is also up
00:34 mutante: phab1001 - service IPs switched - puppet ran - ssh-phab service up
00:30 twentyafterfour: phab1001 chown -R phd:phd /srv/repos
00:28 twentyafterfour: testing phd on phab1001
00:24 mutante: phab1001 sudo ip addr add 2620:0:861:103:10:64:32:186/128 dev eth0
00:24 mutante: phab1001 sudo ip addr add 10.64.32.186/32 dev eth0
00:23 mutante: iridium sudo ip addr del 2620:0:861:ed1a::3:16/128 dev lo
00:23 mutante: iridiium sudo ip addr del 208.80.154.250/32 dev lo
00:23 mutante: iridium sudo ip addr del 2620:0:861:103:10:64:32:186/128 dev eth0
00:23 mutante: iridium sudo ip addr del 10.64.32.186/32 dev eth0
00:07 twentyafterfour: stoped phd and apache on iridium
00:07 twentyafterfour: stopped ssh-phab on iridium
00:03 mutante: phab1001 - /usr/bin/rsync -av rsync://iridium.eqiad.wmnet/srv-repos /srv/repos/
00:02 twentyafterfour: Taking phabricator down for maintenance / migration to a new server: phab1001
2017-08-03
23:43 ebernhardson@tin: Synchronized php-1.30.0-wmf.12/extensions/WikimediaEvents/modules/ext.wikimediaEvents.searchSatisfaction.js: T171212 : Turn on CirrusSearch MLR AB test (duration: 00m 46s)
23:36 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/370117/2 (duration: 00m 47s)
23:14 ebernhardson@tin: Synchronized php-1.30.0-wmf.12/extensions/CodeMirror/resources/ext.CodeMirror.js: Only show popup if CodeMirror button exists (duration: 00m 46s)
23:12 ebernhardson@tin: Synchronized php-1.30.0-wmf.12/extensions/WikimediaEvents/extension.json: (no justification provided) (duration: 00m 47s)
23:08 ebernhardson@tin: Synchronized wmf-config/CirrusSearch-production.php: CirrusSearch config for MLR test (step 2) (duration: 00m 46s)
23:07 ebernhardson@tin: Synchronized wmf-config/InitialiseSettings.php: CirrusSearch config for MLR test (step 1) (duration: 00m 47s)
22:06 ejegg: restarted donation and refund queue consumers
21:51 maxsem@tin: Synchronized wmf-config/CommonSettings-labs.php: https://gerrit.wikimedia.org/r/#/c/370096/ (duration: 00m 47s)
21:51 ejegg: enabled ingenico_audit drupal module
21:45 ejegg: updated civicrm from 5c741b1 to f1fd7f0
21:43 ejegg: disabled donation and refund queue consumers
21:31 chasemp: clear out nova-fullstack project so we can monitor fresh on new puppetmaster
21:30 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: Enable CodeMirror https://gerrit.wikimedia.org/r/#/c/370072/ (duration: 00m 47s)
21:20 maxsem@tin: Synchronized php-1.30.0-wmf.12/extensions/CodeMirror: https://gerrit.wikimedia.org/r/#/c/370066/ https://gerrit.wikimedia.org/r/#/c/370074/ (duration: 00m 47s)
21:04 herron: copper /var/cache/pbuilder 95% full - grew /dev/copper-vg/pbuilder fs by +5G and tune2fs -m 0. now at 85% full
20:40 milimetric@tin: Finished deploy [analytics/refinery@cc40bf2]: Fix sqoop script with updated scap config (duration: 11m 51s)
20:29 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: all wikis except wikidata to 1.30.0-wmf.12, leaving wikidata behind due to T172320
20:28 milimetric@tin: Started deploy [analytics/refinery@cc40bf2]: Fix sqoop script with updated scap config
20:24 milimetric@tin: Finished deploy [analytics/refinery@cc40bf2]: Fix sqoop script (duration: 33m 17s)
19:51 milimetric@tin: Started deploy [analytics/refinery@cc40bf2]: Fix sqoop script
19:21 twentyafterfour@tin: Synchronized php-1.30.0-wmf.12/extensions/Wikidata/extensions/Wikibase/client/includes/Changes: be sure https://gerrit.wikimedia.org/r/#/c/369847/ is sync'd (duration: 00m 46s)
19:12 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to 1.30.0-wmf.12 refs T168053
19:08 twentyafterfour: deploying 1.30.0-wmf.12 to group1, will proceed to group2 after verifying the branch is stable.
18:59 arlolra: Updated Parsoid to 6e1a20d5 (T168765 , T155038 , T165977 )
18:53 demon@tin: Synchronized wmf-config/InitialiseSettings.php: Final gerrit 369960: "pagePreviews: Deploy to first 50 of stage 1 wikis" (duration: 00m 46s)
18:52 arlolra@tin: Finished deploy [parsoid/deploy@48be65b]: Updating Parsoid to 6e1a20d5 (duration: 06m 25s)
18:51 demon@tin: Synchronized wmf-config/CommonSettings.php: new dblist for gerrit 369960 (duration: 00m 46s)
18:46 arlolra@tin: Started deploy [parsoid/deploy@48be65b]: Updating Parsoid to 6e1a20d5
18:41 demon@tin: Synchronized docroot/: gerrit 369960 (duration: 00m 47s)
18:39 demon@tin: Synchronized dblists/: gerrit 369960 (duration: 00m 47s)
18:29 demon@tin: Synchronized wmf-config/InitialiseSettings.php: last part of gerrit 351287 (duration: 00m 47s)
18:23 mobrovac@tin: Finished deploy [restbase/deploy@65af18d]: Expose the recommendation API publicly and activate hiwikiversity - T170877 T168765 (duration: 08m 33s)
18:17 demon@tin: Synchronized wmf-config/CommonSettings.php: Add new dblist for page preview stuff, basically no-op (duration: 00m 47s)
18:15 demon@tin: Synchronized docroot/noc/conf/pp_stage0.dblist: tidy up pp config (duration: 00m 46s)
18:15 mobrovac@tin: Started deploy [restbase/deploy@65af18d]: Expose the recommendation API publicly and activate hiwikiversity - T170877 T168765
18:14 demon@tin: Synchronized dblists/pp_stage0.dblist: clean up config (duration: 00m 47s)
18:11 mobrovac@tin: Finished deploy [restbase/deploy@65af18d] (staging): (no justification provided) (duration: 01m 57s)
18:09 mobrovac@tin: Started deploy [restbase/deploy@65af18d] (staging): (no justification provided)
18:03 arlolra: Updated Parsoid to 651f12c2 (T119802 , T73386 , T170289 )
17:55 andrewbogott: restarting rabbitmq-server on labcontrol1001
17:50 arlolra@tin: Finished deploy [parsoid/deploy@612e711]: Updating Parsoid to 651f12c2 (duration: 11m 17s)
17:39 arlolra@tin: Started deploy [parsoid/deploy@612e711]: Updating Parsoid to 651f12c2
16:15 jynus: repointing dbproxy1008 back to db1043
16:06 demon@tin: Finished deploy [gerrit/gerrit@15f1544]: This is a test, disregard (duration: 00m 03s)
16:05 demon@tin: Started deploy [gerrit/gerrit@15f1544]: This is a test, disregard
16:03 demon@tin: Synchronized wmf-config/: doc fixes, no-op (duration: 00m 48s)
16:02 demon@tin: Synchronized scap/plugins/prep.py: doc fixes, no-op (duration: 00m 47s)
16:00 godog: silence cassandra-related alerts on restbase-dev cluster, known OOMs
15:44 urandom: T172384 : lower tombstone failure threshold in RESTBase dev to 1000
15:39 reedy@tin: Synchronized wmf-config/interwiki.php: Update for 3 new wikis (duration: 00m 46s)
15:31 reedy@tin: rebuilt wikiversions.php and synchronized wikiversions files: techconductwiki
15:27 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: techconductwiki (duration: 00m 46s)
15:25 reedy@tin: Synchronized static/images/project-logos/: techconductwiki (duration: 00m 44s)
15:24 reedy@tin: Synchronized dblists/: techconductwiki (duration: 00m 47s)
15:03 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=elastic1028.eqiad.wmnet
15:02 gehel: unbanning and repooling elastic1028 - T168816
14:58 reedy@tin: Synchronized static/images/project-logos/: hiwikiversity (duration: 00m 46s)
14:57 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: hiwikiversity (duration: 00m 48s)
14:55 reedy@tin: rebuilt wikiversions.php and synchronized wikiversions files: hiwikiversity
14:55 reedy@tin: Synchronized dblists: hiwikiversity (duration: 00m 49s)
14:53 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=elastic10(29|30|31|32).eqiad.wmnet
14:53 Dereckson: mwscript initSiteStats.php --wiki srwikiquote --update (T172241 )
14:52 gehel: unbanning and repooling elastic10(29|30|31|32) - T168816
14:29 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: wikimania2018wiki (duration: 00m 47s)
14:28 reedy@tin: rebuilt wikiversions.php and synchronized wikiversions files: (no justification provided)
14:27 reedy@tin: Synchronized dblists: wikimania2018wiki (duration: 00m 47s)
14:21 marostegui: Restart MySQL on db2019 to get the new socket location updated
14:19 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2045 - T170662 (duration: 00m 46s)
14:06 marostegui: Compress s5 on db2075 - T170662
14:02 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=elastic10(28|29|30|31|32).eqiad.wmnet
14:01 gehel: depooling and shutting down elastic10(28|29|30|31|32) - T168816
14:00 marostegui: Add db2075 to tendril - T170662
13:44 marostegui: Enable gtid back on codfw s4 slaves - T170351
13:35 marostegui@tin: Synchronized wmf-config/db-codfw.php: Promote db2051 as s4 codfw master - T170351 (duration: 00m 46s)
13:28 elukey: drop CookieBlock backup tables for T171883
13:26 marostegui: Starting the actual s4 codfw failover db2019 -> db2051 - T170351
13:22 gehel: banning elastic1032 - T168816
13:19 chasemp: disabling puppet acros wmcs things for testing
13:14 marostegui: Start topology change for s4 in codfw, slaves will be moved under db2051 - T170351
13:10 aude@tin: Synchronized php-1.30.0-wmf.12/extensions/Wikidata: Fix bug in InjectRCRecordsJob (duration: 02m 12s)
13:01 marostegui: Disable gtid on s4 codfw slaves to get ready for the topology change - T170351
12:15 marostegui: Restart MySQL on db2051 - T170351
10:21 elukey: removed /run/pacct_shadow.d on stat1005 to allow atopacct.service restart
10:01 moritzm: installing poppler security updates on trusty
09:57 _joe_: systemctl reset-failed puppetmaster.service on labpuppetmaster1002
09:52 moritzm: installing mysql 5.5 security updates (package as shipped by Debian jessie, not our internal wmf-mariadb package)
09:32 gehel: banning elastic103[01] - T168816
08:56 ema: upgrading cache_upload to varnish 4.1.8-1wm1
08:36 gehel: banning elastic102[89] - T168816
08:35 marostegui: Stop MySQL on db2045 to copy its data to db2075 - T170662
08:34 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2045 - T170662 (duration: 00m 47s)
08:33 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Depool db2045 - T170662 (duration: 00m 47s)
07:58 marostegui: Add db2073 and db2074 to tendril - T170662
06:19 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2074 to s3 - T170662 (duration: 00m 46s)
06:18 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2074 to s3 - T170662 (duration: 00m 46s)
06:01 marostegui: Compress s7 on db1102 - T172169
05:57 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2051 - T170351 (duration: 00m 54s)
05:27 marostegui: Deploy alter table on enwiki - labsdb1010 - T166204
05:17 marostegui: Stop replication on labsdb1011 for maintenance - T153743
03:09 l10nupdate@tin: ResourceLoader cache refresh completed at Thu Aug 3 03:09:22 UTC 2017 (duration 7m 18s)
03:02 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.12) (duration: 05m 47s)
02:33 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 07m 53s)
01:40 twentyafterfour: repositories synced for phabricator migration.
00:28 reedy@tin: Synchronized wmf-config/InitialiseSettings-labs.php: Simply ores (duration: 00m 47s)
00:26 twentyafterfour: scheduled downtime for phabricator migration
00:12 mutante: phab1001 starting ferm service
00:11 mutante: iridium restarted ferm - rsync fragment was in config but not applied, breaking data rsync to phab1001
00:05 mutante: rsyncing /srv/repos from iridium to phab1001 (T163938 )
2017-08-02
23:53 ebernhardson@tin: Finished scap: dblists/rtl.dblist T172305 : Adding RTL database list for project with default RTL languages (duration: 36m 21s)
23:17 ebernhardson@tin: Started scap: dblists/rtl.dblist T172305 : Adding RTL database list for project with default RTL languages
21:48 ejegg: updated SmashPig from f4ca53c to c501f53
20:24 bsitzmann@tin: Finished deploy [mobileapps/deploy@3b61ced]: Update mobileapps to 2d8e8f6 (T170325 ) (duration: 05m 38s)
20:19 bsitzmann@tin: Started deploy [mobileapps/deploy@3b61ced]: Update mobileapps to 2d8e8f6 (T170325 )
19:50 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to 1.30.0-wmf.11 refs T168053 - rollback due to T172320
19:24 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to 1.30.0-wmf.12 refs refs T168053
19:23 twentyafterfour: group1 wikis to 1.30.0-wmf.12 refs T168053
18:41 thcipriani@tin: Synchronized php-1.30.0-wmf.12/includes/specials/SpecialRecentchanges.php: SWAT: Follow-up 31be7d0: send tags list if experimental mode is disabled (duration: 00m 47s)
18:21 thcipriani@tin: Synchronized wmf-config/InitialiseSettings-labs.php: SWAT: beta-only change Enable HTML5 sections in betalabs (duration: 00m 46s)
18:20 gehel@puppetmaster1001: conftool action : set/pooled=yes; selector: name=elastic10(24|25|26|27).eqiad.wmnet
18:18 thcipriani@tin: Synchronized php-1.30.0-wmf.12/includes/specials/SpecialUndelete.php: SWAT: Fix Special:Undelete search - use variable and not request param (duration: 00m 46s)
18:18 gehel: un-banning and repooling elastic102[4567] - T168816
18:13 thcipriani@tin: Synchronized wmf-config/jobqueue.php: SWAT: JobQueueEventBus: Enable job events in group0 wikis T163380 Part II (duration: 00m 47s)
18:11 thcipriani@tin: Synchronized wmf-config/InitialiseSettings.php: SWAT: JobQueueEventBus: Enable job events in group0 wikis T163380 Part I (duration: 00m 47s)
17:56 ottomata: restart kafka1012 broker with listeners=PLAINTEXT://:9092 to verify https://gerrit.wikimedia.org/r/#/c/356232/ before merge. This should be a functional no-op
17:44 mobrovac@tin: Finished deploy [cxserver/deploy@f43ef96]: Revert canary scb2001 to f43ef963 , take #2 (duration: 00m 16s)
17:44 mobrovac@tin: Started deploy [cxserver/deploy@f43ef96]: Revert canary scb2001 to f43ef963 , take #2
17:44 mobrovac@tin: Finished deploy [cxserver/deploy@f43ef96]: Revert canary scb2001 to f43ef963 (duration: 00m 29s)
17:43 mobrovac@tin: Started deploy [cxserver/deploy@f43ef96]: Revert canary scb2001 to f43ef963
17:26 mobrovac@tin: Finished deploy [recommendation-api/deploy@8dfae34]: (no justification provided) (duration: 01m 36s)
17:25 mobrovac@tin: Started deploy [recommendation-api/deploy@8dfae34]: (no justification provided)
17:18 herron: installed libmail-spf-perl and restarted spamassassin service on mx[1,2]001 - T172299
16:45 gehel@puppetmaster1001: conftool action : set/pooled=no; selector: name=elastic10(24|25|26|27).eqiad.wmnet
16:43 gehel: depooling and shutting down elastic102[4567] - T168816
15:34 mobrovac@tin: Finished deploy [cxserver/deploy@cf3e280]: (no justification provided) (duration: 00m 22s)
15:34 mobrovac@tin: Started deploy [cxserver/deploy@cf3e280]: (no justification provided)
15:18 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Change db2051 IP - T169501 (duration: 00m 46s)
15:17 marostegui@tin: Synchronized wmf-config/db-codfw.php: Change db2051 IP - T169501 (duration: 00m 46s)
15:17 nschaaf@tin: Finished deploy [recommendation-api/deploy@baa11c0]: source parameter validation (duration: 03m 22s)
15:14 nschaaf@tin: Started deploy [recommendation-api/deploy@baa11c0]: source parameter validation
15:06 marostegui: Poweroff db2051 to get it move to another rack - T169501
14:46 ema: upgrading cache_misc to varnish 4.1.8-1wm1
14:31 ema: varnish 4.1.8-1wm1 (fixes VSV00001, DSA 3924-1) built and uploaded to apt.w.o
14:12 marostegui: Stop MySQL on db2051 in order to get it ready to move to another rack - T170351
13:16 hashar@tin: Synchronized wmf-config/Wikibase-production.php: Write to term_full_entity_id column in wb_terms table in prod too - T167229 (duration: 00m 46s)
13:07 hashar@tin: Synchronized php-1.30.0-wmf.12/extensions/Wikidata: fix constraint type checks - T169326 (duration: 02m 13s)
12:59 gehel: banning elastic102[67] - T168816
12:41 gehel: banning elastic102[45] - T168816
11:42 jynus: disabling autolearn on newest db1* hosts
11:39 jynus: disabling autolearn on newest db2* hosts
11:33 marostegui: Stop MySQL on db2051 for maintenance - T170351
11:29 jynus: setting all es hosts with disabled auto-learn bbu properties
11:22 kartik@tin: (no justification provided)
11:21 kartik@tin: (no justification provided)
11:19 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2051 - T170351 (duration: 00m 46s)
11:18 kartik@tin: Finished deploy [cxserver/deploy@cf3e280]: Update cxserver to fe03ad7 (duration: 00m 56s)
11:17 kartik@tin: Started deploy [cxserver/deploy@cf3e280]: Update cxserver to fe03ad7
11:12 jynus@tin: Synchronized wmf-config/db-codfw.php: Depool es2013 (duration: 00m 47s)
11:02 hashar: Upgraded tox on CI to 2.5.0
10:49 marostegui: Force a re-learn cycle on es2013
10:18 marostegui: Rename wikigrok tables on enwiki on db1089 - T172020
09:03 marostegui: Drop table click_tracking_user_properties wherever it exists - T115982
09:02 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2057 - T170662 (duration: 00m 46s)
08:51 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Add db2073 to s4 - T170662 (duration: 00m 47s)
08:50 marostegui@tin: Synchronized wmf-config/db-codfw.php: Add db2073 to s4 - T170662 (duration: 00m 57s)
08:32 marostegui: Drop table click_tracking wherever it exists - T115982
08:15 godog: bounce pdfrender on scb1001, stuck
07:09 marostegui: Disable BBU auto-learn on es2013
06:58 marostegui: Stop MySQL on db2057 for maintenance - T148507
06:50 marostegui@tin: Synchronized wmf-config/db-eqiad.php: Repool db1055 - T166204 (duration: 00m 45s)
05:47 demon@tin: Synchronized dblists/: No-op (duration: 00m 47s)
02:59 l10nupdate@tin: ResourceLoader cache refresh completed at Wed Aug 2 02:59:33 UTC 2017 (duration 7m 3s)
02:52 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.12) (duration: 05m 54s)
02:36 eileen: update process-control to 3d3978a
02:27 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 07m 35s)
2017-08-01
23:47 reedy@tin: Synchronized wmf-config/CommonSettings.php: Make Babel use databasey stuff T145366 (duration: 00m 46s)
23:46 eileen: update process control to33a0262d87aa1512888c82ac6b4ed91de3d3cac5
23:44 reedy@tin: Synchronized wmf-config/CommonSettings.php: wfLoadExtension Scribunto (duration: 00m 46s)
23:43 reedy@tin: Synchronized wmf-config/extension-list: json! (duration: 00m 46s)
23:41 reedy@tin: Synchronized wmf-config/CommonSettings.php: Unbreak ContactPage T172199 (duration: 00m 46s)
23:39 reedy@tin: Synchronized wmf-config/MetaContactPages.php: Unbreak ContactPage T172199 (duration: 00m 46s)
23:36 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: T167071 (duration: 00m 47s)
23:30 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: wikinews - T172211 (duration: 00m 46s)
23:26 reedy@tin: Synchronized php-1.30.0-wmf.12/resources/src/mediawiki.rcfilters/mw.rcfilters.Controller.js: T172156 T171514 (duration: 00m 46s)
23:24 reedy@tin: Synchronized static/images/project-logos/: optimise pngs (duration: 00m 46s)
23:23 eileen: update process_control to ecb6669
23:23 reedy@tin: Synchronized static/apple-touch/: optimise pngs (duration: 00m 46s)
23:21 reedy@tin: Synchronized docroot/noc/css/images/: optimise pngs (duration: 00m 47s)
23:13 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: Disable popups on Special pages T170893 (duration: 00m 47s)
23:09 reedy@tin: Synchronized wmf-config/InitialiseSettings.php: New wordmarks T168203 T171769 (duration: 00m 47s)
23:08 reedy@tin: Synchronized static/images/mobile/copyright/: 2 new workmarks (duration: 00m 48s)
22:57 chasemp: push procs off swap for labcontrol1001 w/ swapoff -a & swapon -a
22:20 maxsem@tin: Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/369434/3 (duration: 00m 47s)
22:20 eileen: tools updated from 58bcbf3 to 9554229
22:17 eileen: update process_control to c198b7f
21:48 demon@tin: Started deploy [gerrit/gerrit@15f1544]: Initial deploy x2
21:47 demon@tin: Finished deploy [gerrit/gerrit@15f1544]: Initial deploy (duration: 00m 40s)
21:47 demon@tin: Started deploy [gerrit/gerrit@15f1544]: Initial deploy
21:46 demon@tin: Started deploy [gerrit/gerrit@15f1544]: (no justification provided)
21:45 demon@tin: Finished deploy [gerrit/gerrit@15f1544]: (no justification provided) (duration: 00m 06s)
21:45 demon@tin: Started deploy [gerrit/gerrit@15f1544]: (no justification provided)
21:44 demon@tin: Started deploy [gerrit/gerrit@15f1544]: (no justification provided)
20:40 twentyafterfour@tin: rebuilt wikiversions.php and synchronized wikiversions files: group0 wikis to 1.30.0-wmf.12 refs refs T168053
19:33 twentyafterfour@tin: Finished scap: Sync 1.30.0-wmf.12 and build l10n refs T168053 (duration: 29m 58s)
19:19 urandom: restarting Cassandra, restbase1004-a.eqiad.wmnet, aberrant read latency
19:03 twentyafterfour@tin: Started scap: Sync 1.30.0-wmf.12 and build l10n refs T168053
18:49 twentyafterfour@tin: Finished deploy [phabricator/deployment@3d728e1]: testing phab1001 deployment (duration: 00m 02s)
18:49 twentyafterfour@tin: Started deploy [phabricator/deployment@3d728e1]: testing phab1001 deployment
18:47 twentyafterfour@tin: Finished deploy [phabricator/deployment@3d728e1]: testing phab1001 deployment (duration: 00m 16s)
18:47 twentyafterfour@tin: Started deploy [phabricator/deployment@3d728e1]: testing phab1001 deployment
18:43 twentyafterfour@tin: Finished deploy [phabricator/deployment@3d728e1]: (no justification provided) (duration: 00m 48s)
18:42 twentyafterfour@tin: Started deploy [phabricator/deployment@3d728e1]: (no justification provided)
17:23 twentyafterfour: MediaWiki train for 1.30.0-wmf.12 - finished `scap prep` & `scap patch` refs T168053
16:41 ejegg: updated CiviCRM from 23f2bbf to 5c741b1
16:25 twentyafterfour: MediaWiki Train: Creating new branch wmf/1.30.0-wmf.12 from master. See T168053 for deployment blockers.
16:17 dcausse: restarting elastic on relforge100x servers to pick up new version of the plugins
15:54 bblack: varnish backend restart on cp1072 (mailbox lag)
15:43 bblack: rebooting lvs1002
15:42 marostegui: db1069: Migrate trwiktionary.page from TokuDB to InnoDB
15:40 bblack: stopping pybal on lvs1002 for impending reboot
15:39 bblack: stopping pybal on 1002 for impending reboot
15:33 marostegui: Stop s3 on db1069 - replication stuck
15:17 marostegui: Stop MySQL on db1055 for maintenance - https://phabricator.wikimedia.org/T148507
14:56 andrewbogott: rebooting labvirt1016
14:46 marostegui: Deploy InnoDB compression on s3 - db2074 for the following tables (revision, pagelinks and templatelinks) - T170662
14:45 ema: lvs1001-1003 (eqiad primaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
14:28 ema: lvs1004-1006 (eqiad secondaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
14:27 bblack: restart varnish backend on cp1049 (mailbox lag)
14:12 bblack: restart varnish backend on cp1074 (mailbox lag)
13:53 ema@neodymium: conftool action : set/pooled=yes; selector: name=achernar.wikimedia.org,service=pdns_recursor
13:26 ema@neodymium: conftool action : set/pooled=no; selector: name=achernar.wikimedia.org,service=pdns_recursor
13:10 hashar@tin: Synchronized wmf-config/InitialiseSettings.php: Enable OOjs UI EditPage on all wikis except Commons (duration: 00m 44s)
13:06 marostegui: Compress s2 on db1102 - T172169
12:49 elukey: restart hive daemons on analytics1003 to pick up new jvm settings (bigger Xmx, JMX ports)
12:06 dcausse: 100% cpu spike on elastic1023 caused percentiles to jump for a short period of time (T169498 )
12:04 elukey: stop eventlogging_sync on analytics-slaves && rename all CookieBlock* tables (log db) to CookieBlock*_backup - T171883
11:52 marostegui: Stop MySQL on db2057 to copy its data to db2074 - T170662
11:36 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2057 - T170662 (duration: 00m 43s)
11:10 marostegui@tin: Synchronized wmf-config/db-codfw.php: Repool db2065 - T170662 (duration: 00m 43s)
10:31 hashar: Enabling Zuul/CI again and reenabling puppet on contint1001
10:24 hashar: contint1001 stopped puppet agent to prevent Zuul server to come back up
10:12 hashar: Stopped Zuul / CI for mass mediawiki extension changes
08:55 ema: lvs2001-2003 (codfw primaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
08:32 ema: lvs2004-2006 (codfw secondaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
08:03 ema: lvs3*: upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
07:40 ema: lvs4001, lvs4002 (ulsfo primaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
07:35 ema: lvs4003, lvs4004 (ulsfo secondaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442 , T103882
07:33 ema: pybal 1.13.11 uploaded to apt.w.o T103882
06:20 marostegui: Stop MySQL on db2065 to copy its data to db2073 - T170662
06:13 marostegui@tin: Synchronized wmf-config/db-codfw.php: Depool db2065 - T170662 (duration: 00m 43s)
05:21 marostegui: Restart MySQL on labsdb1003 as it is totally stuck
03:50 mobrovac@tin: Finished deploy [restbase/deploy@0d12138]: Add nl.wikinews - T171897 (duration: 07m 57s)
03:42 mobrovac@tin: Started deploy [restbase/deploy@0d12138]: Add nl.wikinews - T171897
02:32 l10nupdate@tin: ResourceLoader cache refresh completed at Tue Aug 1 02:32:17 UTC 2017 (duration 6m 36s)
02:25 l10nupdate@tin: scap sync-l10n completed (1.30.0-wmf.11) (duration: 07m 46s)
2000s
2010s
2020s
Toggle limited content width