3901-3950 of 10000 results (40ms)
2021-02-19 §
13:43 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:42 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:42 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:41 <godog> reset-failed ifup@ens13 on prometheus5001 - T273026 [production]
13:39 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host prometheus5001.eqsin.wmnet [production]
13:31 <gehel@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1010.eqiad.wmnet with reason: REIMAGE [production]
13:29 <gehel@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1010.eqiad.wmnet with reason: REIMAGE [production]
13:22 <filippo@cumin1001> START - Cookbook sre.hosts.reboot-single for host prometheus5001.eqsin.wmnet [production]
09:27 <elukey@cumin1001> END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) for Hadoop backup cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 [production]
09:16 <elukey@cumin1001> START - Cookbook sre.hadoop.stop-cluster for Hadoop backup cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 [production]
08:40 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-airflow1001.eqiad.wmnet [production]
08:34 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-airflow1001.eqiad.wmnet [production]
08:06 <godog> swift codfw-prod: more weight to ms-be20[58-61] - T269337 [production]
08:04 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1108.eqiad.wmnet [production]
07:47 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host an-worker1108.eqiad.wmnet [production]
02:26 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1133.eqiad.wmnet with reason: REIMAGE [production]
02:24 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1133.eqiad.wmnet with reason: REIMAGE [production]
01:22 <mutante> mwmaint2001 back on buster and back in scap dsh groups (if anything pops up you can revert 665175) [production]
01:19 <mutante> deleting my huge build from puppet-compiler that failed because it made the compiler instance run out of disk to run on * [production]
01:03 <urbanecm@deploy1001> Synchronized php-1.36.0-wmf.30/includes/ProtectionForm.php: d305308a5d46a3f86bf0b211e8a733c0a951ddc1: field descriptors in HTMLForm must have keys (T275018; T274980) (duration: 01m 08s) [production]
01:02 <urbanecm@deploy1001> Synchronized php-1.36.0-wmf.31/includes/ProtectionForm.php: 2487c253b090d93daf85adae8ceb9d255cbf4ff2: field descriptors in HTMLForm must have keys (T275018; T274980) (duration: 01m 10s) [production]
00:54 <mutante> mwmaint2001 - back from reimage - scap pull [production]
00:26 <urbanecm@deploy1001> Synchronized static/images/project-logos/wikimedia-cloud-services.svg: 686acba2f31df0d454c6f1c506c042af50b5cce0: Restore logos on Vector (classic version) and use cloud icon for labs (T274210) (duration: 01m 07s) [production]
00:13 <dpifke@deploy1001> Synchronized wmf-config/PhpAutoPrepend.php: Deploying excimer-wall profiler pipeline T253160 (duration: 01m 03s) [production]
00:12 <dpifke@deploy1001> Synchronized wmf-config/profiler.php: Deploying excimer-wall profiler pipeline T253160 (duration: 01m 02s) [production]
2021-02-18 §
23:48 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mwmaint2001.codfw.wmnet with reason: REIMAGE [production]
23:46 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mwmaint2001.codfw.wmnet with reason: REIMAGE [production]
23:26 <dancy@deploy1001> Synchronized wmf-config/: Syncing https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/634552 (duration: 01m 07s) [production]
23:22 <dancy@deploy1001> Synchronized wmf-config/CommonSettings.php: Syncing https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/634551 (duration: 01m 08s) [production]
23:15 <dancy@deploy1001> Synchronized src/ServiceConfig.php: (no justification provided) (duration: 03m 21s) [production]
23:11 <mutante> mwmaint2001 - will be rebooted for OS upgrade - T267607 [production]
23:10 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade [production]
23:10 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade [production]
23:04 <mutante> mwmaint1002 - rsyncing data from mwmaint2001 [production]
22:30 <mutante> mwmaint2001 - tar-gzipping a lot of old user home data I keep finding, partially museum worthy from several maintenance hosts ago, like places like /root/home-mwmaint1001/username/home-terbium/iron/ :p [production]
21:29 <marxarelli> 1.36.0-wmf.31 rolled back due to T275161 and new logspam (T271345) [production]
21:26 <dduvall@deploy1001> rebuilt and synchronized wikiversions files: Revert "all wikis to 1.36.0-wmf.31" [production]
20:09 <dduvall@deploy1001> rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.31 [production]
19:27 <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: f33f9f71b13d9b9276df88ef6384ec6028ee2e1d: Make DiscussionTools replytool available for everyone on gomwiktionary (T258554) (duration: 01m 05s) [production]
19:25 <mutante> mwmaint2001 - deleting 'home-terbium' from all home directories (yes, it's in Bacula if you really used that, hope you didn't, it's been years since terbium) [production]
19:25 <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: da7b8123ecb373c1de1634ae867fb2f5fbee89ad: Enable DiscussionTools beta feature for newtopictool on arwiki, cswiki, huwiki (T273145) (duration: 01m 12s) [production]
19:20 <urbanecm@deploy1001> Synchronized php-1.36.0-wmf.31/extensions/DiscussionTools/: 1cc29df: 6b88aff: DiscussionTools backports (T272666; T274949) (duration: 01m 08s) [production]
19:19 <urbanecm@deploy1001> sync-file aborted: 1cc29df DiscussionTools backports (T272666; T274949) (duration: 00m 00s) [production]
19:17 <urbanecm@deploy1001> Synchronized php-1.36.0-wmf.30/extensions/DiscussionTools/: 9c6cdf5: 97acef6: DiscussionTools backports (T272666; T274949) (duration: 01m 26s) [production]
19:04 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade [production]
19:04 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade [production]