2021-02-18
§
|
23:48 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mwmaint2001.codfw.wmnet with reason: REIMAGE |
[production] |
23:46 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mwmaint2001.codfw.wmnet with reason: REIMAGE |
[production] |
23:26 |
<dancy@deploy1001> |
Synchronized wmf-config/: Syncing https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/634552 (duration: 01m 07s) |
[production] |
23:22 |
<dancy@deploy1001> |
Synchronized wmf-config/CommonSettings.php: Syncing https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/634551 (duration: 01m 08s) |
[production] |
23:15 |
<dancy@deploy1001> |
Synchronized src/ServiceConfig.php: (no justification provided) (duration: 03m 21s) |
[production] |
23:11 |
<mutante> |
mwmaint2001 - will be rebooted for OS upgrade - T267607 |
[production] |
23:10 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade |
[production] |
23:10 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade |
[production] |
23:04 |
<mutante> |
mwmaint1002 - rsyncing data from mwmaint2001 |
[production] |
22:30 |
<mutante> |
mwmaint2001 - tar-gzipping a lot of old user home data I keep finding, partially museum worthy from several maintenance hosts ago, like places like /root/home-mwmaint1001/username/home-terbium/iron/ :p |
[production] |
21:29 |
<marxarelli> |
1.36.0-wmf.31 rolled back due to T275161 and new logspam (T271345) |
[production] |
21:26 |
<dduvall@deploy1001> |
rebuilt and synchronized wikiversions files: Revert "all wikis to 1.36.0-wmf.31" |
[production] |
20:45 |
<wm-bot> |
<lucaswerkmeister> deployed a0ba7b84ab (quickfix) |
[tools.lexeme-forms] |
20:44 |
<wm-bot> |
<lucaswerkmeister> deployed 23ccbcf6f6 (work around T272319) |
[tools.lexeme-forms] |
20:09 |
<dduvall@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.31 |
[production] |
19:27 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: f33f9f71b13d9b9276df88ef6384ec6028ee2e1d: Make DiscussionTools replytool available for everyone on gomwiktionary (T258554) (duration: 01m 05s) |
[production] |
19:25 |
<mutante> |
mwmaint2001 - deleting 'home-terbium' from all home directories (yes, it's in Bacula if you really used that, hope you didn't, it's been years since terbium) |
[production] |
19:25 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: da7b8123ecb373c1de1634ae867fb2f5fbee89ad: Enable DiscussionTools beta feature for newtopictool on arwiki, cswiki, huwiki (T273145) (duration: 01m 12s) |
[production] |
19:20 |
<urbanecm@deploy1001> |
Synchronized php-1.36.0-wmf.31/extensions/DiscussionTools/: 1cc29df: 6b88aff: DiscussionTools backports (T272666; T274949) (duration: 01m 08s) |
[production] |
19:19 |
<urbanecm@deploy1001> |
sync-file aborted: 1cc29df DiscussionTools backports (T272666; T274949) (duration: 00m 00s) |
[production] |
19:17 |
<urbanecm@deploy1001> |
Synchronized php-1.36.0-wmf.30/extensions/DiscussionTools/: 9c6cdf5: 97acef6: DiscussionTools backports (T272666; T274949) (duration: 01m 26s) |
[production] |
19:04 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade |
[production] |
19:04 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mwmaint2001.codfw.wmnet with reason: OS upgrade |
[production] |
18:38 |
<razzi> |
rebalance kafka partition for webrequest_upload partition 1 |
[analytics] |
17:27 |
<elukey> |
an-coord1002 back in service with raid1 configured |
[analytics] |
16:51 |
<volans> |
uploaded python3-wmflib_0.0.7 to apt.wikimedia.org buster-wikimedia |
[production] |
16:23 |
<shdubsh> |
restart ircecho on kraz -- deploying new metrics endpoint T216611 |
[production] |
16:05 |
<moritzm> |
installing libmaxminddb updates from buster 10.8 point release |
[production] |
15:48 |
<elukey> |
stop hive/mysql on an-coord1002 as precautionary step to rebuild the md array |
[analytics] |
15:33 |
<_joe_> |
rebuilding base images for stretch,buster |
[production] |
15:30 |
<moritzm> |
installing PHP 7.3 security updates on buster |
[production] |
15:06 |
<godog> |
swift codfw-prod decrease HDD weight for ms-be20[16-27] - T272837 |
[production] |
14:50 |
<arturo> |
rebooting cloudnet1004 for T271058 |
[admin] |
14:35 |
<moritzm> |
installing libzstd security updates on Buster |
[production] |
13:59 |
<moritzm> |
installing intel-microcode security updates on buster |
[production] |
13:49 |
<jynus> |
restart db1150 T271913 |
[production] |
13:10 |
<elukey> |
failover analytics-hive to an-coord1001 after maintenance (DNS change) |
[analytics] |
12:24 |
<arturo> |
delete couple of VMs no longer in use (arturo-puppetmaster, arturo-cloudgw-test) |
[testlabs] |
12:20 |
<jynus> |
restart db1140 T271913 |
[production] |
12:01 |
<urbanecm@deploy1001> |
Synchronized php-1.36.0-wmf.31/includes/HookContainer/DeprecatedHooks.php: 28aa8718549b76c88e9757a273e0c602479b8d8b: Silent deprecate ProtectionForm::buildForm (T274889) (duration: 01m 14s) |
[production] |
11:49 |
<jynus> |
restart db1102 T271913 |
[production] |
11:32 |
<elukey> |
restart hive daemons on an-coord1001 to pick up new parquet settings |
[analytics] |
11:13 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Repool pc1009 (duration: 01m 09s) |
[production] |
11:04 |
<marostegui> |
Upgrade and reboot pc1009 |
[production] |