2021-11-16
§
|
12:29 |
<moritzm> |
installing Linux 4.19.208 updates on buster hosts (no reboots) |
[production] |
12:24 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6007.drmrs.wmnet with OS buster |
[production] |
12:22 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid public cluster: Roll restart of Druid jvm daemons. |
[production] |
12:13 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6006.drmrs.wmnet with OS buster |
[production] |
11:55 |
<moritzm> |
failover ganeti master in test cluster to ganeti-test2002 |
[production] |
11:34 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6006.drmrs.wmnet with OS buster |
[production] |
11:31 |
<btullis@cumin1001> |
START - Cookbook sre.druid.roll-restart-workers for Druid public cluster: Roll restart of Druid jvm daemons. |
[production] |
11:03 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hadoop.roll-restart-masters (exit_code=0) restart masters for Hadoop analytics cluster: Restart of jvm daemons. |
[production] |
10:30 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6005.drmrs.wmnet with OS buster |
[production] |
10:21 |
<ema> |
A:cp re-enable puppet after successful test on cp402[17] T293879 |
[production] |
10:20 |
<btullis@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters restart masters for Hadoop analytics cluster: Restart of jvm daemons. |
[production] |
10:15 |
<moritzm> |
installing testvm2001 |
[production] |
10:06 |
<arturo> |
updating deb packages on stretch-wikimedia/thirdparty/kubeadm-k8s-1-21 (T282942) |
[production] |
10:02 |
<ema> |
A:cp disable puppet to test https://gerrit.wikimedia.org/r/c/operations/puppet/+/738910 on cp4021 T293879 |
[production] |
09:51 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6005.drmrs.wmnet with OS buster |
[production] |
09:48 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6004.drmrs.wmnet with OS buster |
[production] |
09:40 |
<ayounsi@deploy1002> |
Finished deploy [homer/deploy@c570af3]: Homer CR738905 (duration: 01m 25s) |
[production] |
09:39 |
<ayounsi@deploy1002> |
Started deploy [homer/deploy@c570af3]: Homer CR738905 |
[production] |
09:09 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6004.drmrs.wmnet with OS buster |
[production] |
08:54 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6003.drmrs.wmnet with OS buster |
[production] |
08:14 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6003.drmrs.wmnet with OS buster |
[production] |
08:04 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6002.drmrs.wmnet with OS buster |
[production] |
07:25 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6002.drmrs.wmnet with OS buster |
[production] |
02:10 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
02:07 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
00:28 |
<urbanecm> |
UTC late window done |
[production] |
00:25 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
00:23 |
<urbanecm@deploy1002> |
Synchronized php-1.38.0-wmf.7/extensions/WikimediaEvents/: 738399: 739004: WikimediaEvents backports (T294738) (duration: 00m 56s) |
[production] |
00:21 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
00:19 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: 50d9f2687cd11e6f838313a530c6bbd498d0b83e: GrowthExperiments: Set up GEHomepageNewAccountVariantsByPlatform (T294737) (duration: 00m 56s) |
[production] |
00:11 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
00:07 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
2021-11-15
§
|
23:10 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thumbor1005.eqiad.wmnet |
[production] |
22:59 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thumbor1005.eqiad.wmnet |
[production] |
22:58 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on thumbor1005.eqiad.wmnet with reason: reboot after first puppet run |
[production] |
22:58 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on thumbor1005.eqiad.wmnet with reason: reboot after first puppet run |
[production] |
21:46 |
<bblack> |
dns6002 - reboot for another round of bios fixups |
[production] |
21:32 |
<bblack> |
dns6001 - reboot for another round of bios fixups |
[production] |
21:21 |
<legoktm> |
uploaded php7.4_7.4.25-1+wmf2+buster1_amd64.changes to apt.wm.o with patch for T293568 |
[production] |
21:19 |
<mutante> |
removing mediawiki font packages from remaining regular appservers globally (T294378) |
[production] |
20:49 |
<mutante> |
retiring https://scholarships.wikimedia.org - removing from ATS (T243037) |
[production] |
20:49 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6001.drmrs.wmnet with OS buster |
[production] |
20:09 |
<Amir1> |
revoked all grants from wikiadmin and gave back an explicit list on clouddb1013:3311 (T249683) |
[production] |
20:08 |
<Amir1> |
revoked all grants from wikiadmin and gave back an explicit list on clouddb1021:3311 (T249683) |
[production] |
20:07 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6001.drmrs.wmnet with OS buster |
[production] |
20:03 |
<Amir1> |
revoked all grants from wikiadmin and gave back an explicit list on db1102:3312 (T249683) |
[production] |
19:57 |
<mmandere@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp6001.drmrs.wmnet with OS buster |
[production] |
19:49 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
19:46 |
<Amir1> |
revoked all grants from wikiadmins and gave back explicit list on db2101:3315 (T249683) |
[production] |
19:46 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |