5251-5300 of 10000 results (34ms)
2021-02-17 §
11:55 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host netbox-dev2001.wikimedia.org [production]
11:24 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly pool db1172 in s8 - T258361', diff saved to https://phabricator.wikimedia.org/P14389 and previous config saved to /var/cache/conftool/dbconfig/20210217-112422-marostegui.json [production]
11:08 <akosiaris@deploy1001> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:08 <akosiaris@deploy1001> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
11:04 <akosiaris@deploy1001> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:04 <akosiaris@deploy1001> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
11:04 <akosiaris@deploy1001> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:03 <akosiaris@deploy1001> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
10:52 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on cloudnet1004.eqiad.wmnet with reason: hardware failure [production]
10:52 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on cloudnet1004.eqiad.wmnet with reason: hardware failure [production]
10:13 <_joe_> depooling mw1331 to perform some tests for T266855 [production]
10:08 <aborrero@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:01 <aborrero@cumin1001> START - Cookbook sre.dns.netbox [production]
09:32 <elukey> reboot dbstore100[3-5] for kernel upgrades [production]
08:44 <marostegui> upgrade es2020 es2021 es2022's kernel [production]
08:41 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly pool db1172 in s8 - T258361', diff saved to https://phabricator.wikimedia.org/P14388 and previous config saved to /var/cache/conftool/dbconfig/20210217-084120-marostegui.json [production]
08:08 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host stat1004.eqiad.wmnet [production]
08:04 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host stat1004.eqiad.wmnet [production]
07:41 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly pool db1172 in s8 - T258361', diff saved to https://phabricator.wikimedia.org/P14387 and previous config saved to /var/cache/conftool/dbconfig/20210217-074107-marostegui.json [production]
07:40 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host stat1007.eqiad.wmnet [production]
07:33 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host stat1007.eqiad.wmnet [production]
07:30 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host stat1006.eqiad.wmnet [production]
07:23 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host stat1006.eqiad.wmnet [production]
07:21 <marostegui@cumin1001> dbctl commit (dc=all): 'Pool db1172 in s8 for the first time - T258361', diff saved to https://phabricator.wikimedia.org/P14386 and previous config saved to /var/cache/conftool/dbconfig/20210217-072131-marostegui.json [production]
07:21 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host stat1004.eqiad.wmnet [production]
07:16 <marostegui> Add x1 to orchestrator [production]
07:04 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host stat1004.eqiad.wmnet [production]
07:01 <marostegui> Restart db1103 (x1) primary master DONE - T273758 [production]
07:00 <marostegui> Restart db1103 (x1) primary master - T273758 [production]
06:39 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1172 to dbctl, but not pooled yet T258361', diff saved to https://phabricator.wikimedia.org/P14385 and previous config saved to /var/cache/conftool/dbconfig/20210217-063915-marostegui.json [production]
01:41 <mutante> mwdebug1001 - back on buster and pooled [production]
01:41 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mwdebug1001.eqiad.wmnet [production]
01:39 <mutante> mwdebug1001 - rebooting [production]
01:04 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1345.eqiad.wmnet [production]
01:04 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw1351.eqiad.wmnet [production]
01:00 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mwdebug1001.eqiad.wmnet [production]
01:00 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mwdebug1001.eqiad.wmnet [production]
00:58 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1345.eqiad.wmnet [production]
00:49 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1351.eqiad.wmnet [production]
00:33 <mutante> mw1351 - powercycled [production]
00:27 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mwdebug1001.eqiad.wmnet [production]
00:17 <legoktm@deploy1001> Synchronized php-1.36.0-wmf.30/extensions/timeline/: Add $wgTimelineFontDirectory to be passed as GDFONTPATH (T274822) (duration: 01m 06s) [production]
00:15 <legoktm@deploy1001> Synchronized php-1.36.0-wmf.31/extensions/timeline/: Add $wgTimelineFontDirectory to be passed as GDFONTPATH (T274822) (duration: 01m 02s) [production]
00:13 <legoktm@deploy1001> Synchronized wmf-config/timeline.php: Set $wgTimelineFontDirectory (T274822) (duration: 01m 05s) [production]
00:04 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1345.eqiad.wmnet with reason: REIMAGE [production]
00:02 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1345.eqiad.wmnet with reason: REIMAGE [production]
2021-02-16 §
23:54 <mutante> puppetmaster1001 - puppet cert clean mwdebug1001, sign new request, initial puppet run, now on buster (T274023) [production]
23:54 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1351.eqiad.wmnet with reason: REIMAGE [production]
23:52 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1351.eqiad.wmnet with reason: REIMAGE [production]
23:44 <dzahn@cumin1001> conftool action : set/pooled=inactive; selector: name=mwdebug1001.eqiad.wmnet [production]