6151-6200 of 10000 results (54ms)
2021-03-09 ยง
13:37 <filippo@cumin1001> START - Cookbook sre.hosts.reboot-single for host prometheus1004.eqiad.wmnet [production]
13:34 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on cloudvirt1038.eqiad.wmnet with reason: HW issue [production]
13:34 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on cloudvirt1038.eqiad.wmnet with reason: HW issue [production]
13:31 <marostegui@cumin1001> dbctl commit (dc=all): 'db1168 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P14692 and previous config saved to /var/cache/conftool/dbconfig/20210309-133124-root.json [production]
13:28 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host prometheus1003.eqiad.wmnet [production]
13:27 <elukey> reimage an-worker1102 and an-worker1080 (hdfs journal node) to Buster [production]
13:21 <jgleeson> updated payments-wiki from 65dbf0ed9d to 0e7800027a [production]
13:16 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1198:3316 for schema change', diff saved to https://phabricator.wikimedia.org/P14691 and previous config saved to /var/cache/conftool/dbconfig/20210309-131652-marostegui.json [production]
13:16 <marostegui@cumin1001> dbctl commit (dc=all): 'db1168 (re)pooling @ 60%: 10', diff saved to https://phabricator.wikimedia.org/P14690 and previous config saved to /var/cache/conftool/dbconfig/20210309-131620-root.json [production]
13:10 <filippo@cumin1001> START - Cookbook sre.hosts.reboot-single for host prometheus1003.eqiad.wmnet [production]
13:08 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1103.eqiad.wmnet with reason: REIMAGE [production]
13:06 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1103.eqiad.wmnet with reason: REIMAGE [production]
13:03 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1013.eqiad.wmnet with reason: REIMAGE [production]
13:01 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1013.eqiad.wmnet with reason: REIMAGE [production]
13:01 <marostegui@cumin1001> dbctl commit (dc=all): 'db1168 (re)pooling @ 30%: 10', diff saved to https://phabricator.wikimedia.org/P14689 and previous config saved to /var/cache/conftool/dbconfig/20210309-130116-root.json [production]
12:59 <elukey> drain + reimage an-worker1103 to Buster [production]
12:59 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1011.eqiad.wmnet with reason: REIMAGE [production]
12:57 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1011.eqiad.wmnet with reason: REIMAGE [production]
12:56 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mw1403.eqiad.wmnet [production]
12:56 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mw1402.eqiad.wmnet [production]
12:50 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1168 for schema change', diff saved to https://phabricator.wikimedia.org/P14688 and previous config saved to /var/cache/conftool/dbconfig/20210309-125007-marostegui.json [production]
12:49 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P14687 and previous config saved to /var/cache/conftool/dbconfig/20210309-124931-root.json [production]
12:41 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host mw1403.eqiad.wmnet [production]
12:41 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host mw1402.eqiad.wmnet [production]
12:38 <hnowlan@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:34 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 60%: 10', diff saved to https://phabricator.wikimedia.org/P14686 and previous config saved to /var/cache/conftool/dbconfig/20210309-123427-root.json [production]
12:33 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1038.eqiad.wmnet [production]
12:31 <hnowlan@cumin1001> START - Cookbook sre.dns.netbox [production]
12:30 <hnowlan> regenerating interfaces and reimaging aqs101[1-5] [production]
12:29 <marostegui> Upgrade db2084 kernel [production]
12:26 <marostegui> Upgrade db2094 kernel [production]
12:19 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 30%: 10', diff saved to https://phabricator.wikimedia.org/P14685 and previous config saved to /var/cache/conftool/dbconfig/20210309-121924-root.json [production]
12:19 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1166 entirely', diff saved to https://phabricator.wikimedia.org/P14684 and previous config saved to /var/cache/conftool/dbconfig/20210309-121913-marostegui.json [production]
12:18 <marostegui@cumin1001> dbctl commit (dc=all): 'db1166 (re)pooling @ 30%: 10', diff saved to https://phabricator.wikimedia.org/P14683 and previous config saved to /var/cache/conftool/dbconfig/20210309-121849-root.json [production]
12:16 <urbanecm@deploy1002> Synchronized php-1.36.0-wmf.33/extensions/GrowthExperiments/: dbd6f0cb299bcfb6648b351e1476100fe669cc58: Make help panel fallback to help desk if no mentor is available (T275908; T273782) (duration: 01m 01s) [production]
12:13 <marostegui> Upgrade db2080 kernel [production]
12:06 <marostegui> Upgrade db2077 kernel [production]
12:03 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1173 for schema change', diff saved to https://phabricator.wikimedia.org/P14682 and previous config saved to /var/cache/conftool/dbconfig/20210309-120326-marostegui.json [production]
12:00 <marostegui> Upgrade db2076 kernel [production]
11:56 <effie> restart envoy on mw1276 [production]
11:56 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1010.eqiad.wmnet with reason: REIMAGE [production]
11:53 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1010.eqiad.wmnet with reason: REIMAGE [production]
11:52 <jayme@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
11:52 <jayme@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
11:45 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mw1307.eqiad.wmnet [production]
11:42 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host prometheus2004.codfw.wmnet [production]
11:42 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwdebug1003.eqiad.wmnet [production]
11:30 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host mw1307.eqiad.wmnet [production]
11:30 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host mwdebug1003.eqiad.wmnet [production]
11:29 <jayme@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]