651-700 of 10000 results (89ms)
2023-01-30 ยง
18:38 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host cp3052.esams.wmnet with OS bullseye [production]
18:37 <sukhe@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp3052.esams.wmnet with OS bullseye [production]
18:37 <sukhe@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cp3052.esams.wmnet'] [production]
18:37 <sukhe@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cp3052.esams.wmnet'] [production]
18:34 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4051.ulsfo.wmnet with OS bullseye [production]
18:29 <aokoth@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on vrts2001.codfw.wmnet with reason: installation failed due to read-only database [production]
18:29 <aokoth@cumin1001> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on vrts2001.codfw.wmnet with reason: installation failed due to read-only database [production]
18:19 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host cp3052.esams.wmnet with OS bullseye [production]
18:19 <sukhe@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp3052.esams.wmnet with OS bullseye [production]
18:10 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host cp3052.esams.wmnet with OS bullseye [production]
18:08 <sukhe@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts cp3052.esams.wmnet [production]
18:07 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4051.ulsfo.wmnet with reason: host reimage [production]
18:04 <brett@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4051.ulsfo.wmnet with reason: host reimage [production]
18:01 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:884427|[Growth] Remove wgGERecentChangesUnstarredMenteesFilterEnabled]] (duration: 07m 59s) [production]
17:53 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:884427|[Growth] Remove wgGERecentChangesUnstarredMenteesFilterEnabled]] [production]
17:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T328255)', diff saved to https://phabricator.wikimedia.org/P43517 and previous config saved to /var/cache/conftool/dbconfig/20230130-174957-ladsgroup.json [production]
17:49 <sukhe@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cp3052.esams.wmnet [production]
17:43 <brett@cumin1001> START - Cookbook sre.hosts.reimage for host cp4051.ulsfo.wmnet with OS bullseye [production]
17:43 <brett@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4051.ulsfo.wmnet with OS bullseye [production]
17:36 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5026.eqsin.wmnet,service=ats-be [production]
17:36 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5026.eqsin.wmnet,service=cdn [production]
17:34 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P43516 and previous config saved to /var/cache/conftool/dbconfig/20230130-173450-ladsgroup.json [production]
17:34 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
17:34 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
17:34 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp3050.esams.wmnet,service=ats-be [production]
17:34 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp3050.esams.wmnet,service=cdn [production]
17:31 <brett@cumin1001> START - Cookbook sre.hosts.reimage for host cp4051.ulsfo.wmnet with OS bullseye [production]
17:31 <brett@cumin1001> conftool action : set/pooled=yes; selector: name=cp4043.ulsfo.wmnet [production]
17:27 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4043.ulsfo.wmnet with OS bullseye [production]
17:24 <inflatador> bking@build2001 rebuilding docker images for 884351 complete [production]
17:22 <inflatador> bking@build2001 rebuilding docker images for 884351 [production]
17:21 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5026.eqsin.wmnet with OS bullseye [production]
17:19 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P43515 and previous config saved to /var/cache/conftool/dbconfig/20230130-171944-ladsgroup.json [production]
17:12 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3050.esams.wmnet with OS bullseye [production]
17:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T328255)', diff saved to https://phabricator.wikimedia.org/P43514 and previous config saved to /var/cache/conftool/dbconfig/20230130-170437-ladsgroup.json [production]
16:59 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4043.ulsfo.wmnet with reason: host reimage [production]
16:56 <brett@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4043.ulsfo.wmnet with reason: host reimage [production]
16:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2177 (T328255)', diff saved to https://phabricator.wikimedia.org/P43513 and previous config saved to /var/cache/conftool/dbconfig/20230130-165359-ladsgroup.json [production]
16:53 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2177.codfw.wmnet with reason: Maintenance [production]
16:53 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2177.codfw.wmnet with reason: Maintenance [production]
16:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2156 (T328255)', diff saved to https://phabricator.wikimedia.org/P43512 and previous config saved to /var/cache/conftool/dbconfig/20230130-165348-ladsgroup.json [production]
16:50 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5026.eqsin.wmnet with reason: host reimage [production]
16:48 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp3050.esams.wmnet with reason: host reimage [production]
16:44 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5026.eqsin.wmnet with reason: host reimage [production]
16:44 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp3050.esams.wmnet with reason: host reimage [production]
16:38 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P43511 and previous config saved to /var/cache/conftool/dbconfig/20230130-163842-ladsgroup.json [production]
16:35 <brett@cumin1001> START - Cookbook sre.hosts.reimage for host cp4043.ulsfo.wmnet with OS bullseye [production]
16:35 <brett@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4043.ulsfo.wmnet with OS bullseye [production]
16:30 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1084.eqiad.wmnet [production]
16:25 <brett@cumin1001> START - Cookbook sre.hosts.reimage for host cp4043.ulsfo.wmnet with OS bullseye [production]