251-300 of 10000 results (77ms)
2024-05-22 ยง
08:45 <arnaudb@cumin1002> START - Cookbook sre.hosts.ipmi-password-reset [production]
08:45 <arnaudb@cumin1002> END (FAIL) - Cookbook sre.hosts.ipmi-password-reset (exit_code=99) [production]
08:45 <arnaudb@cumin1002> START - Cookbook sre.hosts.ipmi-password-reset [production]
08:41 <aklapper@deploy1002> rebuilt and synchronized wikiversions files: group1 wikis to 1.43.0-wmf.6 refs T361400 [production]
08:39 <arnaudb@cumin1002> dbctl commit (dc=all): 'db1232 (re)pooling @ 5%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P62848 and previous config saved to /var/cache/conftool/dbconfig/20240522-083937-arnaudb.json [production]
08:24 <arnaudb@cumin1002> dbctl commit (dc=all): 'db1232 (re)pooling @ 2%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P62846 and previous config saved to /var/cache/conftool/dbconfig/20240522-082431-arnaudb.json [production]
08:16 <hashar@deploy1002> Finished scap: Backport for [[gerrit:1034182|Fix fatal error due to missing signature on very old comments (T365495)]] (duration: 16m 27s) [production]
08:13 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2173.codfw.wmnet with reason: reimage [production]
08:13 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2173.codfw.wmnet with reason: reimage [production]
08:11 <arnaudb@cumin1002> dbctl commit (dc=all): 'T364290 db2173', diff saved to https://phabricator.wikimedia.org/P62845 and previous config saved to /var/cache/conftool/dbconfig/20240522-081059-arnaudb.json [production]
08:09 <arnaudb@cumin1002> dbctl commit (dc=all): 'db1232 (re)pooling @ 1%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P62844 and previous config saved to /var/cache/conftool/dbconfig/20240522-080924-arnaudb.json [production]
08:08 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db1249.eqiad.wmnet [production]
08:02 <hashar@deploy1002> jforrester and hashar: Continuing with sync [production]
08:02 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1232.eqiad.wmnet with OS bookworm [production]
08:02 <hashar@deploy1002> jforrester and hashar: Backport for [[gerrit:1034182|Fix fatal error due to missing signature on very old comments (T365495)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:00 <hashar@deploy1002> Started scap: Backport for [[gerrit:1034182|Fix fatal error due to missing signature on very old comments (T365495)]] [production]
07:56 <kartik@deploy1002> Finished scap: Backport for [[gerrit:1034610|SpecialNotifyTranslators: Fix group id in dropdown (T253984)]] (duration: 22m 42s) [production]
07:51 <marostegui@cumin1002> dbctl commit (dc=all): 'db1249 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P62843 and previous config saved to /var/cache/conftool/dbconfig/20240522-075142-root.json [production]
07:43 <kartik@deploy1002> abi and kartik: Continuing with sync [production]
07:42 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1154.eqiad.wmnet with OS bookworm [production]
07:42 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1232.eqiad.wmnet with reason: host reimage [production]
07:39 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1232.eqiad.wmnet with reason: host reimage [production]
07:36 <marostegui@cumin1002> dbctl commit (dc=all): 'db1249 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P62842 and previous config saved to /var/cache/conftool/dbconfig/20240522-073636-root.json [production]
07:36 <kartik@deploy1002> abi and kartik: Backport for [[gerrit:1034610|SpecialNotifyTranslators: Fix group id in dropdown (T253984)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:33 <kartik@deploy1002> Started scap: Backport for [[gerrit:1034610|SpecialNotifyTranslators: Fix group id in dropdown (T253984)]] [production]
07:33 <ayounsi@cumin1002> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
07:33 <ayounsi@cumin1002> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
07:32 <moritzm> installing postgresql-11 security updates [production]
07:30 <kartik@deploy1002> Finished scap: Backport for [[gerrit:1034642|Disable Section Translation on simplewiki (T361597)]] (duration: 19m 47s) [production]
07:26 <arnaudb@cumin1002> START - Cookbook sre.hosts.reimage for host db1232.eqiad.wmnet with OS bookworm [production]
07:25 <ayounsi@cumin1002> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
07:25 <ayounsi@cumin1002> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
07:24 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1232.eqiad.wmnet with reason: reimage [production]
07:24 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1232.eqiad.wmnet with reason: reimage [production]
07:23 <moritzm> installing nodejs security updates [production]
07:23 <arnaudb@cumin1002> dbctl commit (dc=all): 'T364290 db1232', diff saved to https://phabricator.wikimedia.org/P62841 and previous config saved to /var/cache/conftool/dbconfig/20240522-072307-arnaudb.json [production]
07:21 <marostegui@cumin1002> dbctl commit (dc=all): 'db1249 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P62840 and previous config saved to /var/cache/conftool/dbconfig/20240522-072130-root.json [production]
07:20 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1154.eqiad.wmnet with reason: host reimage [production]
07:17 <kartik@deploy1002> kartik: Continuing with sync [production]
07:17 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1154.eqiad.wmnet with reason: host reimage [production]
07:13 <kartik@deploy1002> kartik: Backport for [[gerrit:1034642|Disable Section Translation on simplewiki (T361597)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:10 <kartik@deploy1002> Started scap: Backport for [[gerrit:1034642|Disable Section Translation on simplewiki (T361597)]] [production]
07:06 <marostegui@cumin1002> dbctl commit (dc=all): 'db1249 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P62839 and previous config saved to /var/cache/conftool/dbconfig/20240522-070624-root.json [production]
07:03 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db1154.eqiad.wmnet with OS bookworm [production]
07:02 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host db1249.eqiad.wmnet [production]
07:01 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db1248.eqiad.wmnet [production]
06:58 <marostegui> Reimage db1154 (sanitarium) there will be lag in s1, s3, s5 and s8 in wiki replicas [production]
06:53 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2201.codfw.wmnet with reason: Maintenance [production]
06:53 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2201.codfw.wmnet with reason: Maintenance [production]
06:53 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2192 (T352010)', diff saved to https://phabricator.wikimedia.org/P62838 and previous config saved to /var/cache/conftool/dbconfig/20240522-065340-ladsgroup.json [production]