3051-3100 of 10000 results (129ms)
2025-01-28 ยง
16:41 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:41 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:41 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:40 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:40 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:37 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs4010.ulsfo.wmnet with reason: host reimage [production]
16:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P72642 and previous config saved to /var/cache/conftool/dbconfig/20250128-163336-marostegui.json [production]
16:33 <vgutierrez@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs4010.ulsfo.wmnet with reason: host reimage [production]
16:26 <root@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2177.codfw.wmnet with reason: Index rebuild [production]
16:26 <marostegui@cumin1002> dbctl commit (dc=all): 'db1166 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72641 and previous config saved to /var/cache/conftool/dbconfig/20250128-162649-root.json [production]
16:26 <root@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2177.codfw.wmnet [production]
16:25 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host gerrit2002 [production]
16:25 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host gerrit2002 [production]
16:25 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2177.codfw.wmnet with reason: maintenance [production]
16:20 <reedy@deploy2002> Finished scap sync-world: Backport for [[gerrit:1114701|FormatMetadata: Prevent running preg_match() on null (T384879)]], [[gerrit:1114702|FormatMetadata: Prevent running preg_match() on null (T384879)]] (duration: 12m 12s) [production]
16:19 <root@cumin1002> START - Cookbook sre.mysql.upgrade for db2177.codfw.wmnet [production]
16:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2177 T382842', diff saved to https://phabricator.wikimedia.org/P72640 and previous config saved to /var/cache/conftool/dbconfig/20250128-161857-marostegui.json [production]
16:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227 (T384592)', diff saved to https://phabricator.wikimedia.org/P72639 and previous config saved to /var/cache/conftool/dbconfig/20250128-161829-marostegui.json [production]
16:15 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-videoscaler: apply [production]
16:15 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-videoscaler: apply [production]
16:15 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reimage for host lvs4010.ulsfo.wmnet with OS bookworm [production]
16:14 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[2013,2036,2088].codfw.wmnet [production]
16:14 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for wikikube-worker[2013,2036,2088].codfw.wmnet [production]
16:14 <jayme@cumin1002> START - Cookbook sre.hosts.remove-downtime for wikikube-worker[2013,2036,2088].codfw.wmnet [production]
16:14 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[2013,2036,2088].codfw.wmnet [production]
16:14 <reedy@deploy2002> reedy: Continuing with sync [production]
16:13 <reedy@deploy2002> reedy: Backport for [[gerrit:1114701|FormatMetadata: Prevent running preg_match() on null (T384879)]], [[gerrit:1114702|FormatMetadata: Prevent running preg_match() on null (T384879)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
16:11 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker2088 [production]
16:11 <marostegui@cumin1002> dbctl commit (dc=all): 'db1166 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P72638 and previous config saved to /var/cache/conftool/dbconfig/20250128-161143-root.json [production]
16:11 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker2088 [production]
16:08 <reedy@deploy2002> Started scap sync-world: Backport for [[gerrit:1114701|FormatMetadata: Prevent running preg_match() on null (T384879)]], [[gerrit:1114702|FormatMetadata: Prevent running preg_match() on null (T384879)]] [production]
16:06 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-ctrl1003.eqiad.wmnet [production]
16:06 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for wikikube-ctrl1003.eqiad.wmnet [production]
16:06 <jayme@cumin1002> START - Cookbook sre.hosts.remove-downtime for wikikube-ctrl1003.eqiad.wmnet [production]
16:06 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-ctrl1003.eqiad.wmnet [production]
16:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1166', diff saved to https://phabricator.wikimedia.org/P72637 and previous config saved to /var/cache/conftool/dbconfig/20250128-160518-marostegui.json [production]
16:04 <root@cumin1002> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) db1166 gradually with 4 steps - Repooling after rebuild index T384807 [production]
16:03 <root@cumin1002> START - Cookbook sre.mysql.pool db1166 gradually with 4 steps - Repooling after rebuild index T384807 [production]
16:00 <jelto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gerrit2002.wikimedia.org with reason: NIC port switch -t T383709 [production]
15:59 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
15:59 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
15:58 <vgutierrez> upload liberica 0.7 to apt.wm.o (bookworm-wikimedia) [production]
15:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
15:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
15:56 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:56 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
15:55 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:54 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
15:54 <cgoubert@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
15:53 <cgoubert@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]