3101-3150 of 10000 results (127ms)
2025-01-28 ยง
16:06 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-ctrl1003.eqiad.wmnet [production]
16:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1166', diff saved to https://phabricator.wikimedia.org/P72637 and previous config saved to /var/cache/conftool/dbconfig/20250128-160518-marostegui.json [production]
16:04 <root@cumin1002> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) db1166 gradually with 4 steps - Repooling after rebuild index T384807 [production]
16:03 <root@cumin1002> START - Cookbook sre.mysql.pool db1166 gradually with 4 steps - Repooling after rebuild index T384807 [production]
16:00 <jelto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gerrit2002.wikimedia.org with reason: NIC port switch -t T383709 [production]
15:59 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
15:59 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
15:58 <vgutierrez> upload liberica 0.7 to apt.wm.o (bookworm-wikimedia) [production]
15:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
15:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
15:56 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:56 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
15:55 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:54 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
15:54 <cgoubert@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
15:53 <cgoubert@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
15:52 <cgoubert@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
15:51 <cgoubert@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
15:50 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker2036 [production]
15:50 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:50 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker2036 [production]
15:49 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
15:49 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-ctrl1003.eqiad.wmnet [production]
15:49 <jayme@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-ctrl1003.eqiad.wmnet with reason: Depooled via sre.k8s.pool-depool-node [production]
15:49 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:49 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
15:49 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-ctrl1003.eqiad.wmnet [production]
15:48 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-ctrl1002.eqiad.wmnet [production]
15:48 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for wikikube-ctrl1002.eqiad.wmnet [production]
15:48 <jayme@cumin1002> START - Cookbook sre.hosts.remove-downtime for wikikube-ctrl1002.eqiad.wmnet [production]
15:48 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-ctrl1002.eqiad.wmnet [production]
15:48 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:48 <aqu> About to deploy analytics/refinery/source 0.2.57 [production]
15:48 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
15:47 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:47 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
15:46 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:45 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
15:38 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker2013 [production]
15:38 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker2013 [production]
15:22 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1227 (T384592)', diff saved to https://phabricator.wikimedia.org/P72635 and previous config saved to /var/cache/conftool/dbconfig/20250128-152159-marostegui.json [production]
15:21 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1227.eqiad.wmnet with reason: Maintenance [production]
15:21 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202 (T384592)', diff saved to https://phabricator.wikimedia.org/P72634 and previous config saved to /var/cache/conftool/dbconfig/20250128-152137-marostegui.json [production]
15:14 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2028.codfw.wmnet [production]
15:13 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2003.codfw.wmnet to plain [production]
15:12 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2003.codfw.wmnet to plain [production]
15:11 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2028.codfw.wmnet [production]
15:10 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2028.codfw.wmnet [production]
15:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P72631 and previous config saved to /var/cache/conftool/dbconfig/20250128-150630-marostegui.json [production]
15:06 <jelto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gerrit2002.wikimedia.org with reason: NIC port switch -t T383709 [production]