251-300 of 10000 results (26ms)
2025-06-19 ยง
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host dse-k8s-worker2002 [production]
10:25 <btullis@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host dse-k8s-worker2002 [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) dse-k8s-worker2002.codfw.wmnet 86.48.192.10.in-addr.arpa 6.8.0.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
10:25 <btullis@cumin1003> START - Cookbook sre.dns.wipe-cache dse-k8s-worker2002.codfw.wmnet 86.48.192.10.in-addr.arpa 6.8.0.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host dse-k8s-worker2002 - btullis@cumin1003" [production]
10:25 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host dse-k8s-worker2002 - btullis@cumin1003" [production]
10:23 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
10:23 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
10:23 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:23 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Fix api token loading - oblivian@cumin1003" [production]
10:23 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Fix api token loading - oblivian@cumin1003 [production]
10:23 <oblivian@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Fix api token loading - oblivian@cumin1003 [production]
10:23 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Fix api token loading - oblivian@cumin1003" [production]
10:23 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:20 <Amir1> dropping searchindex table in itwiki (T397367) [production]
10:19 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
10:19 <btullis@cumin1003> START - Cookbook sre.hosts.move-vlan for host dse-k8s-worker2002 [production]
10:19 <Emperor> depool / restart / repool ms-fe1009 [some idle timeouts] [production]
10:19 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker2002.codfw.wmnet with OS bookworm [production]
10:19 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-worker2001.codfw.wmnet with OS bookworm [production]
10:18 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:17 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
10:17 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:16 <akosiaris@cumin1003> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on aux-k8s-worker2006.codfw.wmnet with reason: host reimage [production]
10:16 <akosiaris@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2006.codfw.wmnet with reason: host reimage [production]
10:14 <jgiannelos@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
10:14 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
10:14 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
10:14 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
10:13 <marostegui@cumin1002> dbctl commit (dc=all): 'db2196 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P78430 and previous config saved to /var/cache/conftool/dbconfig/20250619-101317-root.json [production]
10:12 <godog> powercycle netmon1003 [production]
10:12 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:12 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:04 <akosiaris@cumin1003> START - Cookbook sre.hosts.reimage for host aux-k8s-worker2006.codfw.wmnet with OS bookworm [production]
10:02 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker2001.codfw.wmnet with reason: host reimage [production]
10:01 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
10:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2182 (T396130)', diff saved to https://phabricator.wikimedia.org/P78429 and previous config saved to /var/cache/conftool/dbconfig/20250619-100102-marostegui.json [production]
09:58 <marostegui@cumin1002> dbctl commit (dc=all): 'db2196 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78428 and previous config saved to /var/cache/conftool/dbconfig/20250619-095811-root.json [production]
09:57 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker2001.codfw.wmnet with reason: host reimage [production]
09:54 <cmooney@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Homer release to add wikikube-worker-exp - cmooney@cumin1003 [production]
09:52 <cmooney@cumin1003> START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin[1002-1003].eqiad.wmnet with reason: Homer release to add wikikube-worker-exp - cmooney@cumin1003 [production]
09:46 <wmbot~nokibsarkar@tools-bastion-13> Test [tools.campwiz-bot]
09:45 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P78427 and previous config saved to /var/cache/conftool/dbconfig/20250619-094554-marostegui.json [production]
09:45 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "New api endpoints for the requestctl client - oblivian@cumin1003" [production]
09:45 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: New api endpoints for the requestctl client - oblivian@cumin1003 [production]
09:44 <oblivian@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: New api endpoints for the requestctl client - oblivian@cumin1003 [production]
09:44 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "New api endpoints for the requestctl client - oblivian@cumin1003" [production]
09:43 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
09:43 <marostegui@cumin1002> dbctl commit (dc=all): 'db2196 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P78426 and previous config saved to /var/cache/conftool/dbconfig/20250619-094306-root.json [production]