2251-2300 of 10000 results (27ms)
2025-06-19 ยง
10:39 <moritzm> installing Django security updates [production]
10:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2208 (T396130)', diff saved to https://phabricator.wikimedia.org/P78431 and previous config saved to /var/cache/conftool/dbconfig/20250619-103400-marostegui.json [production]
10:33 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2208.codfw.wmnet with reason: Maintenance [production]
10:32 <akosiaris@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-worker2006.codfw.wmnet with OS bookworm [production]
10:32 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
10:32 <moritzm> installing twisted security updates [production]
10:31 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-cluster depool all services in codfw/codfw: maintenance [production]
10:31 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:31 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:28 <moritzm> installing postgresql-13 security updates [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host dse-k8s-worker2002 [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host dse-k8s-worker2002 [production]
10:25 <btullis@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host dse-k8s-worker2002 [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) dse-k8s-worker2002.codfw.wmnet 86.48.192.10.in-addr.arpa 6.8.0.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
10:25 <btullis@cumin1003> START - Cookbook sre.dns.wipe-cache dse-k8s-worker2002.codfw.wmnet 86.48.192.10.in-addr.arpa 6.8.0.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:25 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host dse-k8s-worker2002 - btullis@cumin1003" [production]
10:25 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host dse-k8s-worker2002 - btullis@cumin1003" [production]
10:23 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
10:23 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
10:23 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:23 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Fix api token loading - oblivian@cumin1003" [production]
10:23 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Fix api token loading - oblivian@cumin1003 [production]
10:23 <oblivian@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Fix api token loading - oblivian@cumin1003 [production]
10:23 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Fix api token loading - oblivian@cumin1003" [production]
10:23 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:20 <Amir1> dropping searchindex table in itwiki (T397367) [production]
10:19 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
10:19 <btullis@cumin1003> START - Cookbook sre.hosts.move-vlan for host dse-k8s-worker2002 [production]
10:19 <Emperor> depool / restart / repool ms-fe1009 [some idle timeouts] [production]
10:19 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker2002.codfw.wmnet with OS bookworm [production]
10:19 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-worker2001.codfw.wmnet with OS bookworm [production]
10:18 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:17 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
10:17 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:16 <akosiaris@cumin1003> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on aux-k8s-worker2006.codfw.wmnet with reason: host reimage [production]
10:16 <akosiaris@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2006.codfw.wmnet with reason: host reimage [production]
10:14 <jgiannelos@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
10:14 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
10:14 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
10:14 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
10:13 <marostegui@cumin1002> dbctl commit (dc=all): 'db2196 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P78430 and previous config saved to /var/cache/conftool/dbconfig/20250619-101317-root.json [production]
10:12 <godog> powercycle netmon1003 [production]
10:12 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:12 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:04 <akosiaris@cumin1003> START - Cookbook sre.hosts.reimage for host aux-k8s-worker2006.codfw.wmnet with OS bookworm [production]
10:02 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker2001.codfw.wmnet with reason: host reimage [production]
10:01 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
10:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2182 (T396130)', diff saved to https://phabricator.wikimedia.org/P78429 and previous config saved to /var/cache/conftool/dbconfig/20250619-100102-marostegui.json [production]
09:58 <marostegui@cumin1002> dbctl commit (dc=all): 'db2196 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78428 and previous config saved to /var/cache/conftool/dbconfig/20250619-095811-root.json [production]