301-350 of 10000 results (85ms)
2025-06-24 ยง
12:59 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin config_reloading P{lvs3008.esams.wmnet} and A:liberica (T396561) [production]
12:59 <jhancock@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:56 <jhancock@cumin1003> START - Cookbook sre.dns.netbox [production]
12:54 <mvernon@cumin1002> START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.0a [production]
12:54 <mvernon@cumin1002> END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.09 [production]
12:50 <jelto> bump kubernetes-client to newest version on aux-k8s-ctrl* - T387548 [production]
12:48 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/push-notifications: apply [production]
12:47 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs3008.esams.wmnet [production]
12:47 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs3008.esams.wmnet [production]
12:47 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/push-notifications: apply [production]
12:45 <jgiannelos@deploy1003> helmfile [eqiad] DONE helmfile.d/services/push-notifications: apply [production]
12:44 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/push-notifications: apply [production]
12:43 <jelto> bump kubernetes-client to newest version on dse-k8s-ctrl100[12] - T387548 [production]
12:43 <mvernon@cumin1002> START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.09 [production]
12:43 <mvernon@cumin1002> END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.08 [production]
12:42 <urbanecm@deploy1003> Finished scap sync-world: Backport for [[gerrit:1163292|[Growth] testwiki: Enable the get-started-experiment (T394958)]] (duration: 18m 18s) [production]
12:39 <jgiannelos@deploy1003> helmfile [eqiad] DONE helmfile.d/services/wikifeeds: apply [production]
12:39 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/wikifeeds: apply [production]
12:39 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikifeeds: apply [production]
12:39 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/wikifeeds: apply [production]
12:38 <jgiannelos@deploy1003> helmfile [eqiad] DONE helmfile.d/services/wikifeeds: apply [production]
12:38 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/wikifeeds: apply [production]
12:37 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/wikifeeds: apply [production]
12:37 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]
12:36 <jgreen@dns1004> END - running authdns-update [production]
12:35 <jgreen@dns1004> START - running authdns-update [production]
12:35 <urbanecm@deploy1003> urbanecm: Continuing with sync [production]
12:35 <vgutierrez@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on lvs3008.esams.wmnet with reason: switching to katran [production]
12:34 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs3008.esams.wmnet} and A:liberica (T396561) [production]
12:34 <akosiaris@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-worker1009.eqiad.wmnet with OS bookworm [production]
12:34 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs3008.esams.wmnet} and A:liberica (T396561) [production]
12:33 <jgiannelos@deploy1003> helmfile [eqiad] DONE helmfile.d/services/proton: apply [production]
12:32 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/proton: apply [production]
12:32 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/proton: apply [production]
12:32 <mvernon@cumin1002> START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.08 [production]
12:32 <mvernon@cumin1002> END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.07 [production]
12:31 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/proton: apply [production]
12:27 <urbanecm@deploy1003> urbanecm: Backport for [[gerrit:1163292|[Growth] testwiki: Enable the get-started-experiment (T394958)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
12:24 <urbanecm@deploy1003> Started scap sync-world: Backport for [[gerrit:1163292|[Growth] testwiki: Enable the get-started-experiment (T394958)]] [production]
12:23 <urbanecm@deploy1003> Finished scap sync-world: Backport for [[gerrit:1163027|[Growth] Disable the Surfacing Structured Tasks feature (T397515)]] (duration: 10m 51s) [production]
12:22 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/proton: apply [production]
12:22 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/proton: apply [production]
12:20 <mvernon@cumin1002> START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.07 [production]
12:20 <mvernon@cumin1002> END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.06 [production]
12:19 <akosiaris@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker1009.eqiad.wmnet with reason: host reimage [production]
12:18 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox_snippets (exit_code=0) Generate and push DNS records from Netbox data [production]
12:18 <cmooney@cumin1003> START - Cookbook sre.dns.netbox_snippets Generate and push DNS records from Netbox data [production]
12:17 <cgoubert@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:wikikube-worker-codfw [production]
12:16 <akosiaris@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-worker1008.eqiad.wmnet with OS bookworm [production]
12:16 <akosiaris@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker1009.eqiad.wmnet with reason: host reimage [production]