2451-2500 of 10000 results (168ms)
2025-06-23 ยง
12:42 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" [production]
12:42 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
12:42 <kamila@deploy1003> helmfile [codfw] FAIL (1) helmfile.d/services/machinetranslation: apply [production]
12:42 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243 (T396130)', diff saved to https://phabricator.wikimedia.org/P78639 and previous config saved to /var/cache/conftool/dbconfig/20250623-124221-marostegui.json [production]
12:38 <akosiaris@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker2007.codfw.wmnet with reason: host reimage [production]
12:34 <akosiaris@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2007.codfw.wmnet with reason: host reimage [production]
12:32 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/linkrecommendation: apply [production]
12:31 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/kartotherian: apply [production]
12:30 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/ipoid: apply [production]
12:30 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/image-suggestion: apply [production]
12:30 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/geo-analytics: apply [production]
12:29 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/eventstreams-internal: apply [production]
12:29 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/eventstreams: apply [production]
12:28 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/eventgate-main: apply [production]
12:28 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/eventgate-logging-external: apply [production]
12:27 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/eventgate-analytics-external: apply [production]
12:27 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P78638 and previous config saved to /var/cache/conftool/dbconfig/20250623-122713-marostegui.json [production]
12:27 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker1008.eqiad.wmnet with reason: host reimage [production]
12:27 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/eventgate-analytics: apply [production]
12:26 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/editor-analytics: apply [production]
12:26 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/edit-analytics: apply [production]
12:25 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/echostore: apply [production]
12:25 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/device-analytics: apply [production]
12:24 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/developer-portal: apply [production]
12:24 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/data-gateway: apply [production]
12:24 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/cxserver: apply [production]
12:24 <jclark@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker1008.eqiad.wmnet with reason: host reimage [production]
12:23 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/commons-impact-analytics: apply [production]
12:23 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/citoid: apply [production]
12:23 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/cirrus-streaming-updater: apply [production]
12:23 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/chart-renderer: apply [production]
12:22 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/changeprop-jobqueue: apply [production]
12:22 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/changeprop: apply [production]
12:22 <akosiaris@cumin1003> START - Cookbook sre.hosts.reimage for host aux-k8s-worker2007.codfw.wmnet with OS bookworm [production]
12:20 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/api-gateway: apply [production]
12:20 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 15 hosts with reason: maintenance [production]
12:20 <kamila@deploy1003> helmfile [codfw] OK helmfile.d/services/apertium: apply [production]
12:17 <kamila@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'sync'. [production]
12:12 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P78637 and previous config saved to /var/cache/conftool/dbconfig/20250623-121206-marostegui.json [production]
12:12 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host aux-k8s-worker1008.eqiad.wmnet with OS bookworm [production]
12:11 <kamila@deploy1003> helmfile [codfw] START helmfile.d/admin 'sync'. [production]
12:09 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host aux-k8s-worker1008.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:08 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host aux-k8s-worker1008.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
12:06 <kartik@deploy1003> helmfile [staging] DONE helmfile.d/services/machinetranslation: apply [production]
11:57 <jiji@cumin1003> conftool action : set/pooled=false; selector: dnsdisc=swift-ro,name=codfw [production]
11:57 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243 (T396130)', diff saved to https://phabricator.wikimedia.org/P78636 and previous config saved to /var/cache/conftool/dbconfig/20250623-115659-marostegui.json [production]
11:53 <jiji@cumin1003> conftool action : set/pooled=false; selector: dnsdisc=swift-rw,name=codfw [production]
11:52 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host debmonitor-dev2001.codfw.wmnet [production]
11:52 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host debmonitor-dev2001.codfw.wmnet with OS bookworm [production]
11:50 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1243 (T396130)', diff saved to https://phabricator.wikimedia.org/P78635 and previous config saved to /var/cache/conftool/dbconfig/20250623-115013-marostegui.json [production]