401-450 of 10000 results (98ms)
2025-12-10 ยง
14:51 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host aqs1024.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:51 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host aqs1023.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:49 <Lucas_WMDE> UTC afternoon backport+config window done [production]
14:49 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1217124|Set wgEnableWatchlistLabels for beta (T411836)]] (duration: 07m 21s) [production]
14:45 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, samtar: Continuing with sync [production]
14:44 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, samtar: Backport for [[gerrit:1217124|Set wgEnableWatchlistLabels for beta (T411836)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:41 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1217124|Set wgEnableWatchlistLabels for beta (T411836)]] [production]
14:26 <arlolra@deploy2002> Finished scap sync-world: Backport for [[gerrit:1216674|ExtensionDistributor: mark 1.45 as stable (T408482)]] (duration: 06m 29s) [production]
14:22 <arlolra@deploy2002> arlolra, macfan4000: Continuing with sync [production]
14:22 <arlolra@deploy2002> arlolra, macfan4000: Backport for [[gerrit:1216674|ExtensionDistributor: mark 1.45 as stable (T408482)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:20 <arlolra@deploy2002> Started scap sync-world: Backport for [[gerrit:1216674|ExtensionDistributor: mark 1.45 as stable (T408482)]] [production]
14:13 <sbisson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1217181|CX3 Build 1.0.0+20251209 (T384485 T408845 T409332 T409337 T409338 T411779)]] (duration: 09m 01s) [production]
14:10 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2228 (T410589)', diff saved to https://phabricator.wikimedia.org/P86501 and previous config saved to /var/cache/conftool/dbconfig/20251210-141046-ladsgroup.json [production]
14:10 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2228.codfw.wmnet with reason: Maintenance [production]
14:10 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2223 (T410589)', diff saved to https://phabricator.wikimedia.org/P86500 and previous config saved to /var/cache/conftool/dbconfig/20251210-141022-ladsgroup.json [production]
14:08 <sbisson@deploy2002> sbisson: Continuing with sync [production]
14:07 <sbisson@deploy2002> sbisson: Backport for [[gerrit:1217181|CX3 Build 1.0.0+20251209 (T384485 T408845 T409332 T409337 T409338 T411779)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:04 <sbisson@deploy2002> Started scap sync-world: Backport for [[gerrit:1217181|CX3 Build 1.0.0+20251209 (T384485 T408845 T409332 T409337 T409338 T411779)]] [production]
13:55 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2223', diff saved to https://phabricator.wikimedia.org/P86499 and previous config saved to /var/cache/conftool/dbconfig/20251210-135514-ladsgroup.json [production]
13:53 <kart_> Updated Recommendation API to 2025-12-09-164214-production (T384485, T409338, T409332) [production]
13:51 <kartik@deploy2002> helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
13:47 <kartik@deploy2002> helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
13:41 <kartik@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
13:40 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2223', diff saved to https://phabricator.wikimedia.org/P86497 and previous config saved to /var/cache/conftool/dbconfig/20251210-134007-ladsgroup.json [production]
13:27 <hnowlan@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-eqiad [production]
13:25 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2223 (T410589)', diff saved to https://phabricator.wikimedia.org/P86496 and previous config saved to /var/cache/conftool/dbconfig/20251210-132459-ladsgroup.json [production]
13:20 <hnowlan@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad [production]
12:53 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply [production]
12:52 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply [production]
11:50 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-analytics-test: apply [production]
11:50 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-analytics-test: apply [production]
11:41 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1017.eqiad.wmnet [production]
11:35 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1017.eqiad.wmnet [production]
10:39 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-build1001.eqiad.wmnet with reason: host reimage [production]
10:35 <dpogorzelski@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-build1001.eqiad.wmnet with reason: host reimage [production]
10:19 <dpogorzelski@cumin1003> START - Cookbook sre.hosts.reimage for host ml-build1001.eqiad.wmnet with OS trixie [production]
10:14 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from ml-lab1001 to ml-build1001 [production]
10:13 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ml-build1001 [production]
10:11 <jelto@puppetserver1001> conftool action : set/pooled=no; selector: cluster=tcp-proxy,service=gerrit [production]
10:11 <dpogorzelski@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host ml-build1001 [production]
10:11 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ml-build1001 on all recursors [production]
10:11 <dpogorzelski@cumin1003> START - Cookbook sre.dns.wipe-cache ml-build1001 on all recursors [production]
10:11 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:11 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming ml-lab1001 to ml-build1001 - dpogorzelski@cumin1003" [production]
10:10 <dpogorzelski@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming ml-lab1001 to ml-build1001 - dpogorzelski@cumin1003" [production]
10:04 <dpogorzelski@cumin1003> START - Cookbook sre.dns.netbox [production]
10:04 <dpogorzelski@cumin1003> START - Cookbook sre.hosts.rename from ml-lab1001 to ml-build1001 [production]
10:01 <jelto@puppetserver1001> conftool action : set/pooled=no; selector: cluster=tcp-proxy,service=gerrit,dc=drmrs [production]
09:50 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
09:50 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]