151-200 of 10000 results (108ms)
2025-12-10 ยง
14:07 <sbisson@deploy2002> sbisson: Backport for [[gerrit:1217181|CX3 Build 1.0.0+20251209 (T384485 T408845 T409332 T409337 T409338 T411779)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:04 <sbisson@deploy2002> Started scap sync-world: Backport for [[gerrit:1217181|CX3 Build 1.0.0+20251209 (T384485 T408845 T409332 T409337 T409338 T411779)]] [production]
13:55 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2223', diff saved to https://phabricator.wikimedia.org/P86499 and previous config saved to /var/cache/conftool/dbconfig/20251210-135514-ladsgroup.json [production]
13:53 <kart_> Updated Recommendation API to 2025-12-09-164214-production (T384485, T409338, T409332) [production]
13:51 <kartik@deploy2002> helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
13:47 <kartik@deploy2002> helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
13:41 <kartik@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
13:40 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2223', diff saved to https://phabricator.wikimedia.org/P86497 and previous config saved to /var/cache/conftool/dbconfig/20251210-134007-ladsgroup.json [production]
13:27 <hnowlan@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-eqiad [production]
13:25 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2223 (T410589)', diff saved to https://phabricator.wikimedia.org/P86496 and previous config saved to /var/cache/conftool/dbconfig/20251210-132459-ladsgroup.json [production]
13:20 <hnowlan@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad [production]
12:53 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply [production]
12:52 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply [production]
11:50 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-analytics-test: apply [production]
11:50 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-analytics-test: apply [production]
11:41 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1017.eqiad.wmnet [production]
11:35 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1017.eqiad.wmnet [production]
10:39 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-build1001.eqiad.wmnet with reason: host reimage [production]
10:35 <dpogorzelski@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-build1001.eqiad.wmnet with reason: host reimage [production]
10:19 <dpogorzelski@cumin1003> START - Cookbook sre.hosts.reimage for host ml-build1001.eqiad.wmnet with OS trixie [production]
10:14 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from ml-lab1001 to ml-build1001 [production]
10:13 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ml-build1001 [production]
10:11 <jelto@puppetserver1001> conftool action : set/pooled=no; selector: cluster=tcp-proxy,service=gerrit [production]
10:11 <dpogorzelski@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host ml-build1001 [production]
10:11 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ml-build1001 on all recursors [production]
10:11 <dpogorzelski@cumin1003> START - Cookbook sre.dns.wipe-cache ml-build1001 on all recursors [production]
10:11 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:11 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming ml-lab1001 to ml-build1001 - dpogorzelski@cumin1003" [production]
10:10 <dpogorzelski@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming ml-lab1001 to ml-build1001 - dpogorzelski@cumin1003" [production]
10:04 <dpogorzelski@cumin1003> START - Cookbook sre.dns.netbox [production]
10:04 <dpogorzelski@cumin1003> START - Cookbook sre.hosts.rename from ml-lab1001 to ml-build1001 [production]
10:01 <jelto@puppetserver1001> conftool action : set/pooled=no; selector: cluster=tcp-proxy,service=gerrit,dc=drmrs [production]
09:50 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
09:50 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
09:47 <jelto@puppetserver1001> conftool action : set/pooled=no; selector: name=tcp-proxy6001.drmrs.wmnet [production]
09:15 <joal@deploy2002> Finished deploy [analytics/refinery@6e8f9d4] (thin): Regular analytics train THIN [analytics/refinery@6e8f9d4a] (duration: 01m 13s) [production]
09:14 <joal@deploy2002> Started deploy [analytics/refinery@6e8f9d4] (thin): Regular analytics train THIN [analytics/refinery@6e8f9d4a] [production]
09:14 <joal@deploy2002> Finished deploy [analytics/refinery@6e8f9d4]: Regular analytics train [analytics/refinery@6e8f9d4a] (duration: 02m 30s) [production]
09:11 <joal@deploy2002> Started deploy [analytics/refinery@6e8f9d4]: Regular analytics train [analytics/refinery@6e8f9d4a] [production]
09:11 <joal@deploy2002> Finished deploy [analytics/refinery@6e8f9d4] (hadoop-test): Regular analytics train TEST [analytics/refinery@6e8f9d4a] (duration: 01m 04s) [production]
09:10 <joal@deploy2002> Started deploy [analytics/refinery@6e8f9d4] (hadoop-test): Regular analytics train TEST [analytics/refinery@6e8f9d4a] [production]
05:56 <dpogorzelski@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ml-lab1001.eqiad.wmnet with OS trixie [production]
05:51 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2223 (T410589)', diff saved to https://phabricator.wikimedia.org/P86492 and previous config saved to /var/cache/conftool/dbconfig/20251210-055138-ladsgroup.json [production]
05:51 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2223.codfw.wmnet with reason: Maintenance [production]
05:51 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2213 (T410589)', diff saved to https://phabricator.wikimedia.org/P86491 and previous config saved to /var/cache/conftool/dbconfig/20251210-055125-ladsgroup.json [production]
05:36 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P86490 and previous config saved to /var/cache/conftool/dbconfig/20251210-053618-ladsgroup.json [production]
05:21 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P86489 and previous config saved to /var/cache/conftool/dbconfig/20251210-052110-ladsgroup.json [production]
05:06 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2213 (T410589)', diff saved to https://phabricator.wikimedia.org/P86488 and previous config saved to /var/cache/conftool/dbconfig/20251210-050603-ladsgroup.json [production]
01:57 <cstone> SmashPig upgraded from 1442d0a0 to 5c731f99 [production]
01:18 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 17m 50s) [production]