301-350 of 10000 results (104ms)
2025-12-17 ยง
18:32 <cmooney@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host es2028.codfw.wmnet with OS trixie [production]
17:54 <cmooney@cumin1003> START - Cookbook sre.hosts.reimage for host es2028.codfw.wmnet with OS trixie [production]
17:51 <topranks> upgrading OS on lswtest-d8-eqiad T412733 [production]
17:51 <cmooney@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ssw1-d[1,8]-eqiad with reason: upgradiing sr-linux on lswtest-d8-eqiad [production]
17:50 <cmooney@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ssw1-d[1,8]-eqiad.mgmt with reason: upgradiing sr-linux on lswtest-d8-eqiad [production]
17:46 <cmooney@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host es2028.codfw.wmnet with OS trixie [production]
17:34 <cmooney@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1006.eqiad.wmnet with reason: upgrading connected switch [production]
17:33 <cmooney@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lswtest-d8-eqiad,lswtest-d8-eqiad IPv6 with reason: upgradiing sr-linux on lswtest-d8-eqiad [production]
17:28 <cmooney@cumin1003> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host es2028 [production]
17:28 <cmooney@cumin1003> START - Cookbook sre.hosts.move-vlan for host es2028 [production]
17:28 <cmooney@cumin1003> START - Cookbook sre.hosts.reimage for host es2028.codfw.wmnet with OS trixie [production]
17:27 <swfrench@deploy2002> Started scap sync-world: Rebuild deployment to pick up new production image [production]
17:24 <cmooney@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es2028.codfw.wmnet with OS trixie [production]
17:12 <swfrench-wmf> reprepro include php8.3_8.3.28-1+wmf11u2 in component/php83 [production]
17:08 <fabfur@cumin1003> conftool action : set/pooled=yes; selector: name=cp7009.* [production]
17:03 <fabfur> enabling puppet and repooling cp7009 (T412785) [production]
16:38 <eevans@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1031.eqiad.wmnet [production]
16:31 <eevans@cumin1003> START - Cookbook sre.hosts.reboot-single for host restbase1031.eqiad.wmnet [production]
15:50 <eevans@cumin1003> conftool action : set/pooled=yes; selector: dc=eqiad,cluster=restbase,service=restbase-ssl [production]
15:50 <eevans@cumin1003> conftool action : set/pooled=yes; selector: dc=eqiad,cluster=restbase,service=restbase-https [production]
15:49 <eevans@cumin1003> conftool action : set/pooled=yes; selector: dc=eqiad,cluster=restbase,service=restbase-backend [production]
15:45 <eevans@cumin1003> conftool action : set/pooled=yes; selector: dc=eqiad,cluster=restbase,service=restbase-* [production]
15:28 <moritzm> upgrade Envoy on etherpad* T410975 [production]
15:12 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) restbase1031.eqiad.wmnet on all recursors [production]
15:12 <ayounsi@cumin1003> START - Cookbook sre.dns.wipe-cache restbase1031.eqiad.wmnet on all recursors [production]
15:12 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:12 <ayounsi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add AAAA to restbase1031 - ayounsi@cumin1003" [production]
15:11 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add AAAA to restbase1031 - ayounsi@cumin1003" [production]
15:11 <cmooney@cumin1003> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host es2028 [production]
15:11 <cmooney@cumin1003> START - Cookbook sre.hosts.move-vlan for host es2028 [production]
15:11 <cmooney@cumin1003> START - Cookbook sre.hosts.reimage for host es2028.codfw.wmnet with OS trixie [production]
15:07 <ayounsi@cumin1003> START - Cookbook sre.dns.netbox [production]
15:06 <XioNoX> add AAAA record to restbase1031.eqiad.wmnet - T271140 [production]
15:05 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
15:05 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
15:04 <Lucas_WMDE> UTC afternoon backport+config window done [production]
15:03 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
15:03 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
15:01 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
15:01 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
14:59 <moritzm> installing nodejs security updates [production]
14:53 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
14:53 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
14:51 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1219158|Revert "Enable v2 non-emergency workflow by default" (T410512 T412715)]], [[gerrit:1218806|Activate post-processing cache on some wikis (T348255)]] (duration: 18m 45s) [production]
14:50 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
14:50 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
14:47 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, ihurbain: Continuing with sync [production]
14:41 <moritzm> installing tiff security updates [production]
14:35 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, ihurbain: Backport for [[gerrit:1219158|Revert "Enable v2 non-emergency workflow by default" (T410512 T412715)]], [[gerrit:1218806|Activate post-processing cache on some wikis (T348255)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:33 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1219158|Revert "Enable v2 non-emergency workflow by default" (T410512 T412715)]], [[gerrit:1218806|Activate post-processing cache on some wikis (T348255)]] [production]