1-50 of 10000 results (151ms)
2026-03-13 §
07:56 <moritzm> installing Linux 6.12.74 on Trixie hosts [production]
07:55 <moritzm> installing 6.12.74 on Trixie hosts [production]
02:57 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp4044.ulsfo.wmnet [reason: trixie reimaging] [production]
02:09 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 08m 18s) [production]
02:00 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
01:41 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4044.ulsfo.wmnet with OS trixie [production]
01:37 <mutante> contint1003/contint2003 - every time(?) we setup machines with puppet using our httpd module and PHP - and puppet runs for the first time we run into the same old issue with "Exec[ensure_present_mod_php" failing and "Considering conflict mpm_worker for mpm_prefork"sudo a2dismod mpm_event". The fix is: 'sudo a2dismod mpm_event' and run puppet again. T418521 [production]
01:26 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on contint1003.wikimedia.org with reason: T418521 [production]
01:26 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on contint2003.wikimedia.org with reason: T418521 [production]
01:23 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on contint2003.wikimedia.org with reason: setup [production]
01:22 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on contint1003.wikimedia.org with reason: setup [production]
01:22 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp4047.* [production]
01:09 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4044.ulsfo.wmnet with reason: host reimage [production]
01:08 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp4043.ulsfo.wmnet [reason: trixie reimaging] [production]
01:06 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4044.ulsfo.wmnet with reason: host reimage [production]
01:05 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4043.ulsfo.wmnet with OS trixie [production]
00:55 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4047.ulsfo.wmnet with OS trixie [production]
00:45 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp4044.ulsfo.wmnet with OS trixie [production]
00:45 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp4044.ulsfo.wmnet [reason: trixie reimaging] [production]
00:42 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp4042.ulsfo.wmnet [reason: trixie reimaging] [production]
00:41 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4042.ulsfo.wmnet with OS trixie [production]
00:39 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4043.ulsfo.wmnet with reason: host reimage [production]
00:31 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4043.ulsfo.wmnet with reason: host reimage [production]
00:30 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4047.ulsfo.wmnet with reason: host reimage [production]
00:27 <rzl@deploy2002> Finished scap sync-world: https://gerrit.wikimedia.org/r/1251187 T419637 (duration: 07m 12s) [production]
00:23 <rzl@deploy2002> rzl: Continuing with sync [production]
00:23 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4047.ulsfo.wmnet with reason: host reimage [production]
00:22 <rzl@deploy2002> rzl: https://gerrit.wikimedia.org/r/1251187 T419637 synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
00:21 <rzl@deploy2002> Started scap sync-world: https://gerrit.wikimedia.org/r/1251187 T419637 [production]
00:15 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4042.ulsfo.wmnet with reason: host reimage [production]
00:14 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp4040.ulsfo.wmnet [reason: trixie reimaging] [production]
00:11 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4042.ulsfo.wmnet with reason: host reimage [production]
00:11 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp4043.ulsfo.wmnet with OS trixie [production]
00:10 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4040.ulsfo.wmnet with OS trixie [production]
00:04 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp4047.ulsfo.wmnet with OS trixie [production]
00:03 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4047.ulsfo.wmnet with OS trixie [production]
00:03 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp4043.ulsfo.wmnet with OS trixie [production]
2026-03-12 §
23:57 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host o11ytest1001.eqiad.wmnet with OS trixie [production]
23:53 <rzl@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
23:53 <rzl@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
23:50 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp4042.ulsfo.wmnet with OS trixie [production]
23:49 <rzl@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
23:48 <rzl@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
23:45 <rzl@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
23:45 <rzl@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
23:45 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4040.ulsfo.wmnet with reason: host reimage [production]
23:44 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp4042.ulsfo.wmnet with OS trixie [production]
23:41 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp4040.ulsfo.wmnet with reason: host reimage [production]
23:41 <rzl@deploy2002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
23:41 <rzl@deploy2002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]