751-800 of 10000 results (25ms)
2026-05-11 §
12:47 <kharlan@deploy1003> kharlan: Continuing with deployment [production]
12:45 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1285789|hCaptcha: Enable editing on group0 wikis (T425354)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
12:41 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1285789|hCaptcha: Enable editing on group0 wikis (T425354)]] [production]
12:25 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1055.eqiad.wmnet with reason: host reimage [production]
12:18 <jiji@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1055.eqiad.wmnet with reason: host reimage [production]
12:05 <jiji@cumin1003> START - Cookbook sre.hosts.reimage for host mc1055.eqiad.wmnet with OS trixie [production]
12:04 <topranks> push out updated ACL to Nokia switches for BGP connections (T425703) and add BFD config (T425813) [production]
11:48 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2185.codfw.wmnet with reason: Reboot [production]
11:31 <moritzm> installing Linux 6.12.86 on Trixie hosts [production]
11:27 <jayme@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-videoscaler: apply [production]
11:27 <jayme@deploy1003> helmfile [codfw] START helmfile.d/services/mw-videoscaler: apply [production]
11:21 <jayme@deploy1003> Finished scap sync-world: upgrade rsyslog on all deployments T418200 (duration: 13m 28s) [production]
11:21 <jayme@deploy1003> Rolling back deployment [production]
11:08 <jayme@deploy1003> Started scap sync-world: upgrade rsyslog on all deployments T418200 [production]
11:03 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance [production]
11:00 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance [production]
10:59 <jayme> uprading rsyslog to 8.2504.0-1 in all mediawiki deployments - T418200 [production]
10:52 <taavi@cumin1003> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Clément Goubert out of all services on: 2459 hosts [production]
10:41 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance [production]
10:35 <fnegri@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component builds-api (T423417) [toolsbeta]
10:31 <fnegri@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component builds-api (T423417) [toolsbeta]
10:26 <jayme@deploy1003> Finished scap sync-world: update rsyslog image (duration: 03m 48s) [production]
10:23 <jayme@deploy1003> Started scap sync-world: update rsyslog image [production]
10:22 <jayme@deploy1003> helmfile [eqiad] DONE helmfile.d/services/ratelimit: apply [production]
10:21 <jayme@deploy1003> helmfile [eqiad] START helmfile.d/services/ratelimit: apply [production]
10:21 <jayme@deploy1003> helmfile [codfw] DONE helmfile.d/services/ratelimit: apply [production]
10:21 <jayme@deploy1003> helmfile [codfw] START helmfile.d/services/ratelimit: apply [production]
10:16 <slyngs> Migrate of lvs2012 due to hardware issues [production]
10:14 <jayme@deploy1003> helmfile [staging] DONE helmfile.d/services/ratelimit: apply [production]
10:13 <jayme@deploy1003> helmfile [staging] START helmfile.d/services/ratelimit: apply [production]
10:13 <jayme@deploy1003> helmfile [staging] DONE helmfile.d/services/ratelimit: apply [production]
10:13 <jayme@deploy1003> helmfile [staging] START helmfile.d/services/ratelimit: apply [production]
10:13 <jayme@deploy1003> helmfile [staging] DONE helmfile.d/services/ratelimit: apply [production]
10:12 <jayme@deploy1003> helmfile [staging] START helmfile.d/services/ratelimit: apply [production]
10:11 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1285731|hCaptcha: Enable for group0 wikis (T425354)]] (duration: 30m 15s) [production]
10:10 <moritzm> rebalance routed Ganeti cluster in eqsin T421863 [production]
10:06 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
10:04 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
10:01 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
10:01 <fceratto@cumin1003> DONE (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 12:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
09:59 <kharlan@deploy1003> kharlan: Continuing with deployment [production]
09:58 <jelto@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/services/miscweb: apply [production]
09:58 <jelto@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/services/miscweb: apply [production]
09:58 <jelto@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
09:58 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1285731|hCaptcha: Enable for group0 wikis (T425354)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
09:57 <jelto@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
09:57 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on lvs2012.codfw.wmnet with reason: Hardware failure [production]
09:57 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs2012.codfw.wmnet with reason: Hardware failure [production]
09:46 <jelto@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
09:46 <jelto@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]