101-150 of 10000 results (28ms)
2026-06-04 ยง
12:05 <taavi@cloudcumin1001> Updating container image docker-registry.svc.toolforge.org/toolforge-kyverno-cleanup-controller:v1.16.4 [tools]
12:04 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool es2050: repool after upgrade [production]
12:04 <taavi@cloudcumin1001> Updating container image docker-registry.svc.toolforge.org/toolforge-kyverno-background-controller:v1.16.4 [tools]
12:04 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) [production]
12:04 <taavi@cloudcumin1001> Updating container image docker-registry.svc.toolforge.org/toolforge-kyverno-kyvernopre:v1.16.4 [tools]
12:04 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1220.eqiad.wmnet with reason: host reimage [production]
12:03 <taavi@cloudcumin1001> Updating container image docker-registry.svc.toolforge.org/toolforge-kyverno-kyverno-cli:v1.16.4 [tools]
12:03 <taavi@cloudcumin1001> Updating container image docker-registry.svc.toolforge.org/toolforge-kyverno-kyverno:v1.16.4 [tools]
12:03 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry [tools]
12:02 <taavi@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry (exit_code=99) [tools]
12:02 <taavi@cloudcumin1001> Updating container image docker-registry.svc.toolforge.org/toolforge-kyverno-kyverno-cli:v1.16.4 [tools]
12:01 <taavi@cloudcumin1001> Updating container image docker-registry.svc.toolforge.org/toolforge-kyverno-kyverno:v1.16.4 [tools]
12:01 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry [tools]
11:59 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1220.eqiad.wmnet with reason: host reimage [production]
11:42 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db1220.eqiad.wmnet with OS trixie [production]
11:40 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2050.codfw.wmnet with OS trixie [production]
11:40 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1220: Upgrading db1220.eqiad.wmnet [production]
11:37 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool db1220: Upgrading db1220.eqiad.wmnet [production]
11:36 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
11:32 <jgleeson> payments-wiki upgraded from 3bc70a73 to aef3d25d [fundraising]
11:32 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) [production]
11:32 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1179: Migration of db1179.eqiad.wmnet completed [production]
11:23 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2050.codfw.wmnet with reason: host reimage [production]
11:16 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on es2050.codfw.wmnet with reason: host reimage [production]
11:13 <wmftkbot> Test Kitchen mw-user experiment (poll 84311) - adds: none; removes: none; fields: incident_reporting_system_interaction - xLab/MPIC/TK tips at https://w.wiki/FwuD [analytics]
11:00 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host es2050.codfw.wmnet with OS trixie [production]
11:00 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool es2050: Upgrading es2050.codfw.wmnet [production]
10:59 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool es2050: Upgrading es2050.codfw.wmnet [production]
10:59 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
10:59 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool es2057: repool after upgrade [production]
10:58 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:55 <cmooney@cumin1003> START - Cookbook sre.dns.netbox [production]
10:46 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool db1179: Migration of db1179.eqiad.wmnet completed [production]
10:38 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1179.eqiad.wmnet with OS trixie [production]
10:19 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1179.eqiad.wmnet with reason: host reimage [production]
10:16 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/tegola-vector-tiles: apply [production]
10:15 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/tegola-vector-tiles: apply [production]
10:15 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/kartotherian: apply [production]
10:15 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/kartotherian: apply [production]
10:15 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1179.eqiad.wmnet with reason: host reimage [production]
10:13 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool es2057: repool after upgrade [production]
10:13 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) [production]
10:11 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2057.codfw.wmnet with OS trixie [production]
09:59 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db1179.eqiad.wmnet with OS trixie [production]
09:58 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1179: Upgrading db1179.eqiad.wmnet [production]
09:58 <jynus> redoing m2 backups after grant change T411111 [production]
09:57 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool db1179: Upgrading db1179.eqiad.wmnet [production]
09:56 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
09:54 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2057.codfw.wmnet with reason: host reimage [production]
09:53 <ozge@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]