2251-2300 of 10000 results (22ms)
2025-06-16 §
16:55 <eevans@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2004.codfw.wmnet with OS bullseye [production]
16:50 <brett@cumin2002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-eqsin and A:cp - Fix VSLbs() assert error and upgrade libvmod-wmfuniq to 0.2.0 (T396581) [production]
16:50 <brett@cumin2002> END (ERROR) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=97) rolling upgrade of Varnish on A:cp-eqsin and A:cp - Fix VSLbs() assert error and upgrade libvmod-wmfuniq to 0.2.0 (T396581) [production]
16:50 <brett@cumin2002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-eqsin and A:cp - Fix VSLbs() assert error and upgrade libvmod-wmfuniq to 0.2.0 (T396581) [production]
16:46 <eevans@cumin1003> START - Cookbook sre.hosts.reimage for host sessionstore2004.codfw.wmnet with OS bullseye [production]
16:45 <eevans@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sessionstore2004.codfw.wmnet with OS bullseye [production]
16:43 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P78071 and previous config saved to /var/cache/conftool/dbconfig/20250616-164317-fceratto.json [production]
16:43 <sukhe@cumin1002> START - Cookbook sre.hosts.reimage for host durum2001.codfw.wmnet with OS bookworm [production]
16:43 <sukhe@cumin1002> START - Cookbook sre.hosts.reimage for host durum1002.eqiad.wmnet with OS bookworm [production]
16:28 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P78070 and previous config saved to /var/cache/conftool/dbconfig/20250616-162810-fceratto.json [production]
16:23 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum1001.eqiad.wmnet with OS bookworm [production]
16:13 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2182 (T395241)', diff saved to https://phabricator.wikimedia.org/P78069 and previous config saved to /var/cache/conftool/dbconfig/20250616-161303-fceratto.json [production]
16:12 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
16:12 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
16:12 <jgiannelos@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
16:11 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
16:11 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
16:11 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
16:10 <jgiannelos@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
16:09 <jgiannelos@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
16:09 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
16:09 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
16:06 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum1001.eqiad.wmnet with reason: host reimage [production]
16:03 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum1001.eqiad.wmnet with reason: host reimage [production]
16:02 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2182 (T395241)', diff saved to https://phabricator.wikimedia.org/P78068 and previous config saved to /var/cache/conftool/dbconfig/20250616-160220-fceratto.json [production]
16:02 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance [production]
16:02 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2168 (T395241)', diff saved to https://phabricator.wikimedia.org/P78067 and previous config saved to /var/cache/conftool/dbconfig/20250616-160203-fceratto.json [production]
15:58 <dancy@deploy1003> Installation of scap version "4.175.0" completed for 2 hosts [production]
15:56 <dancy@deploy1003> Installing scap version "4.175.0" for 2 host(s) [production]
15:55 <jdrewniak@deploy1003> Synchronized portals: Wikimedia Portals Update: [[gerrit:1159503| Bumping portals to master (T128546)]] (duration: 02m 19s) [production]
15:53 <jdrewniak@deploy1003> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:1159503| Bumping portals to master (T128546)]] (duration: 09m 21s) [production]
15:49 <eevans@cumin1003> START - Cookbook sre.hosts.reimage for host sessionstore2004.codfw.wmnet with OS bullseye [production]
15:47 <sukhe@cumin1002> START - Cookbook sre.hosts.reimage for host durum1001.eqiad.wmnet with OS bookworm [production]
15:47 <fnegri@cumin1003> conftool action : set/pooled=no; selector: name=clouddb1017.eqiad.wmnet [production]
15:47 <fnegri@cumin1003> conftool action : set/pooled=yes; selector: name=clouddb1013.eqiad.wmnet [production]
15:46 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2168', diff saved to https://phabricator.wikimedia.org/P78066 and previous config saved to /var/cache/conftool/dbconfig/20250616-154656-fceratto.json [production]
15:41 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephmon2005-dev.codfw.wmnet with reason: host reimage [production]
15:37 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephmon2005-dev.codfw.wmnet with reason: host reimage [production]
15:31 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2168', diff saved to https://phabricator.wikimedia.org/P78065 and previous config saved to /var/cache/conftool/dbconfig/20250616-153148-fceratto.json [production]
15:30 <dreamyjazz@deploy1003> Finished scap sync-world: Backport for [[gerrit:1159477|Revert "Enable temporary accounts onboarding dialog on WMF wikis"]] (duration: 24m 48s) [production]
15:29 <urandom> decommissioning sessionstore2004-a/Cassandra — T391544 [production]
15:22 <dreamyjazz@deploy1003> dreamyjazz: Continuing with sync [production]
15:20 <fnegri@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on clouddb1013.eqiad.wmnet with reason: Upgrading clouddbs T394372 [production]
15:20 <fnegri@cumin1003> conftool action : set/pooled=no; selector: name=clouddb1013.eqiad.wmnet [production]
15:20 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.quota_increase (exit_code=0) (T397059) [automation-framework]
15:19 <taavi@cloudcumin1001> START - Cookbook wmcs.openstack.quota_increase (T397059) [automation-framework]
15:17 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephmon2005-dev.codfw.wmnet with OS bullseye [production]
15:17 <inflatador> bking@cumin2002:~$ sudo cumin A:lvs-low-traffic 'run-puppet-agent' T387569 [production]
15:16 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2168 (T395241)', diff saved to https://phabricator.wikimedia.org/P78064 and previous config saved to /var/cache/conftool/dbconfig/20250616-151641-fceratto.json [production]
15:15 <James_F> Docker: [quibble-bullseye] Add the MariaDB binaries to our path T366646 [releng]