2025-06-16
ยง
|
17:17 |
<cjming@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
17:16 |
<cjming@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
17:12 |
<swfrench@deploy1003> |
Finished scap sync-world: Scap run to test newly enabled dse-k8s-eqiad deployment - T389786 (duration: 02m 15s) |
[production] |
17:11 |
<swfrench@deploy1003> |
Started scap sync-world: Scap run to test newly enabled dse-k8s-eqiad deployment - T389786 |
[production] |
17:07 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2208 (T395241)', diff saved to https://phabricator.wikimedia.org/P78074 and previous config saved to /var/cache/conftool/dbconfig/20250616-170726-fceratto.json |
[production] |
17:07 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum2001.codfw.wmnet with reason: host reimage |
[production] |
17:06 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2004.codfw.wmnet with OS bullseye |
[production] |
17:03 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on durum2001.codfw.wmnet with reason: host reimage |
[production] |
17:03 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum1002.eqiad.wmnet with reason: host reimage |
[production] |
16:58 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2208 (T395241)', diff saved to https://phabricator.wikimedia.org/P78073 and previous config saved to /var/cache/conftool/dbconfig/20250616-165855-fceratto.json |
[production] |
16:58 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2208.codfw.wmnet with reason: Maintenance |
[production] |
16:58 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T395241)', diff saved to https://phabricator.wikimedia.org/P78072 and previous config saved to /var/cache/conftool/dbconfig/20250616-165825-fceratto.json |
[production] |
16:58 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on durum1002.eqiad.wmnet with reason: host reimage |
[production] |
16:55 |
<eevans@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2004.codfw.wmnet with OS bullseye |
[production] |
16:50 |
<brett@cumin2002> |
START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-eqsin and A:cp - Fix VSLbs() assert error and upgrade libvmod-wmfuniq to 0.2.0 (T396581) |
[production] |
16:50 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=97) rolling upgrade of Varnish on A:cp-eqsin and A:cp - Fix VSLbs() assert error and upgrade libvmod-wmfuniq to 0.2.0 (T396581) |
[production] |
16:50 |
<brett@cumin2002> |
START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-eqsin and A:cp - Fix VSLbs() assert error and upgrade libvmod-wmfuniq to 0.2.0 (T396581) |
[production] |
16:46 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2004.codfw.wmnet with OS bullseye |
[production] |
16:45 |
<eevans@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sessionstore2004.codfw.wmnet with OS bullseye |
[production] |
16:43 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P78071 and previous config saved to /var/cache/conftool/dbconfig/20250616-164317-fceratto.json |
[production] |
16:43 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.reimage for host durum2001.codfw.wmnet with OS bookworm |
[production] |
16:43 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.reimage for host durum1002.eqiad.wmnet with OS bookworm |
[production] |
16:28 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P78070 and previous config saved to /var/cache/conftool/dbconfig/20250616-162810-fceratto.json |
[production] |
16:23 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum1001.eqiad.wmnet with OS bookworm |
[production] |
16:13 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T395241)', diff saved to https://phabricator.wikimedia.org/P78069 and previous config saved to /var/cache/conftool/dbconfig/20250616-161303-fceratto.json |
[production] |
16:12 |
<jgiannelos@deploy1003> |
helmfile [staging] DONE helmfile.d/services/changeprop: apply |
[production] |
16:12 |
<jgiannelos@deploy1003> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
16:12 |
<jgiannelos@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: apply |
[production] |
16:11 |
<jgiannelos@deploy1003> |
helmfile [eqiad] START helmfile.d/services/changeprop: apply |
[production] |
16:11 |
<jgiannelos@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/changeprop: apply |
[production] |
16:11 |
<jgiannelos@deploy1003> |
helmfile [codfw] START helmfile.d/services/changeprop: apply |
[production] |
16:10 |
<jgiannelos@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/changeprop: apply |
[production] |
16:09 |
<jgiannelos@deploy1003> |
helmfile [codfw] START helmfile.d/services/changeprop: apply |
[production] |
16:09 |
<jgiannelos@deploy1003> |
helmfile [staging] DONE helmfile.d/services/changeprop: apply |
[production] |
16:09 |
<jgiannelos@deploy1003> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
16:06 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum1001.eqiad.wmnet with reason: host reimage |
[production] |
16:03 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on durum1001.eqiad.wmnet with reason: host reimage |
[production] |
16:02 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2182 (T395241)', diff saved to https://phabricator.wikimedia.org/P78068 and previous config saved to /var/cache/conftool/dbconfig/20250616-160220-fceratto.json |
[production] |
16:02 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2182.codfw.wmnet with reason: Maintenance |
[production] |
16:02 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2168 (T395241)', diff saved to https://phabricator.wikimedia.org/P78067 and previous config saved to /var/cache/conftool/dbconfig/20250616-160203-fceratto.json |
[production] |
15:58 |
<dancy@deploy1003> |
Installation of scap version "4.175.0" completed for 2 hosts |
[production] |
15:56 |
<dancy@deploy1003> |
Installing scap version "4.175.0" for 2 host(s) |
[production] |
15:55 |
<jdrewniak@deploy1003> |
Synchronized portals: Wikimedia Portals Update: [[gerrit:1159503| Bumping portals to master (T128546)]] (duration: 02m 19s) |
[production] |
15:53 |
<jdrewniak@deploy1003> |
Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:1159503| Bumping portals to master (T128546)]] (duration: 09m 21s) |
[production] |
15:49 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2004.codfw.wmnet with OS bullseye |
[production] |
15:47 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.reimage for host durum1001.eqiad.wmnet with OS bookworm |
[production] |
15:47 |
<fnegri@cumin1003> |
conftool action : set/pooled=no; selector: name=clouddb1017.eqiad.wmnet |
[production] |
15:47 |
<fnegri@cumin1003> |
conftool action : set/pooled=yes; selector: name=clouddb1013.eqiad.wmnet |
[production] |
15:46 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2168', diff saved to https://phabricator.wikimedia.org/P78066 and previous config saved to /var/cache/conftool/dbconfig/20250616-154656-fceratto.json |
[production] |
15:41 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephmon2005-dev.codfw.wmnet with reason: host reimage |
[production] |