1401-1450 of 10000 results (42ms)
2025-06-23 §
15:28 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on rdb2012.codfw.wmnet with reason: host reimage [production]
15:28 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on rdb2011.codfw.wmnet with reason: host reimage [production]
15:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1252 (T396130)', diff saved to https://phabricator.wikimedia.org/P78656 and previous config saved to /var/cache/conftool/dbconfig/20250623-152551-marostegui.json [production]
15:25 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1252.eqiad.wmnet with reason: Maintenance [production]
15:25 <sukhe@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade of ATS on A:magru and A:cp - 9.2.11 upgrade (T397456) [production]
15:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1249 (T396130)', diff saved to https://phabricator.wikimedia.org/P78655 and previous config saved to /var/cache/conftool/dbconfig/20250623-152529-marostegui.json [production]
15:22 <lucaswerkmeister-wmde@deploy1003> anzx, lucaswerkmeister-wmde: Continuing with sync [production]
15:21 <lucaswerkmeister-wmde@deploy1003> anzx, lucaswerkmeister-wmde: Backport for [[gerrit:1162889|brwiki: add patroller usergroup (T397576)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:19 <lucaswerkmeister-wmde@deploy1003> Started scap sync-world: Backport for [[gerrit:1162889|brwiki: add patroller usergroup (T397576)]] [production]
15:18 <eevans@cumin1003> START - Cookbook sre.hosts.reimage for host sessionstore2005.codfw.wmnet with OS bullseye [production]
15:18 <eevans@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2005.codfw.wmnet with OS bullseye [production]
15:18 <klausman@deploy1003> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
15:16 <lucaswerkmeister-wmde@deploy1003> Finished scap sync-world: Backport for [[gerrit:1141852|Create feature flags to resolve Wikibase item labels on the Watchlist. (T388685)]] (duration: 12m 07s) [production]
15:15 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host rdb2012.codfw.wmnet with OS bookworm [production]
15:15 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host rdb2011.codfw.wmnet with OS bookworm [production]
15:10 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P78654 and previous config saved to /var/cache/conftool/dbconfig/20250623-151021-marostegui.json [production]
15:09 <lucaswerkmeister-wmde@deploy1003> neslihanturan, lucaswerkmeister-wmde: Continuing with sync [production]
15:06 <lucaswerkmeister-wmde@deploy1003> neslihanturan, lucaswerkmeister-wmde: Backport for [[gerrit:1141852|Create feature flags to resolve Wikibase item labels on the Watchlist. (T388685)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:04 <lucaswerkmeister-wmde@deploy1003> Started scap sync-world: Backport for [[gerrit:1141852|Create feature flags to resolve Wikibase item labels on the Watchlist. (T388685)]] [production]
15:02 <eevans@cumin1003> START - Cookbook sre.hosts.reimage for host sessionstore2005.codfw.wmnet with OS bullseye [production]
15:00 <urandom> decommission Cassandra/sessionstore2005-a — T391544 [production]
14:59 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) config_reloading P{lvs6001.drmrs.wmnet} and A:liberica (T396561) [production]
14:59 <vgutierrez> repool lvs6001 using katran - T396561 [production]
14:59 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin config_reloading P{lvs6001.drmrs.wmnet} and A:liberica (T396561) [production]
14:58 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
14:55 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P78653 and previous config saved to /var/cache/conftool/dbconfig/20250623-145514-marostegui.json [production]
14:48 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
14:40 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1249 (T396130)', diff saved to https://phabricator.wikimedia.org/P78652 and previous config saved to /var/cache/conftool/dbconfig/20250623-144005-marostegui.json [production]
14:39 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs6001.drmrs.wmnet [production]
14:39 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs6001.drmrs.wmnet [production]
14:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1249 (T396130)', diff saved to https://phabricator.wikimedia.org/P78651 and previous config saved to /var/cache/conftool/dbconfig/20250623-143311-marostegui.json [production]
14:33 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1249.eqiad.wmnet with reason: Maintenance [production]
14:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1248 (T396130)', diff saved to https://phabricator.wikimedia.org/P78650 and previous config saved to /var/cache/conftool/dbconfig/20250623-143300-marostegui.json [production]
14:31 <elukey@deploy1003> helmfile [codfw] DONE helmfile.d/services/machinetranslation: sync [production]
14:21 <elukey@deploy1003> helmfile [codfw] START helmfile.d/services/machinetranslation: sync [production]
14:19 <vgutierrez@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on lvs6001.drmrs.wmnet with reason: switching to katran [production]
14:18 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs6001.drmrs.wmnet} and A:liberica (T396561) [production]
14:18 <mvernon@cumin2002> conftool action : set/pooled=true; selector: dnsdisc=swift-ro,name=codfw [production]
14:18 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs6001.drmrs.wmnet} and A:liberica (T396561) [production]
14:17 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P78649 and previous config saved to /var/cache/conftool/dbconfig/20250623-141753-marostegui.json [production]
14:17 <mvernon@cumin2002> conftool action : set/pooled=true; selector: dnsdisc=swift,name=codfw [production]
14:17 <Emperor> repool ms swift in codfw [production]
14:13 <kamila@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-cluster (exit_code=0) pool all services in codfw/codfw: maintenance [production]
14:10 <kamila@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
14:10 <kamila@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
14:10 <kamila@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
14:09 <kamila@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
14:09 <kamila@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:08 <kamila@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
14:08 <kamila@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]