1-50 of 10000 results (106ms)
2026-06-02 ยง
08:59 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2250.codfw.wmnet with reason: rack A3 maintenance [production]
08:56 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
08:56 <blake@cumin1003> START - Cookbook sre.hosts.reimage for host mc1055.eqiad.wmnet with OS trixie [production]
08:55 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
08:54 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
08:54 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
08:53 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:52 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
08:51 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:50 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
08:50 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
08:47 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
08:46 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2045.codfw.wmnet [production]
08:41 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
08:39 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
08:37 <urbanecm> Reset user email of Barras@votewiki to the one of Barras@SUL [production]
08:30 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance [production]
08:30 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T419635)', diff saved to https://phabricator.wikimedia.org/P93505 and previous config saved to /var/cache/conftool/dbconfig/20260602-083033-fceratto.json [production]
08:30 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
08:29 <slyngs> IDP, new configuration in preparation for webauthn [production]
08:20 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
08:20 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P93504 and previous config saved to /var/cache/conftool/dbconfig/20260602-082026-fceratto.json [production]
08:19 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:18 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
08:18 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
08:17 <atsuko@deploy1003> Finished scap sync-world: Backport for [[gerrit:1296488|Revert "translate: adding separate read/write endpoints" (T425377)]] (duration: 03m 33s) [production]
08:16 <atsuko@deploy1003> atsuko: Rolling back deployment [production]
08:16 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool es2053: repool after upgrade [production]
08:15 <atsuko@deploy1003> atsuko: Backport for [[gerrit:1296488|Revert "translate: adding separate read/write endpoints" (T425377)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:13 <atsuko@deploy1003> Started scap sync-world: Backport for [[gerrit:1296488|Revert "translate: adding separate read/write endpoints" (T425377)]] [production]
08:11 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
08:10 <marostegui> Install mariadb 10.11.17 on es2053 T427345 [production]
08:10 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P93502 and previous config saved to /var/cache/conftool/dbconfig/20260602-081018-fceratto.json [production]
08:09 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
08:09 <fceratto@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db2241: Depool for rack maintenance [production]
08:03 <atsuko@deploy1003> Finished scap sync-world: Backport for [[gerrit:1296262|translate: fixing missed variable in credentials formatting closure (T425377)]] (duration: 14m 47s) [production]
08:00 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T419635)', diff saved to https://phabricator.wikimedia.org/P93499 and previous config saved to /var/cache/conftool/dbconfig/20260602-080011-fceratto.json [production]
07:59 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
07:59 <atsuko@deploy1003> atsuko: Rolling back deployment [production]
07:58 <dpogorzelski@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
07:57 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1181 (T419635)', diff saved to https://phabricator.wikimedia.org/P93498 and previous config saved to /var/cache/conftool/dbconfig/20260602-075759-fceratto.json [production]
07:57 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
07:57 <fceratto@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1180: Pooling [production]
07:50 <atsuko@deploy1003> atsuko: Backport for [[gerrit:1296262|translate: fixing missed variable in credentials formatting closure (T425377)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:49 <atsuko@deploy1003> Started scap sync-world: Backport for [[gerrit:1296262|translate: fixing missed variable in credentials formatting closure (T425377)]] [production]
07:48 <fceratto@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db1181: Pooling [production]
07:47 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db1181: Pooling [production]
07:44 <fceratto@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1181: Reboot [production]
07:43 <fceratto@cumin1003> START - Cookbook sre.mysql.depool depool db1181: Reboot [production]
07:42 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db1181.eqiad.wmnet with reason: Reboot [production]