1-50 of 10000 results (34ms)
2026-07-02 ยง
10:13 <fnegri@cumin1003> START - Cookbook sre.mysql.multiinstance_reboot for clouddb1017.eqiad.wmnet [production]
10:12 <fceratto@cumin1003> START - Cookbook sre.hosts.remove-downtime for db2214.codfw.wmnet [production]
10:12 <fceratto@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for db2214.codfw.wmnet [production]
10:12 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db2214: Repooling [production]
10:11 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P94693 and previous config saved to /var/cache/conftool/dbconfig/20260702-101130-fceratto.json [production]
10:10 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
10:10 <fceratto@cumin1003> START - Cookbook sre.dns.netbox [production]
10:04 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
10:03 <fceratto@cumin1003> START - Cookbook sre.hosts.decommission for hosts es1033.eqiad.wmnet [production]
10:03 <fceratto@cumin1003> START - Cookbook sre.mysql.decommission [production]
10:01 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2213 (T426633)', diff saved to https://phabricator.wikimedia.org/P94691 and previous config saved to /var/cache/conftool/dbconfig/20260702-100122-fceratto.json [production]
09:55 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2213 (T426633)', diff saved to https://phabricator.wikimedia.org/P94690 and previous config saved to /var/cache/conftool/dbconfig/20260702-095529-fceratto.json [production]
09:55 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2213.codfw.wmnet with reason: Maintenance [production]
09:54 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
09:52 <fceratto@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db2213: Repooling after switchover [production]
09:51 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db2213: Repooling after switchover [production]
09:44 <fceratto@cumin1003> END (FAIL) - Cookbook sre.mysql.pool (exit_code=99) pool db2213: Repooling after switchover [production]
09:39 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db2213: Repooling after switchover [production]
09:39 <fceratto@cumin1003> dbctl commit (dc=all): 'Depool db2213 T430923', diff saved to https://phabricator.wikimedia.org/P94688 and previous config saved to /var/cache/conftool/dbconfig/20260702-093859-fceratto.json [production]
09:36 <fceratto@cumin1003> dbctl commit (dc=all): 'Promote db2192 to s5 primary T430923', diff saved to https://phabricator.wikimedia.org/P94687 and previous config saved to /var/cache/conftool/dbconfig/20260702-093650-fceratto.json [production]
09:36 <federico3> Starting s5 codfw failover from db2213 to db2192 - T430923 [production]
09:35 <godog> carry out nfs tests on toolsbeta-test-k8s-worker-nfs-8 - T411248 [toolsbeta]
09:30 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2220 (T426633)', diff saved to https://phabricator.wikimedia.org/P94686 and previous config saved to /var/cache/conftool/dbconfig/20260702-093004-fceratto.json [production]
09:24 <fceratto@cumin1003> dbctl commit (dc=all): 'Set db2192 with weight 0 T430923', diff saved to https://phabricator.wikimedia.org/P94685 and previous config saved to /var/cache/conftool/dbconfig/20260702-092455-fceratto.json [production]
09:24 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 23 hosts with reason: Primary switchover s5 T430923 [production]
09:19 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2220', diff saved to https://phabricator.wikimedia.org/P94684 and previous config saved to /var/cache/conftool/dbconfig/20260702-091957-fceratto.json [production]
09:16 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1307076|SourceEditorOverlay: Re-enable buttons after non-captcha save failure (T430518)]] (duration: 06m 57s) [production]
09:13 <moritzm> installing libgcrypt20 security updates [production]
09:12 <kharlan@deploy1003> kharlan: Continuing with deployment [production]
09:11 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1307076|SourceEditorOverlay: Re-enable buttons after non-captcha save failure (T430518)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
09:09 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2220', diff saved to https://phabricator.wikimedia.org/P94683 and previous config saved to /var/cache/conftool/dbconfig/20260702-090950-fceratto.json [production]
09:09 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1307076|SourceEditorOverlay: Re-enable buttons after non-captcha save failure (T430518)]] [production]
09:03 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
09:01 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1307075|build: Update required Node version from 24.14.1 to 24.18.0]] (duration: 07m 07s) [production]
08:59 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2220 (T426633)', diff saved to https://phabricator.wikimedia.org/P94682 and previous config saved to /var/cache/conftool/dbconfig/20260702-085942-fceratto.json [production]
08:57 <kharlan@deploy1003> kharlan: Continuing with deployment [production]
08:56 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1307075|build: Update required Node version from 24.14.1 to 24.18.0]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:55 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [tools]
08:54 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1307075|build: Update required Node version from 24.14.1 to 24.18.0]] [production]
08:52 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
08:52 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
08:52 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2220 (T426633)', diff saved to https://phabricator.wikimedia.org/P94681 and previous config saved to /var/cache/conftool/dbconfig/20260702-085237-fceratto.json [production]
08:52 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2220.codfw.wmnet with reason: Maintenance [production]
08:43 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
08:42 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [tools]
08:41 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component jobs-api [tools]
08:40 <aklapper@deploy1003> rebuilt and synchronized wikiversions files: group2 to 1.47.0-wmf.9 refs T423918 [production]
08:39 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [tools]
08:37 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
08:25 <cscott@deploy1003> Finished scap sync-world: Backport for [[gerrit:1307059|Bump wikimedia/parsoid to 0.24.0-a14 (T387374 T430186 T430367 T430501)]], [[gerrit:1307061|Bump wikimedia/parsoid to 0.24.0-a14 (T430501)]] (duration: 11m 44s) [production]