1-50 of 10000 results (32ms)
2026-06-30 ยง
10:20 <daniel@deploy1003> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
10:19 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1261.eqiad.wmnet with reason: host reimage [production]
10:18 <daniel@deploy1003> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
10:15 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1261.eqiad.wmnet with reason: host reimage [production]
10:12 <daniel@deploy1003> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
10:07 <daniel@deploy1003> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
10:00 <cwilliams@cumin1003> START - Cookbook sre.hosts.reimage for host db1261.eqiad.wmnet with OS trixie [production]
09:56 <godog> restart pybal on A:lvs-high-traffic2-eqiad [production]
09:53 <godog> restart pybal on A:lvs-secondary-eqiad [production]
09:51 <aikochou@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: sync [production]
09:51 <aikochou@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: sync [production]
09:49 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2204 (T426633)', diff saved to https://phabricator.wikimedia.org/P94616 and previous config saved to /var/cache/conftool/dbconfig/20260630-094938-fceratto.json [production]
09:39 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2204', diff saved to https://phabricator.wikimedia.org/P94615 and previous config saved to /var/cache/conftool/dbconfig/20260630-093931-fceratto.json [production]
09:38 <aklapper@deploy1003> rebuilt and synchronized wikiversions files: group0 to 1.47.0-wmf.9 refs T423918 [production]
09:35 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1261: Upgrading db1261.eqiad.wmnet [production]
09:34 <cwilliams@cumin1003> START - Cookbook sre.mysql.depool depool db1261: Upgrading db1261.eqiad.wmnet [production]
09:34 <cwilliams@cumin1003> dbmaint on s4@eqiad T429893 [production]
09:34 <cwilliams@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
09:33 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [tools]
09:29 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2204', diff saved to https://phabricator.wikimedia.org/P94613 and previous config saved to /var/cache/conftool/dbconfig/20260630-092923-fceratto.json [production]
09:23 <aklapper@deploy1003> Finished scap sync-world: Backport for [[gerrit:1306629|Fix overflow menu for non-advanced users (T428220)]] (duration: 11m 40s) [production]
09:20 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [tools]
09:19 <filippo@puppetserver1001> conftool action : set/pooled=yes:weight=100; selector: service=dumps-nfs [production]
09:19 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2204 (T426633)', diff saved to https://phabricator.wikimedia.org/P94612 and previous config saved to /var/cache/conftool/dbconfig/20260630-091915-fceratto.json [production]
09:17 <aklapper@deploy1003> aklapper: Continuing with deployment [production]
09:16 <aklapper@deploy1003> aklapper: Backport for [[gerrit:1306629|Fix overflow menu for non-advanced users (T428220)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
09:13 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2204 (T426633)', diff saved to https://phabricator.wikimedia.org/P94611 and previous config saved to /var/cache/conftool/dbconfig/20260630-091307-fceratto.json [production]
09:13 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2204.codfw.wmnet with reason: Maintenance [production]
09:12 <aklapper@deploy1003> Started scap sync-world: Backport for [[gerrit:1306629|Fix overflow menu for non-advanced users (T428220)]] [production]
09:08 <fceratto@cumin1003> dbctl commit (dc=all): 'Set weight db2204 T430624', diff saved to https://phabricator.wikimedia.org/P94610 and previous config saved to /var/cache/conftool/dbconfig/20260630-090841-fceratto.json [production]
09:05 <fceratto@cumin1003> dbctl commit (dc=all): 'Promote db2207 to s2 primary T430624', diff saved to https://phabricator.wikimedia.org/P94609 and previous config saved to /var/cache/conftool/dbconfig/20260630-090530-fceratto.json [production]
09:04 <federico3> Starting s2 codfw failover from db2204 to db2207 - T430624 [production]
09:02 <marostegui@cumin1003> conftool action : set/pooled=no; selector: name=clouddb1014.eqiad.wmnet,service=s2 [production]
09:02 <marostegui@cumin1003> conftool action : set/pooled=no; selector: name=clouddb1014.eqiad.wmnet,service=s7 [production]
09:02 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1014,1027].eqiad.wmnet with reason: cloning [production]
09:01 <marostegui@cumin1003> conftool action : set/pooled=no; selector: name=clouddb1027.eqiad.wmnet,service=s7 [production]
09:01 <marostegui@cumin1003> conftool action : set/pooled=no; selector: name=clouddb1027.eqiad.wmnet,service=s2 [production]
08:57 <jmm@dns1004> END - running authdns-update [production]
08:55 <jmm@dns1004> START - running authdns-update [production]
08:55 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
08:46 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1223 (T426633)', diff saved to https://phabricator.wikimedia.org/P94606 and previous config saved to /var/cache/conftool/dbconfig/20260630-084632-fceratto.json [production]
08:44 <fceratto@cumin1003> dbctl commit (dc=all): 'Set db2207 with weight 0 T430624', diff saved to https://phabricator.wikimedia.org/P94605 and previous config saved to /var/cache/conftool/dbconfig/20260630-084436-fceratto.json [production]
08:43 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [toolsbeta]
08:40 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 25 hosts with reason: Primary switchover s2 T430624 [production]
08:39 <ryankemper@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch2099.codfw.wmnet with OS trixie [production]
08:37 <jmm@dns1004> END - running authdns-update [production]
08:36 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1223', diff saved to https://phabricator.wikimedia.org/P94604 and previous config saved to /var/cache/conftool/dbconfig/20260630-083624-fceratto.json [production]
08:35 <jmm@dns1004> START - running authdns-update [production]
08:34 <filippo@dns1004> END - running authdns-update [production]
08:32 <filippo@dns1004> START - running authdns-update [production]