101-150 of 10000 results (28ms)
2026-07-02 ยง
14:06 <Tran> Deployed patch for T427287 [production]
14:04 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:59 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db2205: Repooling after switchover [production]
13:59 <fceratto@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db2205: Repooling after switchover [production]
13:59 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:55 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db2205: Repooling after switchover [production]
13:55 <fceratto@cumin1003> dbctl commit (dc=all): 'Depool db2205 T430912', diff saved to https://phabricator.wikimedia.org/P94704 and previous config saved to /var/cache/conftool/dbconfig/20260702-135505-fceratto.json [production]
13:54 <moritzm> installing sed security updates [production]
13:53 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:52 <fceratto@cumin1003> dbctl commit (dc=all): 'Promote db2209 to s3 primary T430912', diff saved to https://phabricator.wikimedia.org/P94703 and previous config saved to /var/cache/conftool/dbconfig/20260702-135235-fceratto.json [production]
13:52 <federico3> Starting s3 codfw failover from db2205 to db2209 - T430912 [production]
13:51 <blake@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-pretrain: apply [production]
13:51 <blake@deploy1003> helmfile [codfw] START helmfile.d/services/mw-pretrain: apply [production]
13:49 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
13:48 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:47 <fceratto@cumin1003> dbctl commit (dc=all): 'Set db2209 with weight 0 T430912', diff saved to https://phabricator.wikimedia.org/P94702 and previous config saved to /var/cache/conftool/dbconfig/20260702-134719-fceratto.json [production]
13:47 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 24 hosts with reason: Primary switchover s3 T430912 [production]
13:44 <blake@deploy1003> helmfile [codfw] START helmfile.d/services/mw-pretrain: apply [production]
13:44 <blake@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-pretrain: apply [production]
13:44 <blake@deploy1003> helmfile [codfw] START helmfile.d/services/mw-pretrain: apply [production]
13:40 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search,name=codfw [production]
13:38 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
13:37 <bking@cumin2003> conftool action : set/pooled=false; selector: dnsdisc=search,name=codfw [production]
13:36 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search-psi,name=codfw [production]
13:36 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:34 <bking@cumin2003> conftool action : set/pooled=false; selector: dnsdisc=search-psi,name=codfw [production]
13:30 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:29 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.provision (exit_code=97) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:29 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:27 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:27 <James_F> Docker: [composer-scratch] Upgrade composer to 2.10.2 and cascade, for T428570 [releng]
13:26 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:25 <sukhe@cumin1003> END (PASS) - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns (exit_code=0) rolling restart_daemons on A:wikidough [production]
13:23 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search-psi,name=codfw [production]
13:22 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search-omega,name=codfw [production]
13:17 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search,name=codfw [production]
13:17 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=dns1004.wikimedia.org [production]
13:12 <sukhe@cumin1003> START - Cookbook sre.dns.roll-restart rolling restart_daemons on A:dnsbox and (A:dnsbox) [production]
13:11 <sukhe@cumin1003> END (ERROR) - Cookbook sre.dns.roll-restart (exit_code=97) rolling restart_daemons on A:dnsbox and (A:dnsbox) [production]
13:11 <sukhe@cumin1003> START - Cookbook sre.dns.roll-restart rolling restart_daemons on A:dnsbox and (A:dnsbox) [production]
13:11 <sukhe@cumin1003> START - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns rolling restart_daemons on A:wikidough [production]
13:11 <sukhe@cumin1003> END (ERROR) - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns (exit_code=97) rolling restart_daemons on A:wikidough [production]
13:11 <sukhe@cumin1003> START - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns rolling restart_daemons on A:wikidough [production]
13:09 <aude@deploy1003> Finished scap sync-world: Backport for [[gerrit:1305773|Phase 3 Legal contact link deployments. (T430227)]] (duration: 07m 20s) [production]
13:05 <aude@deploy1003> jdrewniak, aude: Continuing with deployment [production]
13:04 <aude@deploy1003> jdrewniak, aude: Backport for [[gerrit:1305773|Phase 3 Legal contact link deployments. (T430227)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:02 <aude@deploy1003> Started scap sync-world: Backport for [[gerrit:1305773|Phase 3 Legal contact link deployments. (T430227)]] [production]
12:19 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts wdqs-categories1001.eqiad.wmnet [production]
12:19 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:19 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wdqs-categories1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1003" [production]