101-150 of 10000 results (104ms)
2026-07-02 ยง
14:52 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Unblock taavi - oblivian@cumin1003" [production]
14:46 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2205 (T426633)', diff saved to https://phabricator.wikimedia.org/P94711 and previous config saved to /var/cache/conftool/dbconfig/20260702-144644-fceratto.json [production]
14:36 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2205', diff saved to https://phabricator.wikimedia.org/P94709 and previous config saved to /var/cache/conftool/dbconfig/20260702-143636-fceratto.json [production]
14:32 <moritzm> installing libdbi-perl security updates [production]
14:26 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2205', diff saved to https://phabricator.wikimedia.org/P94708 and previous config saved to /var/cache/conftool/dbconfig/20260702-142628-fceratto.json [production]
14:16 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2205 (T426633)', diff saved to https://phabricator.wikimedia.org/P94707 and previous config saved to /var/cache/conftool/dbconfig/20260702-141621-fceratto.json [production]
14:12 <moritzm> installing rsync security updates [production]
14:11 <sukhe@cumin1003> END (PASS) - Cookbook sre.dns.roll-restart (exit_code=0) rolling restart_daemons on A:dnsbox and (A:dnsbox) [production]
14:10 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2205 (T426633)', diff saved to https://phabricator.wikimedia.org/P94706 and previous config saved to /var/cache/conftool/dbconfig/20260702-140959-fceratto.json [production]
14:09 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2205.codfw.wmnet with reason: Maintenance [production]
14:09 <fceratto@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db2205: Repooling after switchover [production]
14:07 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-test-master1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:06 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:06 <Tran> Deployed patch for T427287 [production]
14:04 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:59 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db2205: Repooling after switchover [production]
13:59 <fceratto@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db2205: Repooling after switchover [production]
13:59 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:55 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db2205: Repooling after switchover [production]
13:55 <fceratto@cumin1003> dbctl commit (dc=all): 'Depool db2205 T430912', diff saved to https://phabricator.wikimedia.org/P94704 and previous config saved to /var/cache/conftool/dbconfig/20260702-135505-fceratto.json [production]
13:54 <moritzm> installing sed security updates [production]
13:53 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:52 <fceratto@cumin1003> dbctl commit (dc=all): 'Promote db2209 to s3 primary T430912', diff saved to https://phabricator.wikimedia.org/P94703 and previous config saved to /var/cache/conftool/dbconfig/20260702-135235-fceratto.json [production]
13:52 <federico3> Starting s3 codfw failover from db2205 to db2209 - T430912 [production]
13:51 <blake@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-pretrain: apply [production]
13:51 <blake@deploy1003> helmfile [codfw] START helmfile.d/services/mw-pretrain: apply [production]
13:49 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
13:48 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:47 <fceratto@cumin1003> dbctl commit (dc=all): 'Set db2209 with weight 0 T430912', diff saved to https://phabricator.wikimedia.org/P94702 and previous config saved to /var/cache/conftool/dbconfig/20260702-134719-fceratto.json [production]
13:47 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 24 hosts with reason: Primary switchover s3 T430912 [production]
13:44 <blake@deploy1003> helmfile [codfw] START helmfile.d/services/mw-pretrain: apply [production]
13:44 <blake@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-pretrain: apply [production]
13:44 <blake@deploy1003> helmfile [codfw] START helmfile.d/services/mw-pretrain: apply [production]
13:40 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search,name=codfw [production]
13:38 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply [production]
13:37 <bking@cumin2003> conftool action : set/pooled=false; selector: dnsdisc=search,name=codfw [production]
13:36 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search-psi,name=codfw [production]
13:36 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:34 <bking@cumin2003> conftool action : set/pooled=false; selector: dnsdisc=search-psi,name=codfw [production]
13:30 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:29 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.provision (exit_code=97) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:29 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:27 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:26 <elukey@cumin1003> START - Cookbook sre.hosts.provision for host an-test-master1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:25 <sukhe@cumin1003> END (PASS) - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns (exit_code=0) rolling restart_daemons on A:wikidough [production]
13:23 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search-psi,name=codfw [production]
13:22 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search-omega,name=codfw [production]
13:17 <bking@cumin2003> conftool action : set/pooled=true; selector: dnsdisc=search,name=codfw [production]
13:17 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=dns1004.wikimedia.org [production]
13:12 <sukhe@cumin1003> START - Cookbook sre.dns.roll-restart rolling restart_daemons on A:dnsbox and (A:dnsbox) [production]