51-100 of 10000 results (23ms)
2026-04-27 ยง
21:14 <jasmine@dns1004> START - running authdns-update [production]
21:14 <vriley@cumin1003> START - Cookbook sre.hosts.reimage for host ganeti1056.eqiad.wmnet with OS bookworm [production]
21:14 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1055.eqiad.wmnet with OS bookworm [production]
21:14 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
21:13 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
21:13 <jasmine@deploy1003> conftool action : set/pooled=true; selector: name=eqiad,dnsdisc=sophroid [production]
20:54 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti1055.eqiad.wmnet with reason: host reimage [production]
20:47 <vriley@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1055.eqiad.wmnet with reason: host reimage [production]
20:45 <fceratto@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1210: Repooling [production]
20:45 <James_F> Zuul: Restrict mw*-codehealth-patch jobs to master only, for T424573 [releng]
20:30 <fceratto@cumin1003> START - Cookbook sre.mysql.pool pool db1210: Repooling [production]
20:17 <vriley@cumin1003> START - Cookbook sre.hosts.reimage for host ganeti1055.eqiad.wmnet with OS bookworm [production]
20:16 <dancy@deploy1003> Finished scap sync-world: Backport for [[gerrit:1277656|JS SDK: Add aliases for compatibility with existing experiment code (T419513)]] (duration: 06m 38s) [production]
20:14 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1058.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:13 <dancy@deploy1003> dancy, sfaci: Continuing with deployment [production]
20:11 <dancy@deploy1003> dancy, sfaci: Backport for [[gerrit:1277656|JS SDK: Add aliases for compatibility with existing experiment code (T419513)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:10 <dancy@deploy1003> Started scap sync-world: Backport for [[gerrit:1277656|JS SDK: Add aliases for compatibility with existing experiment code (T419513)]] [production]
20:04 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host ganeti1058.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:03 <vriley@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti1058.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:02 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host ganeti1058.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:02 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1057.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:49 <jasmine_> "Restarting pybal on primary LVS servers in codfw" [production]
19:45 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host ganeti1057.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:45 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1056.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:41 <jasmine_> "Restarting pybal on the backup LVS servers in codfw" [production]
19:33 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host ganeti1056.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:32 <rzl> root@apt1002:~# reprepro --noskipold --restrict vopsbot update bookworm-wikimedia [production]
19:31 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1055.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:21 <dancy@deploy1003> Installation of scap version "4.253.0" completed for 2 hosts [production]
19:20 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host ganeti1055.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
19:19 <dancy@deploy1003> Installing scap version "4.253.0" for 2 host(s) [production]
19:06 <jasmine_> "Restarting pybal on primary LVS servers in eqiad" [production]
19:05 <alexsanford@deploy1003> Finished scap sync-world: Backport for [[gerrit:1277693|Add 2FA enforcement demotion config for phase 1 groups]] (duration: 09m 03s) [production]
19:04 <ejegg> fundraising civicrm upgraded from 3f8d49fa to be3bb76b [fundraising]
19:02 <ejegg> payments-wiki upgraded from b1a352af to 5265089d [fundraising]
19:00 <alexsanford@deploy1003> alexsanford: Continuing with deployment [production]
19:00 <jasmine_> "Restarting pybal on the backup LVS servers in eqiad" [production]
18:58 <ejegg> standalone (IPN listener) SmashPig upgraded from 572b69da to 88a1bcba [fundraising]
18:57 <alexsanford@deploy1003> alexsanford: Backport for [[gerrit:1277693|Add 2FA enforcement demotion config for phase 1 groups]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
18:56 <alexsanford@deploy1003> Started scap sync-world: Backport for [[gerrit:1277693|Add 2FA enforcement demotion config for phase 1 groups]] [production]
18:54 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1210 (T419635)', diff saved to https://phabricator.wikimedia.org/P91688 and previous config saved to /var/cache/conftool/dbconfig/20260427-185444-fceratto.json [production]
18:54 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1210.eqiad.wmnet with reason: Maintenance [production]
18:54 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1207 (T419635)', diff saved to https://phabricator.wikimedia.org/P91687 and previous config saved to /var/cache/conftool/dbconfig/20260427-185429-fceratto.json [production]
18:44 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P91686 and previous config saved to /var/cache/conftool/dbconfig/20260427-184421-fceratto.json [production]
18:34 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P91685 and previous config saved to /var/cache/conftool/dbconfig/20260427-183413-fceratto.json [production]
18:26 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1263 (T419961)', diff saved to https://phabricator.wikimedia.org/P91682 and previous config saved to /var/cache/conftool/dbconfig/20260427-182652-fceratto.json [production]
18:26 <jasmine@dns1004> END - running authdns-update [production]
18:24 <jasmine@dns1004> START - running authdns-update [production]
18:24 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1207 (T419635)', diff saved to https://phabricator.wikimedia.org/P91681 and previous config saved to /var/cache/conftool/dbconfig/20260427-182404-fceratto.json [production]
18:21 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1207 (T419635)', diff saved to https://phabricator.wikimedia.org/P91680 and previous config saved to /var/cache/conftool/dbconfig/20260427-182154-fceratto.json [production]