|
2026-05-19
ยง
|
| 07:07 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti-test2001.codfw.wmnet |
[production] |
| 07:07 |
<mlitn@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1288994|Squashed diff to master]] |
[production] |
| 07:07 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-maint1001.eqiad.wmnet |
[production] |
| 07:04 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1058.eqiad.wmnet with OS bookworm |
[production] |
| 07:03 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ldap-maint1001.eqiad.wmnet |
[production] |
| 07:02 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reimage for host mc1056.eqiad.wmnet with OS bookworm |
[production] |
| 07:02 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow1003.eqiad.wmnet |
[production] |
| 06:59 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1057.eqiad.wmnet with OS bookworm |
[production] |
| 06:57 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2212.codfw.wmnet with reason: Maintenance |
[production] |
| 06:56 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host netflow1003.eqiad.wmnet |
[production] |
| 06:56 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depool db2212 T426703', diff saved to https://phabricator.wikimedia.org/P92584 and previous config saved to /var/cache/conftool/dbconfig/20260519-065637-fceratto.json |
[production] |
| 06:54 |
<moritzm> |
installing qemu security updates |
[production] |
| 06:52 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Promote db2203 to s1 primary T426703', diff saved to https://phabricator.wikimedia.org/P92583 and previous config saved to /var/cache/conftool/dbconfig/20260519-065224-fceratto.json |
[production] |
| 06:52 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on pc2021.codfw.wmnet,pc[1011,1021].eqiad.wmnet with reason: Maintenance on pc1 |
[production] |
| 06:51 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool pc1011.eqiad.wmnet: Maintenance on pc1 |
[production] |
| 06:51 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.parsercache (exit_code=0) |
[production] |
| 06:51 |
<federico3> |
Starting s1 codfw failover from db2212 to db2203 - T426703 |
[production] |
| 06:51 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.parsercache |
[production] |
| 06:51 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.depool depool pc1011.eqiad.wmnet: Maintenance on pc1 |
[production] |
| 06:50 |
<marostegui@cumin1003> |
END (FAIL) - Cookbook sre.mysql.parsercache (exit_code=99) |
[production] |
| 06:50 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.parsercache |
[production] |
| 06:48 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1058.eqiad.wmnet with reason: host reimage |
[production] |
| 06:45 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow1002.eqiad.wmnet |
[production] |
| 06:45 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Set db2203 with weight 0 T426703', diff saved to https://phabricator.wikimedia.org/P92581 and previous config saved to /var/cache/conftool/dbconfig/20260519-064500-fceratto.json |
[production] |
| 06:44 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 30 hosts with reason: Primary switchover s1 T426703 |
[production] |
| 06:44 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1057.eqiad.wmnet with reason: host reimage |
[production] |
| 06:41 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host netflow1002.eqiad.wmnet |
[production] |
| 06:40 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1058.eqiad.wmnet with reason: host reimage |
[production] |
| 06:39 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1057.eqiad.wmnet with reason: host reimage |
[production] |
| 06:33 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts pc2014.codfw.wmnet |
[production] |
| 06:33 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 06:33 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: pc2014.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1003" |
[production] |
| 06:33 |
<marostegui@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: pc2014.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1003" |
[production] |
| 06:29 |
<marostegui@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
| 06:28 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reimage for host mc1058.eqiad.wmnet with OS bookworm |
[production] |
| 06:28 |
<fceratto@cumin1003> |
START - Cookbook sre.mysql.pool pool db1210: Repooling after switchover |
[production] |
| 06:28 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reimage for host mc1057.eqiad.wmnet with OS bookworm |
[production] |
| 06:24 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.decommission for hosts pc2014.codfw.wmnet |
[production] |
| 06:22 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Remove pc2014 from dbctl T426595', diff saved to https://phabricator.wikimedia.org/P92578 and previous config saved to /var/cache/conftool/dbconfig/20260519-062227-marostegui.json |
[production] |
| 06:22 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1210.eqiad.wmnet with reason: Maintenance |
[production] |
| 06:20 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depool db1210 T426087', diff saved to https://phabricator.wikimedia.org/P92577 and previous config saved to /var/cache/conftool/dbconfig/20260519-062056-fceratto.json |
[production] |
| 06:19 |
<fceratto@dns1005> |
END - running authdns-update |
[production] |
| 06:18 |
<fceratto@dns1005> |
START - running authdns-update |
[production] |
| 06:15 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Promote db1230 to s5 primary and set section read-write T426087', diff saved to https://phabricator.wikimedia.org/P92576 and previous config saved to /var/cache/conftool/dbconfig/20260519-061524-fceratto.json |
[production] |
| 06:14 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Set s5 eqiad as read-only for maintenance - T426087', diff saved to https://phabricator.wikimedia.org/P92575 and previous config saved to /var/cache/conftool/dbconfig/20260519-061435-fceratto.json |
[production] |
| 06:14 |
<federico3> |
Starting s5 eqiad failover from db1210 to db1230 - T426087 |
[production] |
| 06:09 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Set db1230 with weight 0 T426087', diff saved to https://phabricator.wikimedia.org/P92574 and previous config saved to /var/cache/conftool/dbconfig/20260519-060929-fceratto.json |
[production] |
| 06:08 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 23 hosts with reason: Primary switchover s5 T426087 |
[production] |
| 05:11 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on pc2014.codfw.wmnet with reason: Maintenance on pc4 |
[production] |
| 04:02 |
<mwpresync@deploy1003> |
Pruned MediaWiki: 1.46.0-wmf.26 (duration: 02m 40s) |
[production] |