|
2026-05-20
ยง
|
| 16:20 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1322-1324].eqiad.wmnet |
[production] |
| 16:19 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1318-1321].eqiad.wmnet |
[production] |
| 16:19 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1318-1321].eqiad.wmnet |
[production] |
| 16:14 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-mariadb1002.eqiad.wmnet |
[production] |
| 16:12 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) |
[production] |
| 16:12 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1257: Migration of db1257.eqiad.wmnet completed |
[production] |
| 16:12 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1318-1321].eqiad.wmnet |
[production] |
| 16:10 |
<pt1979@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cr4-ulsfo,cr4-ulsfo IPv6,cr4-ulsfo.mgmt with reason: switch refresh |
[production] |
| 16:10 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1318-1321].eqiad.wmnet |
[production] |
| 16:10 |
<urbanecm@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1289980|Fix newFromUserIdentity calls with interwiki users (T426832)]] (duration: 09m 12s) |
[production] |
| 16:09 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1314-1317].eqiad.wmnet |
[production] |
| 16:09 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1314-1317].eqiad.wmnet |
[production] |
| 16:08 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host an-mariadb1002.eqiad.wmnet |
[production] |
| 16:07 |
<brouberol@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-jumbo1015.eqiad.wmnet with OS trixie |
[production] |
| 16:07 |
<pt1979@cumin1003> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cr3-ulsfo,cr3-ulsfo IPv6,cr3-ulsfo.mgmt |
[production] |
| 16:07 |
<pt1979@cumin1003> |
START - Cookbook sre.hosts.remove-downtime for cr3-ulsfo,cr3-ulsfo IPv6,cr3-ulsfo.mgmt |
[production] |
| 16:05 |
<urbanecm@deploy1003> |
urbanecm, mszwarc: Continuing with deployment |
[production] |
| 16:05 |
<brett@cumin2002> |
cookbooks.sre.dns.roll-reboot finished rebooting dns1004.wikimedia.org |
[production] |
| 16:05 |
<eevans@cumin1003> |
START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev |
[production] |
| 16:02 |
<urbanecm@deploy1003> |
urbanecm, mszwarc: Backport for [[gerrit:1289980|Fix newFromUserIdentity calls with interwiki users (T426832)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 16:02 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1314-1317].eqiad.wmnet |
[production] |
| 16:00 |
<urbanecm@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1289980|Fix newFromUserIdentity calls with interwiki users (T426832)]] |
[production] |
| 16:00 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1314-1317].eqiad.wmnet |
[production] |
| 15:59 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1310-1313].eqiad.wmnet |
[production] |
| 15:59 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1310-1313].eqiad.wmnet |
[production] |
| 15:59 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas (exit_code=0) rolling reboot on A:schema |
[production] |
| 15:57 |
<bking@cumin2002> |
conftool action : set/pooled=true; selector: dnsdisc=wdqs-scholarly,name=eqiad |
[production] |
| 15:56 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: T426560 - bking@cumin2002 |
[production] |
| 15:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 15:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 15:52 |
<btullis@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host dse-k8s-worker1028.eqiad.wmnet |
[production] |
| 15:52 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host dse-k8s-worker1027.eqiad.wmnet |
[production] |
| 15:52 |
<btullis@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host dse-k8s-worker1027.eqiad.wmnet |
[production] |
| 15:52 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wdqs1023.eqiad.wmnet |
[production] |
| 15:52 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wdqs1024.eqiad.wmnet |
[production] |
| 15:51 |
<brett@cumin2002> |
cookbooks.sre.dns.roll-reboot begin reboot of dns1004.wikimedia.org |
[production] |
| 15:51 |
<brett@cumin2002> |
START - Cookbook sre.dns.roll-reboot rolling reboot on A:dnsbox and not P{dns6002.wikimedia.org} and not A:magru and (A:dnsbox) |
[production] |
| 15:51 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[1310-1313].eqiad.wmnet |
[production] |
| 15:50 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on P{dns6002.wikimedia.org} and (A:dnsbox) |
[production] |
| 15:50 |
<brett@cumin2002> |
cookbooks.sre.dns.roll-reboot finished rebooting dns6002.wikimedia.org |
[production] |
| 15:50 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: T426560 - bking@cumin2002 |
[production] |
| 15:49 |
<brouberol@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-jumbo1015.eqiad.wmnet with reason: host reimage |
[production] |
| 15:49 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[1310-1313].eqiad.wmnet |
[production] |
| 15:48 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host matomo1003.eqiad.wmnet |
[production] |
| 15:48 |
<blake@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1306-1309].eqiad.wmnet |
[production] |
| 15:48 |
<blake@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1306-1309].eqiad.wmnet |
[production] |
| 15:46 |
<brett@cumin2002> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cp[2041-2042].codfw.wmnet |
[production] |
| 15:46 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 15:46 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[2041-2042].codfw.wmnet decommissioned, removing all IPs except the asset tag one - brett@cumin2002" |
[production] |
| 15:46 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host dse-k8s-worker1027.eqiad.wmnet |
[production] |