2025-06-12
ยง
|
11:00 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ncredir7004.magru.wmnet on all recursors |
[production] |
10:59 |
<jmm@cumin1003> |
START - Cookbook sre.dns.wipe-cache ncredir7004.magru.wmnet on all recursors |
[production] |
10:59 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
10:59 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM ncredir7004.magru.wmnet - jmm@cumin1003" |
[production] |
10:58 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P77821 and previous config saved to /var/cache/conftool/dbconfig/20250612-105848-fceratto.json |
[production] |
10:57 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1051.eqiad.wmnet |
[production] |
10:56 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1050.eqiad.wmnet |
[production] |
10:56 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1050.eqiad.wmnet |
[production] |
10:50 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ganeti1050.eqiad.wmnet |
[production] |
10:50 |
<jmm@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM ncredir7004.magru.wmnet - jmm@cumin1003" |
[production] |
10:50 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . |
[production] |
10:47 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1050.eqiad.wmnet |
[production] |
10:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P77820 and previous config saved to /var/cache/conftool/dbconfig/20250612-104706-marostegui.json |
[production] |
10:44 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1049.eqiad.wmnet |
[production] |
10:43 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1049.eqiad.wmnet |
[production] |
10:43 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P77819 and previous config saved to /var/cache/conftool/dbconfig/20250612-104341-fceratto.json |
[production] |
10:43 |
<jmm@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
10:43 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.makevm for new host ncredir7004.magru.wmnet |
[production] |
10:42 |
<jmm@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ganeti2050.codfw.wmnet with OS bookworm |
[production] |
10:38 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ganeti1049.eqiad.wmnet |
[production] |
10:36 |
<dcaro> |
rebooting tools-prometheus-8 due to the VM having load issues (not responding to ssh) |
[tools] |
10:36 |
<cgoubert@deploy1003> |
Finished scap sync-world: 1156288: mediawiki: Add job history limit control - T395885 (duration: 02m 48s) |
[production] |
10:34 |
<wmbot~dcaro@acme> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) |
[tools] |
10:33 |
<cgoubert@deploy1003> |
Started scap sync-world: 1156288: mediawiki: Add job history limit control - T395885 |
[production] |
10:32 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1049.eqiad.wmnet |
[production] |
10:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1212 (T396130)', diff saved to https://phabricator.wikimedia.org/P77818 and previous config saved to /var/cache/conftool/dbconfig/20250612-103159-marostegui.json |
[production] |
10:28 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2145 (T395241)', diff saved to https://phabricator.wikimedia.org/P77817 and previous config saved to /var/cache/conftool/dbconfig/20250612-102834-fceratto.json |
[production] |
10:28 |
<wmbot~dcaro@acme> |
START - Cookbook wmcs.openstack.cloudvirt.vm_console |
[tools] |
10:27 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1212 (T396130)', diff saved to https://phabricator.wikimedia.org/P77816 and previous config saved to /var/cache/conftool/dbconfig/20250612-102700-marostegui.json |
[production] |
10:26 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
10:26 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1212.eqiad.wmnet with reason: Maintenance |
[production] |
10:26 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1198 (T396130)', diff saved to https://phabricator.wikimedia.org/P77815 and previous config saved to /var/cache/conftool/dbconfig/20250612-102630-marostegui.json |
[production] |
10:25 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reimage for host ganeti2050.codfw.wmnet with OS bookworm |
[production] |
10:24 |
<jmm@cumin1003> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host ncredir7004.magru.wmnet |
[production] |
10:23 |
<jmm@cumin1003> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
10:23 |
<jmm@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ganeti2050.codfw.wmnet with OS bookworm |
[production] |
10:16 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2145 (T395241)', diff saved to https://phabricator.wikimedia.org/P77814 and previous config saved to /var/cache/conftool/dbconfig/20250612-101655-fceratto.json |
[production] |
10:16 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2145.codfw.wmnet with reason: Maintenance |
[production] |
10:14 |
<moritzm> |
installing Kerberos security updates |
[production] |
10:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P77813 and previous config saved to /var/cache/conftool/dbconfig/20250612-101123-marostegui.json |
[production] |
10:11 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2141.codfw.wmnet with reason: Maintenance |
[production] |
10:07 |
<jmm@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
10:07 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.makevm for new host ncredir7004.magru.wmnet |
[production] |
10:06 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reimage for host ganeti2050.codfw.wmnet with OS bookworm |
[production] |
09:53 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti2049.codfw.wmnet with OS bookworm |
[production] |
09:50 |
<esanders@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1156247|Support placeholders mangled by MF's HtmlFormatter (T396695)]] (duration: 10m 37s) |
[production] |
09:46 |
<cmooney@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1047.eqiad.wmnet |
[production] |
09:43 |
<esanders@deploy1003> |
esanders: Continuing with sync |
[production] |
09:42 |
<jmm@cumin1003> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host ncredir7004.magru.wmnet |
[production] |
09:42 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ncredir7004.magru.wmnet on all recursors |
[production] |