2025-06-10
ยง
|
12:06 |
<taavi@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
12:06 |
<taavi@cumin1003> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
12:06 |
<jmm@dns1004> |
END - running authdns-update |
[production] |
12:05 |
<jmm@dns1004> |
START - running authdns-update |
[production] |
12:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1168 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P77506 and previous config saved to /var/cache/conftool/dbconfig/20250610-120412-root.json |
[production] |
12:03 |
<taavi@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
12:02 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1191 (T395241)', diff saved to https://phabricator.wikimedia.org/P77505 and previous config saved to /var/cache/conftool/dbconfig/20250610-120249-fceratto.json |
[production] |
12:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P77504 and previous config saved to /var/cache/conftool/dbconfig/20250610-120103-marostegui.json |
[production] |
11:59 |
<jmm@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host install7002.wikimedia.org |
[production] |
11:54 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1191 (T395241)', diff saved to https://phabricator.wikimedia.org/P77503 and previous config saved to /var/cache/conftool/dbconfig/20250610-115444-fceratto.json |
[production] |
11:54 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1191.eqiad.wmnet with reason: Maintenance |
[production] |
11:54 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1181 (T395241)', diff saved to https://phabricator.wikimedia.org/P77502 and previous config saved to /var/cache/conftool/dbconfig/20250610-115419-fceratto.json |
[production] |
11:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1168 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P77501 and previous config saved to /var/cache/conftool/dbconfig/20250610-114906-root.json |
[production] |
11:48 |
<moritzm> |
installing qemu bugfix updates |
[production] |
11:47 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance |
[production] |
11:47 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host install7002.wikimedia.org |
[production] |
11:46 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P77500 and previous config saved to /var/cache/conftool/dbconfig/20250610-114617-root.json |
[production] |
11:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2189 (T396130)', diff saved to https://phabricator.wikimedia.org/P77499 and previous config saved to /var/cache/conftool/dbconfig/20250610-114556-marostegui.json |
[production] |
11:44 |
<cgoubert@deploy1003> |
Finished scap sync-world: mediawiki-cli: Fix the paths of some of the dumps scripts and config files - T394389 (duration: 08m 49s) |
[production] |
11:39 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P77497 and previous config saved to /var/cache/conftool/dbconfig/20250610-113913-fceratto.json |
[production] |
11:37 |
<moritzm> |
failover Ganeti master in codfw to ganeti2032 |
[production] |
11:35 |
<cgoubert@deploy1003> |
Started scap sync-world: mediawiki-cli: Fix the paths of some of the dumps scripts and config files - T394389 |
[production] |
11:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1168 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P77495 and previous config saved to /var/cache/conftool/dbconfig/20250610-113401-root.json |
[production] |
11:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2189 (T396130)', diff saved to https://phabricator.wikimedia.org/P77494 and previous config saved to /var/cache/conftool/dbconfig/20250610-113328-marostegui.json |
[production] |
11:33 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2189.codfw.wmnet with reason: Maintenance |
[production] |
11:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175 (T396130)', diff saved to https://phabricator.wikimedia.org/P77493 and previous config saved to /var/cache/conftool/dbconfig/20250610-113306-marostegui.json |
[production] |
11:31 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2044.codfw.wmnet |
[production] |
11:31 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2044.codfw.wmnet |
[production] |
11:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P77492 and previous config saved to /var/cache/conftool/dbconfig/20250610-113112-root.json |
[production] |
11:26 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ganeti2044.codfw.wmnet |
[production] |
11:24 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P77491 and previous config saved to /var/cache/conftool/dbconfig/20250610-112406-fceratto.json |
[production] |
11:21 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2044.codfw.wmnet |
[production] |
11:21 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2043.codfw.wmnet |
[production] |
11:20 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2043.codfw.wmnet |
[production] |
11:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1168 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P77490 and previous config saved to /var/cache/conftool/dbconfig/20250610-111856-root.json |
[production] |
11:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P77489 and previous config saved to /var/cache/conftool/dbconfig/20250610-111759-marostegui.json |
[production] |
11:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P77488 and previous config saved to /var/cache/conftool/dbconfig/20250610-111606-root.json |
[production] |
11:15 |
<jmm@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host ganeti2043.codfw.wmnet |
[production] |
11:14 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1168.eqiad.wmnet with reason: Maintenance |
[production] |
11:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1168 T395989', diff saved to https://phabricator.wikimedia.org/P77487 and previous config saved to /var/cache/conftool/dbconfig/20250610-111440-marostegui.json |
[production] |
11:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2033 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P77486 and previous config saved to /var/cache/conftool/dbconfig/20250610-111054-root.json |
[production] |
11:08 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1181 (T395241)', diff saved to https://phabricator.wikimedia.org/P77485 and previous config saved to /var/cache/conftool/dbconfig/20250610-110859-fceratto.json |
[production] |
11:04 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
11:04 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
11:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P77484 and previous config saved to /var/cache/conftool/dbconfig/20250610-110252-marostegui.json |
[production] |
11:01 |
<hnowlan@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
11:01 |
<hnowlan@deploy1003> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
11:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P77483 and previous config saved to /var/cache/conftool/dbconfig/20250610-110101-root.json |
[production] |
10:59 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1181 (T395241)', diff saved to https://phabricator.wikimedia.org/P77482 and previous config saved to /var/cache/conftool/dbconfig/20250610-105951-fceratto.json |
[production] |
10:59 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance |
[production] |