2024-02-22
ยง
|
08:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1033 (re)pooling @ 100%: After migration', diff saved to https://phabricator.wikimedia.org/P57670 and previous config saved to /var/cache/conftool/dbconfig/20240222-085800-root.json |
[production] |
08:56 |
<arnaudb@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2195.codfw.wmnet |
[production] |
08:55 |
<arnaudb@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2143.codfw.wmnet |
[production] |
08:55 |
<arnaudb@cumin1002> |
START - Cookbook sre.mysql.upgrade for db1180.eqiad.wmnet |
[production] |
08:55 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'T356240 - depooling db1187 db2143 db2195', diff saved to https://phabricator.wikimedia.org/P57669 and previous config saved to /var/cache/conftool/dbconfig/20240222-085521-arnaudb.json |
[production] |
08:52 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on db[2143,2195].codfw.wmnet,db1187.eqiad.wmnet with reason: Silence for reboot T356240 |
[production] |
08:52 |
<jayme> |
rolling out prometheus-rsyslog-exporter 1.0.0+git20221110-1 to wikikube nodes - T357616 |
[production] |
08:52 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on db[2143,2195].codfw.wmnet,db1187.eqiad.wmnet with reason: Silence for reboot T356240 |
[production] |
08:46 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2033 (re)pooling @ 5%: After migration to 10.6', diff saved to https://phabricator.wikimedia.org/P57668 and previous config saved to /var/cache/conftool/dbconfig/20240222-084616-root.json |
[production] |
08:44 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetmaster1002.eqiad.wmnet |
[production] |
08:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1033 (re)pooling @ 75%: After migration', diff saved to https://phabricator.wikimedia.org/P57667 and previous config saved to /var/cache/conftool/dbconfig/20240222-084255-root.json |
[production] |
08:42 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db1167 (T357189)', diff saved to https://phabricator.wikimedia.org/P57666 and previous config saved to /var/cache/conftool/dbconfig/20240222-084235-arnaudb.json |
[production] |
08:42 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
08:42 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
08:42 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host puppetmaster1002.eqiad.wmnet |
[production] |
08:42 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance |
[production] |
08:42 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance |
[production] |
08:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2033 (re)pooling @ 1%: After migration to 10.6', diff saved to https://phabricator.wikimedia.org/P57665 and previous config saved to /var/cache/conftool/dbconfig/20240222-083111-root.json |
[production] |
08:30 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2033.codfw.wmnet with OS bookworm |
[production] |
08:29 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 18779 |
[production] |
08:28 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'configure' for AS: 18779 |
[production] |
08:27 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1033 (re)pooling @ 50%: After migration', diff saved to https://phabricator.wikimedia.org/P57664 and previous config saved to /var/cache/conftool/dbconfig/20240222-082750-root.json |
[production] |
08:25 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 138997 |
[production] |
08:24 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'configure' for AS: 138997 |
[production] |
08:24 |
<ayounsi@cumin1002> |
END (FAIL) - Cookbook sre.network.peering (exit_code=99) with action 'email' for AS: 138997 |
[production] |
08:23 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 138997 |
[production] |
08:21 |
<hoo@deploy2002> |
Finished scap: Backport for [[gerrit:1005467|Migrate to virtual domain mapping (T348526)]], [[gerrit:1005485|Migrate to virtual domain mapping (T348526)]] (duration: 14m 44s) |
[production] |
08:20 |
<marostegui@cumin1002> |
conftool action : set/pooled=yes; selector: name=clouddb1017.eqiad.wmnet,service=s1 |
[production] |
08:20 |
<marostegui@cumin1002> |
conftool action : set/pooled=yes; selector: name=clouddb1017.eqiad.wmnet,service=s3 |
[production] |
08:14 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2033.codfw.wmnet with reason: host reimage |
[production] |
08:13 |
<hoo@deploy2002> |
hoo: Continuing with sync |
[production] |
08:12 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1033 (re)pooling @ 25%: After migration', diff saved to https://phabricator.wikimedia.org/P57663 and previous config saved to /var/cache/conftool/dbconfig/20240222-081243-root.json |
[production] |
08:12 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on es2033.codfw.wmnet with reason: host reimage |
[production] |
08:08 |
<hoo@deploy2002> |
hoo: Backport for [[gerrit:1005467|Migrate to virtual domain mapping (T348526)]], [[gerrit:1005485|Migrate to virtual domain mapping (T348526)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
08:06 |
<hoo@deploy2002> |
Started scap: Backport for [[gerrit:1005467|Migrate to virtual domain mapping (T348526)]], [[gerrit:1005485|Migrate to virtual domain mapping (T348526)]] |
[production] |
07:58 |
<taavi> |
taavi@puppetmaster1002 ~ $ sudo systemctl restart apache2 # lots of 'Error 500 on SERVER: Server Error: undefined method `content' for nil:NilClass' in the logs, seems to have helped |
[production] |
07:57 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1033 (re)pooling @ 10%: After migration', diff saved to https://phabricator.wikimedia.org/P57662 and previous config saved to /var/cache/conftool/dbconfig/20240222-075738-root.json |
[production] |
07:54 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host es2033.codfw.wmnet with OS bookworm |
[production] |
07:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2137 (re)pooling @ 100%: After migration to 10.6', diff saved to https://phabricator.wikimedia.org/P57661 and previous config saved to /var/cache/conftool/dbconfig/20240222-075448-root.json |
[production] |
07:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1033 (re)pooling @ 5%: After migration', diff saved to https://phabricator.wikimedia.org/P57660 and previous config saved to /var/cache/conftool/dbconfig/20240222-074233-root.json |
[production] |
07:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es2033 T358080', diff saved to https://phabricator.wikimedia.org/P57659 and previous config saved to /var/cache/conftool/dbconfig/20240222-074042-root.json |
[production] |
07:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2137 (re)pooling @ 75%: After migration to 10.6', diff saved to https://phabricator.wikimedia.org/P57658 and previous config saved to /var/cache/conftool/dbconfig/20240222-073943-root.json |
[production] |
07:30 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote es2026 as es2 codfw master T358080', diff saved to https://phabricator.wikimedia.org/P57657 and previous config saved to /var/cache/conftool/dbconfig/20240222-073017-marostegui.json |
[production] |
07:27 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1033 (re)pooling @ 1%: After migration', diff saved to https://phabricator.wikimedia.org/P57656 and previous config saved to /var/cache/conftool/dbconfig/20240222-072729-root.json |
[production] |
07:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2137 (re)pooling @ 50%: After migration to 10.6', diff saved to https://phabricator.wikimedia.org/P57655 and previous config saved to /var/cache/conftool/dbconfig/20240222-072438-root.json |
[production] |
07:19 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es1033.eqiad.wmnet with OS bookworm |
[production] |
07:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2137 (re)pooling @ 25%: After migration to 10.6', diff saved to https://phabricator.wikimedia.org/P57654 and previous config saved to /var/cache/conftool/dbconfig/20240222-070933-root.json |
[production] |
06:58 |
<marostegui@cumin1002> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on es1033.eqiad.wmnet with reason: host reimage |
[production] |
06:57 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on es1033.eqiad.wmnet with reason: host reimage |
[production] |
06:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2137 (re)pooling @ 10%: After migration to 10.6', diff saved to https://phabricator.wikimedia.org/P57653 and previous config saved to /var/cache/conftool/dbconfig/20240222-065428-root.json |
[production] |