2020-03-06
§
|
09:46 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) |
[production] |
09:45 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:43 |
<elukey@cumin1001> |
START - Cookbook sre.aqs.roll-restart |
[production] |
09:42 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:21 |
<marostegui> |
Stop MySQL on db2084:3315, db2084:3314 for reimage T246604 |
[production] |
09:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2084:3314, db2084:3315 for reimage to buster - T246604', diff saved to https://phabricator.wikimedia.org/P10645 and previous config saved to /var/cache/conftool/dbconfig/20200306-092103-marostegui.json |
[production] |
09:20 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1074', diff saved to https://phabricator.wikimedia.org/P10644 and previous config saved to /var/cache/conftool/dbconfig/20200306-092026-marostegui.json |
[production] |
09:12 |
<moritzm> |
rolling restart of mw canaries to pick up libidn security updates |
[production] |
09:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1074', diff saved to https://phabricator.wikimedia.org/P10643 and previous config saved to /var/cache/conftool/dbconfig/20200306-090328-marostegui.json |
[production] |
09:00 |
<moritzm> |
installing libidn security updates |
[production] |
08:56 |
<moritzm> |
rolling restart of kartotherian/tilerator/tileratorui to pick up OpenJPEG security updates |
[production] |
08:56 |
<marostegui> |
Stop MySQL on db1074 for upgrade T239791 |
[production] |
08:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1074 for upgrade T239791', diff saved to https://phabricator.wikimedia.org/P10642 and previous config saved to /var/cache/conftool/dbconfig/20200306-085435-marostegui.json |
[production] |
08:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1113:3315, db1113:3316 after upgrade - T239791', diff saved to https://phabricator.wikimedia.org/P10641 and previous config saved to /var/cache/conftool/dbconfig/20200306-085332-marostegui.json |
[production] |
08:47 |
<marostegui> |
Stop mysql for db1113:3315, db1113:3316 for upgrade T239791 |
[production] |
08:44 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1113:3315, db1113:3316 for upgrade - T239791', diff saved to https://phabricator.wikimedia.org/P10640 and previous config saved to /var/cache/conftool/dbconfig/20200306-084439-marostegui.json |
[production] |
08:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1078 T246604', diff saved to https://phabricator.wikimedia.org/P10639 and previous config saved to /var/cache/conftool/dbconfig/20200306-084141-marostegui.json |
[production] |
08:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2085:3311, db2085:3318 after reimage to buster - T246604', diff saved to https://phabricator.wikimedia.org/P10638 and previous config saved to /var/cache/conftool/dbconfig/20200306-082858-marostegui.json |
[production] |
08:25 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1078 T246604', diff saved to https://phabricator.wikimedia.org/P10637 and previous config saved to /var/cache/conftool/dbconfig/20200306-082549-marostegui.json |
[production] |
08:19 |
<moritzm> |
installing openjpeg2 security updates |
[production] |
08:11 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:09 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:50 |
<marostegui> |
Stop MySQL on db2085:3311, db2085:3318 for reimage to buster T246604 |
[production] |
07:44 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2085:3311, db2085:3318 for reimage to buster - T246604', diff saved to https://phabricator.wikimedia.org/P10636 and previous config saved to /var/cache/conftool/dbconfig/20200306-074427-marostegui.json |
[production] |
07:37 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1078 T246604', diff saved to https://phabricator.wikimedia.org/P10635 and previous config saved to /var/cache/conftool/dbconfig/20200306-073707-marostegui.json |
[production] |
07:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1078 T246604', diff saved to https://phabricator.wikimedia.org/P10634 and previous config saved to /var/cache/conftool/dbconfig/20200306-070538-marostegui.json |
[production] |
06:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Install 10.4 instead of 10.3 on db1078', diff saved to https://phabricator.wikimedia.org/P10633 and previous config saved to /var/cache/conftool/dbconfig/20200306-064800-marostegui.json |
[production] |
01:38 |
<mutante> |
added 9 more appservers to codfw pool split between appserver and API appservers, weight 15 (like all in codfw) T247021 |
[production] |
01:37 |
<mutante> |
added 9 more appservers to codfw pool |
[production] |
01:34 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw230[1-9].codfw.wmnet |
[production] |
01:34 |
<dzahn@cumin1001> |
conftool action : set/weight=15; selector: name=mw230[1-9].codfw.wmnet |
[production] |
01:01 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
00:58 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
00:33 |
<cdanis> |
repool esams T246338 |
[production] |
00:19 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
00:19 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
00:02 |
<cdanis> |
T246338 depool esams for router maintenance |
[production] |
2020-03-05
§
|
23:55 |
<mutante> |
pooled mw2290 - noticed it was the only API appserver in codfw not pooled but did not see why, fine in Icinga and no open tickets/SAL |
[production] |
23:55 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2290.codfw.wmnet |
[production] |
23:30 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1413.eqiad.wmnet |
[production] |
23:27 |
<rzl@cumin1001> |
conftool action : set/weight=30; selector: name=mw1413.eqiad.wmnet |
[production] |
23:26 |
<rlazarus> |
mw1413 test-reimage completed successfully, pooling |
[production] |
23:03 |
<rzl@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
23:01 |
<rzl@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
22:50 |
<mutante> |
added 8 new appservers to pool in eqiad |
[production] |
22:50 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw139[0-2].eqiad.wmnet |
[production] |
22:47 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw138[5-9].eqiad.wmnet |
[production] |
22:47 |
<dzahn@cumin1001> |
conftool action : set/weight=30; selector: name=mw138[5-9].eqiad.wmnet |
[production] |
22:46 |
<dzahn@cumin1001> |
conftool action : set/weight=30; selector: name=mw139[0-2].eqiad.wmnet |
[production] |
22:46 |
<dzahn@cumin1001> |
conftool action : set/weight=20; selector: name=mw139[0-2].eqiad.wmnet |
[production] |