2022-01-26
ยง
|
19:15 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/InitialiseSettings-labs.php: Config: [[gerrit:756985|[wmf-config] Undeploy gdi survey on cawiki beta (T299913)]] (no-op sync, beta only) (duration: 00m 52s) |
[production] |
19:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T298559)', diff saved to https://phabricator.wikimedia.org/P19374 and previous config saved to /var/cache/conftool/dbconfig/20220126-191002-marostegui.json |
[production] |
18:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1146:3312', diff saved to https://phabricator.wikimedia.org/P19373 and previous config saved to /var/cache/conftool/dbconfig/20220126-185457-marostegui.json |
[production] |
18:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1146:3312', diff saved to https://phabricator.wikimedia.org/P19372 and previous config saved to /var/cache/conftool/dbconfig/20220126-183953-marostegui.json |
[production] |
18:24 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T298559)', diff saved to https://phabricator.wikimedia.org/P19371 and previous config saved to /var/cache/conftool/dbconfig/20220126-182448-marostegui.json |
[production] |
18:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1146:3312 (T298559)', diff saved to https://phabricator.wikimedia.org/P19370 and previous config saved to /var/cache/conftool/dbconfig/20220126-182333-marostegui.json |
[production] |
18:23 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1146.eqiad.wmnet with reason: Maintenance |
[production] |
18:23 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1146.eqiad.wmnet with reason: Maintenance |
[production] |
18:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1170:3312 (T298559)', diff saved to https://phabricator.wikimedia.org/P19369 and previous config saved to /var/cache/conftool/dbconfig/20220126-182325-marostegui.json |
[production] |
18:14 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=restbase1019.eqiad.wmnet |
[production] |
18:14 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase1019.eqiad.wmnet with OS buster |
[production] |
18:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1170:3312', diff saved to https://phabricator.wikimedia.org/P19368 and previous config saved to /var/cache/conftool/dbconfig/20220126-180819-marostegui.json |
[production] |
18:02 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . |
[production] |
17:59 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
17:59 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
17:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1025 (re)pooling @ 100%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19366 and previous config saved to /var/cache/conftool/dbconfig/20220126-175405-root.json |
[production] |
17:53 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
17:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1170:3312', diff saved to https://phabricator.wikimedia.org/P19365 and previous config saved to /var/cache/conftool/dbconfig/20220126-175315-marostegui.json |
[production] |
17:52 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
17:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1025 (re)pooling @ 75%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19364 and previous config saved to /var/cache/conftool/dbconfig/20220126-173901-root.json |
[production] |
17:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1170:3312 (T298559)', diff saved to https://phabricator.wikimedia.org/P19363 and previous config saved to /var/cache/conftool/dbconfig/20220126-173810-marostegui.json |
[production] |
17:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1170:3312 (T298559)', diff saved to https://phabricator.wikimedia.org/P19361 and previous config saved to /var/cache/conftool/dbconfig/20220126-173654-marostegui.json |
[production] |
17:36 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
17:36 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
17:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1156 (T298559)', diff saved to https://phabricator.wikimedia.org/P19360 and previous config saved to /var/cache/conftool/dbconfig/20220126-173647-marostegui.json |
[production] |
17:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1025 (re)pooling @ 60%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19359 and previous config saved to /var/cache/conftool/dbconfig/20220126-172358-root.json |
[production] |
17:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P19357 and previous config saved to /var/cache/conftool/dbconfig/20220126-172141-marostegui.json |
[production] |
17:21 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.reimage for host restbase1019.eqiad.wmnet with OS buster |
[production] |
17:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1025 (re)pooling @ 50%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19356 and previous config saved to /var/cache/conftool/dbconfig/20220126-170852-root.json |
[production] |
17:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P19355 and previous config saved to /var/cache/conftool/dbconfig/20220126-170635-marostegui.json |
[production] |
16:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1025 (re)pooling @ 40%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19354 and previous config saved to /var/cache/conftool/dbconfig/20220126-165349-root.json |
[production] |
16:53 |
<jayme> |
published image docker-registry.discovery.wmnet/cfssl-issuer:0.2.1-1 - T299906 |
[production] |
16:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1156 (T298559)', diff saved to https://phabricator.wikimedia.org/P19353 and previous config saved to /var/cache/conftool/dbconfig/20220126-165130-marostegui.json |
[production] |
16:51 |
<ryankemper> |
[WCQS Deploy] Restarted updaters across fleet: `ryankemper@cumin1001:~$ sudo cumin -b 6 'wcqs*' 'sudo systemctl restart wcqs-updater'` |
[production] |
16:47 |
<moritzm> |
draining instances off ganeti1007 for reimage |
[production] |
16:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1025 (re)pooling @ 25%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19352 and previous config saved to /var/cache/conftool/dbconfig/20220126-163845-root.json |
[production] |
16:34 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=restbase1019.eqiad.wmnet |
[production] |
16:33 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on restbase1019.eqiad.wmnet with reason: Firmware upgrade |
[production] |
16:33 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on restbase1019.eqiad.wmnet with reason: Firmware upgrade |
[production] |
16:30 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1156 (T298559)', diff saved to https://phabricator.wikimedia.org/P19351 and previous config saved to /var/cache/conftool/dbconfig/20220126-162810-marostegui.json |
[production] |
16:28 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
16:28 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
16:28 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance |
[production] |
16:28 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1156.eqiad.wmnet with reason: Maintenance |
[production] |
16:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1182 (T298559)', diff saved to https://phabricator.wikimedia.org/P19350 and previous config saved to /var/cache/conftool/dbconfig/20220126-162756-marostegui.json |
[production] |
16:26 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
16:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es1025 (re)pooling @ 20%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19349 and previous config saved to /var/cache/conftool/dbconfig/20220126-162342-root.json |
[production] |
16:23 |
<elukey> |
restart varnishkafka instances on cp1087 |
[production] |
16:17 |
<cmjohnson@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudvirt1047.eqiad.wmnet with OS bullseye |
[production] |