2024-04-03
ยง
|
16:29 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 16:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
16:29 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1167.eqiad.wmnet with reason: Maintenance |
[production] |
16:29 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1167.eqiad.wmnet with reason: Maintenance |
[production] |
16:26 |
<effie> |
pooling back mw-web-ro in eqiad |
[production] |
16:26 |
<jayme@deploy1002> |
Started scap: (no justification provided) |
[production] |
16:26 |
<jiji@cumin1002> |
conftool action : set/pooled=true; selector: dnsdisc=mw-web-ro,name=eqiad |
[production] |
16:19 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2124', diff saved to https://phabricator.wikimedia.org/P59358 and previous config saved to /var/cache/conftool/dbconfig/20240403-161933-arnaudb.json |
[production] |
16:14 |
<jiji@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-web: apply |
[production] |
16:12 |
<jiji@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-web: apply |
[production] |
16:08 |
<akosiaris@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop: apply |
[production] |
16:07 |
<akosiaris@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop: apply |
[production] |
16:05 |
<akosiaris@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: apply |
[production] |
16:05 |
<akosiaris@deploy1002> |
helmfile [eqiad] START helmfile.d/services/changeprop: apply |
[production] |
16:04 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2124 (T360332)', diff saved to https://phabricator.wikimedia.org/P59357 and previous config saved to /var/cache/conftool/dbconfig/20240403-160425-arnaudb.json |
[production] |
16:02 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db2124 (T360332)', diff saved to https://phabricator.wikimedia.org/P59356 and previous config saved to /var/cache/conftool/dbconfig/20240403-160159-arnaudb.json |
[production] |
16:02 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2124.codfw.wmnet with reason: Maintenance |
[production] |
16:01 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2124.codfw.wmnet with reason: Maintenance |
[production] |
16:01 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114 (T360332)', diff saved to https://phabricator.wikimedia.org/P59355 and previous config saved to /var/cache/conftool/dbconfig/20240403-160136-arnaudb.json |
[production] |
15:53 |
<jiji@cumin1002> |
conftool action : set/pooled=false; selector: dnsdisc=mw-web-ro,name=eqiad |
[production] |
15:46 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114', diff saved to https://phabricator.wikimedia.org/P59354 and previous config saved to /var/cache/conftool/dbconfig/20240403-154628-arnaudb.json |
[production] |
15:33 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) status all services in all: None - None |
[production] |
15:33 |
<jiji@cumin1002> |
START - Cookbook sre.discovery.datacenter status all services in all: None - None |
[production] |
15:31 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114', diff saved to https://phabricator.wikimedia.org/P59353 and previous config saved to /var/cache/conftool/dbconfig/20240403-153121-arnaudb.json |
[production] |
15:22 |
<pfischer@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
15:22 |
<Dreamy_Jazz> |
Starting MediaModeration scanning script again - It crashed due to the outage |
[production] |
15:22 |
<pfischer@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
15:16 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2114 (T360332)', diff saved to https://phabricator.wikimedia.org/P59352 and previous config saved to /var/cache/conftool/dbconfig/20240403-151614-arnaudb.json |
[production] |
15:13 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db2114 (T360332)', diff saved to https://phabricator.wikimedia.org/P59351 and previous config saved to /var/cache/conftool/dbconfig/20240403-151349-arnaudb.json |
[production] |
15:13 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2114.codfw.wmnet with reason: Maintenance |
[production] |
15:13 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2114.codfw.wmnet with reason: Maintenance |
[production] |
15:13 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2097.codfw.wmnet with reason: Maintenance |
[production] |
15:12 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2097.codfw.wmnet with reason: Maintenance |
[production] |
15:12 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance |
[production] |
15:12 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance |
[production] |
15:12 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231 (T360332)', diff saved to https://phabricator.wikimedia.org/P59350 and previous config saved to /var/cache/conftool/dbconfig/20240403-151233-arnaudb.json |
[production] |
15:04 |
<jynus@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2098.codfw.wmnet with reason: restart of mysqld |
[production] |
15:03 |
<jynus@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2098.codfw.wmnet with reason: restart of mysqld |
[production] |
15:02 |
<dreamyjazz@deploy1002> |
Finished scap: (no justification provided) (duration: 18m 54s) |
[production] |
15:01 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/wikifeeds: sync |
[production] |
15:01 |
<aborrero@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1040.eqiad.wmnet with OS bookworm |
[production] |
15:01 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/wikifeeds: sync |
[production] |
14:57 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P59349 and previous config saved to /var/cache/conftool/dbconfig/20240403-145725-arnaudb.json |
[production] |
14:44 |
<dreamyjazz@deploy1002> |
Started scap: (no justification provided) |
[production] |
14:42 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P59347 and previous config saved to /var/cache/conftool/dbconfig/20240403-144217-arnaudb.json |
[production] |
14:34 |
<aborrero@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: host reimage |
[production] |
14:31 |
<aborrero@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: host reimage |
[production] |
14:27 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
14:27 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1231 (T360332)', diff saved to https://phabricator.wikimedia.org/P59346 and previous config saved to /var/cache/conftool/dbconfig/20240403-142709-arnaudb.json |
[production] |
14:27 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
14:26 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |