2021-11-25
ยง
|
13:40 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host ping1002.eqiad.wmnet |
[production] |
13:32 |
<ayounsi@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host ping1002.eqiad.wmnet |
[production] |
13:30 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host ping2002.codfw.wmnet |
[production] |
13:28 |
<Amir1> |
killing lingering process from mwmaint to depooled db1147 |
[production] |
13:20 |
<ayounsi@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host ping2002.codfw.wmnet |
[production] |
13:14 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host ping3002.esams.wmnet |
[production] |
13:05 |
<ayounsi@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host ping3002.esams.wmnet |
[production] |
12:27 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching restbase202[1-3].codfw.wmnet: Restarting for certificate updates - hnowlan@cumin1001 |
[production] |
12:14 |
<arturo> |
update repo bullseye-wikimedia/thirdparty/ceph-octopus (T296175) |
[production] |
12:14 |
<jynus> |
disable temp. gtid on db1163 |
[production] |
12:11 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Temp. depool db1163 fully', diff saved to https://phabricator.wikimedia.org/P17847 and previous config saved to /var/cache/conftool/dbconfig/20211125-121138-jynus.json |
[production] |
12:04 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Reduce db1163 load even more', diff saved to https://phabricator.wikimedia.org/P17846 and previous config saved to /var/cache/conftool/dbconfig/20211125-120435-jynus.json |
[production] |
11:56 |
<hnowlan@cumin1001> |
START - Cookbook sre.cassandra.roll-restart for nodes matching restbase202[1-3].codfw.wmnet: Restarting for certificate updates - hnowlan@cumin1001 |
[production] |
11:56 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Reduce db1163 load', diff saved to https://phabricator.wikimedia.org/P17845 and previous config saved to /var/cache/conftool/dbconfig/20211125-115602-jynus.json |
[production] |
11:04 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1147 (T296143)', diff saved to https://phabricator.wikimedia.org/P17844 and previous config saved to /var/cache/conftool/dbconfig/20211125-110443-ladsgroup.json |
[production] |
11:04 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1147.eqiad.wmnet with reason: Maintenance T296143 |
[production] |
11:04 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1147.eqiad.wmnet with reason: Maintenance T296143 |
[production] |
11:04 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1146:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17843 and previous config saved to /var/cache/conftool/dbconfig/20211125-110435-ladsgroup.json |
[production] |
10:49 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1146:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17842 and previous config saved to /var/cache/conftool/dbconfig/20211125-104930-ladsgroup.json |
[production] |
10:34 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1146:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17841 and previous config saved to /var/cache/conftool/dbconfig/20211125-103425-ladsgroup.json |
[production] |
10:25 |
<vgutierrez> |
rolling restart of varnish and HAProxy on cp2042.codfw.wmnet,cp1090.eqiad.wmnet,cp[5012].eqsin.wmnet,cp3065.esams.wmnet,cp[4026,4032].ulsfo.wmnet' to disable PROXY protocol - T290005 |
[production] |
10:19 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1146:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17840 and previous config saved to /var/cache/conftool/dbconfig/20211125-101921-ladsgroup.json |
[production] |
09:55 |
<jelto@cumin1001> |
conftool action : set/pooled=true; selector: name=eqiad,dnsdisc=(apertium|api-gateway|apple-search|blubberoid|citoid|cxserver|echostore|eventgate-analytics|eventgate-analytics-external|eventgate-logging-external|eventstreams|eventstreams-internal|linkrecommendation|mathoid|mobileapps|proton|push-notifications|recommendation-api|sessionstore|shellbox|shellbox-constraints|shellbox-media|shellbox-syntaxh |
[production] |
09:45 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'zotero' for release 'production' . |
[production] |
09:43 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . |
[production] |
09:39 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'toolhub' for release 'main' . |
[production] |
09:37 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'termbox' for release 'production' . |
[production] |
09:34 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . |
[production] |
09:31 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'similar-users' for release 'main' . |
[production] |
09:29 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'shellbox-timeline' for release 'main' . |
[production] |
09:27 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'shellbox-syntaxhighlight' for release 'main' . |
[production] |
09:24 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'shellbox-media' for release 'main' . |
[production] |
09:23 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'shellbox-media' for release 'main' . |
[production] |
09:21 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'shellbox-constraints' for release 'main' . |
[production] |
09:19 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'shellbox' for release 'main' . |
[production] |
09:16 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'recommendation-api' for release 'production' . |
[production] |
09:10 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . |
[production] |
09:05 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'push-notifications' for release 'main' . |
[production] |
09:02 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
08:59 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
08:51 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
08:50 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
08:48 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1146:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17837 and previous config saved to /var/cache/conftool/dbconfig/20211125-084834-ladsgroup.json |
[production] |
08:48 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1146.eqiad.wmnet with reason: Maintenance T296143 |
[production] |
08:48 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1146.eqiad.wmnet with reason: Maintenance T296143 |
[production] |
08:47 |
<jelto@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'miscweb' for release 'main' . |
[production] |
08:46 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1146.eqiad.wmnet with reason: Maintenance T296143 |
[production] |
08:46 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1146.eqiad.wmnet with reason: Maintenance T296143 |
[production] |
08:44 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1146.eqiad.wmnet with reason: Maintenance T296143 |
[production] |
08:44 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1146.eqiad.wmnet with reason: Maintenance T296143 |
[production] |