2021-03-25
§
|
09:26 |
<moritzm> |
drain ganeti2024 |
[production] |
09:25 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2023.codfw.wmnet |
[production] |
09:17 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti2023.codfw.wmnet |
[production] |
08:45 |
<moritzm> |
drain ganeti2023 |
[production] |
08:43 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2022.codfw.wmnet |
[production] |
08:35 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ganeti2022.codfw.wmnet |
[production] |
08:12 |
<elukey> |
upgrade hive packages in thirdparty/bigtop15 to 2.3.6-2 for buster-wikimedia |
[production] |
08:11 |
<elukey> |
upgrade hive packages in thirdparty/bigtop15 to 2.3.6-2 |
[production] |
07:41 |
<legoktm> |
upgraded lists1002 to hyperkitty 1.2.2-1+wmf1 (T276687) |
[production] |
07:36 |
<legoktm> |
uploaded hyperkitty 1.2.2-1+wmf1 to buster-wikimedia (T276687) |
[production] |
07:35 |
<jynus> |
restart db2135 T278408 T273281 |
[production] |
07:05 |
<effie> |
enable puppet on all mediawiki servers |
[production] |
06:57 |
<XioNoX> |
Option 82: use-vlan-id |
[production] |
06:53 |
<effie> |
enable puppet on jobrunners |
[production] |
06:47 |
<effie> |
enable puppet on parsoid |
[production] |
06:40 |
<effie> |
disable puppet on all mediawiki servers to merge 673061 (service proxy to listen on ::1) |
[production] |
06:23 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-upgrade (exit_code=99) |
[production] |
05:19 |
<ryankemper@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-upgrade |
[production] |
04:44 |
<legoktm> |
restarted exim4 on lists1002 so it listens on 0.0.0.0 instead of 127.0.0.1 |
[production] |
04:16 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-upgrade (exit_code=99) |
[production] |
03:10 |
<ryankemper@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-upgrade |
[production] |
01:33 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-upgrade (exit_code=99) |
[production] |
01:10 |
<legoktm> |
mailman3: added lists-next.wikimedia.org domain |
[production] |
01:08 |
<legoktm> |
mailman3: renamed default site from "example.com" to "lists-next.wikimedia.org" |
[production] |
00:50 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2378.codfw.wmnet |
[production] |
00:35 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2377.codfw.wmnet |
[production] |
00:35 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2777.codfw.wmnet |
[production] |
00:34 |
<mutante> |
mw2377, mw2378 - first scap pull |
[production] |
00:33 |
<dzahn@cumin1001> |
conftool action : set/weight=10; selector: name=mw2378.codfw.wmnet |
[production] |
00:33 |
<dzahn@cumin1001> |
conftool action : set/weight=10; selector: name=mw2377.codfw.wmnet |
[production] |
00:32 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2378.codfw.wmnet |
[production] |
00:32 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2377.codfw.wmnet |
[production] |
00:29 |
<legoktm> |
syncing facts for puppet-compiler |
[production] |
00:23 |
<mutante> |
mw2377, mw2378 - reboot |
[production] |
00:14 |
<twentyafterfour> |
phabricator update complete |
[production] |
00:10 |
<twentyafterfour> |
deploying phabricator |
[production] |
00:05 |
<ryankemper> |
T274204 `sudo -i cookbook sre.elasticsearch.rolling-upgrade search_eqiad "eqiad cluster reboot" --task-id T274204 --nodes-per-run 3 --start-datetime 2021-03-24T23:55:35` on `ryankemper@cumin1001` tmux session `elasticsearch_rolling_upgrade_reboots` |
[production] |
2021-03-24
§
|
23:57 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2378.codfw.wmnet with reason: new_install |
[production] |
23:57 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2378.codfw.wmnet with reason: new_install |
[production] |
23:56 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2377.codfw.wmnet with reason: new_install |
[production] |
23:56 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2377.codfw.wmnet with reason: new_install |
[production] |
23:56 |
<ryankemper@cumin1001> |
START - Cookbook sre.elasticsearch.rolling-upgrade |
[production] |
23:48 |
<mutante> |
generating new mcrouter certs for mw2377, mw2378 |
[production] |
22:07 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.elasticsearch.rolling-upgrade (exit_code=0) |
[production] |
22:07 |
<legoktm> |
disabled puppet on lists1002 while mailman3-web is broken |
[production] |
21:49 |
<ryankemper@cumin2001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
21:19 |
<mutante> |
webperf2001 - restarted apache |
[production] |
21:11 |
<hashar@deploy1002> |
Synchronized php: group1 wikis to 1.36.0-wmf.36 (duration: 01m 07s) |
[production] |
21:10 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.36.0-wmf.36 |
[production] |
21:08 |
<kharlan@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |