2021-05-26
§
|
05:09 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1148', diff saved to https://phabricator.wikimedia.org/P16214 and previous config saved to /var/cache/conftool/dbconfig/20210526-050919-marostegui.json |
[production] |
05:04 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1160 (re)pooling @ 75%: Repool db1160', diff saved to https://phabricator.wikimedia.org/P16213 and previous config saved to /var/cache/conftool/dbconfig/20210526-050431-root.json |
[production] |
04:49 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1160 (re)pooling @ 50%: Repool db1160', diff saved to https://phabricator.wikimedia.org/P16212 and previous config saved to /var/cache/conftool/dbconfig/20210526-044928-root.json |
[production] |
04:35 |
<marostegui> |
Deploy schema change on db1106, this will generate lag on s1 (enwiki) on wiki replicas T266486 T268392 T273360 |
[production] |
04:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1106', diff saved to https://phabricator.wikimedia.org/P16211 and previous config saved to /var/cache/conftool/dbconfig/20210526-043439-marostegui.json |
[production] |
04:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1160 (re)pooling @ 25%: Repool db1160', diff saved to https://phabricator.wikimedia.org/P16210 and previous config saved to /var/cache/conftool/dbconfig/20210526-043424-root.json |
[production] |
03:29 |
<eileen> |
process-control config revision is 7b646533da |
[production] |
02:43 |
<wm-bot> |
<legoktm> Shutdown freenode version |
[tools.wikibugs] |
02:06 |
<wm-bot> |
<bd808> Shutdown freenode bot |
[tools.jouncebot] |
02:05 |
<wm-bot> |
<bd808> Shutdown freenode bot |
[tools.stashbot] |
01:58 |
<wm-bot> |
<bd808> Disabled all freenode connections |
[tools.bridgebot] |
00:47 |
<eileen> |
civicrm revision changed from 584b96452a to eac772e9c9, config revision is 2ca92c3c3c |
[production] |
00:27 |
<mutante> |
phab2001 - restarted apache2 |
[production] |
2021-05-25
§
|
23:09 |
<razzi@cumin1001> |
END (PASS) - Cookbook sre.hadoop.roll-restart-masters (exit_code=0) |
[production] |
22:39 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters |
[production] |
22:21 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) |
[production] |
22:21 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters |
[production] |
22:21 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) |
[production] |
22:21 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters |
[production] |
22:04 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) |
[production] |
22:04 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters |
[production] |
21:58 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) |
[production] |
21:58 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters |
[production] |
21:13 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) |
[production] |
21:13 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters |
[production] |
21:13 |
<razzi@cumin1001> |
END (ERROR) - Cookbook sre.hadoop.roll-restart-workers (exit_code=97) |
[production] |
21:13 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-workers |
[production] |
20:40 |
<razzi@cumin1001> |
END (PASS) - Cookbook sre.hadoop.roll-restart-workers (exit_code=0) |
[production] |
20:28 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-workers |
[production] |
20:00 |
<twentyafterfour@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.37.0-wmf.7 |
[production] |
19:20 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:17 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:17 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:12 |
<twentyafterfour@deploy1002> |
Finished scap: testwikis wikis to 1.37.0-wmf.7 (duration: 33m 29s) |
[production] |
19:12 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:38 |
<twentyafterfour@deploy1002> |
Started scap: testwikis wikis to 1.37.0-wmf.7 |
[production] |
18:16 |
<razzi> |
sudo systemctl start all failed units from `systemctl list-units --state=failed` on an-launcher1002 |
[analytics] |
18:14 |
<razzi> |
sudo systemctl start eventlogging_to_druid_navigationtiming_hourly.service |
[analytics] |
18:08 |
<krinkle@deploy1002> |
Synchronized wmf-config/CommonSettings.php: I2ebe9674fb109f (duration: 00m 56s) |
[production] |
18:01 |
<razzi> |
manually edit /etc/hadoop/conf/capacity-scheduler.xml to make queues running and sudo -u yarn kerberos-run-command yarn yarn rmadmin -refreshQueues |
[analytics] |
17:52 |
<razzi> |
sudo -u yarn kerberos-run-command yarn yarn rmadmin -refreshQueues on an-master1001 and an-master1002 |
[analytics] |
17:34 |
<Krinkle> |
mwmaint1002: Running purge-parsercache-now.php on server 2/4 (pc1007, depooled spare). Ref P16060, T280605, T282761. |
[production] |
17:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1164 (re)pooling @ 100%: Repool db1164', diff saved to https://phabricator.wikimedia.org/P16207 and previous config saved to /var/cache/conftool/dbconfig/20210525-173031-root.json |
[production] |
17:28 |
<razzi> |
sudo systemctl restart refine_eventlogging_legacy |
[analytics] |
17:28 |
<razzi> |
sudo -u yarn kerberos-run-command yarn yarn rmadmin -refreshQueues to enable submitting jobs once again |
[analytics] |
17:22 |
<effie> |
disable puppet on mc2019 (for tests) |
[production] |
17:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1164 (re)pooling @ 75%: Repool db1164', diff saved to https://phabricator.wikimedia.org/P16206 and previous config saved to /var/cache/conftool/dbconfig/20210525-171527-root.json |
[production] |
17:14 |
<andrewbogott> |
deleting old ingress controllers toolsbeta-test-k8s-ingress-1 and toolsbeta-test-k8s-ingress-2 |
[toolsbeta] |
17:13 |
<andrewbogott> |
created two new ingress nodes, toolsbeta-test-k8s-ingress-4 and toolsbeta-test-k8s-ingress-5 |
[toolsbeta] |
17:07 |
<razzi> |
re-enabled puppet on an-masters and an-launcher |
[analytics] |