2020-03-17
§
|
10:20 |
<godog> |
bounce squid on install1003 T247759 |
[production] |
10:07 |
<_joe_> |
sudo cumin -b2 -s 50 'A:mw-jobrunner' 'restart-php7.2-fpm' T247622 |
[production] |
10:03 |
<Amir1> |
warming up cache for Q60M to Q70M for new term store on db1111, db1126, db1104, db1092 (T219123) |
[production] |
10:02 |
<ema> |
create kafka topic atskafka_test_webrequest_text T247497 |
[production] |
09:57 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.roll-restart-workers (exit_code=0) |
[production] |
09:55 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: [[gerrit:579925|Set up read new term store up to Q60M (T219123)]], take II (duration: 01m 05s) |
[production] |
09:54 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: [[gerrit:579925|Set up read new term store up to Q60M (T219123)]] (duration: 01m 09s) |
[production] |
09:27 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-workers |
[production] |
09:21 |
<ema> |
cp: rolling varnish-frontend-restart to decrease memory usage and apply transient storage limits T185968 |
[production] |
09:09 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.roll-restart-workers (exit_code=0) |
[production] |
08:39 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-workers |
[production] |
03:32 |
<wm-bot> |
<bd808> Updated to 18f7e14 (Bump Elastica to 7.0.0-beta.3) |
[tools.bash] |
02:49 |
<wm-bot> |
<bd808> Update to d7de56f (Fix Elastica call to add a quip) |
[tools.bash] |
00:57 |
<krinkle@deploy1001> |
Synchronized php-1.35.0-wmf.23/extensions/Wikibase/lib/includes/Formatters/: Ic77b2c6b33a, T247458 (duration: 01m 12s) |
[production] |
00:08 |
<bstorm_> |
shut off tools-flannel-etcd-01/02/03 T246689 |
[tools] |
2020-03-16
§
|
23:43 |
<wm-bot> |
<root> Stopped jdk11 webservice stuck in CrashLoopBackOff because no java command was given when the webservice was started. |
[tools.wiper] |
23:35 |
<wm-bot> |
<root> Deleted CronJob esfichataxon. Set to run every minute with an invalid jar file path. |
[tools.esfichataxon] |
23:14 |
<tzatziki> |
reset email for "MNadrofsky (WMF)" on SUL and officewiki |
[production] |
22:01 |
<bstorm_> |
shut off tools-k8s-etcd-01/02/03 T246689 |
[tools] |
22:00 |
<bstorm_> |
shut off tools-k8s-master-01 T246689 |
[tools] |
21:59 |
<bstorm_> |
shut down tools-worker-1001 and tools-worker-1002 T246689 |
[tools] |
21:38 |
<bstorm_> |
removed lots of hiera related to the legacy k8s cluster T246689 |
[toolsbeta] |
20:58 |
<mutante> |
mw1223 power down |
[production] |
20:54 |
<mutante> |
powercycling mw1223 |
[production] |
20:52 |
<mutante> |
5 old API appservers in eqiad removed |
[production] |
20:45 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
20:43 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
20:42 |
<dzahn@cumin1001> |
conftool action : set/pooled=inactive; selector: name=mw122[1-6].eqiad.wmnet |
[production] |
20:37 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
20:35 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
20:04 |
<mutante> |
depool (yes->no) mw1221 - mw1226 (T247780) |
[production] |
20:04 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw122[1-6].eqiad.wmnet |
[production] |
19:45 |
<bstorm_> |
deleting toolsbeta-worker-1001, toolsbeta-k8s-master, toolsbeta-flannel-etcd-01 and toolsbeta-k8s-etcd-01 T246689 |
[toolsbeta] |
19:43 |
<joal> |
Kill-restart wikidata-articleplaceholder_metrics-coord to fix yarn queue |
[analytics] |
19:28 |
<bsitzmann@deploy1001> |
Finished deploy [mobileapps/deploy@f5600d6]: Update mobileapps to 8a6e403 (duration: 06m 48s) |
[production] |
19:26 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventstreams' for release 'canary' . |
[production] |
19:24 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventstreams' for release 'canary' . |
[production] |
19:23 |
<jynus> |
stop replication at pc1010 at pos pc1007-bin.080617:259138670 |
[production] |
19:21 |
<bsitzmann@deploy1001> |
Started deploy [mobileapps/deploy@f5600d6]: Update mobileapps to 8a6e403 |
[production] |
19:11 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Pool pc1010 instead of pc1008 as pc1008 is overloaded (duration: 01m 06s) |
[production] |
19:07 |
<bstorm_> |
shutting down toolsbeta-flannel-etcd-01 T246689 |
[toolsbeta] |
19:06 |
<bstorm_> |
shutting down toolsbeta-worker-1001, toolsbeta-k8s-master and toolsbeta-k8s-etcd T246689 |
[toolsbeta] |
18:38 |
<krinkle@deploy1001> |
Synchronized wmf-config/: I2c3217fb3da8bb65 (duration: 01m 07s) |
[production] |
18:36 |
<krinkle@deploy1001> |
Synchronized wmf-config/CommonSettings.php: no-op, courtesy of opcache (duration: 01m 06s) |
[production] |
18:34 |
<krinkle@deploy1001> |
Synchronized docroot/noc/: I2c3217fb3 (duration: 01m 07s) |
[production] |
18:30 |
<mforns> |
Deployed refinery using scap, then deployed onto hdfs |
[analytics] |
18:18 |
<mforns@deploy1001> |
Finished deploy [analytics/refinery@1681b92]: deploying refinery to add forgotten artifacts for v0.0.118 (duration: 13m 01s) |
[production] |
18:05 |
<mforns@deploy1001> |
Started deploy [analytics/refinery@1681b92]: deploying refinery to add forgotten artifacts for v0.0.118 |
[production] |
17:08 |
<Amir1> |
warming up cache for Q50M to Q60M for new term store on db1111, db1126, db1104, db1092 (T219123) |
[production] |
17:06 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: [[gerrit:579925|Set up read new term store up to Q50M (T219123)]], take II (duration: 01m 08s) |
[production] |