2020-07-06
§
|
07:58 |
<qchris> |
Disable puppet on gerrit1002 (gerrit-test) to deploy Gerrit UI updates there to gather more feedback |
[production] |
07:51 |
<elukey> |
enable binlog on matomo's database on matomo1002 |
[analytics] |
07:51 |
<elukey> |
enable binlog on matomo's database on matomo1002 |
[production] |
07:46 |
<XioNoX> |
repool eqsin - T257154 |
[production] |
07:11 |
<XioNoX> |
reboot cr3-eqsin - T257154 |
[production] |
06:55 |
<XioNoX> |
depool eqsin for cr3-eqsin reboot/investigation - T257154 |
[production] |
06:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1089', diff saved to https://phabricator.wikimedia.org/P11740 and previous config saved to /var/cache/conftool/dbconfig/20200706-065437-marostegui.json |
[production] |
06:54 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.change-distro (exit_code=99) |
[production] |
06:22 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro |
[production] |
06:21 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) |
[production] |
06:14 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.stop-cluster |
[production] |
05:45 |
<kart_> |
Updated cxserver to 2020-07-01-044435-production (T254143) |
[production] |
05:40 |
<kartik@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
05:36 |
<kartik@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
05:32 |
<kartik@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
05:13 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1089', diff saved to https://phabricator.wikimedia.org/P11739 and previous config saved to /var/cache/conftool/dbconfig/20200706-051333-marostegui.json |
[production] |
05:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1089', diff saved to https://phabricator.wikimedia.org/P11738 and previous config saved to /var/cache/conftool/dbconfig/20200706-050347-marostegui.json |
[production] |
04:49 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1089', diff saved to https://phabricator.wikimedia.org/P11737 and previous config saved to /var/cache/conftool/dbconfig/20200706-044908-marostegui.json |
[production] |
2020-07-05
§
|
22:57 |
<wm-bot> |
<lucaswerkmeister> deployed f29663c2b2 (Norwegian Bokmål nouns) |
[tools.lexeme-forms] |
21:50 |
<qchris> |
Restarting gerrit on gerrit1001 to pick up new war and jars. |
[production] |
21:50 |
<qchris@deploy1001> |
Finished deploy [gerrit/gerrit@fbd0684]: Bump gerrit to 3.2.2-102-g3bbb138e13, zuul plugin to master-0-g7accc67, and gitiles to v3.2.2-1-g00c5ca0-with-0e3b533 on gerrit1001 (duration: 00m 07s) |
[production] |
21:50 |
<qchris@deploy1001> |
Started deploy [gerrit/gerrit@fbd0684]: Bump gerrit to 3.2.2-102-g3bbb138e13, zuul plugin to master-0-g7accc67, and gitiles to v3.2.2-1-g00c5ca0-with-0e3b533 on gerrit1001 |
[production] |
21:46 |
<qchris> |
Restarting gerrit on gerrit2001 to pick up new war and jars. |
[production] |
21:45 |
<qchris@deploy1001> |
Finished deploy [gerrit/gerrit@fbd0684]: Bump gerrit to 3.2.2-102-g3bbb138e13, zuul plugin to master-0-g7accc67, and gitiles to v3.2.2-1-g00c5ca0-with-0e3b533 on gerrit2001 (duration: 00m 10s) |
[production] |
21:45 |
<qchris@deploy1001> |
Started deploy [gerrit/gerrit@fbd0684]: Bump gerrit to 3.2.2-102-g3bbb138e13, zuul plugin to master-0-g7accc67, and gitiles to v3.2.2-1-g00c5ca0-with-0e3b533 on gerrit2001 |
[production] |
21:32 |
<qchris> |
Restarting gerrit on gerrit1002 to pick up new wars and jars. |
[production] |
21:32 |
<qchris@deploy1001> |
Finished deploy [gerrit/gerrit@fbd0684]: Bump gerrit to 3.2.2-102-g3bbb138e13 and zuul plugin to master-0-g7accc67 (duration: 00m 08s) |
[production] |
21:32 |
<qchris@deploy1001> |
Started deploy [gerrit/gerrit@fbd0684]: Bump gerrit to 3.2.2-102-g3bbb138e13 and zuul plugin to master-0-g7accc67 |
[production] |
21:20 |
<qchris> |
Enable puppet on gerrit1002 (gerrit-test) again to let it catch up again |
[production] |
16:01 |
<gehel> |
restart elastic-psi on elastic1052 (high GC rate) |
[production] |
15:56 |
<gehel> |
restart blazegraph + updater on wdqs1007 and depool to allow catching up on lag |
[production] |
2020-07-04
§
|
23:51 |
<Amir1> |
deleted deployment-sentry01 (T106915) |
[releng] |
19:23 |
<qchris@deploy1001> |
Finished deploy [gerrit/gerrit@b78914b]: Bump gitiles to v3.2.2-1-g00c5ca0-with-0e3b533 on gerrit1002 (duration: 00m 08s) |
[production] |
19:23 |
<qchris@deploy1001> |
Started deploy [gerrit/gerrit@b78914b]: Bump gitiles to v3.2.2-1-g00c5ca0-with-0e3b533 on gerrit1002 |
[production] |
16:04 |
<wm-bot> |
<lucaswerkmeister> deployed cbf5ad6440 (Norwegian Bokmål) |
[tools.lexeme-forms] |
14:05 |
<qchris> |
Disable puppet on gerrit1002 (gerrit-test) to deploy Gerrit UI updates there to gather feedback |
[production] |
12:42 |
<reedy@deploy1001> |
Synchronized wmf-config/interwiki.php: Update interwiki cache (duration: 02m 24s) |
[production] |
10:52 |
<joal> |
Rerun mediawiki-geoeditors-monthly-wf-2020-06 after heisenbug (patch provided for long-term fix) |
[analytics] |
08:56 |
<hashar> |
Fixed Jenkins collapsible section parsing for Quibble. A logger changed from quibble.cmd to quibble.commands. # T220586 |
[releng] |
02:28 |
<reedy@deploy1001> |
Synchronized php-1.35.0-wmf.39/extensions/Score/includes/Score.php: Short circuit lilypond version check to allow usage of cached files T257066 (duration: 00m 55s) |
[production] |
2020-07-03
§
|
21:49 |
<reedy@deploy1001> |
Synchronized php-1.35.0-wmf.39/extensions/Score/: Sync maintenance script (duration: 00m 58s) |
[production] |
21:44 |
<RhinosF1> |
decom sopel.bot |
[tools.zppixbot] |
19:20 |
<joal> |
restart failed webrequest-load job webrequest-load-wf-text-2020-7-3-17 with higher thresholds - error due to burst of requests in ulsfo |
[analytics] |
19:13 |
<joal> |
restart mediawiki-history-denormalize oozie job using 0.0.115 refinery-job jar |
[analytics] |
19:05 |
<joal> |
kill manual execution of mediawiki-history to save an-coord1001 (too big of a spark-driver) |
[analytics] |
18:53 |
<joal> |
restart webrequest-load-wf-text-2020-7-3-17 after hive server failure |
[analytics] |
18:52 |
<joal> |
restart data_quality_stats-wf-event.navigationtiming-useragent_entropy-hourly-2020-7-3-15 after have server failure |
[analytics] |
18:51 |
<joal> |
restart virtualpageview-hourly-wf-2020-7-3-15 after hive-server failure |
[analytics] |
18:47 |
<cdanis> |
✔️ cdanis@an-coord1001.eqiad.wmnet ~ 🕒☕ sudo systemctl restart hive-server2.service |
[production] |
16:51 |
<krinkle@deploy1001> |
Synchronized wmf-config/CommonSettings.php: Ifa929b2ad4 (duration: 00m 57s) |
[production] |