2021-05-13
§
|
15:46 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host poolcounter1004.eqiad.wmnet |
[production] |
15:46 |
<effie> |
restarting poolcounter1004 |
[production] |
15:27 |
<jiji@deploy1002> |
Synchronized wmf-config/ProductionServices.php: Config: [[gerrit:688239|ProductionServices: poolcounter1004 will be rebooted for updates (T273278)]] (duration: 01m 08s) |
[production] |
14:49 |
<Urbanecm> |
Start server-side upload for 1 video file (T282785) |
[production] |
14:07 |
<Urbanecm> |
Start server-side upload for 3 video files (T282558, T282556) |
[production] |
12:40 |
<tgr@deploy1002> |
Synchronized php-1.37.0-wmf.5/extensions/GrowthExperiments: Backport: instrumentation patches ([[gerrit:690070|]] [[gerrit:690071|]] [[gerrit:690072|]] [[gerrit:690073|]]) (T278116 T278117 T278114 T278177 T278487 T278112 T278111 T278118) (duration: 01m 09s) |
[production] |
11:00 |
<hnowlan> |
deleting packages still referenced by jessie components: `sudo -i reprepro clearvanished --delete` |
[production] |
10:46 |
<mvolz@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'citoid' for release 'production' . |
[production] |
10:40 |
<mvolz@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'citoid' for release 'staging' . |
[production] |
10:31 |
<mvolz@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'zotero' for release 'production' . |
[production] |
10:25 |
<mvolz@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'zotero' for release 'production' . |
[production] |
10:11 |
<mvolz@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'zotero' for release 'staging' . |
[production] |
08:47 |
<akosiaris@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . |
[production] |
08:47 |
<akosiaris@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
08:45 |
<akosiaris@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
08:45 |
<akosiaris@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . |
[production] |
08:21 |
<akosiaris@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . |
[production] |
07:43 |
<kevinbazira@deploy1002> |
Finished deploy [ores/deploy@8fd23ed]: Regular ORES Deployment T278723 (duration: 32m 50s) |
[production] |
07:10 |
<kevinbazira@deploy1002> |
Started deploy [ores/deploy@8fd23ed]: Regular ORES Deployment T278723 |
[production] |
05:54 |
<_joe_> |
running docker image prune on contint1001, which has 722 unlinked images stored in its docker daemon |
[production] |
01:20 |
<ryankemper@cumin2001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
2021-05-12
§
|
23:48 |
<urbanecm@deploy1002> |
Synchronized php-1.37.0-wmf.4/extensions/WikiEditor/includes/WikiEditorHooks.php: 2f6af514c49d47bbec5ce51f9f7263015e039003? PHP VisualEditorFeatureUse logging: properly record session id (T281409) (duration: 01m 07s) |
[production] |
23:40 |
<urbanecm@deploy1002> |
Synchronized php-1.37.0-wmf.5/extensions/WikiEditor/includes/WikiEditorHooks.php: ef4139628a36eb8b747c610c8d769a802faf2fc3: PHP VisualEditorFeatureUse logging: properly record session id (T281409) (duration: 01m 08s) |
[production] |
23:27 |
<ryankemper> |
T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs2001.codfw.wmnet --dest wdqs2007.codfw.wmnet --reason "transferring fresh wikidata journal following reimage" --blazegraph_instance blazegraph` on `ryankemper@cumin2001` tmux session `wdqs_reimage` |
[production] |
23:27 |
<ryankemper> |
T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs2001.codfw.wmnet --dest wdqs2007.codfw.wmnet --reason "transferring fresh wikidata journal following reimage" --blazegraph_instance blazegraph` on `ryankemper@cumin1001` tmux session `reimage` |
[production] |
23:27 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
22:01 |
<ryankemper@cumin2001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
21:56 |
<ryankemper> |
T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs2001.codfw.wmnet --dest wdqs2007.codfw.wmnet --reason "transferring fresh categories journal following reimage" --blazegraph_instance categories` on `ryankemper@cumin1001` tmux session `reimage` |
[production] |
21:56 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
21:54 |
<ryankemper> |
T280382 `wdqs1012.eqiad.wmnet` has been re-imaged and had the appropriate wikidata/categories journal files transferred. `df -h` shows disk space is no longer an issue following the switch to `raid0`: `/dev/mapper/vg0-srv 2.7T 998G 1.6T 39% /srv` |
[production] |
20:57 |
<ottomata> |
starting new drop_event data purge job to drop all event data older than 90 days in the Hive event database - T273789 |
[production] |
20:33 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
19:27 |
<ryankemper@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2007.codfw.wmnet with reason: REIMAGE |
[production] |
19:25 |
<ryankemper@cumin2001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2007.codfw.wmnet with reason: REIMAGE |
[production] |
19:15 |
<ryankemper> |
T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs1011.eqiad.wmnet --dest wdqs1012.eqiad.wmnet --reason "transferring fresh wikidata journal following reimage" --blazegraph_instance blazegraph` on `ryankemper@cumin1001` tmux session `wdqs_reimage` |
[production] |
19:15 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
19:15 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
19:11 |
<dancy@deploy1002> |
Synchronized php: group1 wikis to 1.37.0-wmf.4 (duration: 01m 07s) |
[production] |
19:10 |
<dancy@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.37.0-wmf.4 |
[production] |
19:10 |
<ryankemper> |
T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs1011.eqiad.wmnet --dest wdqs1012.eqiad.wmnet --reason "transferring fresh categories journal following reimage" --blazegraph_instance categories` on `ryankemper@cumin1001` tmux session `wdqs_reimage` |
[production] |
19:09 |
<ryankemper> |
T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs1011.eqiad.wmnet --dest wdqs1012.eqiad.wmnet --reason "transferring fresh categories journal following reimage" --blazegraph_instance categories` on `ryankemper@cumin1001` tmux session `reimage` |
[production] |
19:09 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
19:07 |
<ryankemper@cumin2001> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) reboot without plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_codfw: codfw reboot - ryankemper@cumin2001 - T280563 |
[production] |
19:06 |
<dancy@deploy1002> |
Synchronized php: group1 wikis to 1.37.0-wmf.5 (duration: 01m 06s) |
[production] |
19:05 |
<dancy@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.37.0-wmf.5 |
[production] |
19:05 |
<ryankemper> |
T280382 T281437 `sudo -i wmf-auto-reimage-host -p T280382 wdqs2007.codfw.wmnet` on `ryankemper@cumin2001` tmux session `wdqs_reimage` |
[production] |
19:00 |
<ryankemper> |
T280563 `sudo -i cookbook sre.elasticsearch.rolling-operation search_codfw "codfw reboot" --reboot --nodes-per-run 3 --start-datetime 2021-04-29T23:04:29 --task-id T280563` on `ryankemper@cumin2001` tmux session `elastic_restarts` |
[production] |
19:00 |
<ryankemper@cumin2001> |
START - Cookbook sre.elasticsearch.rolling-operation reboot without plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_codfw: codfw reboot - ryankemper@cumin2001 - T280563 |
[production] |
18:59 |
<ryankemper> |
[Elastic] Restarted `*search*` services on `elastic2058` |
[production] |
18:48 |
<mutante> |
rsyncing home dirs of people1003 over to people2002 as well (T280989) |
[production] |