2021-10-14
ยง
|
17:32 |
<addshore@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . |
[production] |
17:31 |
<addshore@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . |
[production] |
17:29 |
<addshore@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . |
[production] |
16:44 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:44 |
<ryankemper> |
T288231 Manually killed dangling `pigz` / `nc` processes on `wdqs2008` (and `wdqs2005` implicitly). Should be in the right state to re-start the `data-transfer` cookbook from again |
[production] |
16:41 |
<ryankemper@cumin1001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
16:37 |
<elukey> |
drop kubeflow-kfserving* docker images from deneb |
[production] |
16:36 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:34 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
16:33 |
<moritzm> |
installing node-ansi-regex security updates |
[production] |
16:28 |
<mbsantos@deploy1002> |
Finished deploy [kartotherian/deploy@4bff2d1]: Force mirrored traffic to 0% for everywhere (duration: 02m 24s) |
[production] |
16:25 |
<mbsantos@deploy1002> |
Started deploy [kartotherian/deploy@4bff2d1]: Force mirrored traffic to 0% for everywhere |
[production] |
16:24 |
<dancy@deploy1002> |
Synchronized php-1.38.0-wmf.4/extensions/Collection/includes/CollectionHooks.php: Backport: [[gerrit:730580|Check that the timestamp key/value is set to avoid undefined offset (T293300)]] (duration: 01m 04s) |
[production] |
16:16 |
<mbsantos@deploy1002> |
Finished deploy [kartotherian/deploy@071f7c3]: Increase mirrored traffic to 100% for eqiad (duration: 02m 41s) |
[production] |
16:14 |
<mbsantos@deploy1002> |
Started deploy [kartotherian/deploy@071f7c3]: Increase mirrored traffic to 100% for eqiad |
[production] |
16:08 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:07 |
<ryankemper@cumin1001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
16:07 |
<ryankemper> |
T288231 About to ctrl+c out of ongoing data transfer because puppet run following merge of https://gerrit.wikimedia.org/r/c/operations/puppet/+/730794 restarted blazegraph; we'll manually disable updater and kick off the transfer again |
[production] |
16:04 |
<ryankemper> |
T288231 `ryankemper@wdqs2005:~$ sudo run-puppet-agent --force` |
[production] |
15:56 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
15:54 |
<ryankemper> |
T288231 `ryankemper@wdqs2008:~$ sudo depool` |
[production] |
15:52 |
<ryankemper> |
T288231 `ryankemper@wdqs2005:~$ sudo depool` |
[production] |
15:22 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti2026.codfw.wmnet to ganeti-test01.svc.codfw.wmnet |
[production] |
15:20 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti2026.codfw.wmnet to ganeti-test01.svc.codfw.wmnet |
[production] |
15:13 |
<bd808@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'toolhub' for release 'main' . |
[production] |
15:06 |
<dancy@deploy1002> |
Synchronized php-1.38.0-wmf.4/extensions/VisualEditor/includes/VisualEditorHooks.php: Backport: [[gerrit:730729|Fix value of 'namespacesWithSubpages' in wgVisualEditorConfig (T293310)]] (duration: 01m 04s) |
[production] |
15:02 |
<dancy@deploy1002> |
Synchronized php-1.38.0-wmf.4/extensions/Collection/includes/CollectionHooks.php: Backport: [[gerrit:730580|Check that the timestamp key/value is set to avoid undefined offset (T293300)]] (duration: 01m 03s) |
[production] |
15:00 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti2026.codfw.wmnet to ganeti-test01.svc.codfw.wmnet |
[production] |
14:59 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti2026.codfw.wmnet to ganeti-test01.svc.codfw.wmnet |
[production] |
14:53 |
<kormat> |
upgrading orchestrator.wm.o to 3.2.6-1 T275784 |
[production] |
14:49 |
<jbond@cumin1001> |
conftool action : set/pooled=true; selector: name=eqiad,dnsdisc=apt |
[production] |
14:43 |
<jbond> |
migrate apt.w.o to a dns active/passiev discovery address (cc moritzm) |
[production] |
14:23 |
<moritzm> |
installing krb5 security updates on KDCs |
[production] |
14:19 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . |
[production] |
14:10 |
<urbanecm@deploy1002> |
Synchronized dblists/growthexperiments.dblist: b35adfc59eec9c19b509bb9439cdfe33978a4f8b: Deploy Growth wikis to 4 wikis in dark mode (T291826; 2/2) (duration: 01m 03s) |
[production] |
14:07 |
<urbanecm> |
Run extensions/GrowthExperiments/initWikiConfig.php for ganwiki, iuwiki, tgwiki (T291826) |
[production] |
14:07 |
<urbanecm> |
Create growthexperiments DB tables for ganwiki, iuwiki, tgwiki (T291826) |
[production] |
14:06 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . |
[production] |
14:05 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
14:05 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
14:04 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: b35adfc59eec9c19b509bb9439cdfe33978a4f8b: Deploy Growth wikis to 4 wikis in dark mode (T291826; 1/2) (duration: 01m 04s) |
[production] |
14:03 |
<urbanecm@deploy1002> |
Synchronized dblists/visualeditor-nondefault.dblist: 82d0a4bf45126ecba2cfcd1a0c2081a00f58dca3: Enable VE by default on 4 more wikis (T290614) (duration: 01m 05s) |
[production] |
13:56 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality' for release 'main' . |
[production] |
13:55 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
13:54 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
13:54 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
13:54 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
13:52 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
13:52 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
13:43 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2026.codfw.wmnet |
[production] |