2022-11-16
§
|
23:47 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1186', diff saved to https://phabricator.wikimedia.org/P40023 and previous config saved to /var/cache/conftool/dbconfig/20221116-234708-ladsgroup.json |
[production] |
23:43 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2104 (T323214)', diff saved to https://phabricator.wikimedia.org/P40022 and previous config saved to /var/cache/conftool/dbconfig/20221116-234323-ladsgroup.json |
[production] |
23:43 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2104.codfw.wmnet with reason: Maintenance |
[production] |
23:43 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2104.codfw.wmnet with reason: Maintenance |
[production] |
23:37 |
<ejegg> |
civicrm upgraded from 85c98fc7 to 8683d375 |
[production] |
23:32 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1186', diff saved to https://phabricator.wikimedia.org/P40021 and previous config saved to /var/cache/conftool/dbconfig/20221116-233200-ladsgroup.json |
[production] |
23:26 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1102.eqiad.wmnet with reason: Maintenance |
[production] |
23:26 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1102.eqiad.wmnet with reason: Maintenance |
[production] |
23:25 |
<brennen@deploy1002> |
Synchronized php: group1 wikis to 1.40.0-wmf.8 refs T320515 (duration: 03m 43s) |
[production] |
23:21 |
<brennen@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.8 refs T320515 |
[production] |
23:16 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1186 (T318605)', diff saved to https://phabricator.wikimedia.org/P40020 and previous config saved to /var/cache/conftool/dbconfig/20221116-231654-ladsgroup.json |
[production] |
23:15 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:856030|Add w/api/index.html (T273179)]] (duration: 05m 26s) |
[production] |
23:12 |
<bking@cumin1001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
23:10 |
<ladsgroup@deploy1002> |
ladsgroup and ladsgroup: Backport for [[gerrit:856030|Add w/api/index.html (T273179)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
23:09 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:856030|Add w/api/index.html (T273179)]] |
[production] |
23:07 |
<ladsgroup@deploy1002> |
Synchronized portals: (no justification provided) (duration: 03m 48s) |
[production] |
23:05 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
23:04 |
<bking@cumin1001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
23:03 |
<ladsgroup@deploy1002> |
Synchronized portals/wikipedia.org/assets: (no justification provided) (duration: 03m 49s) |
[production] |
22:58 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
22:58 |
<bking@cumin1001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
22:57 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
22:53 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
22:52 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance |
[production] |
22:52 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance |
[production] |
22:52 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1198 (T323214)', diff saved to https://phabricator.wikimedia.org/P40019 and previous config saved to /var/cache/conftool/dbconfig/20221116-225229-ladsgroup.json |
[production] |
22:46 |
<brennen@deploy1002> |
Synchronized php: group1 wikis to 1.40.0-wmf.10 refs T320515 (duration: 03m 54s) |
[production] |
22:45 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
22:42 |
<brennen@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.10 refs T320515 |
[production] |
22:37 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2097.codfw.wmnet with reason: Maintenance |
[production] |
22:37 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P40018 and previous config saved to /var/cache/conftool/dbconfig/20221116-223722-ladsgroup.json |
[production] |
22:37 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2097.codfw.wmnet with reason: Maintenance |
[production] |
22:36 |
<brennen> |
train 1.40.0-wmf.10 (T320515) - blocker seems resolved, making one attempt to roll to group1 again. |
[production] |
22:33 |
<brennen@deploy1002> |
Finished scap: Backport for [[gerrit:857439|specialpage: Silence known violation unsafe RequestContext changes (T323184)]] (duration: 05m 50s) |
[production] |
22:28 |
<brennen@deploy1002> |
brennen and brennen: Backport for [[gerrit:857439|specialpage: Silence known violation unsafe RequestContext changes (T323184)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet |
[production] |
22:27 |
<brennen@deploy1002> |
Started scap: Backport for [[gerrit:857439|specialpage: Silence known violation unsafe RequestContext changes (T323184)]] |
[production] |
22:22 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P40017 and previous config saved to /var/cache/conftool/dbconfig/20221116-222216-ladsgroup.json |
[production] |
22:20 |
<jhathaway@deploy1002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
22:20 |
<jhathaway@deploy1002> |
helmfile [aux-k8s-eqiad] START helmfile.d/admin 'sync'. |
[production] |
22:20 |
<jhathaway@deploy1002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
22:20 |
<jhathaway@deploy1002> |
helmfile [aux-k8s-eqiad] START helmfile.d/admin 'sync'. |
[production] |
22:07 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1198 (T323214)', diff saved to https://phabricator.wikimedia.org/P40016 and previous config saved to /var/cache/conftool/dbconfig/20221116-220710-ladsgroup.json |
[production] |
21:41 |
<urbanecm> |
Run `time mwscript extensions/GrowthExperiments/maintenance/updateIsActiveFlagForMentees.php`for all wikis in growthexperiments.dblist (T318457) |
[production] |
21:39 |
<mforns@deploy1002> |
Finished deploy [airflow-dags/analytics@e08e32e]: (no justification provided) (duration: 00m 20s) |
[production] |
21:39 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1186 (T318605)', diff saved to https://phabricator.wikimedia.org/P40015 and previous config saved to /var/cache/conftool/dbconfig/20221116-213928-ladsgroup.json |
[production] |
21:39 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1186.eqiad.wmnet with reason: Maintenance |
[production] |
21:39 |
<mforns@deploy1002> |
Started deploy [airflow-dags/analytics@e08e32e]: (no justification provided) |
[production] |
21:39 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1186.eqiad.wmnet with reason: Maintenance |
[production] |
21:39 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1184 (T318605)', diff saved to https://phabricator.wikimedia.org/P40014 and previous config saved to /var/cache/conftool/dbconfig/20221116-213907-ladsgroup.json |
[production] |