2023-10-11
§
|
09:19 |
<jayme@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
09:15 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS bullseye |
[production] |
08:53 |
<aikochou@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
08:44 |
<hashar@deploy2002> |
Synchronized php: group1 wikis to 1.41.0-wmf.30 refs T347081 (duration: 06m 00s) |
[production] |
08:38 |
<hashar@deploy2002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.41.0-wmf.30 refs T347081 |
[production] |
08:00 |
<hashar@deploy2002> |
Synchronized php-1.41.0-wmf.30/skins/Vector: Backports for Vector styling issues T348572 T348530 (duration: 06m 16s) |
[production] |
07:35 |
<elukey@deploy2002> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
07:35 |
<sgimeno@deploy2002> |
Finished scap: Backport for [[gerrit:964949|GrowthExperiments: enable AddLink backend 15th round of wikis (T308141)]] (duration: 07m 45s) |
[production] |
07:29 |
<sgimeno@deploy2002> |
sgimeno: Continuing with sync |
[production] |
07:28 |
<sgimeno@deploy2002> |
sgimeno: Backport for [[gerrit:964949|GrowthExperiments: enable AddLink backend 15th round of wikis (T308141)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:27 |
<sgimeno@deploy2002> |
Started scap: Backport for [[gerrit:964949|GrowthExperiments: enable AddLink backend 15th round of wikis (T308141)]] |
[production] |
07:24 |
<sgimeno@deploy2002> |
Finished scap: Backport for [[gerrit:964929|GrowthExperiments: enable AddLink frontend 14th round of wikis (T308139)]] (duration: 09m 05s) |
[production] |
07:19 |
<sgimeno@deploy2002> |
sgimeno: Continuing with sync |
[production] |
07:17 |
<sgimeno@deploy2002> |
sgimeno: Backport for [[gerrit:964929|GrowthExperiments: enable AddLink frontend 14th round of wikis (T308139)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:15 |
<sgimeno@deploy2002> |
Started scap: Backport for [[gerrit:964929|GrowthExperiments: enable AddLink frontend 14th round of wikis (T308139)]] |
[production] |
05:46 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
05:45 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
05:45 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
05:45 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
05:44 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
05:44 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
05:23 |
<kart_> |
Updated cxserver to 2023-10-11-045323-production (T341478, T344982, T338432, T347939) |
[production] |
05:21 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
05:21 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
05:19 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
05:18 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
05:11 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
05:10 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
03:00 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2170:3312 (T343198)', diff saved to https://phabricator.wikimedia.org/P52896 and previous config saved to /var/cache/conftool/dbconfig/20231011-030054-arnaudb.json |
[production] |
03:00 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2170.codfw.wmnet with reason: Maintenance |
[production] |
03:00 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2170.codfw.wmnet with reason: Maintenance |
[production] |
03:00 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T343198)', diff saved to https://phabricator.wikimedia.org/P52895 and previous config saved to /var/cache/conftool/dbconfig/20231011-030032-arnaudb.json |
[production] |
02:45 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P52894 and previous config saved to /var/cache/conftool/dbconfig/20231011-024526-arnaudb.json |
[production] |
02:30 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P52893 and previous config saved to /var/cache/conftool/dbconfig/20231011-023019-arnaudb.json |
[production] |
02:18 |
<vriley@cumin1001> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp1104.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
02:15 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T343198)', diff saved to https://phabricator.wikimedia.org/P52892 and previous config saved to /var/cache/conftool/dbconfig/20231011-021513-arnaudb.json |
[production] |
02:03 |
<vriley@cumin1001> |
START - Cookbook sre.hosts.provision for host cp1104.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
02:02 |
<vriley@cumin1001> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cp1104 |
[production] |
02:01 |
<vriley@cumin1001> |
START - Cookbook sre.network.configure-switch-interfaces for host cp1104 |
[production] |
2023-10-10
§
|
22:45 |
<brett@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ncredir5001.eqsin.wmnet with OS bookworm |
[production] |
22:41 |
<pt1979@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1064.eqiad.wmnet with OS bullseye |
[production] |
22:40 |
<cstone> |
SmashPig upgraded from a78a91d9 to 211284b9 |
[production] |
22:13 |
<pt1979@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudvirt1064.eqiad.wmnet with OS bullseye |
[production] |
21:45 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device lsw1-f6-eqiad |
[production] |
21:43 |
<cmooney@cumin1001> |
START - Cookbook sre.network.tls for network device lsw1-f6-eqiad |
[production] |
21:34 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host ncredir5001.eqsin.wmnet with OS bookworm |
[production] |
21:33 |
<brett@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ncredir5001.eqsin.wmnet with OS bookworm |
[production] |
20:48 |
<taavi@deploy2002> |
Finished scap: Backport for [[gerrit:963388|Set READ_NEW for CA wikis on OATHAuth multiple devices (T242031)]] (duration: 08m 24s) |
[production] |
20:43 |
<taavi@deploy2002> |
taavi: Continuing with sync |
[production] |
20:41 |
<taavi@deploy2002> |
taavi: Backport for [[gerrit:963388|Set READ_NEW for CA wikis on OATHAuth multiple devices (T242031)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |