2023-10-04
§
|
07:53 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-fe2003.codfw.wmnet with reason: host reimage |
[production] |
07:34 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reimage for host thanos-fe2003.codfw.wmnet with OS bullseye |
[production] |
07:19 |
<XioNoX> |
Remove static routes for anycast prefixes - T347494 |
[production] |
06:30 |
<moritzm> |
installing glibc security updates |
[production] |
06:19 |
<Surbhi_> |
Deployed refinery using scap, then deployed onto hdfs |
[production] |
05:54 |
<sg912@deploy2002> |
Finished deploy [analytics/refinery@e954b12] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@e954b12a] (duration: 03m 00s) |
[production] |
05:51 |
<sg912@deploy2002> |
Started deploy [analytics/refinery@e954b12] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@e954b12a] |
[production] |
05:50 |
<sg912@deploy2002> |
Finished deploy [analytics/refinery@e954b12] (thin): Regular analytics weekly train THIN [analytics/refinery@e954b12a] (duration: 00m 06s) |
[production] |
05:50 |
<sg912@deploy2002> |
Started deploy [analytics/refinery@e954b12] (thin): Regular analytics weekly train THIN [analytics/refinery@e954b12a] |
[production] |
05:49 |
<sg912@deploy2002> |
Finished deploy [analytics/refinery@e954b12]: Regular analytics weekly train [analytics/refinery@e954b12a] (duration: 06m 02s) |
[production] |
05:43 |
<sg912@deploy2002> |
Started deploy [analytics/refinery@e954b12]: Regular analytics weekly train [analytics/refinery@e954b12a] |
[production] |
03:56 |
<kart_> |
Updated cxserver to 2023-09-28-043003-production (T343450, T347389, T338689) |
[production] |
03:56 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
03:55 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
03:51 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
03:51 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
03:48 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
03:48 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
2023-10-03
§
|
23:43 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2162 (T343198)', diff saved to https://phabricator.wikimedia.org/P52812 and previous config saved to /var/cache/conftool/dbconfig/20231003-234343-arnaudb.json |
[production] |
23:43 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2162.codfw.wmnet with reason: Maintenance |
[production] |
23:43 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2162.codfw.wmnet with reason: Maintenance |
[production] |
23:43 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2154 (T343198)', diff saved to https://phabricator.wikimedia.org/P52811 and previous config saved to /var/cache/conftool/dbconfig/20231003-234322-arnaudb.json |
[production] |
23:28 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P52810 and previous config saved to /var/cache/conftool/dbconfig/20231003-232815-arnaudb.json |
[production] |
23:13 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P52809 and previous config saved to /var/cache/conftool/dbconfig/20231003-231309-arnaudb.json |
[production] |
22:58 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2154 (T343198)', diff saved to https://phabricator.wikimedia.org/P52808 and previous config saved to /var/cache/conftool/dbconfig/20231003-225803-arnaudb.json |
[production] |
22:22 |
<jdrewniak@deploy2002> |
Finished scap: Backport for [[gerrit:963043|Web typography prototype survey (T347208)]], [[gerrit:963137|Correct a recently-added message]], [[gerrit:963138|[Prototype] Change i18n message (T347208)]] (duration: 39m 08s) |
[production] |
22:11 |
<jdrewniak@deploy2002> |
jdrewniak: Continuing with sync |
[production] |
22:01 |
<jdrewniak@deploy2002> |
jdrewniak: Backport for [[gerrit:963043|Web typography prototype survey (T347208)]], [[gerrit:963137|Correct a recently-added message]], [[gerrit:963138|[Prototype] Change i18n message (T347208)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:43 |
<jdrewniak@deploy2002> |
Started scap: Backport for [[gerrit:963043|Web typography prototype survey (T347208)]], [[gerrit:963137|Correct a recently-added message]], [[gerrit:963138|[Prototype] Change i18n message (T347208)]] |
[production] |
21:32 |
<jdrewniak@deploy2002> |
Finished scap: Backport for [[gerrit:962684|Promote several Wikipedias to Vector 2022 as default skin (T347321)]] (duration: 09m 26s) |
[production] |
21:26 |
<jdrewniak@deploy2002> |
jdlrobson and jdrewniak: Continuing with sync |
[production] |
21:24 |
<jdrewniak@deploy2002> |
jdlrobson and jdrewniak: Backport for [[gerrit:962684|Promote several Wikipedias to Vector 2022 as default skin (T347321)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:23 |
<jdrewniak@deploy2002> |
Started scap: Backport for [[gerrit:962684|Promote several Wikipedias to Vector 2022 as default skin (T347321)]] |
[production] |
20:56 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1004.eqiad.wmnet with OS bullseye |
[production] |
20:56 |
<eileen> |
tools upgraded from 130ca87e to 2e19cd39 |
[production] |
20:50 |
<jdrewniak@deploy2002> |
Finished scap: Backport for [[gerrit:944978|Re-enable Extension:ParserMigration on labs (T333179)]] (duration: 38m 52s) |
[production] |
20:49 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1003.eqiad.wmnet with OS bullseye |
[production] |
20:35 |
<jdrewniak@deploy2002> |
jdrewniak and sbailey: Continuing with sync |
[production] |
20:34 |
<jdrewniak@deploy2002> |
jdrewniak and sbailey: Backport for [[gerrit:944978|Re-enable Extension:ParserMigration on labs (T333179)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:16 |
<fabfur> |
merged https://gerrit.wikimedia.org/r/c/operations/puppet/+/963081 (T347837). `purged` daemon will be restarted by puppet in eqsin in the next 30m |
[production] |
20:11 |
<jdrewniak@deploy2002> |
Started scap: Backport for [[gerrit:944978|Re-enable Extension:ParserMigration on labs (T333179)]] |
[production] |
19:41 |
<jclark@cumin1001> |
START - Cookbook sre.hosts.reimage for host an-master1004.eqiad.wmnet with OS bullseye |
[production] |
19:38 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1004.eqiad.wmnet with OS bullseye |
[production] |
19:16 |
<jclark@cumin1001> |
START - Cookbook sre.hosts.reimage for host an-master1004.eqiad.wmnet with OS bullseye |
[production] |
19:16 |
<jclark@cumin1001> |
START - Cookbook sre.hosts.reimage for host an-master1003.eqiad.wmnet with OS bullseye |
[production] |
19:16 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1004.eqiad.wmnet with OS bullseye |
[production] |
19:15 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1003.eqiad.wmnet with OS bullseye |
[production] |
19:15 |
<jclark@cumin1001> |
START - Cookbook sre.hosts.reimage for host an-master1004.eqiad.wmnet with OS bullseye |
[production] |
19:15 |
<jclark@cumin1001> |
START - Cookbook sre.hosts.reimage for host an-master1003.eqiad.wmnet with OS bullseye |
[production] |
18:48 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1003.eqiad.wmnet with OS bullseye |
[production] |