2024-06-06
ยง
|
14:58 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
14:56 |
<topranks> |
disable ssw1-f1-eqiad leaf-facing ports in advance of upgrade T366361 |
[production] |
14:56 |
<jforrester@deploy1002> |
Started scap: Backport for [[gerrit:1038828|Add wikilambda-edit-monolingual-text-placeholder message to extension.json (T359782)]] |
[production] |
14:54 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1193 (T352010)', diff saved to https://phabricator.wikimedia.org/P64187 and previous config saved to /var/cache/conftool/dbconfig/20240606-145440-ladsgroup.json |
[production] |
14:52 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1209 (T360332)', diff saved to https://phabricator.wikimedia.org/P64186 and previous config saved to /var/cache/conftool/dbconfig/20240606-145205-arnaudb.json |
[production] |
14:51 |
<elukey> |
kill sessionstore pod running on mw1390.eqiad.wmnet (no dedicated='kask' taint) |
[production] |
14:49 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db1209 (T360332)', diff saved to https://phabricator.wikimedia.org/P64185 and previous config saved to /var/cache/conftool/dbconfig/20240606-144943-arnaudb.json |
[production] |
14:49 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1209.eqiad.wmnet with reason: Maintenance |
[production] |
14:49 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db1209.eqiad.wmnet with reason: Maintenance |
[production] |
14:43 |
<sukhe> |
sudo cumin -b1 -s60 'A:cp and A:eqsin' 'run-puppet-agent --enable "merging CR 1038881"' |
[production] |
14:25 |
<TheresNoTime> |
close UTC afternoon backport window |
[production] |
14:18 |
<hashar@deploy1002> |
Finished deploy [integration/docroot@eee90e6]: Build dependencies updates (duration: 00m 10s) |
[production] |
14:18 |
<hashar@deploy1002> |
Started deploy [integration/docroot@eee90e6]: Build dependencies updates |
[production] |
14:17 |
<hashar@deploy1002> |
Finished deploy [integration/docroot@eee90e6]: Build dependencies updates (duration: 00m 09s) |
[production] |
14:17 |
<hashar@deploy1002> |
Started deploy [integration/docroot@eee90e6]: Build dependencies updates |
[production] |
14:17 |
<samtar@deploy1002> |
Finished scap: Backport for [[gerrit:1037006|commonswiki: Enable numeric wgCategoryCollation (T362494)]], [[gerrit:1037505|Add project namespace alias for Azerbaijani Wikisource (T365966)]] (duration: 12m 58s) |
[production] |
14:15 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ssw1-f1-eqiad,ssw1-f1-eqiad IPv6,ssw1-f1-eqiad.mgmt with reason: upgrading spine switches eqiad rows e and f |
[production] |
14:15 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ssw1-f1-eqiad,ssw1-f1-eqiad IPv6,ssw1-f1-eqiad.mgmt with reason: upgrading spine switches eqiad rows e and f |
[production] |
14:14 |
<topranks> |
disabling BGP on cr2-eqiad towards ssw1-f1-eqiad prior to upgrade of ssw later T366361 |
[production] |
14:14 |
<ChrisDobbins901_> |
sudo cumin 'A:cp and A:eqsin' 'disable-puppet "merging CR 1038881"' |
[production] |
14:08 |
<samtar@deploy1002> |
samtar and anzx and nmw03: Continuing with sync |
[production] |
14:07 |
<fabfur@cumin1002> |
conftool action : set/pooled=yes; selector: name=cp4050.ulsfo.wmnet |
[production] |
14:06 |
<samtar@deploy1002> |
samtar and anzx and nmw03: Backport for [[gerrit:1037006|commonswiki: Enable numeric wgCategoryCollation (T362494)]], [[gerrit:1037505|Add project namespace alias for Azerbaijani Wikisource (T365966)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:06 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp4050.ulsfo.wmnet |
[production] |
14:05 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-ctrl1001.eqiad.wmnet with reason: host reimage |
[production] |
14:04 |
<samtar@deploy1002> |
Started scap: Backport for [[gerrit:1037006|commonswiki: Enable numeric wgCategoryCollation (T362494)]], [[gerrit:1037505|Add project namespace alias for Azerbaijani Wikisource (T365966)]] |
[production] |
14:02 |
<kamila@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-ctrl1001.eqiad.wmnet with reason: host reimage |
[production] |
14:00 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:1039571|CX: Fix translation container max width for large screens (T366374)]] (duration: 13m 11s) |
[production] |
13:57 |
<fabfur@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host cp4050.ulsfo.wmnet |
[production] |
13:56 |
<fabfur@cumin1002> |
conftool action : set/pooled=no; selector: name=cp4050.ulsfo.wmnet |
[production] |
13:52 |
<kartik@deploy1002> |
kartik: Continuing with sync |
[production] |
13:50 |
<kartik@deploy1002> |
kartik: Backport for [[gerrit:1039571|CX: Fix translation container max width for large screens (T366374)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:47 |
<kamila@cumin1002> |
START - Cookbook sre.hosts.reimage for host wikikube-ctrl1001.eqiad.wmnet with OS bullseye |
[production] |
13:47 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:1039571|CX: Fix translation container max width for large screens (T366374)]] |
[production] |
13:46 |
<samtar@deploy1002> |
Finished scap: Backport for [[gerrit:1039612|[mswiktionary] Change the default Sitename value to Wikikamus (T366549)]] (duration: 16m 05s) |
[production] |
13:45 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.dhcp (exit_code=0) for host wikikube-ctrl1001.eqiad.wmnet |
[production] |
13:44 |
<kamila@cumin1002> |
START - Cookbook sre.hosts.dhcp for host wikikube-ctrl1001.eqiad.wmnet |
[production] |
13:44 |
<kamila@cumin1002> |
END (PASS) - Cookbook sre.hosts.dhcp (exit_code=0) for host wikikube-ctrl1001.eqiad.wmnet |
[production] |
13:37 |
<samtar@deploy1002> |
samtar and gergesshamon: Continuing with sync |
[production] |
13:32 |
<samtar@deploy1002> |
samtar and gergesshamon: Backport for [[gerrit:1039612|[mswiktionary] Change the default Sitename value to Wikikamus (T366549)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:30 |
<samtar@deploy1002> |
Started scap: Backport for [[gerrit:1039612|[mswiktionary] Change the default Sitename value to Wikikamus (T366549)]] |
[production] |
13:28 |
<samtar@deploy1002> |
Finished scap: Backport for [[gerrit:1038862|Activate campaignEvents extension on Igbo wiki. (T363199)]] (duration: 14m 07s) |
[production] |
13:19 |
<samtar@deploy1002> |
mhorsey and samtar: Continuing with sync |
[production] |
13:16 |
<samtar@deploy1002> |
mhorsey and samtar: Backport for [[gerrit:1038862|Activate campaignEvents extension on Igbo wiki. (T363199)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:15 |
<samtar@deploy1002> |
Started scap: Backport for [[gerrit:1038862|Activate campaignEvents extension on Igbo wiki. (T363199)]] |
[production] |
13:11 |
<taavi> |
taavi@deploy1002 ~ $ sudo kill 32174 # kill forgotten scap sync-world process |
[production] |
13:08 |
<klausman@cumin1002> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:ml-serve-worker-eqiad |
[production] |
12:57 |
<vgutierrez> |
repool text@cofw with IPIP encapsulation enabled - T366466 |
[production] |
12:56 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:wikikube-worker-eqiad |
[production] |
12:56 |
<isaranto@deploy1002> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . |
[production] |