|
2025-10-09
§
|
| 05:43 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Add es1050 and es1053 depooled T406488', diff saved to https://phabricator.wikimedia.org/P83687 and previous config saved to /var/cache/conftool/dbconfig/20251009-054347-marostegui.json |
[production] |
| 05:37 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db2155 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P83686 and previous config saved to /var/cache/conftool/dbconfig/20251009-053730-marostegui.json |
[production] |
| 05:37 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2155.codfw.wmnet with reason: Maintenance |
[production] |
| 05:36 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool es1027 gradually with 4 steps - Pool es1027.eqiad.wmnet in after cloning |
[production] |
| 05:16 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool es1030 gradually with 4 steps - Pool es1030.eqiad.wmnet in after cloning |
[production] |
| 04:13 |
<eileen> |
civicrm upgraded from 6f24d513 to 132211d5 |
[production] |
| 02:11 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: security release 20251008 |
[production] |
| 02:02 |
<dzahn@cumin2002> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: security release 20251008 |
[production] |
| 01:54 |
<mutante> |
[wdqs1020:~] $ sudo systemctl restart wdqs-blazegraph |
[production] |
| 01:32 |
<eileen> |
civicrm upgraded from 4c13f904 to 6f24d513 |
[production] |
| 01:18 |
<eileen> |
civicrm upgraded from 2c6fedc8 to 4c13f904 |
[production] |
| 01:15 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 14m 20s) |
[production] |
| 01:00 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
|
2025-10-08
§
|
| 23:58 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1019.eqiad.wmnet with reason: host reimage |
[production] |
| 23:54 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1018.eqiad.wmnet with reason: host reimage |
[production] |
| 23:50 |
<ryankemper@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1019.eqiad.wmnet with reason: host reimage |
[production] |
| 23:50 |
<ryankemper@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1018.eqiad.wmnet with reason: host reimage |
[production] |
| 22:09 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T405978, transfer to freshly reimaged host) xfer scholarly_articles from wdqs2016.codfw.wmnet -> wdqs2017.codfw.wmnet w/ force delete existing files, repooling source-only afterwards |
[production] |
| 21:47 |
<wmbot~dcaro@acme> |
END (PASS) - Cookbook wmcs.toolforge.k8s.reboot_stuck_workers (exit_code=0) for tools-k8s-worker-nfs-36, tools-k8s-worker-nfs-76 |
[tools] |
| 21:25 |
<ryankemper@cumin2002> |
START - Cookbook sre.hosts.reimage for host wdqs1019.eqiad.wmnet with OS bullseye |
[production] |
| 21:19 |
<wmbot~dcaro@acme> |
START - Cookbook wmcs.toolforge.k8s.reboot_stuck_workers for tools-k8s-worker-nfs-36, tools-k8s-worker-nfs-76 |
[tools] |
| 21:19 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T405978, transfer to freshly reimaged host) xfer scholarly_articles from wdqs2016.codfw.wmnet -> wdqs2017.codfw.wmnet w/ force delete existing files, repooling source-only afterwards |
[production] |
| 21:19 |
<ryankemper@cumin2002> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) (T405978, transfer to freshly reimaged host) xfer scholarly_articles from wdqs2016.codfw.wmnet -> wdqs2017.codfw.wmnet w/ force delete existing files, repooling source-only afterwards |
[production] |
| 21:18 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T405978, transfer to freshly reimaged host) xfer scholarly_articles from wdqs2016.codfw.wmnet -> wdqs2017.codfw.wmnet w/ force delete existing files, repooling source-only afterwards |
[production] |
| 21:13 |
<ryankemper@deploy2002> |
Finished deploy [wdqs/wdqs@fea7794]: deploy to fresh internal-scholarly host T405978 (duration: 00m 12s) |
[production] |
| 21:13 |
<ryankemper@deploy2002> |
Started deploy [wdqs/wdqs@fea7794]: deploy to fresh internal-scholarly host T405978 |
[production] |
| 21:10 |
<ryankemper@cumin2002> |
START - Cookbook sre.hosts.reimage for host wdqs1018.eqiad.wmnet with OS bullseye |
[production] |
| 20:36 |
<tgr_> |
UTC late deploys done |
[production] |
| 20:35 |
<tgr@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1194622|Deploy JWT session cookies to group2 (T399631)]] (duration: 13m 53s) |
[production] |
| 20:31 |
<tgr@deploy2002> |
tgr: Continuing with sync |
[production] |
| 20:26 |
<tgr@deploy2002> |
tgr: Backport for [[gerrit:1194622|Deploy JWT session cookies to group2 (T399631)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 20:21 |
<tgr@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1194622|Deploy JWT session cookies to group2 (T399631)]] |
[production] |
| 20:19 |
<tgr@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1194650|eswiki, commonswiki: lift IP cap for workshop (T406655)]], [[gerrit:1194334|Launch VisualEditor EditCheck paste check a/b test to 22 wikis (T405422)]] (duration: 13m 03s) |
[production] |
| 20:15 |
<tgr@deploy2002> |
tgr, kemayo, anzx: Continuing with sync |
[production] |
| 20:11 |
<tgr@deploy2002> |
tgr, kemayo, anzx: Backport for [[gerrit:1194650|eswiki, commonswiki: lift IP cap for workshop (T406655)]], [[gerrit:1194334|Launch VisualEditor EditCheck paste check a/b test to 22 wikis (T405422)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 20:06 |
<tgr@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1194650|eswiki, commonswiki: lift IP cap for workshop (T406655)]], [[gerrit:1194334|Launch VisualEditor EditCheck paste check a/b test to 22 wikis (T405422)]] |
[production] |
| 20:02 |
<hashar> |
Disabled Gerrit Apache mod_qos by putting it to be logging only # T406774 |
[production] |
| 19:46 |
<andrew@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan+apply for main branch |
[admin] |
| 19:45 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch |
[admin] |
| 19:30 |
<krinkle@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1194562|Disable wmgUseMdotRouting on remaining Wikipedias except enwiki (T403510)]], [[gerrit:1194563|Disable wmgUseMdotRouting on enwiki (T403510)]] (duration: 09m 26s) |
[production] |
| 19:25 |
<krinkle@deploy2002> |
krinkle: Continuing with sync |
[production] |
| 19:25 |
<krinkle@deploy2002> |
krinkle: Backport for [[gerrit:1194562|Disable wmgUseMdotRouting on remaining Wikipedias except enwiki (T403510)]], [[gerrit:1194563|Disable wmgUseMdotRouting on enwiki (T403510)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 19:20 |
<krinkle@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1194562|Disable wmgUseMdotRouting on remaining Wikipedias except enwiki (T403510)]], [[gerrit:1194563|Disable wmgUseMdotRouting on enwiki (T403510)]] |
[production] |
| 19:10 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha1001.wikimedia.org with OS bookworm |
[production] |
| 18:58 |
<andrew@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcontrol2005-dev.codfw.wmnet with OS bookworm |
[production] |
| 18:56 |
<ssastry@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1194712|Revert "Add a DOM version of the TOC markers pass"]] (duration: 16m 00s) |
[production] |
| 18:54 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on hcaptcha1001.wikimedia.org with reason: host reimage |
[production] |
| 18:50 |
<ssastry@deploy2002> |
ssastry: Continuing with sync |
[production] |
| 18:48 |
<sukhe@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on hcaptcha1001.wikimedia.org with reason: host reimage |
[production] |
| 18:46 |
<ssastry@deploy2002> |
ssastry: Backport for [[gerrit:1194712|Revert "Add a DOM version of the TOC markers pass"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |