|
2026-02-18
§
|
| 11:14 |
<arnaudb@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage |
[production] |
| 11:12 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 11:09 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 11:07 |
<fabfur@cumin1003> |
END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" |
[production] |
| 11:07 |
<fabfur@cumin1003> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 |
[production] |
| 11:06 |
<fabfur@cumin1003> |
START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 |
[production] |
| 11:06 |
<fabfur@cumin1003> |
START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" |
[production] |
| 10:56 |
<arnaudb@cumin1003> |
START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm |
[production] |
| 10:51 |
<joal@deploy2002> |
Finished deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] (duration: 01m 56s) |
[production] |
| 10:49 |
<joal@deploy2002> |
Started deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] |
[production] |
| 10:49 |
<joal@deploy2002> |
Finished deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] (duration: 04m 06s) |
[production] |
| 10:44 |
<joal@deploy2002> |
Started deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] |
[production] |
| 10:44 |
<joal@deploy2002> |
Finished deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] (duration: 01m 57s) |
[production] |
| 10:42 |
<joal@deploy2002> |
Started deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] |
[production] |
| 10:41 |
<arnaudb@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host gerrit1003.wikimedia.org with OS bookworm |
[production] |
| 10:26 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cumin2003.codfw.wmnet |
[production] |
| 10:20 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host cumin2003.codfw.wmnet |
[production] |
| 09:53 |
<arnaudb@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage |
[production] |
| 09:46 |
<arnaudb@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage |
[production] |
| 09:28 |
<arnaudb@cumin1003> |
START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm |
[production] |
| 08:58 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1029.eqiad.wmnet with OS trixie |
[production] |
| 08:38 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage |
[production] |
| 08:32 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage |
[production] |
| 08:19 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host dbproxy1029.eqiad.wmnet with OS trixie |
[production] |
| 05:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2179 (T415786)', diff saved to https://phabricator.wikimedia.org/P88861 and previous config saved to /var/cache/conftool/dbconfig/20260218-053229-marostegui.json |
[production] |
| 05:32 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2179.codfw.wmnet with reason: Maintenance |
[production] |
| 05:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172 (T415786)', diff saved to https://phabricator.wikimedia.org/P88860 and previous config saved to /var/cache/conftool/dbconfig/20260218-053204-marostegui.json |
[production] |
| 05:28 |
<kart_> |
Updated cxserver to 2026-01-20-115813-production (T415038, T415046, T414558) |
[production] |
| 05:25 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
| 05:25 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
| 05:24 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
| 05:24 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
| 05:18 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
| 05:17 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
| 05:16 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P88859 and previous config saved to /var/cache/conftool/dbconfig/20260218-051656-marostegui.json |
[production] |
| 05:01 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P88858 and previous config saved to /var/cache/conftool/dbconfig/20260218-050148-marostegui.json |
[production] |
| 04:46 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2172 (T415786)', diff saved to https://phabricator.wikimedia.org/P88857 and previous config saved to /var/cache/conftool/dbconfig/20260218-044639-marostegui.json |
[production] |
| 03:23 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1199 (T415786)', diff saved to https://phabricator.wikimedia.org/P88856 and previous config saved to /var/cache/conftool/dbconfig/20260218-032324-marostegui.json |
[production] |
| 03:23 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1199.eqiad.wmnet with reason: Maintenance |
[production] |
| 03:22 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1190 (T415786)', diff saved to https://phabricator.wikimedia.org/P88855 and previous config saved to /var/cache/conftool/dbconfig/20260218-032258-marostegui.json |
[production] |
| 03:07 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P88854 and previous config saved to /var/cache/conftool/dbconfig/20260218-030750-marostegui.json |
[production] |
| 02:52 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P88853 and previous config saved to /var/cache/conftool/dbconfig/20260218-025242-marostegui.json |
[production] |
| 02:37 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1190 (T415786)', diff saved to https://phabricator.wikimedia.org/P88852 and previous config saved to /var/cache/conftool/dbconfig/20260218-023733-marostegui.json |
[production] |
| 01:02 |
<zabe@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1239762|Add small comment pointing to ForeignDBViaLBRepo above file migration (T416548)]] (duration: 11m 16s) |
[production] |
| 00:58 |
<Krinkle> |
Edit Module:Date on various wikis in attempt to mitigate T416616, T416540. Details at https://phabricator.wikimedia.org/T416616#11625838. |
[production] |
| 00:55 |
<zabe@deploy2002> |
zabe: Continuing with sync |
[production] |
| 00:55 |
<zabe@deploy2002> |
zabe: Backport for [[gerrit:1239762|Add small comment pointing to ForeignDBViaLBRepo above file migration (T416548)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 00:50 |
<zabe@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1239762|Add small comment pointing to ForeignDBViaLBRepo above file migration (T416548)]] |
[production] |