|
2025-11-28
ยง
|
| 10:37 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 10:36 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 10:26 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox |
[production] |
| 10:26 |
<ayounsi@cumin1003> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox |
[production] |
| 10:24 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1218', diff saved to https://phabricator.wikimedia.org/P86078 and previous config saved to /var/cache/conftool/dbconfig/20251128-102412-marostegui.json |
[production] |
| 10:09 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1218 (T410531)', diff saved to https://phabricator.wikimedia.org/P86077 and previous config saved to /var/cache/conftool/dbconfig/20251128-100905-marostegui.json |
[production] |
| 10:05 |
<bwojtowicz@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 10:04 |
<klausman@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve1001.eqiad.wmnet with OS trixie |
[production] |
| 10:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1218 (T410531)', diff saved to https://phabricator.wikimedia.org/P86076 and previous config saved to /var/cache/conftool/dbconfig/20251128-100258-marostegui.json |
[production] |
| 10:02 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1218.eqiad.wmnet with reason: Maintenance |
[production] |
| 10:02 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T410531)', diff saved to https://phabricator.wikimedia.org/P86075 and previous config saved to /var/cache/conftool/dbconfig/20251128-100234-marostegui.json |
[production] |
| 10:02 |
<bwojtowicz@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 09:52 |
<jnuche> |
temporarily disabled beta-scap-sync-world to avoid spamming in channel |
[releng] |
| 09:48 |
<klausman@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1001.eqiad.wmnet with reason: host reimage |
[production] |
| 09:47 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P86074 and previous config saved to /var/cache/conftool/dbconfig/20251128-094727-marostegui.json |
[production] |
| 09:44 |
<klausman@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1001.eqiad.wmnet with reason: host reimage |
[production] |
| 09:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P86073 and previous config saved to /var/cache/conftool/dbconfig/20251128-093219-marostegui.json |
[production] |
| 09:27 |
<klausman@cumin1003> |
START - Cookbook sre.hosts.reimage for host ml-serve1001.eqiad.wmnet with OS trixie |
[production] |
| 09:23 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2158 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86072 and previous config saved to /var/cache/conftool/dbconfig/20251128-092341-marostegui.json |
[production] |
| 09:23 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2158.codfw.wmnet with reason: Maintenance |
[production] |
| 09:23 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86071 and previous config saved to /var/cache/conftool/dbconfig/20251128-092318-marostegui.json |
[production] |
| 09:17 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T410531)', diff saved to https://phabricator.wikimedia.org/P86070 and previous config saved to /var/cache/conftool/dbconfig/20251128-091712-marostegui.json |
[production] |
| 09:11 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1206 (T410531)', diff saved to https://phabricator.wikimedia.org/P86069 and previous config saved to /var/cache/conftool/dbconfig/20251128-091116-marostegui.json |
[production] |
| 09:11 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1206.eqiad.wmnet with reason: Maintenance |
[production] |
| 09:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196 (T410531)', diff saved to https://phabricator.wikimedia.org/P86068 and previous config saved to /var/cache/conftool/dbconfig/20251128-091052-marostegui.json |
[production] |
| 09:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151', diff saved to https://phabricator.wikimedia.org/P86067 and previous config saved to /var/cache/conftool/dbconfig/20251128-090810-marostegui.json |
[production] |
| 08:59 |
<jynus@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2239.codfw.wmnet with reason: Upgrade and reboot |
[production] |
| 08:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196', diff saved to https://phabricator.wikimedia.org/P86066 and previous config saved to /var/cache/conftool/dbconfig/20251128-085544-marostegui.json |
[production] |
| 08:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151', diff saved to https://phabricator.wikimedia.org/P86065 and previous config saved to /var/cache/conftool/dbconfig/20251128-085303-marostegui.json |
[production] |
| 08:50 |
<brouberol@dns1004> |
END - running authdns-update |
[production] |
| 08:49 |
<brouberol@dns1004> |
START - running authdns-update |
[production] |
| 08:40 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196', diff saved to https://phabricator.wikimedia.org/P86064 and previous config saved to /var/cache/conftool/dbconfig/20251128-084037-marostegui.json |
[production] |
| 08:37 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2151 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P86063 and previous config saved to /var/cache/conftool/dbconfig/20251128-083755-marostegui.json |
[production] |
| 08:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1196 (T410531)', diff saved to https://phabricator.wikimedia.org/P86062 and previous config saved to /var/cache/conftool/dbconfig/20251128-082529-marostegui.json |
[production] |
| 08:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1196 (T410531)', diff saved to https://phabricator.wikimedia.org/P86061 and previous config saved to /var/cache/conftool/dbconfig/20251128-081852-marostegui.json |
[production] |
| 08:18 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1013,1017].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
| 08:18 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1196.eqiad.wmnet with reason: Maintenance |
[production] |
| 08:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1195 (T410531)', diff saved to https://phabricator.wikimedia.org/P86060 and previous config saved to /var/cache/conftool/dbconfig/20251128-081820-marostegui.json |
[production] |
| 08:08 |
<moritzm> |
installing Linux 6.1.158 kernel on Bookworm hosts |
[production] |
| 08:05 |
<arnaudb@cumin1003> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Upgrade gitlab |
[production] |
| 08:03 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1195', diff saved to https://phabricator.wikimedia.org/P86059 and previous config saved to /var/cache/conftool/dbconfig/20251128-080312-marostegui.json |
[production] |
| 07:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1195', diff saved to https://phabricator.wikimedia.org/P86058 and previous config saved to /var/cache/conftool/dbconfig/20251128-074804-marostegui.json |
[production] |
| 07:32 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1195 (T410531)', diff saved to https://phabricator.wikimedia.org/P86057 and previous config saved to /var/cache/conftool/dbconfig/20251128-073257-marostegui.json |
[production] |
| 07:26 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1184 gradually with 4 steps - After testing |
[production] |
| 07:26 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1195.eqiad.wmnet with reason: Maintenance |
[production] |
| 07:25 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1186 (T410531)', diff saved to https://phabricator.wikimedia.org/P86055 and previous config saved to /var/cache/conftool/dbconfig/20251128-072551-marostegui.json |
[production] |
| 07:10 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1186', diff saved to https://phabricator.wikimedia.org/P86053 and previous config saved to /var/cache/conftool/dbconfig/20251128-071043-marostegui.json |
[production] |
| 06:57 |
<arnaudb@cumin1003> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Upgrade gitlab |
[production] |
| 06:55 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1186', diff saved to https://phabricator.wikimedia.org/P86051 and previous config saved to /var/cache/conftool/dbconfig/20251128-065536-marostegui.json |
[production] |
| 06:40 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool db1184 gradually with 4 steps - After testing |
[production] |