|
2026-01-20
ยง
|
| 09:54 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host dbproxy1024.eqiad.wmnet with OS trixie |
[production] |
| 09:50 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 09:46 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 09:40 |
<dpogorzelski@cumin1003> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for registry[2004-2005].codfw.wmnet,registry[1004-1005].eqiad.wmnet |
[production] |
| 09:40 |
<dpogorzelski@cumin1003> |
START - Cookbook sre.hosts.remove-downtime for registry[2004-2005].codfw.wmnet,registry[1004-1005].eqiad.wmnet |
[production] |
| 09:36 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2003.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 09:35 |
<ayounsi@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest2003.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
| 09:21 |
<aklapper@deploy2002> |
rebuilt and synchronized wikiversions files: group0 to 1.46.0-wmf.12 refs T413803 |
[production] |
| 09:07 |
<dpogorzelski@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on registry[2004-2005].codfw.wmnet,registry[1004-1005].eqiad.wmnet with reason: testing ml changes |
[production] |
| 08:56 |
<slyngshede@dns1004> |
END - running authdns-update |
[production] |
| 08:55 |
<kharlan@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1228993|Hooks: Log the security log context for edit errors (T410877)]] (duration: 07m 24s) |
[production] |
| 08:55 |
<slyngshede@dns1004> |
START - running authdns-update |
[production] |
| 08:55 |
<slyngshede@dns1004> |
START - running authdns-update |
[production] |
| 08:54 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 08:53 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 08:52 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
| 08:51 |
<kharlan@deploy2002> |
kharlan: Continuing with sync |
[production] |
| 08:51 |
<dpogorzelski@deploy2002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
| 08:50 |
<kharlan@deploy2002> |
kharlan: Backport for [[gerrit:1228993|Hooks: Log the security log context for edit errors (T410877)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 08:48 |
<kharlan@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1228993|Hooks: Log the security log context for edit errors (T410877)]] |
[production] |
| 08:45 |
<kharlan@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1223636|IPReputation: Enable OpenSearch IPoid provider on testwiki (T410615)]] (duration: 11m 43s) |
[production] |
| 08:44 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.swift.remove-ghost-objects (exit_code=0) from container wikipedia-commons-local-public.0c in codfw |
[production] |
| 08:41 |
<mvernon@cumin2002> |
START - Cookbook sre.swift.remove-ghost-objects from container wikipedia-commons-local-public.0c in codfw |
[production] |
| 08:40 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2202.codfw.wmnet with reason: Maintenance |
[production] |
| 08:40 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87786 and previous config saved to /var/cache/conftool/dbconfig/20260120-084001-marostegui.json |
[production] |
| 08:39 |
<kharlan@deploy2002> |
kharlan: Continuing with sync |
[production] |
| 08:36 |
<kharlan@deploy2002> |
kharlan: Backport for [[gerrit:1223636|IPReputation: Enable OpenSearch IPoid provider on testwiki (T410615)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 08:33 |
<kharlan@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1223636|IPReputation: Enable OpenSearch IPoid provider on testwiki (T410615)]] |
[production] |
| 08:29 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P87785 and previous config saved to /var/cache/conftool/dbconfig/20260120-082952-marostegui.json |
[production] |
| 08:19 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P87784 and previous config saved to /var/cache/conftool/dbconfig/20260120-081944-marostegui.json |
[production] |
| 08:10 |
<bwojtowicz@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 08:09 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87783 and previous config saved to /var/cache/conftool/dbconfig/20260120-080935-marostegui.json |
[production] |
| 08:08 |
<bwojtowicz@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . |
[production] |
| 07:53 |
<cgoubert@deploy2002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 07:50 |
<moritzm> |
installing unbound security updates |
[production] |
| 07:49 |
<cgoubert@deploy2002> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
| 07:48 |
<cgoubert@deploy2002> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 07:46 |
<cgoubert@deploy2002> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 07:45 |
<cgoubert@deploy2002> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 07:42 |
<cgoubert@deploy2002> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 07:42 |
<cgoubert@deploy2002> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 07:41 |
<cgoubert@deploy2002> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 06:49 |
<marostegui> |
Deploy schema change on s8 sanitarium master - s8 wikireplicas will be lagging for many hours T411164 T411163 |
[production] |
| 06:48 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2152 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87781 and previous config saved to /var/cache/conftool/dbconfig/20260120-064801-marostegui.json |
[production] |
| 06:47 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2152.codfw.wmnet with reason: Maintenance |
[production] |
| 06:47 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1167 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87780 and previous config saved to /var/cache/conftool/dbconfig/20260120-064708-marostegui.json |
[production] |
| 06:47 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
| 06:46 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance |
[production] |
| 06:37 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1160.eqiad.wmnet with reason: Schema change |
[production] |
| 06:29 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2179.codfw.wmnet with reason: Maintenance |
[production] |