1-50 of 10000 results (88ms)
2026-01-20 ยง
09:54 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy1024.eqiad.wmnet with OS trixie [production]
09:50 <kevinbazira@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
09:46 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
09:40 <dpogorzelski@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for registry[2004-2005].codfw.wmnet,registry[1004-1005].eqiad.wmnet [production]
09:40 <dpogorzelski@cumin1003> START - Cookbook sre.hosts.remove-downtime for registry[2004-2005].codfw.wmnet,registry[1004-1005].eqiad.wmnet [production]
09:36 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2003.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:35 <ayounsi@cumin1003> START - Cookbook sre.hosts.provision for host sretest2003.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:21 <aklapper@deploy2002> rebuilt and synchronized wikiversions files: group0 to 1.46.0-wmf.12 refs T413803 [production]
09:07 <dpogorzelski@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on registry[2004-2005].codfw.wmnet,registry[1004-1005].eqiad.wmnet with reason: testing ml changes [production]
08:56 <slyngshede@dns1004> END - running authdns-update [production]
08:55 <kharlan@deploy2002> Finished scap sync-world: Backport for [[gerrit:1228993|Hooks: Log the security log context for edit errors (T410877)]] (duration: 07m 24s) [production]
08:55 <slyngshede@dns1004> START - running authdns-update [production]
08:55 <slyngshede@dns1004> START - running authdns-update [production]
08:54 <dpogorzelski@deploy2002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. [production]
08:53 <dpogorzelski@deploy2002> helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. [production]
08:52 <dpogorzelski@deploy2002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:51 <kharlan@deploy2002> kharlan: Continuing with sync [production]
08:51 <dpogorzelski@deploy2002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
08:50 <kharlan@deploy2002> kharlan: Backport for [[gerrit:1228993|Hooks: Log the security log context for edit errors (T410877)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:48 <kharlan@deploy2002> Started scap sync-world: Backport for [[gerrit:1228993|Hooks: Log the security log context for edit errors (T410877)]] [production]
08:45 <kharlan@deploy2002> Finished scap sync-world: Backport for [[gerrit:1223636|IPReputation: Enable OpenSearch IPoid provider on testwiki (T410615)]] (duration: 11m 43s) [production]
08:44 <mvernon@cumin2002> END (PASS) - Cookbook sre.swift.remove-ghost-objects (exit_code=0) from container wikipedia-commons-local-public.0c in codfw [production]
08:41 <mvernon@cumin2002> START - Cookbook sre.swift.remove-ghost-objects from container wikipedia-commons-local-public.0c in codfw [production]
08:40 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2202.codfw.wmnet with reason: Maintenance [production]
08:40 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2188 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87786 and previous config saved to /var/cache/conftool/dbconfig/20260120-084001-marostegui.json [production]
08:39 <kharlan@deploy2002> kharlan: Continuing with sync [production]
08:36 <kharlan@deploy2002> kharlan: Backport for [[gerrit:1223636|IPReputation: Enable OpenSearch IPoid provider on testwiki (T410615)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:33 <kharlan@deploy2002> Started scap sync-world: Backport for [[gerrit:1223636|IPReputation: Enable OpenSearch IPoid provider on testwiki (T410615)]] [production]
08:29 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P87785 and previous config saved to /var/cache/conftool/dbconfig/20260120-082952-marostegui.json [production]
08:19 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P87784 and previous config saved to /var/cache/conftool/dbconfig/20260120-081944-marostegui.json [production]
08:10 <bwojtowicz@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . [production]
08:09 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2188 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87783 and previous config saved to /var/cache/conftool/dbconfig/20260120-080935-marostegui.json [production]
08:08 <bwojtowicz@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revise-tone-task-generator' for release 'main' . [production]
07:53 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
07:50 <moritzm> installing unbound security updates [production]
07:49 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
07:48 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
07:46 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
07:45 <cgoubert@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
07:42 <cgoubert@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
07:42 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
07:41 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
06:49 <marostegui> Deploy schema change on s8 sanitarium master - s8 wikireplicas will be lagging for many hours T411164 T411163 [production]
06:48 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2152 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87781 and previous config saved to /var/cache/conftool/dbconfig/20260120-064801-marostegui.json [production]
06:47 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2152.codfw.wmnet with reason: Maintenance [production]
06:47 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1167 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87780 and previous config saved to /var/cache/conftool/dbconfig/20260120-064708-marostegui.json [production]
06:47 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
06:46 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
06:37 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1160.eqiad.wmnet with reason: Schema change [production]
06:29 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2179.codfw.wmnet with reason: Maintenance [production]