151-200 of 10000 results (107ms)
2026-02-18 §
11:14 <arnaudb@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
11:12 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:09 <kevinbazira@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:07 <fabfur@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" [production]
11:07 <fabfur@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 [production]
11:06 <fabfur@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 [production]
11:06 <fabfur@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" [production]
10:56 <arnaudb@cumin1003> START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm [production]
10:51 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] (duration: 01m 56s) [production]
10:49 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] [production]
10:49 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] (duration: 04m 06s) [production]
10:44 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] [production]
10:44 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] (duration: 01m 57s) [production]
10:42 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] [production]
10:41 <arnaudb@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host gerrit1003.wikimedia.org with OS bookworm [production]
10:26 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cumin2003.codfw.wmnet [production]
10:20 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host cumin2003.codfw.wmnet [production]
09:53 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
09:46 <arnaudb@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
09:28 <arnaudb@cumin1003> START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm [production]
08:58 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1029.eqiad.wmnet with OS trixie [production]
08:38 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage [production]
08:32 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage [production]
08:19 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy1029.eqiad.wmnet with OS trixie [production]
05:32 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2179 (T415786)', diff saved to https://phabricator.wikimedia.org/P88861 and previous config saved to /var/cache/conftool/dbconfig/20260218-053229-marostegui.json [production]
05:32 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2179.codfw.wmnet with reason: Maintenance [production]
05:32 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172 (T415786)', diff saved to https://phabricator.wikimedia.org/P88860 and previous config saved to /var/cache/conftool/dbconfig/20260218-053204-marostegui.json [production]
05:28 <kart_> Updated cxserver to 2026-01-20-115813-production (T415038, T415046, T414558) [production]
05:25 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
05:25 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
05:24 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
05:24 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
05:18 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
05:17 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
05:16 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P88859 and previous config saved to /var/cache/conftool/dbconfig/20260218-051656-marostegui.json [production]
05:01 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P88858 and previous config saved to /var/cache/conftool/dbconfig/20260218-050148-marostegui.json [production]
04:46 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172 (T415786)', diff saved to https://phabricator.wikimedia.org/P88857 and previous config saved to /var/cache/conftool/dbconfig/20260218-044639-marostegui.json [production]
03:23 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1199 (T415786)', diff saved to https://phabricator.wikimedia.org/P88856 and previous config saved to /var/cache/conftool/dbconfig/20260218-032324-marostegui.json [production]
03:23 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1199.eqiad.wmnet with reason: Maintenance [production]
03:22 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1190 (T415786)', diff saved to https://phabricator.wikimedia.org/P88855 and previous config saved to /var/cache/conftool/dbconfig/20260218-032258-marostegui.json [production]
03:07 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P88854 and previous config saved to /var/cache/conftool/dbconfig/20260218-030750-marostegui.json [production]
02:52 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P88853 and previous config saved to /var/cache/conftool/dbconfig/20260218-025242-marostegui.json [production]
02:37 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1190 (T415786)', diff saved to https://phabricator.wikimedia.org/P88852 and previous config saved to /var/cache/conftool/dbconfig/20260218-023733-marostegui.json [production]
01:02 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239762|Add small comment pointing to ForeignDBViaLBRepo above file migration (T416548)]] (duration: 11m 16s) [production]
00:58 <Krinkle> Edit Module:Date on various wikis in attempt to mitigate T416616, T416540. Details at https://phabricator.wikimedia.org/T416616#11625838. [production]
00:55 <zabe@deploy2002> zabe: Continuing with sync [production]
00:55 <zabe@deploy2002> zabe: Backport for [[gerrit:1239762|Add small comment pointing to ForeignDBViaLBRepo above file migration (T416548)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
00:50 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1239762|Add small comment pointing to ForeignDBViaLBRepo above file migration (T416548)]] [production]
2026-02-17 §
22:03 <kemayo@deploy2002> Finished scap sync-world: Backport for [[gerrit:1240041|EditCheck: update shown stats on initial page load (T417452)]], [[gerrit:1240043|EditCheck: adjust editsuggestion-seen tag (T413419)]] (duration: 40m 26s) [production]
21:50 <kemayo@deploy2002> caro, kemayo: Continuing with sync [production]