1501-1550 of 10000 results (89ms)
2023-11-27 ยง
14:25 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host elastic2087.mgmt.codfw.wmnet with reboot policy FORCED [production]
14:24 <urbanecm@deploy2002> urbanecm and anzx: Backport for [[gerrit:975377|bjnwikiquote: add timezone, wgSitename (T350235)]], [[gerrit:975594|dgawiki: add logos, timezone and sitename (T350229)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:23 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:975377|bjnwikiquote: add timezone, wgSitename (T350235)]], [[gerrit:975594|dgawiki: add logos, timezone and sitename (T350229)]] [production]
14:21 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:977644|GrowthExperiments: enable frontend for 15th round of wikis (T308141)]], [[gerrit:975378|zghwiki: add timezone, wgSitename (T350241)]], [[gerrit:975376|bbcwiki: add timezone, wgSitename (T350373)]] (duration: 11m 23s) [production]
14:15 <urbanecm@deploy2002> sgimeno and anzx and urbanecm: Continuing with sync [production]
14:11 <urbanecm@deploy2002> sgimeno and anzx and urbanecm: Backport for [[gerrit:977644|GrowthExperiments: enable frontend for 15th round of wikis (T308141)]], [[gerrit:975378|zghwiki: add timezone, wgSitename (T350241)]], [[gerrit:975376|bbcwiki: add timezone, wgSitename (T350373)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:10 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:977644|GrowthExperiments: enable frontend for 15th round of wikis (T308141)]], [[gerrit:975378|zghwiki: add timezone, wgSitename (T350241)]], [[gerrit:975376|bbcwiki: add timezone, wgSitename (T350373)]] [production]
14:10 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:977614|UserImpact: Bump VERSION to 10 (T329700)]] (duration: 07m 56s) [production]
14:04 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host druid1007.eqiad.wmnet with OS bullseye [production]
14:03 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
14:03 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:977614|UserImpact: Bump VERSION to 10 (T329700)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:03 <isaranto@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
14:02 <isaranto@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
14:02 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:977614|UserImpact: Bump VERSION to 10 (T329700)]] [production]
13:59 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
13:45 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid1007.eqiad.wmnet with reason: host reimage [production]
13:45 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
13:45 <isaranto@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
13:45 <isaranto@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
13:43 <stevemunene@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on druid1007.eqiad.wmnet with reason: host reimage [production]
13:38 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
13:38 <isaranto@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
13:38 <isaranto@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
13:37 <godog> roll-restart prometheus/ops in eqiad/codfw to apply space-based retention - T351179 [production]
13:32 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
13:31 <isaranto@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
13:30 <isaranto@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
13:26 <stevemunene@cumin1001> START - Cookbook sre.hosts.reimage for host druid1007.eqiad.wmnet with OS bullseye [production]
13:20 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply [production]
13:19 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-int: apply [production]
13:19 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
13:19 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
13:19 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
13:09 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
13:09 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
13:09 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
13:04 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:977613|Compress geui_data json blobs (T351898)]], [[gerrit:977645|User impact: timezone cleanup (T329700)]], [[gerrit:977629|UserImpact: Make smaller SQL queries (T351898)]] (duration: 07m 37s) [production]
12:56 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:977613|Compress geui_data json blobs (T351898)]], [[gerrit:977645|User impact: timezone cleanup (T329700)]], [[gerrit:977629|UserImpact: Make smaller SQL queries (T351898)]] [production]
12:34 <jayme@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
12:34 <jayme@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
12:18 <kart_> Updated cxserver to 2023-11-24-152117-production (T351932) [production]
12:15 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
12:15 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
12:14 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
12:13 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
12:08 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
12:08 <jayme@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
12:08 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
12:08 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]
12:08 <jayme@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]