production SAL

101-150 of 10000 results (112ms)

2026-03-31 §
17:39	<brett@cumin2002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on hcaptcha[1001-1002,2001-2002].wikimedia.org with reason: Decommissioning	[production]
17:37	<akhatun@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply	[production]
17:37	<akhatun@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply	[production]
17:36	<brett@dns1006>	END - running authdns-update	[production]
17:35	<eevans@cumin1003>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching restbase[1031,2034]*: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003	[production]
17:34	<brett@dns1006>	START - running authdns-update	[production]
17:29	<brett@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1110.eqiad.wmnet with reason: host reimage	[production]
17:25	<brett@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp1110.eqiad.wmnet with reason: host reimage	[production]
17:18	<eevans@cumin1003>	START - Cookbook sre.cassandra.roll-restart for nodes matching restbase[1031,2034]*: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003	[production]
17:16	<btullis@cumin1003>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1003.eqiad.wmnet	[production]
17:13	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply	[production]
17:12	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply	[production]
17:10	<btullis@cumin1003>	START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1003.eqiad.wmnet	[production]
17:10	<joal@deploy1003>	Finished deploy [analytics/refinery@8d91f24] (thin): Regular analytics weekly train THIN [analytics/refinery@8d91f242] (duration: 01m 56s)	[production]
17:08	<brett@cumin2002>	START - Cookbook sre.hosts.reimage for host cp1110.eqiad.wmnet with OS trixie	[production]
17:08	<joal@deploy1003>	Started deploy [analytics/refinery@8d91f24] (thin): Regular analytics weekly train THIN [analytics/refinery@8d91f242]	[production]
17:08	<brett@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1110.eqiad.wmnet with OS trixie	[production]
17:08	<joal@deploy1003>	Finished deploy [analytics/refinery@8d91f24]: Regular analytics weekly train [analytics/refinery@8d91f242] (duration: 07m 47s)	[production]
17:07	<otto@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
17:07	<otto@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
17:07	<javiermonton@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
17:07	<javiermonton@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
17:07	<javiermonton@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
17:06	<javiermonton@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
17:00	<joal@deploy1003>	Started deploy [analytics/refinery@8d91f24]: Regular analytics weekly train [analytics/refinery@8d91f242]	[production]
16:59	<joal@deploy1003>	Finished deploy [analytics/refinery@8d91f24] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@8d91f242] (duration: 01m 53s)	[production]
16:58	<joal@deploy1003>	Started deploy [analytics/refinery@8d91f24] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@8d91f242]	[production]
16:56	<brett@cumin2002>	START - Cookbook sre.hosts.reimage for host cp1110.eqiad.wmnet with OS trixie	[production]
16:55	<eevans@cumin1003>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs2001.codfw.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003	[production]
16:54	<javiermonton@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
16:54	<javiermonton@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply	[production]
16:51	<moritzm>	installing Bind security updates (client-side tools and libs)	[production]
16:48	<eevans@cumin1003>	START - Cookbook sre.cassandra.roll-restart for nodes matching aqs2001.codfw.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003	[production]
16:45	<eevans@cumin1003>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs1010.eqiad.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003	[production]
16:45	<ayounsi@cumin1003>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1365.eqiad.wmnet with OS trixie	[production]
16:40	<fceratto@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db2238 (T419635)', diff saved to https://phabricator.wikimedia.org/P90094 and previous config saved to /var/cache/conftool/dbconfig/20260331-164004-fceratto.json	[production]
16:37	<eevans@cumin1003>	START - Cookbook sre.cassandra.roll-restart for nodes matching aqs1010.eqiad.wmnet: Upgrade Cassandra to 4.1.11 — T418417 - eevans@cumin1003	[production]
16:31	<eevans@cumin1003>	END (PASS) - Cookbook sre.misc-clusters.roll-restart-restbase (exit_code=0) rolling restart_daemons on A:restbase	[production]
16:29	<fceratto@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db2238', diff saved to https://phabricator.wikimedia.org/P90092 and previous config saved to /var/cache/conftool/dbconfig/20260331-162956-fceratto.json	[production]
16:29	<ayounsi@cumin1003>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1365.eqiad.wmnet with reason: host reimage	[production]
16:28	<cdobbins@cumin2002>	conftool action : set/pooled=yes; selector: name=cp1114.eqiad.wmnet [reason: trixie reimaging]	[production]
16:22	<ayounsi@cumin1003>	START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1365.eqiad.wmnet with reason: host reimage	[production]
16:22	<akhatun@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply	[production]
16:22	<akhatun@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-edit-type-enrich-next: apply	[production]
16:19	<fceratto@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db2238', diff saved to https://phabricator.wikimedia.org/P90091 and previous config saved to /var/cache/conftool/dbconfig/20260331-161947-fceratto.json	[production]
16:15	<eevans@cumin1003>	START - Cookbook sre.misc-clusters.roll-restart-restbase rolling restart_daemons on A:restbase	[production]
16:14	<ebysans@deploy1003>	helmfile [staging] DONE helmfile.d/services/media-analytics: apply	[production]
16:12	<rzl@deploy1003>	helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply	[production]
16:12	<rzl@deploy1003>	helmfile [codfw] START helmfile.d/services/mw-experimental: apply	[production]
16:11	<rzl@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply	[production]