production SAL

1151-1200 of 10000 results (77ms)

2024-02-27 §
20:50	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
20:48	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: sync	[production]
20:48	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: sync	[production]
20:48	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: sync	[production]
20:47	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
20:47	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
20:45	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
20:45	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
20:43	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
20:41	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
20:40	<cdanis@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
19:47	<ryankemper@cumin2002>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T347624, testing 961878 patch) xfer categories from wdqs2024.codfw.wmnet -> wdqs2025.codfw.wmnet w/ force delete existing files, repooling source-only afterwards	[production]
19:40	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db2177 (T352010)', diff saved to https://phabricator.wikimedia.org/P58012 and previous config saved to /var/cache/conftool/dbconfig/20240227-194021-ladsgroup.json	[production]
19:40	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2177.codfw.wmnet with reason: Maintenance	[production]
19:40	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2177.codfw.wmnet with reason: Maintenance	[production]
19:36	<ryankemper@cumin2002>	START - Cookbook sre.wdqs.data-transfer (T347624, testing 961878 patch) xfer categories from wdqs2024.codfw.wmnet -> wdqs2025.codfw.wmnet w/ force delete existing files, repooling source-only afterwards	[production]
19:26	<dduvall@deploy2002>	rebuilt and synchronized wikiversions files: group0 wikis to 1.42.0-wmf.20 refs T354438	[production]
18:57	<tchin>	finished deploying refinery successfully	[production]
18:53	<tchin@deploy2002>	Finished deploy [analytics/refinery@ac9fd7b] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@ac9fd7b4] (duration: 03m 42s)	[production]
18:50	<tchin@deploy2002>	Started deploy [analytics/refinery@ac9fd7b] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@ac9fd7b4]	[production]
18:50	<tchin@deploy2002>	Finished deploy [analytics/refinery@ac9fd7b] (thin): Regular analytics weekly train THIN [analytics/refinery@ac9fd7b4] (duration: 00m 06s)	[production]
18:49	<tchin@deploy2002>	Started deploy [analytics/refinery@ac9fd7b] (thin): Regular analytics weekly train THIN [analytics/refinery@ac9fd7b4]	[production]
18:49	<tchin@deploy2002>	Finished deploy [analytics/refinery@ac9fd7b]: Regular analytics weekly train [analytics/refinery@ac9fd7b4] (duration: 00m 18s)	[production]
18:49	<tchin@deploy2002>	Started deploy [analytics/refinery@ac9fd7b]: Regular analytics weekly train [analytics/refinery@ac9fd7b4]	[production]
18:48	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logging-hd1003.eqiad.wmnet with OS bookworm	[production]
18:48	<jclark@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
18:48	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host logging-hd1001.eqiad.wmnet with OS bookworm	[production]
18:48	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host logging-hd1001.eqiad.wmnet with OS bookworm	[production]
18:46	<jclark@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
18:46	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logging-hd1001.eqiad.wmnet with OS bookworm	[production]
18:46	<jclark@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
18:44	<jclark@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
18:38	<tchin>	rollbacked refinery deployment, failed on stat1010 and stat1011	[production]
18:37	<tchin@deploy2002>	Finished deploy [analytics/refinery@ac9fd7b]: Regular analytics weekly train [analytics/refinery@ac9fd7b4] (duration: 09m 51s)	[production]
18:27	<tchin@deploy2002>	Started deploy [analytics/refinery@ac9fd7b]: Regular analytics weekly train [analytics/refinery@ac9fd7b4]	[production]
18:25	<tchin>	deploying refinery	[production]
18:25	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logging-hd1002.eqiad.wmnet with OS bookworm	[production]
18:25	<jclark@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
18:24	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-hd1003.eqiad.wmnet with reason: host reimage	[production]
18:23	<jclark@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
18:22	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-hd1001.eqiad.wmnet with reason: host reimage	[production]
18:22	<tchin@deploy2002>	helmfile [codfw] DONE helmfile.d/services/eventstreams: apply	[production]
18:21	<tchin@deploy2002>	helmfile [codfw] START helmfile.d/services/eventstreams: apply	[production]
18:19	<jclark@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on logging-hd1003.eqiad.wmnet with reason: host reimage	[production]
18:19	<jclark@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on logging-hd1001.eqiad.wmnet with reason: host reimage	[production]
18:18	<tchin@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/eventstreams: apply	[production]
18:17	<tchin@deploy2002>	helmfile [eqiad] START helmfile.d/services/eventstreams: apply	[production]
18:15	<tchin@deploy2002>	helmfile [staging] DONE helmfile.d/services/eventstreams: apply	[production]
18:15	<tchin@deploy2002>	helmfile [staging] START helmfile.d/services/eventstreams: apply	[production]
18:01	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-hd1002.eqiad.wmnet with reason: host reimage	[production]