production SAL

4851-4900 of 10000 results (111ms)

2024-06-13 §
00:19	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1182', diff saved to https://phabricator.wikimedia.org/P64753 and previous config saved to /var/cache/conftool/dbconfig/20240613-001937-ladsgroup.json	[production]
00:04	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1182', diff saved to https://phabricator.wikimedia.org/P64752 and previous config saved to /var/cache/conftool/dbconfig/20240613-000430-ladsgroup.json	[production]
2024-06-12 §
23:49	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1182 (T352010)', diff saved to https://phabricator.wikimedia.org/P64751 and previous config saved to /var/cache/conftool/dbconfig/20240612-234923-ladsgroup.json	[production]
22:17	<brett@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=cp4037.ulsfo.wmnet	[production]
22:13	<krinkle@deploy1002>	Finished scap: Backport for [[gerrit:891733\|Move etcd.php from wmf-config/ to src/ (T308932)]] (duration: 13m 42s)	[production]
22:10	<eevans@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/data-gateway: apply	[production]
22:08	<eevans@deploy1002>	helmfile [eqiad] START helmfile.d/services/data-gateway: apply	[production]
22:07	<brett@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4037.ulsfo.wmnet with OS bullseye	[production]
22:06	<eevans@deploy1002>	helmfile [codfw] DONE helmfile.d/services/data-gateway: apply	[production]
22:04	<krinkle@deploy1002>	krinkle: Continuing with sync	[production]
22:04	<eevans@deploy1002>	helmfile [codfw] START helmfile.d/services/data-gateway: apply	[production]
22:03	<krinkle@deploy1002>	krinkle: Backport for [[gerrit:891733\|Move etcd.php from wmf-config/ to src/ (T308932)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
21:59	<krinkle@deploy1002>	Started scap: Backport for [[gerrit:891733\|Move etcd.php from wmf-config/ to src/ (T308932)]]	[production]
21:44	<brett@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4037.ulsfo.wmnet with reason: host reimage	[production]
21:42	<eevans@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: Apply remote logging fix (r1042273) - eevans@cumin1002	[production]
21:41	<brett@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp4037.ulsfo.wmnet with reason: host reimage	[production]
21:36	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/device-analytics: sync	[production]
21:36	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/device-analytics: sync	[production]
21:36	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/device-analytics: apply	[production]
21:35	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/device-analytics: apply	[production]
21:34	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/edit-analytics: apply	[production]
21:33	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/edit-analytics: apply	[production]
21:33	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/editor-analytics: apply	[production]
21:32	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/editor-analytics: apply	[production]
21:31	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/geo-analytics: sync	[production]
21:31	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/geo-analytics: sync	[production]
21:30	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/geo-analytics: apply	[production]
21:30	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/geo-analytics: apply	[production]
21:28	<swfrench@deploy1002>	helmfile [codfw] START helmfile.d/services/image-suggestion: sync	[production]
21:28	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/image-suggestion: apply	[production]
21:28	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/image-suggestion: apply	[production]
21:27	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/media-analytics: apply	[production]
21:26	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/media-analytics: apply	[production]
21:25	<swfrench@deploy1002>	helmfile [staging] DONE helmfile.d/services/page-analytics: apply	[production]
21:24	<swfrench@deploy1002>	helmfile [staging] START helmfile.d/services/page-analytics: apply	[production]
21:22	<swfrench@deploy1002>	helmfile [codfw] DONE helmfile.d/services/data-gateway: sync	[production]
21:22	<swfrench@deploy1002>	helmfile [codfw] START helmfile.d/services/data-gateway: sync	[production]
21:21	<eevans@cumin1002>	START - Cookbook sre.cassandra.roll-restart for nodes matching A:cassandra-dev: Apply remote logging fix (r1042273) - eevans@cumin1002	[production]
21:20	<eevans@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs1010.eqiad.wmnet: Apply remote logging fix (r1042273) - eevans@cumin1002	[production]
21:19	<brett@cumin2002>	START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS bullseye	[production]
21:18	<brett@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4037.ulsfo.wmnet with OS bullseye	[production]
21:17	<swfrench@deploy1002>	helmfile [codfw] DONE helmfile.d/services/data-gateway: apply	[production]
21:17	<swfrench@deploy1002>	helmfile [codfw] START helmfile.d/services/data-gateway: apply	[production]
21:13	<eevans@cumin1002>	START - Cookbook sre.cassandra.roll-restart for nodes matching aqs1010.eqiad.wmnet: Apply remote logging fix (r1042273) - eevans@cumin1002	[production]
21:11	<ryankemper@cumin2002>	END (PASS) - Cookbook sre.wdqs.data-reload (exit_code=0) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet)	[production]
21:05	<ryankemper@cumin2002>	START - Cookbook sre.hadoop.reboot-workers for Hadoop analytics cluster	[production]
21:05	<ryankemper@cumin2002>	START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet)	[production]
21:04	<brett@cumin2002>	START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS bullseye	[production]
20:53	<cjming>	end of UTC late backport window	[production]
20:52	<cjming@deploy1002>	Finished scap: Backport for [[gerrit:1041674\|Don't squish images in non-responsive skins e.g. Vector 2010 (T113101)]] (duration: 12m 52s)	[production]