__all__ SAL

9251-9300 of 10000 results (98ms)

2022-10-07 §
17:51	<ryankemper>	[Elastic] Updated list of cross-cluster remote seeds for all eqiad/codfw elastic clusters; should resolve `ElasticSearch setting check` alerts	[production]
17:43	<wm-bot>	<lucaswerkmeister> commented out static-cleaner cronjob, created toolforge-jobs periodic job instead (T319609)	[tools.bridgebot]
17:20	<sukhe>	sudo gnt-node evacuate -s ganeti4004.ulsfo.wmnet	[production]
17:13	<sukhe>	migrate ganeti4004: T317249	[production]
17:03	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
17:02	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
17:02	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
17:01	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
16:59	<brennen@deploy1002>	rebuilt and synchronized wikiversions files: all wikis to 1.40.0-wmf.4 refs T314193	[production]
16:56	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
16:52	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
16:52	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
16:51	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
16:50	<brennen@deploy1002>	Finished scap: Backport for [[gerrit:840041\|RecentSignificantEditStore: Force section titles to be an index array (T319799)]] (duration: 06m 41s)	[production]
16:46	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
16:46	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
16:46	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
16:44	<brennen@deploy1002>	brennen and kartik: Backport for [[gerrit:840041\|RecentSignificantEditStore: Force section titles to be an index array (T319799)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet	[production]
16:43	<brennen@deploy1002>	Started scap: Backport for [[gerrit:840041\|RecentSignificantEditStore: Force section titles to be an index array (T319799)]]	[production]
16:42	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
16:42	<brennen@deploy1002>	Finished scap: Backport for [[gerrit:840180\|Check whether title actually exists (T319798)]] (duration: 05m 47s)	[production]
16:36	<brennen@deploy1002>	brennen and brennen: Backport for [[gerrit:840180\|Check whether title actually exists (T319798)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet	[production]
16:36	<brennen@deploy1002>	Started scap: Backport for [[gerrit:840180\|Check whether title actually exists (T319798)]]	[production]
16:15	<brennen>	train 1.40.0-wmf.4 (T314193) blockers have patches; after discussion in releng, going ahead with friday deploy in interest of avoiding a scramble during the coming holiday week	[production]
15:09	<btullis@cumin1001>	END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) for AQS aqs cluster: Roll restart of all AQS's nodejs daemons.	[production]
14:57	<btullis@cumin1001>	START - Cookbook sre.aqs.roll-restart for AQS aqs cluster: Roll restart of all AQS's nodejs daemons.	[production]
14:35	<jmm@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS buster	[production]
13:40	<andrewbogott>	dhinus is resetting rabbitmq cluster in an attempt to resolve a suspected (by Andrew) split-brain	[admin]
13:27	<James_F>	Zuul: Add two former contractors to the CI allowlist	[releng]
13:26	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1002.eqiad.wmnet with reason: host reimage	[production]
13:24	<jmm@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1002.eqiad.wmnet with reason: host reimage	[production]
13:11	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS buster	[production]
13:08	<jmm@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1001.eqiad.wmnet with OS buster	[production]
13:02	<taavi>	taavi@cloudcontrol1005 ~ $ sudo mark_tool --disable oncall # T320240	[tools]
12:40	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]
12:39	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1001.eqiad.wmnet with OS buster	[production]
12:20	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage	[production]
12:17	<jmm@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage	[production]
12:02	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]
11:57	<jmm@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1001.eqiad.wmnet with OS buster	[production]
11:57	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]
11:56	<jmm@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1001.eqiad.wmnet with OS buster	[production]
11:51	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]
11:50	<jmm@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1001.eqiad.wmnet with OS buster	[production]
11:50	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]
11:50	<jmm@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1001.eqiad.wmnet with OS buster	[production]
11:50	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]
11:49	<jmm@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1001.eqiad.wmnet with OS buster	[production]
11:33	<arturo>	rabbitmq-server.service @ cloudrabbit1002 is again up and running (T320232)	[admin]
11:27	<jmm@cumin2002>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]