production SAL

4501-4550 of 10000 results (72ms)

2022-08-05 §
14:43	<jbond>	upload fressian to puppet7 component	[production]
14:40	<pt1979@cumin1001>	START - Cookbook sre.hosts.reimage for host db1185.eqiad.wmnet with OS bullseye	[production]
14:40	<jbond>	upload test-generative-clojure to puppet7 component	[production]
14:35	<pt1979@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
14:34	<jbond>	upload data-generators-clojure to puppet7 component	[production]
14:31	<pt1979@cumin2002>	START - Cookbook sre.dns.netbox	[production]
14:23	<jbond>	upload encore-clojure to puppet7 component	[production]
14:17	<jbond>	upload truss-clojure to puppet7 component	[production]
14:13	<jbond>	upload structured-logging-clojure to puppet7 component	[production]
14:06	<jbond>	upload murphy-clojure to puppet7 component	[production]
13:57	<jbond>	upload logstash-logback-encoder-7.2 to puppet7 component	[production]
13:49	<jbond>	upload kitchensink-clojure to puppet7 component	[production]
13:27	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depool hosts with fragile power supply (T314559 T314628)', diff saved to https://phabricator.wikimedia.org/P32292 and previous config saved to /var/cache/conftool/dbconfig/20220805-132709-ladsgroup.json	[production]
13:12	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on db2095.codfw.wmnet with reason: Maintenance	[production]
13:12	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on db2095.codfw.wmnet with reason: Maintenance	[production]
13:09	<sukhe>	repool codfw	[production]
13:02	<jbond>	upload honeysql-clojure to puppet7 component	[production]
12:53	<_joe_>	progressive repool of services in codfw	[production]
12:24	<moritzm>	installing nano bugfix updates from bullseye point release	[production]
11:50	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync	[production]
11:40	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync	[production]
11:37	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repool after PDU maint on D3 (T310146)', diff saved to https://phabricator.wikimedia.org/P32291 and previous config saved to /var/cache/conftool/dbconfig/20220805-113729-ladsgroup.json	[production]
11:35	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repool after PDU maint on C6 (T310145)', diff saved to https://phabricator.wikimedia.org/P32290 and previous config saved to /var/cache/conftool/dbconfig/20220805-113555-ladsgroup.json	[production]
11:34	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repool after PDU maint on C5 (T310145)', diff saved to https://phabricator.wikimedia.org/P32289 and previous config saved to /var/cache/conftool/dbconfig/20220805-113436-ladsgroup.json	[production]
10:46	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync	[production]
10:36	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync	[production]
10:17	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync	[production]
10:12	<Amir1>	dbmaint at s4@codfw (T312863)	[production]
10:07	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync	[production]
09:04	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on 12 hosts with reason: Maintenance	[production]
09:03	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on 12 hosts with reason: Maintenance	[production]
09:03	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2110.codfw.wmnet with reason: Maintenance	[production]
09:03	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2110.codfw.wmnet with reason: Maintenance	[production]
00:53	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 0:00:00 on gerrit2001.wikimedia.org with reason: decom, replaced by gerrit2002	[production]
00:53	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 8 days, 0:00:00 on gerrit2001.wikimedia.org with reason: decom, replaced by gerrit2002	[production]
00:53	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for gerrit2002.wikimedia.org	[production]
00:53	<dzahn@cumin1001>	START - Cookbook sre.hosts.remove-downtime for gerrit2002.wikimedia.org	[production]
00:52	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 0:00:00 on gerrit2002.wikimedia.org with reason: decom, replaced by gerrit2002	[production]
00:52	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 8 days, 0:00:00 on gerrit2002.wikimedia.org with reason: decom, replaced by gerrit2002	[production]
00:18	<mutante>	restarting gerrit for config change - removing old replica T313250	[production]
2022-08-04 §
23:06	<mutante>	switching gerrit-replica.wikimedia.org to new machine gerrit2002, dropping gerrit-replica-new.wikimedia.org T313250	[production]
21:07	<ryankemper@deploy1002>	helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply	[production]
20:59	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
20:57	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
20:57	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
20:56	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
20:56	<thcipriani@deploy1002>	Finished scap: Backport for [[gerrit:819774]] tkwiki: Update wordmark (duration: 06m 12s)	[production]
20:51	<ryankemper@deploy1002>	helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply	[production]
20:51	<ryankemper@deploy1002>	helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply	[production]
20:51	<ryankemper@deploy1002>	helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply	[production]