production SAL

5351-5400 of 10000 results (100ms)

2023-05-02 §
14:59	<jiji@cumin1001>	START - Cookbook sre.discovery.datacenter pool all active/active services in codfw: codfw row C switches upgrade - T334049	[production]
14:59	<jclark@cumin1001>	START - Cookbook sre.dns.netbox	[production]
14:58	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/thumbor: apply	[production]
14:56	<jclark@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
14:55	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/thumbor: apply	[production]
14:54	<jclark@cumin1001>	START - Cookbook sre.dns.netbox	[production]
14:53	<hnowlan@deploy1002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
14:52	<hnowlan@deploy1002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
14:40	<moritzm>	installing intel-microcode security updates on bullseye servers	[production]
14:40	<akosiaris>	emergency disabling of puppet on parse hosts	[production]
14:33	<akosiaris@deploy1002>	helmfile [staging] DONE helmfile.d/services/machinetranslation: sync	[production]
14:33	<claime>	Merging new internal certs for api, jobrunner, appservers, parsoid - T313227	[production]
14:29	<akosiaris@deploy1002>	helmfile [staging] START helmfile.d/services/machinetranslation: sync	[production]
14:27	<denisse>	sync prometheus3001 -> prometheus3002	[production]
14:27	<akosiaris@deploy1002>	helmfile [staging] DONE helmfile.d/services/machinetranslation: apply	[production]
14:23	<_joe_>	also on contint1002, the current ci master	[production]
14:22	<_joe_>	restarted zuul on contint2001	[production]
14:07	<akosiaris@deploy1002>	helmfile [staging] START helmfile.d/services/machinetranslation: apply	[production]
13:51	<sukhe>	run authdns-update to repool codfw	[production]
13:47	<cgoubert@cumin1001>	conftool action : set/pooled=yes; selector: dc=codfw,cluster=parsoid	[production]
13:47	<cgoubert@cumin1001>	conftool action : set/pooled=yes; selector: dc=codfw,cluster=appserver	[production]
13:47	<cgoubert@cumin1001>	conftool action : set/pooled=yes; selector: dc=codfw,cluster=api_appserver	[production]
13:45	<akosiaris@deploy1002>	helmfile [staging] DONE helmfile.d/services/machinetranslation: apply	[production]
13:37	<urbanecm@deploy1002>	Finished scap: Backport for [[gerrit:914267\|Enable Kartographer Nearby on mobile (T333137)]], [[gerrit:914286\|Fix clearing wrong container when closing fullscreen map (T335648)]], [[gerrit:914287\|Fix clearing wrong container when closing fullscreen map (T335648)]] (duration: 14m 54s)	[production]
13:25	<akosiaris@deploy1002>	helmfile [staging] START helmfile.d/services/machinetranslation: apply	[production]
13:24	<jmm@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=ldap-replica2005.wikimedia.org	[production]
13:24	<urbanecm@deploy1002>	wmde-fisch and urbanecm: Backport for [[gerrit:914267\|Enable Kartographer Nearby on mobile (T333137)]], [[gerrit:914286\|Fix clearing wrong container when closing fullscreen map (T335648)]], [[gerrit:914287\|Fix clearing wrong container when closing fullscreen map (T335648)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet	[production]
13:22	<urbanecm@deploy1002>	Started scap: Backport for [[gerrit:914267\|Enable Kartographer Nearby on mobile (T333137)]], [[gerrit:914286\|Fix clearing wrong container when closing fullscreen map (T335648)]], [[gerrit:914287\|Fix clearing wrong container when closing fullscreen map (T335648)]]	[production]
13:16	<urbanecm@deploy1002>	Sync cancelled.	[production]
13:16	<urbanecm@deploy1002>	urbanecm and wmde-fisch: Backport for [[gerrit:914267\|Enable Kartographer Nearby on mobile (T333137)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet	[production]
13:07	<urbanecm@deploy1002>	Started scap: Backport for [[gerrit:914267\|Enable Kartographer Nearby on mobile (T333137)]]	[production]
13:05	<XioNoX>	rebooting asw-c-codfw for software upgrade - T334049	[production]
13:03	<ayounsi@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 185 hosts with reason: codfw row C upgrade	[production]
13:01	<ayounsi@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on 185 hosts with reason: codfw row C upgrade	[production]
12:54	<ayounsi@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on 186 hosts with reason: codfw row C upgrade	[production]
12:54	<ayounsi@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on 186 hosts with reason: codfw row C upgrade	[production]
12:31	<eevans@cumin1001>	END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) check 2 services: maintenance	[production]
12:31	<eevans@cumin1001>	START - Cookbook sre.discovery.service-route check 2 services: maintenance	[production]
12:30	<cmooney@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
12:29	<cmooney@cumin1001>	START - Cookbook sre.dns.netbox	[production]
12:28	<elukey@deploy1002>	helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' .	[production]
12:24	<moritzm>	installing LInux 5.10.178 on bullseye hosts	[production]
12:20	<sukhe>	run authdns-update to depool codfwL T334049	[production]
12:17	<Amir1>	stop slave on eqiad masters of s1, x1, s8 (T334049)	[production]
12:10	<jmm@puppetmaster1001>	conftool action : set/pooled=no; selector: name=ldap-replica2005.wikimedia.org	[production]
12:05	<Amir1>	stop slave again on db1130 (eqiad master of s5) (T334049)	[production]
12:03	<jclark@cumin1001>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1011	[production]
12:03	<jclark@cumin1001>	START - Cookbook sre.network.configure-switch-interfaces for host backup1011	[production]
11:57	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host poolcounter2003.codfw.wmnet	[production]
11:53	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host poolcounter2003.codfw.wmnet	[production]