production SAL

6101-6150 of 10000 results (90ms)

2023-11-03 §
11:13	<jynus@cumin1001>	START - Cookbook sre.hosts.reimage for host backup1010.eqiad.wmnet with OS bookworm	[production]
11:08	<fnegri@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudnet1006.eqiad.wmnet with OS bookworm	[production]
11:07	<jynus@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host backup1011.eqiad.wmnet with OS bookworm	[production]
10:53	<jynus@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on backup1011.eqiad.wmnet with reason: host reimage	[production]
10:50	<jynus@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on backup1011.eqiad.wmnet with reason: host reimage	[production]
10:34	<jynus@cumin1001>	START - Cookbook sre.hosts.reimage for host backup1011.eqiad.wmnet with OS bookworm	[production]
10:08	<mvernon@cumin1001>	END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe	[production]
09:59	<mvernon@cumin1001>	START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe	[production]
09:59	<Emperor>	roll-restart swift frontends	[production]
04:01	<eileen>	civicrm upgraded from 84ec2957 to 5be02f1b	[production]
01:23	<thcipriani@deploy2002>	Finished scap: Backport for [[gerrit:971287\|Disable namespaceDupes.php for now (T350443)]] (duration: 10m 29s)	[production]
01:18	<thcipriani@deploy2002>	thcipriani: Continuing with sync	[production]
01:14	<thcipriani@deploy2002>	thcipriani: Backport for [[gerrit:971287\|Disable namespaceDupes.php for now (T350443)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
01:13	<thcipriani@deploy2002>	Started scap: Backport for [[gerrit:971287\|Disable namespaceDupes.php for now (T350443)]]	[production]
2023-11-02 §
22:31	<Amir1>	killed update collation on s5	[production]
22:13	<brett>	import acme-chief 0.36-2 into bookworm-wikimedia repo	[production]
21:22	<inflatador>	bking@cumin2002 enabling elastic snapshots on eqiad clusters T348686	[production]
20:32	<mabualruz@deploy2002>	Finished scap: Backport for [[gerrit:971244\|Enable native math rendering mode on testwiki (T311620)]] (duration: 14m 06s)	[production]
20:27	<mabualruz@deploy2002>	mabualruz and physikerwelt: Continuing with sync	[production]
20:20	<mabualruz@deploy2002>	mabualruz and physikerwelt: Backport for [[gerrit:971244\|Enable native math rendering mode on testwiki (T311620)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
20:18	<mabualruz@deploy2002>	Started scap: Backport for [[gerrit:971244\|Enable native math rendering mode on testwiki (T311620)]]	[production]
20:04	<cmooney@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
20:04	<cmooney@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS entries for codfw CR IPs moved to new interfaces. - cmooney@cumin1001"	[production]
20:01	<cmooney@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS entries for codfw CR IPs moved to new interfaces. - cmooney@cumin1001"	[production]
19:59	<cmooney@cumin1001>	START - Cookbook sre.dns.netbox	[production]
18:52	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host doh1001.wikimedia.org with OS bookworm	[production]
18:46	<topranks>	shutting down uplink from asw-b-codfw et-2/0/51 to cr1-codfw in advance of cable move (T347191)	[production]
18:44	<topranks>	Making cr2-codfw VRRP Master for row B traffic over new link from ssw1-a8-codfw (T347191)	[production]
18:35	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doh1001.wikimedia.org with reason: host reimage	[production]
18:32	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on doh1001.wikimedia.org with reason: host reimage	[production]
18:22	<dduvall@deploy2002>	rebuilt and synchronized wikiversions files: group2 wikis to 1.42.0-wmf.3 refs T348356	[production]
18:22	<sukhe@cumin2002>	START - Cookbook sre.hosts.reimage for host doh1001.wikimedia.org with OS bookworm	[production]
18:21	<topranks>	Shutting asw-b-codfw uplink to cr2-codfw down in advance of cable move (T347191)	[production]
18:09	<ebernhardson@deploy2002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
18:09	<ebernhardson@deploy2002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
18:07	<topranks>	Making cr1-codfw VRRP Master for row A traffic again on ssw1-a1-codfw interface (T347191)	[production]
17:50	<topranks>	Shutting asw-a-codfw uplink to cr1-codfw down in advance of cable move (T347191)	[production]
17:45	<topranks>	Moving row A outbound traffic from direct CR link to routing via Spinie (T347191)	[production]
17:45	<fnegri@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcontrol1005.eqiad.wmnet with OS bookworm	[production]
17:42	<vgutierrez>	repool cp4051 and cp5030	[production]
17:40	<ebernhardson@deploy2002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:40	<ebernhardson@deploy2002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:23	<vgutierrez>	depool cp5030	[production]
17:19	<vgutierrez>	restart haproxy on cp4051	[production]
17:14	<bd808@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/toolhub: apply	[production]
17:14	<fnegri@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcontrol1005.eqiad.wmnet with reason: host reimage	[production]
17:13	<bd808@deploy2002>	helmfile [eqiad] START helmfile.d/services/toolhub: apply	[production]
17:13	<bd808@deploy2002>	helmfile [codfw] DONE helmfile.d/services/toolhub: apply	[production]
17:12	<bd808@deploy2002>	helmfile [codfw] START helmfile.d/services/toolhub: apply	[production]
17:11	<fnegri@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcontrol1005.eqiad.wmnet with reason: host reimage	[production]