production SAL

201-250 of 10000 results (75ms)

2023-07-12 §
16:47	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on durum6001.drmrs.wmnet with reason: host reimage	[production]
16:42	<btullis@puppetmaster1001>	conftool action : set/pooled=no; selector: service=wikireplicas-a,name=dbproxy1018.eqiad.wmnet	[production]
16:42	<btullis@puppetmaster1001>	conftool action : set/pooled=yes; selector: service=wikireplicas-a,name=dbproxy1019.eqiad.wmnet	[production]
16:41	<btullis@puppetmaster1001>	conftool action : set/pooled=no; selector: service=wikireplicas-b,name=dbproxy1018.eqiad.wmnet	[production]
16:41	<btullis@puppetmaster1001>	conftool action : set/pooled=yes; selector: service=wikireplicas-b,name=dbproxy1019.eqiad.wmnet	[production]
16:40	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dbproxy1013.eqiad.wmnet	[production]
16:40	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
16:40	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbproxy1013.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - ladsgroup@cumin1001"	[production]
16:38	<ladsgroup@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dbproxy1013.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - ladsgroup@cumin1001"	[production]
16:37	<btullis@puppetmaster1001>	conftool action : set/pooled=no; selector: service=wikireplicas-b,name=dbproxy1019.eqiad.wmnet	[production]
16:37	<btullis@puppetmaster1001>	conftool action : set/pooled=yes; selector: service=wikireplicas-b,name=dbproxy1018.eqiad.wmnet	[production]
16:35	<ladsgroup@cumin1001>	START - Cookbook sre.dns.netbox	[production]
16:32	<jnuche@deploy1002>	Finished deploy [releng/jenkins-deploy@a0e00cb] (releasing): (no justification provided) (duration: 00m 58s)	[production]
16:31	<jnuche@deploy1002>	Started deploy [releng/jenkins-deploy@a0e00cb] (releasing): (no justification provided)	[production]
16:30	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.decommission for hosts dbproxy1013.eqiad.wmnet	[production]
16:21	<sukhe@cumin2002>	START - Cookbook sre.hosts.reimage for host durum6001.drmrs.wmnet with OS bullseye	[production]
16:21	<sukhe@cumin2002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host durum6001.drmrs.wmnet with OS bookworm	[production]
16:01	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum6001.drmrs.wmnet with reason: host reimage	[production]
15:57	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on durum6001.drmrs.wmnet with reason: host reimage	[production]
15:43	<jiji@deploy1002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
15:43	<jiji@deploy1002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
15:42	<tchin@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply	[production]
15:42	<tchin@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply	[production]
15:35	<tchin@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply	[production]
15:34	<tchin@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply	[production]
15:11	<btullis@cumin1001>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons.	[production]
15:08	<bking@cumin1001>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0)	[production]
15:07	<sukhe@cumin2002>	START - Cookbook sre.hosts.reimage for host durum6001.drmrs.wmnet with OS bookworm	[production]
15:07	<jiji@deploy1002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
15:05	<btullis@cumin1001>	START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons.	[production]
15:03	<jiji@deploy1002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
14:49	<lucaswerkmeister-wmde@deploy1002>	Finished scap: Backport for [[gerrit:937471\|Temporarily allow OAuth on non-API entry points again (T341656)]] (duration: 08m 03s)	[production]
14:48	<sukhe>	upgrade dns2004 to gdnsd 3.99.0~alpha2	[production]
14:42	<lucaswerkmeister-wmde@deploy1002>	tgr and lucaswerkmeister-wmde: Backport for [[gerrit:937471\|Temporarily allow OAuth on non-API entry points again (T341656)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet	[production]
14:41	<lucaswerkmeister-wmde@deploy1002>	Started scap: Backport for [[gerrit:937471\|Temporarily allow OAuth on non-API entry points again (T341656)]]	[production]
14:17	<btullis@cumin1001>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons.	[production]
14:11	<btullis@cumin1001>	START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons.	[production]
14:07	<sukhe>	dns4003: upgrade to pdns-rec 4.8.4: T341611	[production]
13:59	<jiji@deploy1002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
13:57	<jiji@deploy1002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
13:56	<jiji@deploy1002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
13:56	<jiji@deploy1002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
13:46	<sukhe>	doh6001: upgrade to pdns-rec 4.8.4: T341611	[production]
13:44	<jiji@deploy1002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
13:43	<jiji@deploy1002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
13:42	<sukhe>	reprepro -C main include bullseye-wikimedia pdns-recursor_4.8.4-1+wmf11u1_amd64.changes: T341611	[production]
13:40	<Lucas_WMDE>	UTC afternoon backport+config window done	[production]
13:37	<lucaswerkmeister-wmde@deploy1002>	Finished scap: Backport for [[gerrit:937123\|Add new campaign_events.event_answers_status column (T341142)]] (duration: 07m 59s)	[production]
13:34	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
13:31	<btullis@cumin1001>	END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid analytics cluster: Roll restart of Druid jvm daemons.	[production]