production SAL

1251-1300 of 10000 results (64ms)

2022-03-14 §
22:19	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1023.eqiad.wmnet with reason: host reimage	[production]
22:16	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1023.eqiad.wmnet with reason: host reimage	[production]
22:04	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1023.eqiad.wmnet with OS bullseye	[production]
22:03	<bking@puppetmaster1001>	conftool action : set/pooled=false; selector: dnsdisc=wdqs-internal,name=eqiad	[production]
22:03	<bking@puppetmaster1001>	conftool action : set/pooled=false; selector: dnsdisc=wdqs,name=eqiad	[production]
22:03	<inflatador>	T302494 bking@puppetmaster1001 depooling eqiad in DNS-discovery for wdqs and wdqs-internal services	[production]
21:47	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1025.eqiad.wmnet with OS bullseye	[production]
21:40	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
21:39	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1025.eqiad.wmnet with OS bullseye	[production]
21:39	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
21:39	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
21:39	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1025.eqiad.wmnet with OS bullseye	[production]
21:38	<inflatador>	T302494 bking@puppetmaster1001 conftool action : set/pooled=true; selector: dnsdisc=wdqs-internal,name=codfw	[production]
21:38	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
21:37	<bking@puppetmaster1001>	conftool action : set/pooled=true; selector: dnsdisc=wdqs,name=codfw	[production]
21:36	<inflatador>	bking@cumin pooling codfw in DNS-discovery for wdqs and wdqs-internal services	[production]
21:31	<sbassett>	Deployed security fix for T160800	[production]
21:30	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1025.eqiad.wmnet with OS bullseye	[production]
21:07	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1023.eqiad.wmnet with OS bullseye	[production]
20:58	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:57	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
20:56	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
20:56	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
20:55	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
20:54	<urbanecm>	UTC late B&C completed	[production]
20:53	<urbanecm@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: bca9c94c9d0bec83cb777bc474fde564c441349c: liwiktionary: Change timezone to CET/CEST (T303734) (duration: 00m 49s)	[production]
20:45	<ebernhardson@deploy1002>	Synchronized php-1.38.0-wmf.25/extensions/CirrusSearch/profiles/SaneitizeProfiles.config.php: Backport: [[gerrit:770056\|Cut saneitizer re-indexing rate in half (T302733)]] (duration: 00m 49s)	[production]
20:38	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1023.eqiad.wmnet with reason: host reimage	[production]
20:35	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1023.eqiad.wmnet with reason: host reimage	[production]
20:35	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
20:34	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
20:34	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
20:33	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:33	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
20:31	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:31	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:31	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:31	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:30	<andrew@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:22	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1024.eqiad.wmnet with OS bullseye	[production]
20:22	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1023.eqiad.wmnet with OS bullseye	[production]
19:44	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1139.eqiad.wmnet with reason: Maintenance	[production]
19:44	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 8:00:00 on db1139.eqiad.wmnet with reason: Maintenance	[production]
19:44	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1129 (T300775)', diff saved to https://phabricator.wikimedia.org/P22457 and previous config saved to /var/cache/conftool/dbconfig/20220314-194404-marostegui.json	[production]
19:29	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1129', diff saved to https://phabricator.wikimedia.org/P22456 and previous config saved to /var/cache/conftool/dbconfig/20220314-192859-marostegui.json	[production]
19:24	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1022.eqiad.wmnet with OS bullseye	[production]
19:22	<ejegg>	updated civicrm from 252269c8 to 52c45874	[production]
19:13	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1129', diff saved to https://phabricator.wikimedia.org/P22455 and previous config saved to /var/cache/conftool/dbconfig/20220314-191354-marostegui.json	[production]
19:07	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1022.eqiad.wmnet with reason: host reimage	[production]
19:04	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1022.eqiad.wmnet with reason: host reimage	[production]