production SAL

6251-6300 of 10000 results (37ms)

2021-02-04 §
07:45	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1137 (re)pooling @ 50%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14189 and previous config saved to /var/cache/conftool/dbconfig/20210204-074558-root.json	[production]
07:37	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1173 (re)pooling @ 5%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14188 and previous config saved to /var/cache/conftool/dbconfig/20210204-073733-root.json	[production]
07:30	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1137 (re)pooling @ 25%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14187 and previous config saved to /var/cache/conftool/dbconfig/20210204-073054-root.json	[production]
07:22	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1173 (re)pooling @ 3%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14186 and previous config saved to /var/cache/conftool/dbconfig/20210204-072229-root.json	[production]
07:16	<elukey@cumin1001>	END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1117.eqiad.wmnet	[production]
07:15	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1137 (re)pooling @ 20%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14185 and previous config saved to /var/cache/conftool/dbconfig/20210204-071551-root.json	[production]
07:13	<elukey@cumin1001>	START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1117.eqiad.wmnet	[production]
07:07	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1173 (re)pooling @ 2%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14184 and previous config saved to /var/cache/conftool/dbconfig/20210204-070726-root.json	[production]
07:00	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1137 (re)pooling @ 10%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14183 and previous config saved to /var/cache/conftool/dbconfig/20210204-070047-root.json	[production]
06:45	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1137 (re)pooling @ 5%: Repool db1137 after daemon restart', diff saved to https://phabricator.wikimedia.org/P14182 and previous config saved to /var/cache/conftool/dbconfig/20210204-064544-root.json	[production]
06:42	<marostegui>	Restart mysql on db1137	[production]
06:41	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1137 T266483', diff saved to https://phabricator.wikimedia.org/P14181 and previous config saved to /var/cache/conftool/dbconfig/20210204-064157-marostegui.json	[production]
06:30	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1173 (re)pooling @ 1%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14180 and previous config saved to /var/cache/conftool/dbconfig/20210204-063033-root.json	[production]
06:28	<marostegui@cumin1001>	dbctl commit (dc=all): 'Add db1173 to dbctl - depooled T258361', diff saved to https://phabricator.wikimedia.org/P14179 and previous config saved to /var/cache/conftool/dbconfig/20210204-062836-marostegui.json	[production]
02:02	<legoktm@deploy1001>	Synchronized logos/config.yaml: Update and recompress logos for nowiki, cawiki, fiwiki, ukwiki, cswiki, huwiki, trwiki (2/2) (duration: 01m 06s)	[production]
02:00	<legoktm@deploy1001>	Synchronized static/images/project-logos/: Update and recompress logos for nowiki, cawiki, fiwiki, ukwiki, cswiki, huwiki, trwiki (1/2) (duration: 01m 10s)	[production]
01:15	<ebernhardson@deploy1001>	Finished deploy [wikimedia/discovery/analytics@4b4872d]: transfer_to_es: Increase timeout waiting for source data to three hours (duration: 01m 16s)	[production]
01:13	<ebernhardson@deploy1001>	Started deploy [wikimedia/discovery/analytics@4b4872d]: transfer_to_es: Increase timeout waiting for source data to three hours	[production]
01:04	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1310.eqiad.wmnet	[production]
00:55	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1318.eqiad.wmnet	[production]
00:51	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw1310.eqiad.wmnet	[production]
00:44	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw1318.eqiad.wmnet	[production]
00:22	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw2279.codfw.wmnet	[production]
00:19	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw2280.codfw.wmnet	[production]
00:17	<eileen>	civicrm revision changed from dfb2ea2148 to 1e9a86dd6e, config revision is 01ea3062f4	[production]
00:12	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw2279.codw.wmnet	[production]
00:11	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw2280.codfw.wmnet	[production]
00:05	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1310.eqiad.wmnet with reason: REIMAGE	[production]
00:03	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw1310.eqiad.wmnet with reason: REIMAGE	[production]
2021-02-03 §
23:59	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1318.eqiad.wmnet with reason: REIMAGE	[production]
23:56	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw1318.eqiad.wmnet with reason: REIMAGE	[production]
23:50	<mutante>	installservers: replacing squid proxy logrotate cron with systemd timer	[production]
23:50	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2279.codfw.wmnet with reason: REIMAGE	[production]
23:48	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2280.codfw.wmnet with reason: REIMAGE	[production]
23:48	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2279.codfw.wmnet with reason: REIMAGE	[production]
23:46	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2280.codfw.wmnet with reason: REIMAGE	[production]
22:53	<razzi@cumin1001>	END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001	[production]
22:06	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox1001.wikimedia.org	[production]
21:53	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single for host netbox1001.wikimedia.org	[production]
21:53	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netboxdb1001.eqiad.wmnet	[production]
21:46	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single for host netboxdb1001.eqiad.wmnet	[production]
21:44	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netbox2001.wikimedia.org	[production]
21:40	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single for host netbox2001.wikimedia.org	[production]
21:39	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netboxdb2001.codfw.wmnet	[production]
21:34	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single for host netboxdb2001.codfw.wmnet	[production]
21:33	<chaomodus>	rebooting Netbox cluster	[production]
21:05	<razzi@cumin1001>	START - Cookbook sre.kafka.reboot-workers for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001	[production]
20:34	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
20:24	<cmjohnson@cumin1001>	START - Cookbook sre.dns.netbox	[production]
20:03	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1334.eqiad.wmnet	[production]