production SAL

5901-5950 of 10000 results (86ms)

2022-12-07 §
22:44	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1201.eqiad.wmnet with reason: Maintenance	[production]
22:44	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T322618)', diff saved to https://phabricator.wikimedia.org/P42540 and previous config saved to /var/cache/conftool/dbconfig/20221207-224440-ladsgroup.json	[production]
22:41	<ryankemper>	T301167 Downtimed `wdqs20[09-12]` for 7 days	[production]
22:37	<bking@cumin2002>	END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99)	[production]
22:36	<ryankemper@puppetmaster1001>	conftool action : set/weight=10:pooled=no; selector: name=wdqs2009.*	[production]
22:36	<ryankemper@puppetmaster1001>	conftool action : set/weight=10:pooled=no; selector: name=wdqs2010.*	[production]
22:35	<bking@cumin2002>	START - Cookbook sre.wdqs.data-reload	[production]
22:32	<bking@cumin2002>	END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99)	[production]
22:30	<bking@cumin2002>	START - Cookbook sre.wdqs.data-reload	[production]
22:29	<bking@cumin2002>	END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99)	[production]
22:29	<bking@cumin2002>	START - Cookbook sre.wdqs.data-reload	[production]
22:29	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P42539 and previous config saved to /var/cache/conftool/dbconfig/20221207-222934-ladsgroup.json	[production]
22:29	<bking@cumin2002>	END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99)	[production]
22:28	<bking@cumin2002>	START - Cookbook sre.wdqs.data-reload	[production]
22:26	<bking@cumin2002>	END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97)	[production]
22:25	<bking@cumin2002>	START - Cookbook sre.wdqs.data-transfer	[production]
22:25	<bking@cumin2002>	END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97)	[production]
22:23	<bking@cumin2002>	START - Cookbook sre.wdqs.data-transfer	[production]
22:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P42538 and previous config saved to /var/cache/conftool/dbconfig/20221207-221427-ladsgroup.json	[production]
22:01	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2180 (T322618)', diff saved to https://phabricator.wikimedia.org/P42537 and previous config saved to /var/cache/conftool/dbconfig/20221207-220110-ladsgroup.json	[production]
21:59	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T322618)', diff saved to https://phabricator.wikimedia.org/P42536 and previous config saved to /var/cache/conftool/dbconfig/20221207-215921-ladsgroup.json	[production]
21:57	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1187 (T322618)', diff saved to https://phabricator.wikimedia.org/P42535 and previous config saved to /var/cache/conftool/dbconfig/20221207-215712-ladsgroup.json	[production]
21:57	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1187.eqiad.wmnet with reason: Maintenance	[production]
21:56	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1187.eqiad.wmnet with reason: Maintenance	[production]
21:56	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180 (T322618)', diff saved to https://phabricator.wikimedia.org/P42534 and previous config saved to /var/cache/conftool/dbconfig/20221207-215651-ladsgroup.json	[production]
21:56	<TheresNoTime>	UTC late backport window done	[production]
21:51	<samtar@deploy1002>	backport aborted: (duration: 00m 15s)	[production]
21:49	<samtar@deploy1002>	Sync cancelled.	[production]
21:47	<sukhe>	homer "cr-eqsin" commit "running homer for Gerrit: 865773"	[production]
21:46	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P42533 and previous config saved to /var/cache/conftool/dbconfig/20221207-214603-ladsgroup.json	[production]
21:44	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts lvs5003.eqsin.wmnet	[production]
21:44	<sukhe@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
21:44	<sukhe@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: lvs5003.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002"	[production]
21:43	<samtar@deploy1002>	samtar and stang: Backport for [[gerrit:865766\|specieswiki: Install GeoData extension (T324348)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet	[production]
21:43	<sukhe@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: lvs5003.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002"	[production]
21:41	<samtar@deploy1002>	Started scap: Backport for [[gerrit:865766\|specieswiki: Install GeoData extension (T324348)]]	[production]
21:41	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P42532 and previous config saved to /var/cache/conftool/dbconfig/20221207-214145-ladsgroup.json	[production]
21:41	<sukhe@cumin2002>	START - Cookbook sre.dns.netbox	[production]
21:39	<samtar@deploy1002>	Finished scap: Backport for [[gerrit:865737\|Remove Research Incentive survey from frwiki (T321930)]] (duration: 09m 04s)	[production]
21:36	<sukhe@cumin2002>	START - Cookbook sre.hosts.decommission for hosts lvs5003.eqsin.wmnet	[production]
21:36	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs5003.eqsin.wmnet with reason: downtimed, in the process of decom	[production]
21:36	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 4:00:00 on lvs5003.eqsin.wmnet with reason: downtimed, in the process of decom	[production]
21:34	<sukhe>	homer "cr-eqsin" commit "running homer for Gerrit: 865742"	[production]
21:32	<samtar@deploy1002>	samtar and dani: Backport for [[gerrit:865737\|Remove Research Incentive survey from frwiki (T321930)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet	[production]
21:32	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs5006.eqsin.wmnet with OS buster	[production]
21:32	<sukhe@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin2002"	[production]
21:30	<sukhe@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin2002"	[production]
21:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P42530 and previous config saved to /var/cache/conftool/dbconfig/20221207-213057-ladsgroup.json	[production]
21:30	<samtar@deploy1002>	Started scap: Backport for [[gerrit:865737\|Remove Research Incentive survey from frwiki (T321930)]]	[production]
21:28	<samtar@deploy1002>	Finished scap: Backport for [[gerrit:865070\|hewiki: enable parser cache writes for parsoid's page/html endpoint. (T322672 T320534 T320529)]], [[gerrit:865071\|Page 5% of calls to parsoid's page/html endpoint write to PC (T322672)]] (duration: 20m 35s)	[production]