production SAL

2951-3000 of 10000 results (57ms)

2022-08-03 §
16:08	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for aqs[2005-2008].codfw.wmnet	[production]
16:08	<mvernon@cumin1001>	START - Cookbook sre.hosts.remove-downtime for aqs[2005-2008].codfw.wmnet	[production]
15:59	<Emperor>	shutdown ms-be20[33,47],thanos-be2002 prior to PDU work T310070	[production]
15:58	<mvernon@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ms-be[2033,2047].codfw.wmnet,thanos-be2002.codfw.wmnet with reason: PDU work	[production]
15:58	<mvernon@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on ms-be[2033,2047].codfw.wmnet,thanos-be2002.codfw.wmnet with reason: PDU work	[production]
15:52	<jelto>	pooling mw2259-2270 again	[production]
15:45	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1172 (T312972)', diff saved to https://phabricator.wikimedia.org/P32242 and previous config saved to /var/cache/conftool/dbconfig/20220803-154515-marostegui.json	[production]
15:38	<vgutierrez>	clearing ats-be cache on cp6008 - T309651	[production]
15:38	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
15:38	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
15:37	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
15:37	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
15:36	<elukey>	powercycle kafka-logging2003 - not responsive to serial console	[production]
15:36	<urbanecm@deploy1002>	Synchronized php-1.39.0-wmf.22/extensions/GrowthExperiments/includes/NewcomerTasks/AddImage/ServiceImageRecommendationProvider.php: 4438957e78e0012aff646e52dc16a4fb796cfd6b: ServiceImageRecommendationProvider: Add extra logging when no JSON response received (T313973) (duration: 03m 04s)	[production]
15:35	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on maps2009.codfw.wmnet with reason: PDU maintenance	[production]
15:35	<hnowlan@cumin1001>	START - Cookbook sre.hosts.downtime for 3:00:00 on maps2009.codfw.wmnet with reason: PDU maintenance	[production]
15:34	<hnowlan@puppetmaster1001>	conftool action : set/pooled=no; selector: name=maps2009.codfw.wmnet	[production]
15:32	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on restbase2024.codfw.wmnet with reason: PDU maintenance	[production]
15:32	<hnowlan@cumin1001>	START - Cookbook sre.hosts.downtime for 12:00:00 on restbase2024.codfw.wmnet with reason: PDU maintenance	[production]
15:32	<hnowlan@puppetmaster1001>	conftool action : set/pooled=no; selector: name=restbase2024.codfw.wmnet	[production]
15:30	<vgutierrez>	clearing ats-be cache on cp6016 - T309651	[production]
15:30	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P32241 and previous config saved to /var/cache/conftool/dbconfig/20220803-153009-marostegui.json	[production]
15:24	<jayme@cumin1001>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) _etcd._tcp.eqsin.wmnet on all recursors	[production]
15:24	<jayme@cumin1001>	START - Cookbook sre.dns.wipe-cache _etcd._tcp.eqsin.wmnet on all recursors	[production]
15:24	<jayme@cumin1001>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) _etcd._tcp.ulsfo.wmnet on all recursors	[production]
15:24	<jayme@cumin1001>	START - Cookbook sre.dns.wipe-cache _etcd._tcp.ulsfo.wmnet on all recursors	[production]
15:24	<jayme@cumin1001>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) _etcd._tcp.codfw.wmnet on all recursors	[production]
15:24	<jayme@cumin1001>	START - Cookbook sre.dns.wipe-cache _etcd._tcp.codfw.wmnet on all recursors	[production]
15:21	<hnowlan@puppetmaster1001>	conftool action : set/pooled=yes; selector: name=restbase2021.codfw.wmnet	[production]
15:19	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2030.codfw.wmnet with reason: T310070	[production]
15:19	<bking@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2030.codfw.wmnet with reason: T310070	[production]
15:15	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P32240 and previous config saved to /var/cache/conftool/dbconfig/20220803-151502-marostegui.json	[production]
15:10	<jayme@cumin1001>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for conf2004.codfw.wmnet	[production]
15:10	<jayme@cumin1001>	START - Cookbook sre.hosts.remove-downtime for conf2004.codfw.wmnet	[production]
15:04	<jelto>	power off mc2023	[production]
14:59	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1172 (T312972)', diff saved to https://phabricator.wikimedia.org/P32239 and previous config saved to /var/cache/conftool/dbconfig/20220803-145956-marostegui.json	[production]
14:59	<jayme@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on mc2023.codfw.wmnet with reason: PDU swap	[production]
14:59	<jayme@cumin1001>	START - Cookbook sre.hosts.downtime for 0:30:00 on mc2023.codfw.wmnet with reason: PDU swap	[production]
14:58	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1172 (T312972)', diff saved to https://phabricator.wikimedia.org/P32238 and previous config saved to /var/cache/conftool/dbconfig/20220803-145849-marostegui.json	[production]
14:58	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1172.eqiad.wmnet with reason: Maintenance	[production]
14:58	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1172.eqiad.wmnet with reason: Maintenance	[production]
14:58	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1109 (T312972)', diff saved to https://phabricator.wikimedia.org/P32237 and previous config saved to /var/cache/conftool/dbconfig/20220803-145828-marostegui.json	[production]
14:56	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
14:56	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
14:56	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
14:53	<dancy@deploy1002>	Pruned MediaWiki: 1.39.0-wmf.19 (duration: 05m 37s)	[production]
14:51	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
14:47	<dancy@deploy1002>	Pruned MediaWiki: 1.39.0-wmf.21 (duration: 06m 13s)	[production]
14:46	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
14:46	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]