production SAL

1501-1550 of 10000 results (101ms)

2024-08-26 §
13:34	<sukhe@cumin1002>	START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade/restart of Apache Traffic Server on A:cp-eqsin and A:cp for 9.2.5-1wm2	[production]
13:34	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) rpki2003.codfw.wmnet on all recursors	[production]
13:34	<ayounsi@cumin1002>	START - Cookbook sre.dns.wipe-cache rpki2003.codfw.wmnet on all recursors	[production]
13:34	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
13:34	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM rpki2003.codfw.wmnet - ayounsi@cumin1002"	[production]
13:34	<ayounsi@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM rpki2003.codfw.wmnet - ayounsi@cumin1002"	[production]
13:30	<ayounsi@cumin1002>	START - Cookbook sre.dns.netbox	[production]
13:30	<ayounsi@cumin1002>	START - Cookbook sre.ganeti.makevm for new host rpki2003.codfw.wmnet	[production]
13:29	<ayounsi@cumin1002>	END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host rpki2003.codfw.wmnet	[production]
13:29	<ayounsi@cumin1002>	START - Cookbook sre.ganeti.makevm for new host rpki2003.codfw.wmnet	[production]
13:28	<urbanecm@deploy1003>	hnowlan, urbanecm, ihurbain: Continuing with sync	[production]
13:27	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1190 (T371742)', diff saved to https://phabricator.wikimedia.org/P67782 and previous config saved to /var/cache/conftool/dbconfig/20240826-132738-ladsgroup.json	[production]
13:27	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1190.eqiad.wmnet with reason: Maintenance	[production]
13:27	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 12:00:00 on db1190.eqiad.wmnet with reason: Maintenance	[production]
13:24	<urbanecm@deploy1003>	hnowlan, urbanecm, ihurbain: Backport for [[gerrit:1064795\|Rollout Parsoid Kartographer support on all wikis (T342871)]], [[gerrit:1059394\|scripts: add script for running jobs from stdin rather than http (T369048)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
13:20	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1173', diff saved to https://phabricator.wikimedia.org/P67781 and previous config saved to /var/cache/conftool/dbconfig/20240826-132016-ladsgroup.json	[production]
13:09	<urbanecm@deploy1003>	Started scap sync-world: Backport for [[gerrit:1064795\|Rollout Parsoid Kartographer support on all wikis (T342871)]], [[gerrit:1059394\|scripts: add script for running jobs from stdin rather than http (T369048)]]	[production]
13:07	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply	[production]
13:06	<hnowlan@deploy1003>	helmfile [eqiad] START helmfile.d/services/shellbox-video: apply	[production]
13:06	<hnowlan@deploy1003>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
13:05	<hnowlan@deploy1003>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
13:05	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1173 (T370903)', diff saved to https://phabricator.wikimedia.org/P67780 and previous config saved to /var/cache/conftool/dbconfig/20240826-130510-ladsgroup.json	[production]
13:04	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1173 (T370903)', diff saved to https://phabricator.wikimedia.org/P67779 and previous config saved to /var/cache/conftool/dbconfig/20240826-130401-ladsgroup.json	[production]
13:03	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1173.eqiad.wmnet with reason: Maintenance	[production]
13:03	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 8:00:00 on db1173.eqiad.wmnet with reason: Maintenance	[production]
13:03	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1168 (T370903)', diff saved to https://phabricator.wikimedia.org/P67778 and previous config saved to /var/cache/conftool/dbconfig/20240826-130350-ladsgroup.json	[production]
12:48	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P67777 and previous config saved to /var/cache/conftool/dbconfig/20240826-124843-ladsgroup.json	[production]
12:33	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P67776 and previous config saved to /var/cache/conftool/dbconfig/20240826-123336-ladsgroup.json	[production]
12:32	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Weight db2214 T373174', diff saved to https://phabricator.wikimedia.org/P67775 and previous config saved to /var/cache/conftool/dbconfig/20240826-123205-arnaudb.json	[production]
12:29	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Promote db2129 to s6 primary T373174', diff saved to https://phabricator.wikimedia.org/P67774 and previous config saved to /var/cache/conftool/dbconfig/20240826-122925-arnaudb.json	[production]
12:28	<arnaudb>	Starting s6 codfw failover from db2214 to db2129 - T373174	[production]
12:25	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: Testing	[production]
12:25	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: Testing	[production]
12:21	<godog>	move to /root unused and about to expire cert on puppetmaster1001:/var/lib/puppet/server/ssl/ca/signed/webperf.discovery.wmnet.pem	[production]
12:18	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1168 (T370903)', diff saved to https://phabricator.wikimedia.org/P67773 and previous config saved to /var/cache/conftool/dbconfig/20240826-121828-ladsgroup.json	[production]
12:18	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 268434	[production]
12:17	<ayounsi@cumin1002>	START - Cookbook sre.network.peering with action 'email' for AS: 268434	[production]
12:17	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 263903	[production]
12:17	<ayounsi@cumin1002>	START - Cookbook sre.network.peering with action 'email' for AS: 263903	[production]
12:17	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 61754	[production]
12:17	<ayounsi@cumin1002>	START - Cookbook sre.network.peering with action 'email' for AS: 61754	[production]
12:16	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 269115	[production]
12:16	<ayounsi@cumin1002>	START - Cookbook sre.network.peering with action 'email' for AS: 269115	[production]
12:16	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 274607	[production]
12:16	<ayounsi@cumin1002>	START - Cookbook sre.network.peering with action 'email' for AS: 274607	[production]
12:14	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1168 (T370903)', diff saved to https://phabricator.wikimedia.org/P67772 and previous config saved to /var/cache/conftool/dbconfig/20240826-121419-ladsgroup.json	[production]
12:14	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1168.eqiad.wmnet with reason: Maintenance	[production]
12:14	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 8:00:00 on db1168.eqiad.wmnet with reason: Maintenance	[production]
12:14	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1165 (T370903)', diff saved to https://phabricator.wikimedia.org/P67771 and previous config saved to /var/cache/conftool/dbconfig/20240826-121408-ladsgroup.json	[production]
12:12	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox	[production]