production SAL

2501-2550 of 10000 results (54ms)

2022-02-10 §
19:12	<jgiannelos@deploy1002>	Started deploy [kartotherian/deploy@828a428] (eqiad): Configure geoshapes postgres max conns	[production]
19:11	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
19:11	<bblack>	lvs1017 rebooting for sanity-check after prod config - T301142	[production]
19:08	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181 (T300382)', diff saved to https://phabricator.wikimedia.org/P20576 and previous config saved to /var/cache/conftool/dbconfig/20220210-190840-marostegui.json	[production]
19:03	<otto@deploy1002>	Finished deploy [airflow-dags/research@b871faf]: (no justification provided) (duration: 00m 03s)	[production]
19:03	<otto@deploy1002>	Started deploy [airflow-dags/research@b871faf]: (no justification provided)	[production]
19:01	<otto@deploy1002>	Finished deploy [airflow-dags/research@b871faf]: (no justification provided) (duration: 00m 27s)	[production]
19:01	<otto@deploy1002>	Started deploy [airflow-dags/research@b871faf]: (no justification provided)	[production]
18:59	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1179 (T298554)', diff saved to https://phabricator.wikimedia.org/P20575 and previous config saved to /var/cache/conftool/dbconfig/20220210-185956-ladsgroup.json	[production]
18:53	<ebernhardson>	restart all mjolnir daemons on search-loader1001 and 2001 to purge old cached node lists	[production]
18:53	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P20574 and previous config saved to /var/cache/conftool/dbconfig/20220210-185336-marostegui.json	[production]
18:52	<jgiannelos@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mobileapps: sync on production	[production]
18:51	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
18:50	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
18:49	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
18:49	<jgiannelos@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply on staging	[production]
18:49	<jgiannelos@deploy1002>	helmfile [eqiad] START helmfile.d/services/mobileapps: apply on production	[production]
18:49	<jgiannelos@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mobileapps: sync on production	[production]
18:49	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
18:46	<jgiannelos@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mobileapps: apply on staging	[production]
18:46	<jgiannelos@deploy1002>	helmfile [codfw] START helmfile.d/services/mobileapps: apply on production	[production]
18:45	<jgiannelos@deploy1002>	helmfile [staging] DONE helmfile.d/services/mobileapps: sync on staging	[production]
18:45	<cmjohnson@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1031.eqiad.wmnet with OS buster	[production]
18:45	<cmjohnson@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1032.eqiad.wmnet with OS buster	[production]
18:45	<jgiannelos@deploy1002>	helmfile [staging] DONE helmfile.d/services/mobileapps: apply on production	[production]
18:45	<cmjohnson@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1033.eqiad.wmnet with OS buster	[production]
18:45	<jgiannelos@deploy1002>	helmfile [staging] START helmfile.d/services/mobileapps: apply on staging	[production]
18:44	<jgiannelos@deploy1002>	helmfile [staging] START helmfile.d/services/mobileapps: apply on staging	[production]
18:43	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
18:43	<bblack>	lvs1013 - stopping puppet+pybal for move to lvs1017, high-traffic1 traffic fails over to lvs1020 for now - T301142	[production]
18:42	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
18:42	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
18:42	<ladsgroup@deploy1002>	Synchronized php-1.38.0-wmf.21/includes/content/ContentHandler.php: Backport: [[gerrit:761419\|ContentHandler: Avoding saving in ParserCache in search index jobs (T285993)]] (duration: 00m 50s)	[production]
18:41	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
18:40	<ladsgroup@deploy1002>	Synchronized php-1.38.0-wmf.20/includes/content/ContentHandler.php: Backport: [[gerrit:761420\|ContentHandler: Avoding saving in ParserCache in search index jobs (T285993)]] (duration: 00m 50s)	[production]
18:40	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1096:3315 (T300775)', diff saved to https://phabricator.wikimedia.org/P20573 and previous config saved to /var/cache/conftool/dbconfig/20220210-184012-marostegui.json	[production]
18:40	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance	[production]
18:40	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance	[production]
18:40	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1110 (T300775)', diff saved to https://phabricator.wikimedia.org/P20572 and previous config saved to /var/cache/conftool/dbconfig/20220210-184004-marostegui.json	[production]
18:38	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P20571 and previous config saved to /var/cache/conftool/dbconfig/20220210-183831-marostegui.json	[production]
18:31	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2088:3312 (T300510)', diff saved to https://phabricator.wikimedia.org/P20570 and previous config saved to /var/cache/conftool/dbconfig/20220210-183107-ladsgroup.json	[production]
18:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1179 (T298554)', diff saved to https://phabricator.wikimedia.org/P20569 and previous config saved to /var/cache/conftool/dbconfig/20220210-182959-ladsgroup.json	[production]
18:29	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1179.eqiad.wmnet with reason: Maintenance	[production]
18:29	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1179.eqiad.wmnet with reason: Maintenance	[production]
18:29	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175 (T298554)', diff saved to https://phabricator.wikimedia.org/P20568 and previous config saved to /var/cache/conftool/dbconfig/20220210-182952-ladsgroup.json	[production]
18:29	<bblack@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
18:28	<jgiannelos@deploy1002>	Finished deploy [kartotherian/deploy@a5be8ac] (eqiad): Remove references to cassandra `storage_id` (duration: 01m 01s)	[production]
18:27	<jgiannelos@deploy1002>	Started deploy [kartotherian/deploy@a5be8ac] (eqiad): Remove references to cassandra `storage_id`	[production]
18:26	<bblack@cumin1001>	START - Cookbook sre.dns.netbox	[production]
18:25	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2088:3311 (T300510)', diff saved to https://phabricator.wikimedia.org/P20567 and previous config saved to /var/cache/conftool/dbconfig/20220210-182547-ladsgroup.json	[production]