production SAL

851-900 of 10000 results (50ms)

2022-04-05 §
11:38	<mmandere@cumin1001>	START - Cookbook sre.hosts.reimage for host cp6007.drmrs.wmnet with OS buster	[production]
11:31	<mmandere>	depool cp6007 for reimage - T290005	[production]
11:25	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
11:23	<mmandere@cumin1001>	START - Cookbook sre.hosts.reimage for host cp5015.eqsin.wmnet with OS buster	[production]
11:15	<mmandere>	depool cp5015 for reimage - T290005	[production]
11:13	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
11:12	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
11:10	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudgw1001.eqiad.wmnet	[production]
11:06	<aborrero@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudgw1001.eqiad.wmnet	[production]
11:06	<aborrero@cumin1001>	END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host cloudgw1001.eqiad.wmnet	[production]
11:06	<aborrero@cumin1001>	START - Cookbook sre.hosts.reboot-single for host cloudgw1001.eqiad.wmnet	[production]
11:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1119 (T298565)', diff saved to https://phabricator.wikimedia.org/P24111 and previous config saved to /var/cache/conftool/dbconfig/20220405-110232-ladsgroup.json	[production]
11:03	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance	[production]
11:03	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance	[production]
11:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1135 (T298565)', diff saved to https://phabricator.wikimedia.org/P24110 and previous config saved to /var/cache/conftool/dbconfig/20220405-110224-ladsgroup.json	[production]
11:03	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
10:56	<volans>	installer spicerack v2.4.0 on the cumin hosts	[production]
10:55	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
10:47	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P24109 and previous config saved to /var/cache/conftool/dbconfig/20220405-104719-ladsgroup.json	[production]
10:45	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
10:42	<aborrero@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
10:32	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P24108 and previous config saved to /var/cache/conftool/dbconfig/20220405-103214-ladsgroup.json	[production]
10:30	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
10:30	<aborrero@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
10:30	<aborrero@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
10:19	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
10:18	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
10:17	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1135 (T298565)', diff saved to https://phabricator.wikimedia.org/P24107 and previous config saved to /var/cache/conftool/dbconfig/20220405-101709-ladsgroup.json	[production]
09:49	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
09:22	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
09:22	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
09:21	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
09:21	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
09:20	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
09:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1135 (T298565)', diff saved to https://phabricator.wikimedia.org/P24105 and previous config saved to /var/cache/conftool/dbconfig/20220405-091947-ladsgroup.json	[production]
09:19	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance	[production]
09:19	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance	[production]
09:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134 (T298565)', diff saved to https://phabricator.wikimedia.org/P24104 and previous config saved to /var/cache/conftool/dbconfig/20220405-091939-ladsgroup.json	[production]
09:12	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
09:11	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: Revert "group0 wikis to 1.39.0-wmf.6"	[production]
09:04	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P24103 and previous config saved to /var/cache/conftool/dbconfig/20220405-090434-ladsgroup.json	[production]
08:52	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
08:49	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P24102 and previous config saved to /var/cache/conftool/dbconfig/20220405-084928-ladsgroup.json	[production]
08:49	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
08:46	<aborrero@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
08:41	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
08:35	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
08:35	<aborrero@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
08:34	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134 (T298565)', diff saved to https://phabricator.wikimedia.org/P24101 and previous config saved to /var/cache/conftool/dbconfig/20220405-083423-ladsgroup.json	[production]
08:34	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]