production SAL

1951-2000 of 10000 results (52ms)

2022-04-05 §
10:55	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
10:47	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P24109 and previous config saved to /var/cache/conftool/dbconfig/20220405-104719-ladsgroup.json	[production]
10:45	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
10:42	<aborrero@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
10:32	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P24108 and previous config saved to /var/cache/conftool/dbconfig/20220405-103214-ladsgroup.json	[production]
10:30	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
10:30	<aborrero@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
10:30	<aborrero@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
10:19	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
10:18	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
10:17	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1135 (T298565)', diff saved to https://phabricator.wikimedia.org/P24107 and previous config saved to /var/cache/conftool/dbconfig/20220405-101709-ladsgroup.json	[production]
09:49	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
09:22	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
09:22	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
09:21	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
09:21	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
09:20	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
09:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1135 (T298565)', diff saved to https://phabricator.wikimedia.org/P24105 and previous config saved to /var/cache/conftool/dbconfig/20220405-091947-ladsgroup.json	[production]
09:19	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance	[production]
09:19	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance	[production]
09:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134 (T298565)', diff saved to https://phabricator.wikimedia.org/P24104 and previous config saved to /var/cache/conftool/dbconfig/20220405-091939-ladsgroup.json	[production]
09:12	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
09:11	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: Revert "group0 wikis to 1.39.0-wmf.6"	[production]
09:04	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P24103 and previous config saved to /var/cache/conftool/dbconfig/20220405-090434-ladsgroup.json	[production]
08:52	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
08:49	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P24102 and previous config saved to /var/cache/conftool/dbconfig/20220405-084928-ladsgroup.json	[production]
08:49	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
08:46	<aborrero@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: host reimage	[production]
08:41	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
08:35	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
08:35	<aborrero@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
08:34	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134 (T298565)', diff saved to https://phabricator.wikimedia.org/P24101 and previous config saved to /var/cache/conftool/dbconfig/20220405-083423-ladsgroup.json	[production]
08:34	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
08:34	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
08:33	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
08:31	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: group0 wikis to 1.39.0-wmf.6 refs T305212	[production]
08:28	<aborrero@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
08:26	<jayme@cumin1001>	END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host dragonfly-supernode2001.codfw.wmnet	[production]
08:23	<aborrero@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudgw1001.eqiad.wmnet with OS bullseye	[production]
08:21	<jnuche@deploy1002>	Finished scap: testwikis wikis to 1.39.0-wmf.6 refs T305212 (duration: 42m 53s)	[production]
08:19	<jayme@cumin1001>	START - Cookbook sre.hosts.reboot-single for host dragonfly-supernode2001.codfw.wmnet	[production]
08:13	<jayme@deploy1002>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
08:13	<jayme@deploy1002>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
08:12	<jayme@deploy1002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
08:12	<jayme@deploy1002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
07:52	<XioNoX>	disable BGP to Tata in drmrs for circuit move - T298208	[production]
07:47	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
07:46	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
07:46	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
07:45	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]