production SAL

101-150 of 10000 results (80ms)

2023-03-08 §
18:13	<bking@cumin2002>	END (ERROR) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=97) generate netbox hiera data: "update locatoin of elastic1065 - bking@cumin2002 - T322082"	[production]
18:13	<bking@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "update locatoin of elastic1065 - bking@cumin2002 - T322082"	[production]
18:13	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P45565 and previous config saved to /var/cache/conftool/dbconfig/20230308-181316-marostegui.json	[production]
18:12	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2179', diff saved to https://phabricator.wikimedia.org/P45564 and previous config saved to /var/cache/conftool/dbconfig/20230308-181220-marostegui.json	[production]
18:12	<bking@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "update locatoin of elastic1064 - bking@cumin2002 - T322082"	[production]
18:11	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P45563 and previous config saved to /var/cache/conftool/dbconfig/20230308-181131-marostegui.json	[production]
18:09	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
18:09	<hnowlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
18:09	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
18:05	<bking@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "update locatoin of elastic1064 - bking@cumin2002 - T322082"	[production]
18:05	<bking@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "update location of elastic1066 - bking@cumin2002 - T322082"	[production]
18:04	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic1064.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
18:02	<hnowlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
18:02	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
18:02	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic1065.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
18:02	<hnowlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
18:02	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
18:00	<hnowlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/thumbor: sync	[production]
18:00	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/thumbor: sync	[production]
18:00	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1109', diff saved to https://phabricator.wikimedia.org/P45562 and previous config saved to /var/cache/conftool/dbconfig/20230308-180008-ladsgroup.json	[production]
17:59	<hnowlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/thumbor: sync	[production]
17:59	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/thumbor: sync	[production]
17:59	<bking@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "update location of elastic1066 - bking@cumin2002 - T322082"	[production]
17:59	<brett@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on acmechief-test2001.codfw.wmnet with reason: host reimage	[production]
17:58	<hnowlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
17:58	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
17:58	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2176 (T328817)', diff saved to https://phabricator.wikimedia.org/P45561 and previous config saved to /var/cache/conftool/dbconfig/20230308-175810-marostegui.json	[production]
17:58	<herron>	failing grafana over from codfw to eqiad	[production]
17:57	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2179', diff saved to https://phabricator.wikimedia.org/P45560 and previous config saved to /var/cache/conftool/dbconfig/20230308-175714-marostegui.json	[production]
17:56	<brett@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on acmechief-test2001.codfw.wmnet with reason: host reimage	[production]
17:56	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2154 (T329260)', diff saved to https://phabricator.wikimedia.org/P45559 and previous config saved to /var/cache/conftool/dbconfig/20230308-175625-marostegui.json	[production]
17:52	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic1066.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
17:51	<btullis@deploy2002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
17:51	<btullis@deploy2002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
17:50	<btullis@deploy2002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
17:50	<btullis@deploy2002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
17:48	<bking@cumin2002>	START - Cookbook sre.hosts.provision for host elastic1066.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
17:47	<bking@cumin2002>	START - Cookbook sre.hosts.provision for host elastic1064.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
17:47	<brett@cumin2002>	START - Cookbook sre.ganeti.reimage for host acmechief-test2001.codfw.wmnet with OS bullseye	[production]
17:46	<bking@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['elastic1064.eqiad.wmnet']	[production]
17:45	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db2176 (T328817)', diff saved to https://phabricator.wikimedia.org/P45558 and previous config saved to /var/cache/conftool/dbconfig/20230308-174535-marostegui.json	[production]
17:45	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2176.codfw.wmnet with reason: Maintenance	[production]
17:45	<bking@cumin2002>	START - Cookbook sre.hosts.provision for host elastic1065.mgmt.eqiad.wmnet with reboot policy GRACEFUL	[production]
17:45	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 12:00:00 on db2176.codfw.wmnet with reason: Maintenance	[production]
17:45	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2174 (T328817)', diff saved to https://phabricator.wikimedia.org/P45557 and previous config saved to /var/cache/conftool/dbconfig/20230308-174514-marostegui.json	[production]
17:45	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1109 (T318605)', diff saved to https://phabricator.wikimedia.org/P45556 and previous config saved to /var/cache/conftool/dbconfig/20230308-174501-ladsgroup.json	[production]
17:43	<bking@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['elastic1066.eqiad.wmnet']	[production]
17:42	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2179 (T329203)', diff saved to https://phabricator.wikimedia.org/P45555 and previous config saved to /var/cache/conftool/dbconfig/20230308-174208-marostegui.json	[production]
17:38	<bking@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['elastic1065.eqiad.wmnet']	[production]
17:37	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db2154 (T329260)', diff saved to https://phabricator.wikimedia.org/P45554 and previous config saved to /var/cache/conftool/dbconfig/20230308-173701-marostegui.json	[production]