production SAL

5301-5350 of 10000 results (122ms)

2024-05-02 §
18:10	<sukhe@cumin1002>	START - Cookbook sre.dns.netbox	[production]
18:10	<sukhe@cumin1002>	START - Cookbook sre.ganeti.makevm for new host doh7001.wikimedia.org	[production]
18:09	<sukhe@cumin1002>	END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host doh7001.wikimedia.org	[production]
18:09	<sukhe@cumin1002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh7001.wikimedia.org on all recursors	[production]
18:09	<sukhe@cumin1002>	START - Cookbook sre.dns.wipe-cache doh7001.wikimedia.org on all recursors	[production]
18:09	<sukhe@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM doh7001.wikimedia.org - sukhe@cumin1002"	[production]
18:09	<sukhe@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
18:08	<sukhe@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM doh7001.wikimedia.org - sukhe@cumin1002"	[production]
18:05	<eevans@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:restbase-codfw: Apply updated JDK 8 - eevans@cumin1002	[production]
18:01	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1241 (T361627)', diff saved to https://phabricator.wikimedia.org/P61756 and previous config saved to /var/cache/conftool/dbconfig/20240502-180136-marostegui.json	[production]
17:58	<sfaci@deploy1002>	helmfile [staging] START helmfile.d/services/editor-analytics: apply	[production]
17:55	<sukhe@cumin1002>	START - Cookbook sre.dns.netbox	[production]
17:55	<sukhe@cumin1002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doh7001.wikimedia.org on all recursors	[production]
17:55	<sukhe@cumin1002>	START - Cookbook sre.dns.wipe-cache doh7001.wikimedia.org on all recursors	[production]
17:55	<sukhe@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
17:53	<sukhe@cumin1002>	START - Cookbook sre.dns.netbox	[production]
17:53	<sfaci@deploy1002>	helmfile [staging] START helmfile.d/services/editor-analytics: apply	[production]
17:52	<sukhe@cumin1002>	END (FAIL) - Cookbook sre.dns.netbox (exit_code=99)	[production]
17:50	<sfaci@deploy1002>	helmfile [staging] START helmfile.d/services/editor-analytics: apply	[production]
17:49	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1241 (T361627)', diff saved to https://phabricator.wikimedia.org/P61755 and previous config saved to /var/cache/conftool/dbconfig/20240502-174920-marostegui.json	[production]
17:49	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1241.eqiad.wmnet with reason: Maintenance	[production]
17:49	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 4:00:00 on db1241.eqiad.wmnet with reason: Maintenance	[production]
17:48	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1221 (T361627)', diff saved to https://phabricator.wikimedia.org/P61754 and previous config saved to /var/cache/conftool/dbconfig/20240502-174856-marostegui.json	[production]
17:33	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1221', diff saved to https://phabricator.wikimedia.org/P61753 and previous config saved to /var/cache/conftool/dbconfig/20240502-173349-marostegui.json	[production]
17:24	<brett@cumin2002>	START - Cookbook sre.dns.netbox	[production]
17:24	<brett@cumin2002>	START - Cookbook sre.ganeti.makevm for new host ncredir7001.magru.wmnet	[production]
17:18	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1221', diff saved to https://phabricator.wikimedia.org/P61752 and previous config saved to /var/cache/conftool/dbconfig/20240502-171840-marostegui.json	[production]
17:15	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply	[production]
17:15	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply	[production]
17:05	<sukhe@cumin1002>	START - Cookbook sre.dns.netbox	[production]
17:05	<sukhe@cumin1002>	START - Cookbook sre.ganeti.makevm for new host doh7001.wikimedia.org	[production]
17:03	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1221 (T361627)', diff saved to https://phabricator.wikimedia.org/P61751 and previous config saved to /var/cache/conftool/dbconfig/20240502-170332-marostegui.json	[production]
16:53	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply	[production]
16:52	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply	[production]
16:52	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1221 (T361627)', diff saved to https://phabricator.wikimedia.org/P61750 and previous config saved to /var/cache/conftool/dbconfig/20240502-165211-marostegui.json	[production]
16:52	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
16:51	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 8:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
16:51	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1221.eqiad.wmnet with reason: Maintenance	[production]
16:51	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 4:00:00 on db1221.eqiad.wmnet with reason: Maintenance	[production]
16:51	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1199 (T361627)', diff saved to https://phabricator.wikimedia.org/P61749 and previous config saved to /var/cache/conftool/dbconfig/20240502-165129-marostegui.json	[production]
16:40	<sukhe@cumin1002>	END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host durum7001.magru.wmnet	[production]
16:40	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum7001.magru.wmnet with OS bookworm	[production]
16:39	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply	[production]
16:38	<sfaci@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply	[production]
16:36	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P61748 and previous config saved to /var/cache/conftool/dbconfig/20240502-163622-marostegui.json	[production]
16:21	<amastilovic@deploy1002>	Finished deploy [airflow-dags/analytics@7513bfa]: (no justification provided) (duration: 00m 44s)	[production]
16:21	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P61747 and previous config saved to /var/cache/conftool/dbconfig/20240502-162114-marostegui.json	[production]
16:20	<amastilovic@deploy1002>	Started deploy [airflow-dags/analytics@7513bfa]: (no justification provided)	[production]
16:16	<sukhe@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum7001.magru.wmnet with reason: host reimage	[production]
16:15	<sukhe>	running authdns-update once again to confirm state of dns700[12]	[production]