production SAL

201-250 of 10000 results (96ms)

2025-07-24 §
09:58	<cgoubert@dns1004>	END - running authdns-update	[production]
09:58	<hnowlan@deploy1003>	helmfile [staging] DONE helmfile.d/services/thumbor: sync	[production]
09:58	<hnowlan@deploy1003>	helmfile [staging] START helmfile.d/services/thumbor: sync	[production]
09:57	<hnowlan@deploy1003>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
09:57	<hnowlan@deploy1003>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
09:57	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs1013.eqiad.wmnet with OS bookworm	[production]
09:57	<cgoubert@dns1004>	START - running authdns-update	[production]
09:55	<hnowlan@deploy1003>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
09:54	<hnowlan@deploy1003>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
09:47	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P79803 and previous config saved to /var/cache/conftool/dbconfig/20250724-094706-marostegui.json	[production]
09:42	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs1013.eqiad.wmnet with reason: host reimage	[production]
09:37	<vgutierrez>	disable BGP for lvs1013 on lsw1-e1-eqiad.mgmt.eqiad.wmnet - T400259	[production]
09:36	<vgutierrez@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on lvs1013.eqiad.wmnet with reason: host reimage	[production]
09:31	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1170 (T399249)', diff saved to https://phabricator.wikimedia.org/P79801 and previous config saved to /var/cache/conftool/dbconfig/20250724-093158-marostegui.json	[production]
09:22	<vgutierrez@cumin1002>	START - Cookbook sre.hosts.reimage for host lvs1013.eqiad.wmnet with OS bookworm	[production]
09:13	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs1013.eqiad.wmnet} and A:liberica (T400259)	[production]
09:12	<vgutierrez@cumin1002>	START - Cookbook sre.loadbalancer.admin depooling P{lvs1013.eqiad.wmnet} and A:liberica (T400259)	[production]
08:22	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1170 (T399249)', diff saved to https://phabricator.wikimedia.org/P79800 and previous config saved to /var/cache/conftool/dbconfig/20250724-082213-marostegui.json	[production]
08:22	<marostegui@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1170.eqiad.wmnet with reason: Maintenance	[production]
08:21	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T399249)', diff saved to https://phabricator.wikimedia.org/P79799 and previous config saved to /var/cache/conftool/dbconfig/20250724-082150-marostegui.json	[production]
08:06	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P79798 and previous config saved to /var/cache/conftool/dbconfig/20250724-080643-marostegui.json	[production]
08:06	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1227 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P79797 and previous config saved to /var/cache/conftool/dbconfig/20250724-080617-root.json	[production]
07:51	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P79796 and previous config saved to /var/cache/conftool/dbconfig/20250724-075135-marostegui.json	[production]
07:51	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1227 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P79795 and previous config saved to /var/cache/conftool/dbconfig/20250724-075112-root.json	[production]
07:36	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T399249)', diff saved to https://phabricator.wikimedia.org/P79794 and previous config saved to /var/cache/conftool/dbconfig/20250724-073628-marostegui.json	[production]
07:36	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1227 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P79793 and previous config saved to /var/cache/conftool/dbconfig/20250724-073606-root.json	[production]
07:21	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1227 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P79792 and previous config saved to /var/cache/conftool/dbconfig/20250724-072100-root.json	[production]
07:13	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depool db1227 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P79791 and previous config saved to /var/cache/conftool/dbconfig/20250724-071300-marostegui.json	[production]
07:12	<marostegui@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1227.eqiad.wmnet with reason: Maintenance	[production]
06:52	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1158 (T399249)', diff saved to https://phabricator.wikimedia.org/P79790 and previous config saved to /var/cache/conftool/dbconfig/20250724-065222-marostegui.json	[production]
06:52	<marostegui@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
06:51	<marostegui@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1158.eqiad.wmnet with reason: Maintenance	[production]
06:33	<marostegui@cumin1002>	dbctl commit (dc=all): 'es2035 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P79789 and previous config saved to /var/cache/conftool/dbconfig/20250724-063300-root.json	[production]
06:17	<marostegui@cumin1002>	dbctl commit (dc=all): 'es2035 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P79788 and previous config saved to /var/cache/conftool/dbconfig/20250724-061755-root.json	[production]
06:02	<marostegui@cumin1002>	dbctl commit (dc=all): 'es2035 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P79787 and previous config saved to /var/cache/conftool/dbconfig/20250724-060249-root.json	[production]
05:47	<marostegui@cumin1002>	dbctl commit (dc=all): 'es2035 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P79786 and previous config saved to /var/cache/conftool/dbconfig/20250724-054743-root.json	[production]
05:32	<marostegui@cumin1002>	dbctl commit (dc=all): 'es2035 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P79785 and previous config saved to /var/cache/conftool/dbconfig/20250724-053236-root.json	[production]
05:28	<marostegui@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2218.codfw.wmnet with reason: Maintenance	[production]
01:28	<ryankemper>	[Cirrus] `ryankemper@cirrussearch2071:~$ sudo systemctl restart opensearch-disable-readahead-production-search-psi-codfw.service`	[production]
01:01	<ryankemper@cumin1002>	END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_codfw: activate new plugins packages - ryankemper@cumin1002 - T397227	[production]
2025-07-23 §
23:54	<dzahn@cumin2002>	END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: security release 20250723	[production]
23:48	<ryankemper@cumin1002>	START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_codfw: activate new plugins packages - ryankemper@cumin1002 - T397227	[production]
23:46	<ryankemper>	[Cirrus] Depooled codfw in anticipation of rolling restart. Hopefully minimal noise on this one :)	[production]
23:46	<ryankemper@cumin1002>	conftool action : set/pooled=false; selector: dnsdisc=search,name=codfw	[production]
23:15	<inflatador>	pool cirrussearch eqiad, will resume investigations tomorrow T400160	[production]
23:14	<bking@cumin2002>	conftool action : set/pooled=true; selector: dnsdisc=search,name=eqiad	[production]
23:08	<bking@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 55 hosts with reason: testing cluster quorum	[production]
22:53	<bking@cumin1002>	END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227	[production]
22:17	<vriley@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART	[production]
22:05	<vriley@cumin1002>	START - Cookbook sre.hosts.provision for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART	[production]