production SAL

2051-2100 of 10000 results (85ms)

2024-07-10 §
13:01	<btullis@cumin1002>	START - Cookbook sre.hosts.reimage for host an-mariadb1001.eqiad.wmnet with OS bookworm	[production]
12:59	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1183 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P66122 and previous config saved to /var/cache/conftool/dbconfig/20240710-125928-root.json	[production]
12:44	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1183 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P66121 and previous config saved to /var/cache/conftool/dbconfig/20240710-124422-root.json	[production]
12:38	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Depooling db1167 (T367781)', diff saved to https://phabricator.wikimedia.org/P66120 and previous config saved to /var/cache/conftool/dbconfig/20240710-123844-arnaudb.json	[production]
12:38	<arnaudb@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
12:38	<arnaudb@cumin1002>	START - Cookbook sre.hosts.downtime for 8:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
12:38	<arnaudb@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1167.eqiad.wmnet with reason: Maintenance	[production]
12:38	<arnaudb@cumin1002>	START - Cookbook sre.hosts.downtime for 4:00:00 on db1167.eqiad.wmnet with reason: Maintenance	[production]
12:30	<topranks>	removing unused wmcs vlans from asw2-b-eqiad	[production]
12:29	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1183 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P66119 and previous config saved to /var/cache/conftool/dbconfig/20240710-122917-root.json	[production]
12:23	<logmsgbot>	lucaswerkmeister-wmde@deploy1002 helmfile [codfw] DONE helmfile.d/services/termbox: apply	[production]
12:22	<logmsgbot>	lucaswerkmeister-wmde@deploy1002 helmfile [codfw] START helmfile.d/services/termbox: apply	[production]
12:22	<logmsgbot>	lucaswerkmeister-wmde@deploy1002 helmfile [eqiad] DONE helmfile.d/services/termbox: apply	[production]
12:21	<logmsgbot>	lucaswerkmeister-wmde@deploy1002 helmfile [eqiad] START helmfile.d/services/termbox: apply	[production]
12:21	<logmsgbot>	lucaswerkmeister-wmde@deploy1002 helmfile [staging] DONE helmfile.d/services/termbox: apply	[production]
12:20	<logmsgbot>	lucaswerkmeister-wmde@deploy1002 helmfile [staging] START helmfile.d/services/termbox: apply	[production]
12:14	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1183 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P66118 and previous config saved to /var/cache/conftool/dbconfig/20240710-121411-root.json	[production]
11:59	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1183 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P66117 and previous config saved to /var/cache/conftool/dbconfig/20240710-115906-root.json	[production]
11:53	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub: sync on production	[production]
11:50	<marostegui@cumin1002>	dbctl commit (dc=all): 'Pool db2136 into api with small weight T365805', diff saved to https://phabricator.wikimedia.org/P66116 and previous config saved to /var/cache/conftool/dbconfig/20240710-115046-marostegui.json	[production]
11:50	<claime>	cleaned up leftover media files on videoscalers	[production]
11:50	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub: apply on production	[production]
11:44	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1183 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P66115 and previous config saved to /var/cache/conftool/dbconfig/20240710-114401-root.json	[production]
11:30	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1162 (T352010)', diff saved to https://phabricator.wikimedia.org/P66114 and previous config saved to /var/cache/conftool/dbconfig/20240710-113010-ladsgroup.json	[production]
11:30	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1162.eqiad.wmnet with reason: Maintenance	[production]
11:29	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1162.eqiad.wmnet with reason: Maintenance	[production]
11:28	<marostegui@cumin1002>	dbctl commit (dc=all): 'db1183 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P66113 and previous config saved to /var/cache/conftool/dbconfig/20240710-112856-root.json	[production]
11:22	<mnz@deploy1002>	Finished deploy [airflow-dags/research@5121748]: (no justification provided) (duration: 00m 41s)	[production]
11:21	<mnz@deploy1002>	Started deploy [airflow-dags/research@5121748]: (no justification provided)	[production]
10:43	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply	[production]
10:43	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply	[production]
10:43	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply	[production]
10:43	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply	[production]
10:42	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: sync on staging	[production]
10:39	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply on staging	[production]
10:38	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply	[production]
10:38	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply	[production]
10:34	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
10:34	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
10:29	<btullis@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply	[production]
10:26	<mnz@deploy1002>	Finished deploy [airflow-dags/research@5121748]: (no justification provided) (duration: 00m 04s)	[production]
10:26	<mnz@deploy1002>	Started deploy [airflow-dags/research@5121748]: (no justification provided)	[production]
10:23	<arnaudb@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1208.eqiad.wmnet with reason: corruption issue	[production]
10:22	<arnaudb@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1208.eqiad.wmnet with reason: corruption issue	[production]
10:21	<jiji@deploy1002>	Finished scap: Switch mediawiki everywhere to use node-local mcrouter ds - T346690 (duration: 05m 15s)	[production]
10:15	<jiji@deploy1002>	Started scap sync-world: Switch mediawiki everywhere to use node-local mcrouter ds - T346690	[production]
09:29	<kevinbazira@deploy1002>	helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' .	[production]
08:51	<aklapper@deploy1002>	rebuilt and synchronized wikiversions files: group1 wikis to 1.43.0-wmf.13 refs T366958	[production]
08:41	<hashar>	On deployment server, unblocked train by manually editing /var/lib/scap/scap/lib/python3.7/site-packages/scap/train.py to allow train blocker task with "progress" status instead of just "open" # T369689	[production]
08:08	<kostajh>	UTC morning deploys done	[production]