production SAL

2801-2850 of 10000 results (102ms)

2024-08-19 §
14:31	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker1303.eqiad.wmnet with OS bullseye	[production]
14:30	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker1301.eqiad.wmnet with OS bullseye	[production]
14:30	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker1300.eqiad.wmnet with OS bullseye	[production]
14:30	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
14:30	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
14:29	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1303.eqiad.wmnet with OS bullseye	[production]
14:29	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1301.eqiad.wmnet with OS bullseye	[production]
14:29	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1300.eqiad.wmnet with OS bullseye	[production]
14:29	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker1303.eqiad.wmnet with OS bullseye	[production]
14:29	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker1301.eqiad.wmnet with OS bullseye	[production]
14:29	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-worker1300.eqiad.wmnet with OS bullseye	[production]
14:27	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on wdqs[1022,1024].eqiad.wmnet with reason: noisy alerts, will look at later in the day	[production]
14:27	<bking@cumin2002>	START - Cookbook sre.hosts.downtime for 5:00:00 on wdqs[1022,1024].eqiad.wmnet with reason: noisy alerts, will look at later in the day	[production]
13:34	<Lucas_WMDE>	UTC afternoon backport+config window done (except for the T195546 maintenance script which is expected to keep running for a few more hours, currently at commonswiki)	[production]
13:31	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
13:31	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
13:27	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 Finished scap sync-world: Backport for [[gerrit:1062979\|(de\|uk\|ja\|he\|fi)wiki: enable shellbox-video (T356241)]] (duration: 06m 57s)	[production]
13:23	<fnegri@cumin1002>	conftool action : set/pooled=yes; selector: name=clouddb1015.eqiad.wmnet,service=s4	[production]
13:23	<fnegri@cumin1002>	conftool action : set/pooled=yes; selector: name=clouddb1015.eqiad.wmnet,service=s6	[production]
13:22	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 lucaswerkmeister-wmde, hnowlan: Continuing with sync	[production]
13:22	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 lucaswerkmeister-wmde, hnowlan: Backport for [[gerrit:1062979\|(de\|uk\|ja\|he\|fi)wiki: enable shellbox-video (T356241)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
13:21	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
13:20	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 Started scap sync-world: Backport for [[gerrit:1062979\|(de\|uk\|ja\|he\|fi)wiki: enable shellbox-video (T356241)]]	[production]
13:17	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 Finished scap sync-world: Backport for [[gerrit:1059422\|Define wgVirtualDomainsMapping for virtual-checkuser-global (T371724)]] (duration: 10m 23s)	[production]
13:17	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db2162 (T367856)', diff saved to https://phabricator.wikimedia.org/P67386 and previous config saved to /var/cache/conftool/dbconfig/20240819-131702-marostegui.json	[production]
13:16	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 7:00:00 on db2162.codfw.wmnet with reason: Maintenance	[production]
13:16	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 2 days, 7:00:00 on db2162.codfw.wmnet with reason: Maintenance	[production]
13:16	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2154 (T367856)', diff saved to https://phabricator.wikimedia.org/P67385 and previous config saved to /var/cache/conftool/dbconfig/20240819-131640-marostegui.json	[production]
13:16	<fnegri@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host clouddb1015.eqiad.wmnet with OS bookworm	[production]
13:13	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 dreamyjazz, lucaswerkmeister-wmde: Continuing with sync	[production]
13:12	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
13:10	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 dreamyjazz, lucaswerkmeister-wmde: Backport for [[gerrit:1059422\|Define wgVirtualDomainsMapping for virtual-checkuser-global (T371724)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
13:10	<cgoubert@cumin1002>	START - Cookbook sre.dns.netbox	[production]
13:09	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "rdb1014 back to active - cgoubert@cumin1002 - T370633"	[production]
13:09	<cgoubert@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "rdb1014 back to active - cgoubert@cumin1002 - T370633"	[production]
13:07	<logmsgbot>	lucaswerkmeister-wmde@deploy1003 Started scap sync-world: Backport for [[gerrit:1059422\|Define wgVirtualDomainsMapping for virtual-checkuser-global (T371724)]]	[production]
13:02	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply	[production]
13:02	<Lucas_WMDE>	START lucaswerkmeister-wmde@mwmaint1002:~$ foreachwiki maintenance/cleanupTitles.php --prefix=T195546 --reporting-interval=1000000000 2>&1 \| tee ~/T195546.log	[production]
13:01	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P67384 and previous config saved to /var/cache/conftool/dbconfig/20240819-130132-marostegui.json	[production]
13:00	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
12:57	<cgoubert@cumin1002>	START - Cookbook sre.dns.netbox	[production]
12:49	<fnegri@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on clouddb1015.eqiad.wmnet with reason: host reimage	[production]
12:46	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P67383 and previous config saved to /var/cache/conftool/dbconfig/20240819-124625-marostegui.json	[production]
12:45	<fnegri@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on clouddb1015.eqiad.wmnet with reason: host reimage	[production]
12:41	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
12:39	<pfischer@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
12:39	<cgoubert@cumin1002>	START - Cookbook sre.dns.netbox	[production]
12:38	<pfischer@deploy1003>	helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
12:38	<pfischer@deploy1003>	helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
12:37	<pfischer@deploy1003>	helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]