production SAL

3801-3850 of 10000 results (98ms)

2024-06-11 §
18:14	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2159.codfw.wmnet with reason: Maintenance	[production]
18:14	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2150 (T364069)', diff saved to https://phabricator.wikimedia.org/P64641 and previous config saved to /var/cache/conftool/dbconfig/20240611-181448-marostegui.json	[production]
18:10	<brennen>	1.43.0-wmf.9 train (T361403): no blockers, rolling to group0	[production]
18:08	<ejegg>	stopped fundraising scheduled jobs	[production]
17:59	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P64640 and previous config saved to /var/cache/conftool/dbconfig/20240611-175941-marostegui.json	[production]
17:59	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
17:58	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
17:56	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
17:56	<taavi@deploy1002>	Finished scap: Backport for [[gerrit:1038750\|wikitech: Stop loading OpenStackManager (T161553 T338477 T359544)]] (duration: 12m 00s)	[production]
17:56	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
17:47	<taavi@deploy1002>	taavi: Continuing with sync	[production]
17:46	<taavi@deploy1002>	taavi: Backport for [[gerrit:1038750\|wikitech: Stop loading OpenStackManager (T161553 T338477 T359544)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
17:45	<bking@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
17:45	<bking@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
17:44	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P64639 and previous config saved to /var/cache/conftool/dbconfig/20240611-174434-marostegui.json	[production]
17:44	<taavi@deploy1002>	Started scap: Backport for [[gerrit:1038750\|wikitech: Stop loading OpenStackManager (T161553 T338477 T359544)]]	[production]
17:37	<rzl@deploy1002>	Finished scap: (no justification provided) (duration: 11m 40s)	[production]
17:33	<rzl>	rzl@cumin2002:~$ sudo cumin 'C:profile::mediawiki::webserver' 'enable-puppet T366649'	[production]
17:33	<rzl@deploy1002>	rzl: Continuing with sync	[production]
17:30	<rzl@deploy1002>	rzl: (no justification provided) synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
17:29	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2150 (T364069)', diff saved to https://phabricator.wikimedia.org/P64638 and previous config saved to /var/cache/conftool/dbconfig/20240611-172928-marostegui.json	[production]
17:26	<rzl@deploy1002>	Started scap: (no justification provided)	[production]
17:14	<rzl>	rzl@cumin2002:~$ sudo cumin 'C:profile::mediawiki::webserver' 'disable-puppet T366649'	[production]
17:11	<ejegg>	fundraising civicrm upgraded from ebfbad86 to 7252b1b9	[production]
17:09	<ebernhardson@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:09	<ebernhardson@deploy1002>	helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:09	<kamila@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye	[production]
17:08	<ebernhardson@deploy1002>	helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:08	<ebernhardson@deploy1002>	helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:04	<ebernhardson@deploy1002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:04	<ebernhardson@deploy1002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
17:04	<bking@cumin2002>	END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Unbanning all hosts in search_eqiad	[production]
17:04	<bking@cumin2002>	START - Cookbook sre.elasticsearch.ban Unbanning all hosts in search_eqiad	[production]
16:59	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye	[production]
16:56	<kamila@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye	[production]
16:56	<brouberol@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply	[production]
16:56	<brouberol@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply	[production]
16:53	<brouberol@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply	[production]
16:53	<brouberol@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply	[production]
16:51	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye	[production]
16:47	<ryankemper@cumin2002>	END (PASS) - Cookbook sre.hadoop.reboot-workers (exit_code=0) for Hadoop test cluster	[production]
16:40	<eevans@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:restbase-codfw	[production]
16:37	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye	[production]
16:36	<kamila@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-ctrl1002.eqiad.wmnet with OS bullseye	[production]
16:35	<ebernhardson@deploy1002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
16:35	<ebernhardson@deploy1002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
16:33	<kamila@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "updated wikikube-ctrl1002 status - kamila@cumin1002 - T366204"	[production]
16:31	<cgoubert@cumin1002>	conftool action : set/weight=10:pooled=yes; selector: name=(wikikube-worker1013.eqiad.wmnet\|wikikube-worker1014.eqiad.wmnet\|wikikube-worker1017.eqiad.wmnet\|wikikube-worker1018.eqiad.wmnet),cluster=kubernetes,service=kubesvc	[production]
16:31	<claime>	pool and uncordon wikikube-worker1013.eqiad.wmnet,wikikube-worker1014.eqiad.wmnet,wikikube-worker1017.eqiad.wmnet,wikikube-worker1018.eqiad.wmnet - T351074	[production]
16:31	<kamila@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "updated wikikube-ctrl1002 status - kamila@cumin1002 - T366204"	[production]