production SAL

2051-2100 of 10000 results (73ms)

2023-01-04 §
07:38	<oblivian@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-web: apply	[production]
07:38	<oblivian@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-web: apply	[production]
07:38	<marostegui>	Switch x1 back to RBR T255174	[production]
07:35	<marostegui>	dbmaint codfw deploy schema change on x1 T255174	[production]
07:35	<marostegui>	dbmaint eqiad deploy schema change on x1 T255174	[production]
07:29	<marostegui@cumin1001>	dbctl commit (dc=all): 'db2131 (re)pooling @ 10%: After testing', diff saved to https://phabricator.wikimedia.org/P42751 and previous config saved to /var/cache/conftool/dbconfig/20230104-072922-root.json	[production]
07:20	<oblivian@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply	[production]
07:20	<oblivian@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-debug: apply	[production]
07:19	<oblivian@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-debug: apply	[production]
07:19	<oblivian@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-debug: apply	[production]
07:14	<marostegui@cumin1001>	dbctl commit (dc=all): 'db2131 (re)pooling @ 5%: After testing', diff saved to https://phabricator.wikimedia.org/P42750 and previous config saved to /var/cache/conftool/dbconfig/20230104-071417-root.json	[production]
06:59	<marostegui@cumin1001>	dbctl commit (dc=all): 'db2131 (re)pooling @ 1%: After testing', diff saved to https://phabricator.wikimedia.org/P42749 and previous config saved to /var/cache/conftool/dbconfig/20230104-065912-root.json	[production]
2023-01-03 §
22:47	<eileen>	config 34754c69 -> 03c4d7a6	[production]
22:33	<eileen>	config revision changed from 5c73975a to 34754c69	[production]
21:55	<mutante>	gitlab-runner* - correction: allowing connections TO kubestagemaster.svc.eqiad.wmnet port 6443 FROM trusted runners, of course - T325385	[production]
21:53	<mutante>	gitlab-runner* - allowing kubestagemaster.svc.eqiad.wmnet to connect to port 6443, run puppet via cumin, deploy gerrit:868737 - T325385	[production]
21:47	<taavi>	UTC late backports done	[production]
21:46	<taavi@deploy1002>	Finished scap: Backport for [[gerrit:869226\|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855\|Use new DiscussionTools heading markup on group1 wikis (T314714)]] (duration: 12m 12s)	[production]
21:35	<taavi@deploy1002>	taavi and matmarex: Backport for [[gerrit:869226\|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855\|Use new DiscussionTools heading markup on group1 wikis (T314714)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet	[production]
21:34	<taavi@deploy1002>	Started scap: Backport for [[gerrit:869226\|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855\|Use new DiscussionTools heading markup on group1 wikis (T314714)]]	[production]
21:30	<taavi@deploy1002>	Finished scap: Backport for [[gerrit:874443\|Start writing to cuc_comment_id on test wikis (T233004)]] (duration: 12m 54s)	[production]
21:19	<taavi@deploy1002>	taavi and zabe: Backport for [[gerrit:874443\|Start writing to cuc_comment_id on test wikis (T233004)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet	[production]
21:17	<taavi@deploy1002>	Started scap: Backport for [[gerrit:874443\|Start writing to cuc_comment_id on test wikis (T233004)]]	[production]
21:15	<taavi@deploy1002>	Finished scap: Backport for [[gerrit:873880\|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887\|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418\|Pin cu_changes comment migration to old schema (T233004)]] (duration: 08m 49s)	[production]
21:08	<taavi@deploy1002>	taavi and zabe: Backport for [[gerrit:873880\|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887\|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418\|Pin cu_changes comment migration to old schema (T233004)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet	[production]
21:06	<taavi@deploy1002>	Started scap: Backport for [[gerrit:873880\|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887\|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418\|Pin cu_changes comment migration to old schema (T233004)]]	[production]
19:27	<dduvall@deploy1002>	rebuilt and synchronized wikiversions files: group0 wikis to 1.40.0-wmf.17 refs T325580	[production]
19:18	<dduvall@deploy1002>	deploy-promote aborted: (duration: 08m 55s)	[production]
19:13	<btullis@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cephosd1001.eqiad.wmnet with OS bullseye	[production]
17:37	<claime>	Finished parse reboots in eqiad	[production]
17:36	<cgoubert@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0)	[production]
17:30	<sukhe>	sudo cumin -b 1 -s 5 'A:codfw and P{O:swift::proxy}' 'depool && sleep 3 && systemctl restart swift-proxy && sleep 3 && pool'	[production]
16:40	<ejegg>	fundraising EOY receipt calculation finished, restarted scheduled jobs	[production]
16:21	<ejegg>	fundraising scheduled jobs disabled for EOY receipt calculation	[production]
15:37	<btullis@cumin1001>	START - Cookbook sre.hosts.reimage for host cephosd1001.eqiad.wmnet with OS bullseye	[production]
15:30	<btullis@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cephosd1001.eqiad.wmnet with OS bullseye	[production]
15:14	<cgoubert@cumin1001>	START - Cookbook sre.hosts.reboot-cluster	[production]
15:13	<cgoubert@cumin1001>	END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97)	[production]
15:13	<cgoubert@cumin1001>	START - Cookbook sre.hosts.reboot-cluster	[production]
15:13	<cgoubert@cumin1001>	END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97)	[production]
15:13	<cgoubert@cumin1001>	START - Cookbook sre.hosts.reboot-cluster	[production]
15:11	<cgoubert@cumin1001>	END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=1)	[production]
15:10	<andrewbogott>	upgrading and rebooting wikitech-static	[production]
15:07	<cgoubert@cumin1001>	START - Cookbook sre.hosts.reboot-cluster	[production]
15:06	<claime>	Starting rolling reboot of parse* hosts in eqiad	[production]
15:05	<taavi>	UTC afternoon backports done	[production]
15:04	<taavi@deploy1002>	Finished scap: Backport for [[gerrit:874871\|SecurePoll: Add files for UCoC 2023 vote (T324793)]], [[gerrit:874872\|ucoc2023: Update populateEditCount to count Flow edits (T324793)]], [[gerrit:874873\|ucoc2023: Update populateEditCount to count Flow edits (T324793)]] (duration: 08m 10s)	[production]
15:00	<filippo@cumin1001>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts graphite1004.eqiad.wmnet	[production]
14:59	<filippo@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
14:59	<filippo@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: graphite1004.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - filippo@cumin1001"	[production]