production SAL

5651-5700 of 10000 results (98ms)

2024-04-17 §
10:08	<hnowlan@deploy1002>	helmfile [eqiad] START helmfile.d/services/wikifeeds: sync	[production]
10:06	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance	[production]
10:06	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance	[production]
10:00	<akosiaris>	manually bump coredns in eqiad to 6	[production]
09:59	<akosiaris>	manually bump coredns in codfw to 6	[production]
09:57	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1240.eqiad.wmnet with reason: Maintenance	[production]
09:57	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1240.eqiad.wmnet with reason: Maintenance	[production]
09:57	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1223 (T361627)', diff saved to https://phabricator.wikimedia.org/P60753 and previous config saved to /var/cache/conftool/dbconfig/20240417-095731-marostegui.json	[production]
09:44	<cgoubert@cumin1002>	conftool action : set/pooled=false; selector: dnsdisc=mw-api-ext-ro,name=eqiad	[production]
09:44	<cgoubert@cumin1002>	conftool action : set/pooled=false; selector: dnsdisc=mw-api-int-ro,name=eqiad	[production]
09:44	<cgoubert@cumin1002>	conftool action : set/pooled=false; selector: dnsdisc=mw-web-ro,name=eqiad	[production]
09:42	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1223', diff saved to https://phabricator.wikimedia.org/P60750 and previous config saved to /var/cache/conftool/dbconfig/20240417-094223-marostegui.json	[production]
09:31	<jiji@deploy1002>	scap failed: KeyError 'production' (duration: 22m 21s)	[production]
09:29	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2150 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P60749 and previous config saved to /var/cache/conftool/dbconfig/20240417-092923-root.json	[production]
09:27	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1223', diff saved to https://phabricator.wikimedia.org/P60748 and previous config saved to /var/cache/conftool/dbconfig/20240417-092714-marostegui.json	[production]
09:14	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2150 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P60747 and previous config saved to /var/cache/conftool/dbconfig/20240417-091418-root.json	[production]
09:12	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1223 (T361627)', diff saved to https://phabricator.wikimedia.org/P60746 and previous config saved to /var/cache/conftool/dbconfig/20240417-091203-marostegui.json	[production]
09:08	<jiji@deploy1002>	Started scap: Switch mediawiki in eqiad to use node-local mcrouter ds - T346690	[production]
09:05	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1223 (T361627)', diff saved to https://phabricator.wikimedia.org/P60745 and previous config saved to /var/cache/conftool/dbconfig/20240417-090539-marostegui.json	[production]
09:05	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1223.eqiad.wmnet with reason: Maintenance	[production]
09:05	<jiji@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply	[production]
09:05	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1223.eqiad.wmnet with reason: Maintenance	[production]
09:05	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1212 (T361627)', diff saved to https://phabricator.wikimedia.org/P60744 and previous config saved to /var/cache/conftool/dbconfig/20240417-090516-marostegui.json	[production]
09:03	<jiji@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-api-int: apply	[production]
08:59	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2150 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P60743 and previous config saved to /var/cache/conftool/dbconfig/20240417-085912-root.json	[production]
08:57	<hashar@deploy1002>	Finished scap: Backport for [[gerrit:1019267\|logging: pluralize $wmgDefaultMonologHandler (T238838)]] (duration: 16m 37s)	[production]
08:50	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P60742 and previous config saved to /var/cache/conftool/dbconfig/20240417-085009-marostegui.json	[production]
08:44	<hashar@deploy1002>	hashar: Continuing with sync	[production]
08:44	<hashar@deploy1002>	hashar: Backport for [[gerrit:1019267\|logging: pluralize $wmgDefaultMonologHandler (T238838)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
08:44	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2150 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P60741 and previous config saved to /var/cache/conftool/dbconfig/20240417-084407-root.json	[production]
08:41	<hashar@deploy1002>	Started scap: Backport for [[gerrit:1019267\|logging: pluralize $wmgDefaultMonologHandler (T238838)]]	[production]
08:40	<aqu>	Deployed refinery using scap, then deployed onto hdfs	[production]
08:35	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P60739 and previous config saved to /var/cache/conftool/dbconfig/20240417-083501-marostegui.json	[production]
08:29	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2150 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P60738 and previous config saved to /var/cache/conftool/dbconfig/20240417-082901-root.json	[production]
08:26	<aqu@deploy1002>	Finished deploy [analytics/refinery@c4e197f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@c4e197fa] (duration: 02m 23s)	[production]
08:24	<aqu@deploy1002>	Started deploy [analytics/refinery@c4e197f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@c4e197fa]	[production]
08:19	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1212 (T361627)', diff saved to https://phabricator.wikimedia.org/P60737 and previous config saved to /var/cache/conftool/dbconfig/20240417-081953-marostegui.json	[production]
08:16	<aqu@deploy1002>	Finished deploy [analytics/refinery@c4e197f] (thin): Regular analytics weekly train THIN [analytics/refinery@c4e197fa] (duration: 03m 39s)	[production]
08:13	<marostegui@cumin1002>	dbctl commit (dc=all): 'db2150 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P60736 and previous config saved to /var/cache/conftool/dbconfig/20240417-081356-root.json	[production]
08:13	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1212 (T361627)', diff saved to https://phabricator.wikimedia.org/P60735 and previous config saved to /var/cache/conftool/dbconfig/20240417-081326-marostegui.json	[production]
08:13	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
08:13	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
08:13	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1212.eqiad.wmnet with reason: Maintenance	[production]
08:13	<aqu@deploy1002>	Started deploy [analytics/refinery@c4e197f] (thin): Regular analytics weekly train THIN [analytics/refinery@c4e197fa]	[production]
08:13	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1212.eqiad.wmnet with reason: Maintenance	[production]
08:12	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1198 (T361627)', diff saved to https://phabricator.wikimedia.org/P60734 and previous config saved to /var/cache/conftool/dbconfig/20240417-081256-marostegui.json	[production]
08:10	<jayme@cumin1002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kubestage2002.codfw.wmnet	[production]
08:07	<aqu@deploy1002>	Finished deploy [analytics/refinery@c4e197f]: Regular analytics weekly train [analytics/refinery@c4e197fa] (duration: 27m 57s)	[production]
08:03	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2150.codfw.wmnet with OS bookworm	[production]
08:00	<jayme@cumin1002>	START - Cookbook sre.hosts.reboot-single for host kubestage2002.codfw.wmnet	[production]