production SAL

201-250 of 10000 results (62ms)

2022-12-08 §
11:29	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance	[production]
11:29	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1170:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42619 and previous config saved to /var/cache/conftool/dbconfig/20221208-112951-ladsgroup.json	[production]
11:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2169:3317', diff saved to https://phabricator.wikimedia.org/P42618 and previous config saved to /var/cache/conftool/dbconfig/20221208-112612-ladsgroup.json	[production]
11:23	<aqu@deploy1002>	Finished deploy [airflow-dags/analytics@73d1267]: Create dag generating weekly snapshot of HDFS usage - analytics [airflow-dags@73d1267] (duration: 00m 18s)	[production]
11:22	<aqu@deploy1002>	Started deploy [airflow-dags/analytics@73d1267]: Create dag generating weekly snapshot of HDFS usage - analytics [airflow-dags@73d1267]	[production]
11:21	<moritzm>	drain ganeti5002 for eventual decom T324610	[production]
11:20	<aqu@deploy1002>	Finished deploy [airflow-dags/analytics_test@73d1267]: Create dag generating weekly snapshot of HDFS usage - analytics_test [airflow-dags@73d1267] (duration: 00m 09s)	[production]
11:20	<aqu@deploy1002>	Started deploy [airflow-dags/analytics_test@73d1267]: Create dag generating weekly snapshot of HDFS usage - analytics_test [airflow-dags@73d1267]	[production]
11:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1170:3317', diff saved to https://phabricator.wikimedia.org/P42617 and previous config saved to /var/cache/conftool/dbconfig/20221208-111444-ladsgroup.json	[production]
11:11	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2169:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42616 and previous config saved to /var/cache/conftool/dbconfig/20221208-111105-ladsgroup.json	[production]
11:10	<steve_munene>	batch restarting varnishkafka-webrequest.service in batches of 3 30 seconds in between T323771	[production]
11:09	<steve_munene>	batch restarting varnishkafka-webrequest.service in batches of 3 30 seconds in between T323771	[production]
11:08	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2169:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42615 and previous config saved to /var/cache/conftool/dbconfig/20221208-110849-ladsgroup.json	[production]
11:08	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2169.codfw.wmnet with reason: Maintenance	[production]
11:08	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2169.codfw.wmnet with reason: Maintenance	[production]
11:08	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42614 and previous config saved to /var/cache/conftool/dbconfig/20221208-110828-ladsgroup.json	[production]
10:59	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1170:3317', diff saved to https://phabricator.wikimedia.org/P42613 and previous config saved to /var/cache/conftool/dbconfig/20221208-105938-ladsgroup.json	[production]
10:57	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti5005.eqsin.wmnet to cluster eqsin and group 1	[production]
10:56	<steve_munene>	batch restarting varnishkafka-statsv.service in batches of 3 30 seconds in between T323771	[production]
10:56	<steve_munene>	batch restarting varnishkafka-statsv.service in batches of 3 30 seconds in between T323771	[production]
10:54	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti5005.eqsin.wmnet to cluster eqsin and group 1	[production]
10:53	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2168:3317', diff saved to https://phabricator.wikimedia.org/P42612 and previous config saved to /var/cache/conftool/dbconfig/20221208-105321-ladsgroup.json	[production]
10:50	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti5005.eqsin.wmnet to cluster eqsin and group 1	[production]
10:50	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti5005.eqsin.wmnet to cluster eqsin and group 1	[production]
10:44	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1170:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42611 and previous config saved to /var/cache/conftool/dbconfig/20221208-104432-ladsgroup.json	[production]
10:43	<steve_munene>	batch restarting varnishkafka-eventlogging.service in batches of 3 30 seconds in between T323771	[production]
10:43	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1170:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42610 and previous config saved to /var/cache/conftool/dbconfig/20221208-104322-ladsgroup.json	[production]
10:43	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1170.eqiad.wmnet with reason: Maintenance	[production]
10:43	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1170.eqiad.wmnet with reason: Maintenance	[production]
10:43	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T322618)', diff saved to https://phabricator.wikimedia.org/P42609 and previous config saved to /var/cache/conftool/dbconfig/20221208-104300-ladsgroup.json	[production]
10:38	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2168:3317', diff saved to https://phabricator.wikimedia.org/P42608 and previous config saved to /var/cache/conftool/dbconfig/20221208-103815-ladsgroup.json	[production]
10:36	<ladsgroup@deploy1002>	Finished scap: Backport for [[gerrit:865828\|Set externallinks migration to WRITE_BOTH in testwiki (T321662)]] (duration: 09m 17s)	[production]
10:35	<steve_munene>	batch restarting varnishkafka-eventlogging.service in batches of 3 30 seconds in between	[production]
10:35	<steve_munene>	batch restarting varnishkafka-eventlogging.service in batches of 3 30 seconds in between	[production]
10:28	<ladsgroup@deploy1002>	ladsgroup and ladsgroup: Backport for [[gerrit:865828\|Set externallinks migration to WRITE_BOTH in testwiki (T321662)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet	[production]
10:27	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P42606 and previous config saved to /var/cache/conftool/dbconfig/20221208-102754-ladsgroup.json	[production]
10:26	<ladsgroup@deploy1002>	Started scap: Backport for [[gerrit:865828\|Set externallinks migration to WRITE_BOTH in testwiki (T321662)]]	[production]
10:25	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti5005.eqsin.wmnet	[production]
10:23	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42605 and previous config saved to /var/cache/conftool/dbconfig/20221208-102308-ladsgroup.json	[production]
10:20	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2168:3317 (T322618)', diff saved to https://phabricator.wikimedia.org/P42604 and previous config saved to /var/cache/conftool/dbconfig/20221208-102052-ladsgroup.json	[production]
10:20	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2168.codfw.wmnet with reason: Maintenance	[production]
10:20	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2168.codfw.wmnet with reason: Maintenance	[production]
10:20	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2159 (T322618)', diff saved to https://phabricator.wikimedia.org/P42603 and previous config saved to /var/cache/conftool/dbconfig/20221208-102030-ladsgroup.json	[production]
10:18	<hashar>	contint1002: activated Icinga monitoring , all services are up and running # T313832	[production]
10:12	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P42602 and previous config saved to /var/cache/conftool/dbconfig/20221208-101247-ladsgroup.json	[production]
10:05	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti5005.eqsin.wmnet	[production]
10:05	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P42600 and previous config saved to /var/cache/conftool/dbconfig/20221208-100524-ladsgroup.json	[production]
10:01	<claime>	Deploying puppet enforcement of zuul-merger on contint1002	[production]
09:57	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T322618)', diff saved to https://phabricator.wikimedia.org/P42599 and previous config saved to /var/cache/conftool/dbconfig/20221208-095741-ladsgroup.json	[production]
09:57	<slyngshede@cumin1001>	END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host test-reimage2001.codfw.wmnet	[production]