production SAL

851-900 of 10000 results (51ms)

2019-10-24 §
10:55	<aborrero@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
10:55	<aborrero@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
10:55	<ema@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99)	[production]
10:52	<ema@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
10:18	<marostegui@cumin1001>	dbctl commit (dc=all): 'Adjust s6 weights for db1093 and db1085', diff saved to https://phabricator.wikimedia.org/P9466 and previous config saved to /var/cache/conftool/dbconfig/20191024-101810-marostegui.json	[production]
09:59	<hashar>	Converting CI jobs to use the new PostBuildScript plugin config \| https://gerrit.wikimedia.org/r/#/c/integration/config/+/544907/ \| T188398	[production]
09:57	<hashar>	Restarting CI Jenkins	[production]
09:35	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:33	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
09:14	<vgutierrez@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99)	[production]
09:12	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
09:12	<gilles@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: T234853 Re-enable performance perception survey on ruwiki (duration: 01m 04s)	[production]
08:39	<vgutierrez@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99)	[production]
08:37	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
08:36	<godog>	roll restart rsyslog in codfw/eqiad to pick up new kafka partitions	[production]
08:18	<godog>	roll restart rsyslog in ulsfo/esams/eqsin to pick up new kafka partitions	[production]
08:15	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db2092 for analyze table', diff saved to https://phabricator.wikimedia.org/P9465 and previous config saved to /var/cache/conftool/dbconfig/20191024-081519-marostegui.json	[production]
07:57	<XioNoX>	reboot mr1-esams	[production]
07:42	<godog>	bump rsyslog- topics partitions to 6 and roll-restart logstash frontends	[production]
07:24	<vgutierrez@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99)	[production]
07:22	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
07:22	<XioNoX>	drain Telia link on cr2-esams	[production]
06:32	<oblivian@puppetmaster1001>	conftool action : set/pooled=true; selector: dnsdisc=parsoid-php,name=eqiad	[production]
05:20	<marostegui@cumin1001>	dbctl commit (dc=all): 'Fully repool db1097:3315 after compression', diff saved to https://phabricator.wikimedia.org/P9463 and previous config saved to /var/cache/conftool/dbconfig/20191024-052002-marostegui.json	[production]
05:18	<marostegui>	Run analyze enwiki.revision on db2092 T223151	[production]
04:59	<marostegui@cumin1001>	dbctl commit (dc=all): 'More traffic to db1097:3315 after compression', diff saved to https://phabricator.wikimedia.org/P9462 and previous config saved to /var/cache/conftool/dbconfig/20191024-045954-marostegui.json	[production]
04:59	<marostegui@cumin1001>	dbctl commit (dc=all): 'Remove db1089 from special slaves group and leave it with its original pooling options T223151', diff saved to https://phabricator.wikimedia.org/P9461 and previous config saved to /var/cache/conftool/dbconfig/20191024-045924-marostegui.json	[production]
04:55	<marostegui@cumin1001>	dbctl commit (dc=all): 'Slowly repool db1097:3315 after compression', diff saved to https://phabricator.wikimedia.org/P9460 and previous config saved to /var/cache/conftool/dbconfig/20191024-045544-marostegui.json	[production]
04:48	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0)	[production]
04:48	<marostegui@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
03:55	<shdubsh>	temporarily turn down accept delay on fermium - T235983	[production]
00:03	<mutante>	restarting gerrit to increase heap_size from 20G to 32G (T225166 T222391)	[production]
2019-10-23 §
22:55	<brennen@deploy1001>	Synchronized php-1.35.0-wmf.3/extensions/AbuseFilter: SWAT: [[gerrit:545620\|Unbreak filter edit form (T236286)]] (duration: 01m 05s)	[production]
22:20	<twentyafterfour@deploy1001>	Finished deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server) (duration: 00m 21s)	[production]
22:20	<twentyafterfour@deploy1001>	Started deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server)	[production]
22:20	<twentyafterfour@deploy1001>	Finished deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server) (duration: 00m 05s)	[production]
22:19	<twentyafterfour@deploy1001>	Started deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server)	[production]
22:15	<twentyafterfour@deploy1001>	Finished deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server) (duration: 01m 10s)	[production]
22:14	<twentyafterfour@deploy1001>	Started deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server)	[production]
22:00	<twentyafterfour@deploy1001>	Finished deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server) (duration: 00m 21s)	[production]
22:00	<twentyafterfour@deploy1001>	Started deploy [phabricator/deployment@e4e2b22]: deploy to phab1001 (currently a warm spare server)	[production]
21:32	<mutante>	webperf1002/2002 - starting bacula-fd service that is failed after initial puppet run turning them into backup::hosts	[production]
21:14	<ejegg>	updated Fundraising python tools from b3c7453be2 to ffc7bf764b	[production]
20:37	<shdubsh>	restart nagios-nrpe-server on stat1007	[production]
18:56	<milimetric@deploy1001>	Finished deploy [analytics/refinery@3aaabf6]: Minor: fix two scripts (duration: 07m 53s)	[production]
18:49	<milimetric@deploy1001>	Started deploy [analytics/refinery@3aaabf6]: Minor: fix two scripts	[production]
18:29	<mforns@deploy1001>	Finished deploy [analytics/refinery@1110d59]: deploying refinery up to 1110d59c3983bcff4986bce1baf885f05ee06ba5 (duration: 06m 40s)	[production]
18:22	<mforns@deploy1001>	Started deploy [analytics/refinery@1110d59]: deploying refinery up to 1110d59c3983bcff4986bce1baf885f05ee06ba5	[production]
17:31	<akosiaris>	restart varnish-be on cp1089 as a response to HTTP availability alerts. High mailbox lag	[production]
17:25	<akosiaris>	restart varnish-be on cp1081 as a response to HTTP availability alerts	[production]