production SAL

401-450 of 10000 results (30ms)

2016-06-08 §
16:36	<hashar>	Disabled puppet on contint1001 to prevent it from bringing back Jenkins	[production]
16:32	<otto@palladium>	conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=mathoid'])	[production]
16:32	<otto@palladium>	conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores'])	[production]
16:32	<otto@palladium>	conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=mobileapps'])	[production]
16:32	<otto@palladium>	conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=cxserver'])	[production]
16:32	<otto@palladium>	conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=citoid'])	[production]
16:32	<otto@palladium>	conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=graphoid'])	[production]
16:24	<ottomata>	restarting hadoop-yarn-resourcemanager on analytics1002 to make analytics1001 active	[production]
16:07	<mobrovac>	scb1002 enabling back puppet	[production]
16:02	<elukey>	temporary set a 10TB upperbound to the Kafka webrequest_text topic to free space (T136690)	[production]
15:43	<ottomata>	restarting zk in codfw and eqiad 1 by 1 to apply maxClientCnxns=1024	[production]
15:12	<ottomata>	restarting zookeeper 1 by 1 in eqiad	[production]
15:03	<_joe_>	contint1001: systemctl mask zuul,zuul-merger	[production]
14:57	<elukey>	rolling out the new Varnishkafka version in cache misc (didn't do it before since there was an outage ongoing)	[production]
14:53	<jynus>	rebooting gallium with netboot for hardware maintenance	[production]
14:44	<mobrovac>	scb1001 enabling and running puppet on scb1001	[production]
13:44	<jynus>	running fsck.ext3 /dev/sda2 in read-write mode for gallium	[production]
13:42	<ottomata>	powercycling scb2001 and scb2002	[production]
13:30	<akosiaris>	disabling puppet on scb1001 & scb1002	[production]
13:30	<mobrovac>	change-prop stopped on scb1002	[production]
13:29	<akosiaris>	stopping changeprop on scb1001	[production]
13:26	<ottomata>	powercycling scb1002	[production]
13:18	<ottomata>	powercycling scb1001	[production]
13:08	<elukey>	rolling out new varnishkafka package in cache misc	[production]
12:09	<jynus>	mounted temporarily / partition from gallium sda on db1085:/mnt	[production]
10:40	<moritzm>	uploaded jenkins 1.651.2 for jessie-wikimedia to carbon	[production]
10:13	<elukey>	rolling out the new varnishkafka package to cache maps	[production]
10:04	<aaron@tin>	Synchronized php-1.28.0-wmf.5/includes/deferred/LinksDeletionUpdate.php: fd44d649787ede78687b4cd2ef21e44a4c8b843b (duration: 00m 33s)	[production]
08:28	<hashar>	stopping Jenkins / zuul / zuul-merger / puppet on gallium	[production]
08:15	<elukey>	lowering down webrequest_text kafka topic retention time from 7 days to 4 days to free disk space (T136690)	[production]
08:14	<hashar>	Jenkins has bunch of executors dead for what ever reason preventing jobs from running :(	[production]
07:53	<mobrovac>	change-prop deploying 84d56e53a	[production]
06:59	<moritzm>	enabling ferm on palladium (will lead to temporary puppet failures)	[production]
02:58	<l10nupdate@tin>	ResourceLoader cache refresh completed at Wed Jun 8 02:58:28 UTC 2016 (duration 6m 31s)	[production]
02:51	<mwdeploy@tin>	scap sync-l10n completed (1.28.0-wmf.5) (duration: 06m 49s)	[production]
02:51	<legoktm>	/ on gallium is currently read-only for some reason	[production]
02:29	<mwdeploy@tin>	scap sync-l10n completed (1.28.0-wmf.4) (duration: 11m 11s)	[production]
00:11	<awight_>	update fundraising-tools from b2425aef2154d6b689900f4848cca02880321230 to 28bc2da677caa795c58f906db76a1f8d612ac899	[production]
2016-06-07 §
23:46	<aaron@tin>	Synchronized php-1.28.0-wmf.5/includes/deferred/LinksUpdate.php: 6d85caaa9bb5918cb2888fc82f2c7c346cf746a2 (duration: 00m 25s)	[production]
23:35	<SMalyshev>	redeploying WDQS to update the Updater for T128947 fix	[production]
23:35	<tgr@tin>	Synchronized wmf-config/InitialiseSettings.php: SWAT [[gerrit:292518]] User rights configuration for meta. wmf-supportsafety group (duration: 00m 26s)	[production]
23:20	<tgr@tin>	Finished scap: (no message) (duration: 24m 51s)	[production]
23:02	<awight>	update paymentswiki from 28e10141454ef53085aed4c6619a34d3a4b43c58 to de11bfe2273d0bcaa0e713389b2d91e8b3567a1d; add PP cert	[production]
22:56	<tgr>	scapping AuthManager backports + feature switch enabled on group0 T135504	[production]
22:56	<tgr@tin>	Started scap: (no message)	[production]
22:10	<mutante>	icinga config broken: Error: Could not find any host matching 'relforge1001'	[production]
21:35	<twentyafterfour>	restarted apache on iridium to deploy D250	[production]
20:02	<andrewbogott>	dist-upgrade on labvirt1010, in hopes of resolving a nova-compute lockup (possibly related to a kvm upgrade earlier today)	[production]
20:00	<thcipriani@tin>	rebuilt wikiversions.php and synchronized wikiversions files: group0 to 1.28.0-wmf.5	[production]
19:44	<jynus>	restarting es2017 due to a bunch of ACPI errors (probably memory-caused)	[production]