5101-5150 of 7871 results (22ms)
2015-04-20 §
16:10 <hashar> deployment-salt kill -9 of puppetmaster processes [releng]
16:08 <hashar> deployment-salt killed git-sync-upstream netcat to labmon1001.eqiad.wmnet 8125 was eating all memory [releng]
16:04 <hashar> beta: manually rebasing operations/puppet on deployment-salt . Might have killed some live hack in the process :/ [releng]
13:58 <hashar> In Gerrit, hidden integration/jenkins-job-builder-config and integration/zuul-config historical repositories. Suggest by addshore on {{bug:T96522}} [releng]
03:39 <legoktm> deploying https://gerrit.wikimedia.org/r/205174 [releng]
2015-04-19 §
06:12 <legoktm> deploying https://gerrit.wikimedia.org/r/205076 [releng]
2015-04-18 §
05:18 <legoktm> deploying https://gerrit.wikimedia.org/r/204995 [releng]
03:09 <Krinkle> Finished set up of integration-slave-trusty-1017. Pooled. [releng]
2015-04-17 §
17:52 <Krinkle> Reloading Zuul to deploy https://gerrit.wikimedia.org/r/204812 [releng]
17:45 <Krinkle> Creating integration-slave-trusty-1017 [releng]
16:29 <Krinkle> Reloading Zuul to deploy https://gerrit.wikimedia.org/r/204791 [releng]
16:00 <Krinkle> Reloading Zuul to deploy https://gerrit.wikimedia.org/r/204783 [releng]
12:42 <hashar> restarting Jenkins [releng]
12:38 <hashar> Switching zuul on lanthanum.eqiad.wmnet to the Debian package version [releng]
12:14 <hashar> Switching Zuul scheduler on gallium.wikimedia.org to the Debian package version [releng]
12:12 <hashar> Jenkins: enabled plugin "ZMQ Event Publisher" and publishing all jobs result on TCP port 8888 [releng]
05:37 <legoktm> deploying https://gerrit.wikimedia.org/r/204706 [releng]
01:11 <Krinkle> Repool integration-slave-precise-1013 and integration-slave-trusty-1015 (live hack with libeatmydata enabled for mysql; T96308) [releng]
2015-04-16 §
22:08 <Krinkle> Rebooting integration-slave-precise-1013 (depooled; experimenting with libeatmydata) [releng]
22:07 <Krinkle> Rebooted integration-slave-trusty-1015 (experimenting with libeatmydata) [releng]
18:31 <Krinkle> Rebooting integration-slave-precise-1012 and integration-slave-trusty-1012 [releng]
17:57 <Krinkle> Repooled instances. Converstion of mysql.datadir to tmpfs worked, but puppet run has errors. Coren and Krinkle working on it. https://gerrit.wikimedia.org/r/#/c/204528/ (T96230) [releng]
17:22 <Krinkle> Gracefully depool integration slaves to deploy https://gerrit.wikimedia.org/r/#/c/204528/ (T96230) [releng]
14:35 <thcipriani> running dpkg --configure -a on deployment-bastion to correct puppet failures [releng]
2015-04-15 §
23:21 <Krinkle> beta-update-databases-eqiad stuck waiting for executors on a node that has plenty executors available [releng]
21:15 <hashar> Jenkins browser test jobs sometime deadlock because of the IRC notification plugin https://phabricator.wikimedia.org/T96183 [releng]
20:34 <hashar> hard restarting Jenkins [releng]
19:24 <Krinkle> Aborting browser tests jobs. Stuck for over 5 hours. [releng]
19:24 <Krinkle> Aborting beta-scap-eqiad. Has been stuck for 2 hours on "Notifying IRC" after "Connection time out" from scap. [releng]
08:22 <hashar> restarted Jenkins [releng]
08:20 <hashar> Exception in thread "RequestHandlerThread[#2]" java.lang.OutOfMemoryError: Java heap space [releng]
08:16 <hashar> Jenkins process went wild taking all CPU busy on gallium [releng]
2015-04-14 §
20:43 <legoktm> starting SULF on beta cluster [releng]
20:42 <marktraceur> stopping all beta jobs, aborting running (and stuck) beta DB update, kicking bastion, to try and get beta to update [releng]
19:49 <Krinkle> All systems go. [releng]
19:48 <Krinkle> Jenkins configuration panel won't load ("Loading..." stays indefine, "Uncaught TypeError: Cannot convert to object at prototype.js:195") [releng]
19:46 <Krinkle> Jenkins restarted. Relaunching Gearman [releng]
19:42 <Krinkle> Jenkins still unable to obtain Gearman connection. (HTTP 503 error from /configure). Have to force restart Jenkins. [releng]
19:42 <Krinkle> deployment-bastion jobs were stuck. marktraceur cancelled queue and relaunched slave. Now processing again. [releng]
15:27 <Krinkle> puppetmaster: Re-apply I05c49e5248cb operations/puppet patch to re-fix T91524. Somehow the patch got lost. [releng]
08:46 <hashar> does qa-morebots works ? [releng]
2015-04-13 §
20:14 <Krinkle> Restarting Zuul, Jenkins and aborting all builds. Everything got stuck following NFS outage in lab [releng]
17:01 <legoktm> deploying https://gerrit.wikimedia.org/r/203858 [releng]
13:56 <Krinkle> Delete old integration-slave1001...1004 (T94916) [releng]
10:43 <hashar> reducing number of executors on Precise instances from 5 to 4 and on Trusty instances from 6 to 4. The Jenkins scheduler tends to assign the unified jobs to the same slave which overload a single slave while others are idling. [releng]
10:43 <hashar> reducing number of executors from 5 to 4 [releng]
08:46 <hashar> jenkins removed #wikimedia-qa IRC channel from the global configuration [releng]
08:42 <hashar> kill -9 jenkins causes it was stuck in some deadlock related to the IRC plugin :( [releng]
08:34 <zeljkof> restarting stuck Jenkins [releng]
2015-04-12 §
23:58 <bd808> sudo ln -s /srv/l10nupdate/mediawiki /var/lib/l10nupdate/mediawiki on deployment-bastion [releng]