2016-02-09
ยง
|
22:02 |
<legoktm> |
reloading zuul to see if it'll pickup the new composer-php53 job |
[releng] |
21:53 |
<legoktm> |
enabling puppet on just integration-slave-trusty-1012 |
[releng] |
21:52 |
<legoktm> |
cherry-picked https://gerrit.wikimedia.org/r/#/c/269370/ onto integration-puppetmaster |
[releng] |
21:50 |
<legoktm> |
disabling puppet on all trusty/precise CI slaves |
[releng] |
21:40 |
<legoktm> |
deploying https://gerrit.wikimedia.org/r/269533 |
[releng] |
21:33 |
<Krenair> |
ran package upgrades on wikitech-static |
[production] |
20:37 |
<bblack> |
restarting nginx for libssl update on cp1049.eqiad.wmnet,cp4008.ulsfo.wmnet,cp3042.esams.wmnet,cp3049.esams.wmnet |
[production] |
20:32 |
<demon@mira> |
Finished scap: all group0 to wmf.13 (duration: 29m 45s) |
[production] |
20:25 |
<bblack> |
cache kernel reboots done (all on '3.19.0-2-amd64 #1 SMP Debian 3.19.3-9 (2016-01-04)', except 4x canaries on '4.4.0-1-amd64 #1 SMP Debian 4.4-1~wmf1 (2016-01-26)') |
[production] |
20:11 |
<bblack> |
cp1067, cp1071 (text, upload in eqiad) -> 4.4 canaries (rebooting over the next ~8 mins or so) |
[production] |
20:02 |
<demon@mira> |
Started scap: all group0 to wmf.13 |
[production] |
19:55 |
<hoo> |
Updated operations/dumps/dcat on snapshot1003 from 0a71deb232 to 92ab37d94e |
[production] |
19:37 |
<demon@mira> |
Finished scap: pruning tons of stale branches + sync wmf.13 files for later + testwiki to wmf.13 to build l10n cache (try 2) (duration: 27m 24s) |
[production] |
19:35 |
<bblack> |
cp3048 (upload esams) rebooting -> kernel 4.4 canary |
[production] |
19:13 |
<mutante> |
gerrit - add ppchelko to mediawiki-services |
[production] |
19:11 |
<bblack> |
cp4006 (upload ulsfo) rebooting -> kernel 4.4 canary |
[production] |
19:09 |
<demon@mira> |
Started scap: pruning tons of stale branches + sync wmf.13 files for later + testwiki to wmf.13 to build l10n cache (try 2) |
[production] |
19:07 |
<demon@mira> |
scap failed: CalledProcessError Command '/usr/local/bin/mwscript rebuildLocalisationCache.php --wiki="testwiki" --outdir="/tmp/scap_l10n_2315818744" --threads=10 --lang en --quiet' returned non-zero exit status 255 (duration: 00m 34s) |
[production] |
19:07 |
<demon@mira> |
Started scap: pruning tons of stale branches + sync wmf.13 files for later + testwiki to wmf.13 to build l10n cache |
[production] |
18:57 |
<yurik> |
deployed graphoid |
[production] |
18:05 |
<jynus> |
bringing down db1048's mysql for cloning to db2012 |
[production] |
17:54 |
<Krenair> |
ssh: connect to host mw1037.eqiad.wmnet port 22: Connection timed out |
[production] |
17:53 |
<krenair@mira> |
Synchronized php-1.27.0-wmf.12/extensions/OpenStackManager: https://gerrit.wikimedia.org/r/#/c/269439/ (duration: 03m 15s) |
[production] |
17:49 |
<marxarelli> |
disabled/enabled gearman in jenkins, connection works this time |
[releng] |
17:49 |
<marxarelli> |
performed stop/start of zuul on gallium to restore zuul and gearman |
[releng] |
17:45 |
<marxarelli> |
"Failed: Unable to Connect" in jenkins when testing gearman connection |
[releng] |
17:40 |
<marxarelli> |
killed old zull process manually and restarted service |
[releng] |
17:39 |
<marxarelli> |
restart of zuul fails as well. old process cannot be killed |
[releng] |
17:38 |
<marxarelli> |
reloading zuul fails with "failed to kill 13660: Operation not permitted" |
[releng] |
17:26 |
<elukey> |
mc1004.eqiad put back into redis/memcached pool |
[production] |
17:23 |
<godog> |
nodetool-a removenode ec0c5a3d-2648-4933-8434-a8d163b92188 in preparation for restbase1007 bootstrap |
[production] |
17:22 |
<bblack> |
rebooting cp1008/pinkunicorn for 4.4-rt kernel test |
[production] |
17:19 |
<_joe_> |
powered down mw1037 |
[production] |
17:07 |
<godog> |
start cassandra-a on restbase1007 with replace_address=10.64.0.230 |
[production] |
16:57 |
<thcipriani@mira> |
Finished scap: SWAT: Clarify and expand messages mentioning loss of session data [[gerrit:269424]] (duration: 27m 36s) |
[production] |
16:53 |
<bblack> |
rebooting cp1008/pinkunicorn for 4.4 kernel |
[production] |
16:34 |
<jynus> |
reimage db2012 |
[production] |
16:30 |
<thcipriani@mira> |
Started scap: SWAT: Clarify and expand messages mentioning loss of session data [[gerrit:269424]] |
[production] |
16:18 |
<thcipriani@mira> |
Synchronized wmf-config: SWAT: Enable ArticlePlaceholder on test wikis [[gerrit:269399]] (duration: 01m 19s) |
[production] |
16:15 |
<thcipriani> |
mw1037.eqiad.wmnet error during SWAT rsync: failed to set times on "/srv/mediawiki/.": Read-only file system (30) |
[production] |
16:09 |
<thcipriani@mira> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable math data type on Wikidata and everywhere [[gerrit:269398]] (duration: 02m 31s) |
[production] |
16:06 |
<bd808> |
Deleted corrupt integration-slave-precise-1003:/mnt/jenkins-workspace/workspace/mediawiki-core-php53lint/.git |
[releng] |
15:59 |
<elukey> |
puppet re-enabled on kafka1012 |
[production] |
15:56 |
<paravoid> |
"power"cycling alsafi |
[production] |
15:55 |
<moritzm> |
uploaded linux 4.4-1~wmf1 (jessie-wikimedia/experimental) to carbon |
[production] |
15:53 |
<elukey> |
restarted kafka1012 with 48hrs of log retention |
[analytics] |
15:47 |
<_joe_> |
re-removed the puppet facts for protactinium |
[production] |
15:41 |
<elukey> |
kafka broker restarted - kafka1012 |
[analytics] |
15:40 |
<paravoid> |
echo 1 > /proc/sys/net/ipv4/vs/schedule_icmp on lvs3001 |
[production] |
15:36 |
<elukey> |
disabled puppet on kafka1012, changing temporary kafka retention to purge some extra logs |
[production] |