8601-8650 of 10000 results (43ms)
2015-08-18 §
15:08 <thcipriani@tin> Synchronized wmf-config/CommonSettings.php: SWAT: Remove extra transcode enablings [[gerrit:232228]] (duration: 00m 13s) [production]
15:04 <andrewbogott> rebooting labvirt1006 [production]
13:57 <valhallasw`cloud> same issue seems to happen with the other hosts: tools-exec-1401.tools.eqiad.wmflabs vs tools-exec-1401.eqiad.wmflabs and tools-exec-catscan.tools.eqiad.wmflabs vs tools-exec-catscan.eqiad.wmflabs. [tools]
13:55 <valhallasw`cloud> no, wait, that's ''tools-webgrid-lighttpd-1411.eqiad.wmflabs'', not the actual host ''tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs''. We should fix that dns mess as well. [tools]
13:54 <valhallasw`cloud> tried to restart gridengine-exec on tools-exec-1401, no effect. tools-webgrid-lighttpd-1411 also just went into 'au' state. [tools]
13:47 <valhallasw`cloud> that brought tools-exec-1403, tools-exec-1406 and tools-webgrid-generic-1402 back up, tools-exec-1401 and tools-exec-catscan are still in 'au' state [tools]
13:46 <valhallasw`cloud> starting gridengine-exec on hosts with queues in 'au' (=alarm, unknown) state using <code>for i in $(qstat -f -xml | grep "<state>au" -B 6 | grep "<name>" | cut -d'@' -f2 | cut -d. -f1); do echo $i; ssh $i sudo service gridengine-exec start; done</code> [tools]
08:37 <valhallasw`cloud> sudo service gridengine-exec start on tools-webgrid-lighttpd-1404.eqiad.wmflabs" tools-webgrid-lighttpd-1406.eqiad.wmflabs" tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" [tools]
08:33 <valhallasw`cloud> tools-webgrid-lighttpd-1403.eqiad.wmflabs, tools-webgrid-lighttpd-1404.eqiad.wmflabs and tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs are all broken (queue dropped because it is temporarily not available) [tools]
08:30 <valhallasw`cloud> hostname mismatch: host is called tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs in config, but it was named tools-webgrid-lighttpd-1411.eqiad.wmflabs in the hostgroup config [tools]
08:21 <valhallasw`cloud> still sudo qmod -e "*@tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" -> invalid queue "*@tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" [tools]
08:20 <valhallasw`cloud> sudo qconf -mhgrp "@webgrid", added tools-webgrid-lighttpd-1411.eqiad.wmflabs [tools]
08:18 <_joe_> reimaging mw1152 [production]
08:14 <godog> restart cassandra on restbase100[569] to pick up latest openjdk [production]
08:14 <valhallasw`cloud> and the hostgroup @webgrid doesn't even exist? (╯°□°)╯︵ ┻━┻ [tools]
08:10 <valhallasw`cloud> /var/lib/gridengine/etc/queues/webgrid-lighttpd does not seem to be the correct configuration as the current config refers to '@webgrid' as host list. [tools]
08:07 <valhallasw`cloud> sudo qconf -Ae /var/lib/gridengine/etc/exechosts/tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs -> root@tools-bastion-01.eqiad.wmflabs added "tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" to exechost list [tools]
08:06 <valhallasw`cloud> ok, success. /var/lib/gridengine/etc/exechosts/tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs now exists. Do I still have to add it manually to the grid? I suppose so. [tools]
08:04 <_joe_> depooling mw1152 from the imagescalers pool [production]
08:04 <valhallasw`cloud> installing packages from /data/project/.system/deb-trusty seems to fail. sudo apt-get update helps. [tools]
08:03 <godog> restart cassandra on restbase100[348] to pick up latest openjdk [production]
08:00 <valhallasw`cloud> running puppet agent -tv again [tools]
07:55 <valhallasw`cloud> argh. Disabling toollabs::node::web::generic again and enabling toollabs::node::web::lighttpd [tools]
07:54 <valhallasw`cloud> various issues such as Error: /Stage[main]/Gridengine::Submit_host/File[/var/lib/gridengine/default/common/accounting]/ensure: change from absent to link failed: Could not set 'link' on ensure: No such file or directory - /var/lib/gridengine/default/common at 17:/etc/puppet/modules/gridengine/manifests/submit_host.pp; probably an ordering issue in [tools]
07:53 <valhallasw`cloud> Setting up adminbot (1.7.8) ... chmod: cannot access '/usr/lib/adminbot/README': No such file or directory --- ran sudo touch /usr/lib/adminbot/README [tools]
07:37 <valhallasw`cloud> applying role::labs::tools::compute and toollabs::node::web::generic to \tools-webgrid-lighttpd-1411 [tools]
07:31 <valhallasw`cloud> reading puppet suggests I should qconf -ah /var/lib/gridengine/etc/exechosts/tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs but that file is missing? [tools]
07:26 <valhallasw`cloud> andrewbogott built tools-webgrid-lighttpd-1411 yesterday but it's not actually added as exec host. Trying to figure out how to do that... [tools]
07:23 <legoktm> live hacking on mw1017 for T109236 [production]
05:45 <l10nupdate@tin> ResourceLoader cache refresh completed at Tue Aug 18 05:45:47 UTC 2015 (duration 45m 46s) [production]
02:25 <l10nupdate@tin> LocalisationUpdate completed (1.26wmf18) at 2015-08-18 02:25:28+00:00 [production]
02:21 <l10nupdate@tin> Synchronized php-1.26wmf18/cache/l10n: l10nupdate for 1.26wmf18 (duration: 06m 50s) [production]
00:02 <ottomata> analytics1041 down, attempting power cycle [production]
2015-08-17 §
22:19 <matt_flaschen> LQT->Flow done on MediaWiki.org. [production]
22:18 <legoktm> running schema change for [[gerrit:202344]] on beta [releng]
21:57 <mattflaschen@tin> Synchronized wmf-config: LQT->Flow: Make frozen wikis no longer able to create LQT pages (duration: 00m 13s) [production]
21:31 <chasemp> remove php5-xdebug from terbium per mattflaschen [production]
21:09 <MaxSem> renamed Gadget:Invention, Travel, & Adventure --> Gadget Invention, Travel, & Adventure on enwiki using moveBatch.php to work around a permissions screwup [production]
20:53 <bd808> T109369: Restarted logstash on logstash1003; parsoid gelf events not being recorded since 2015-08-15 [production]
20:16 <subbu> deployed parsoid version 4b656b72 [production]
19:19 <legoktm> freeing up disk space on 1012 [releng]
19:19 <ottomata> stopping kafka on analytics1012, preparing to reinstall with Jessie and rename to kafka1012 [production]
19:15 <legoktm> [11:45:39] <legoktm> !log freeing up disk space on 1017 [releng]
19:15 <legoktm> restarted qa-morebots [releng]
18:45 <legoktm> freeing up disk space on 1017 [releng]
16:17 <andrewbogott> disable queues for tools-exec-1205 tools-exec-1207 tools-exec-1208 tools-exec-140 tools-exec-1404 tools-exec-1409 tools-exec-1410 tools-exec-catscan tools-web-static-01 tools-webgrid-lighttpd-1201 tools-webgrid-lighttpd-1205 tools-webgrid lighttpd-1206 tools-webgrid-lighttpd-1406 tools-webproxy-02 [tools]
15:44 <krenair@tin> Synchronized wmf-config/extension-list: https://gerrit.wikimedia.org/r/#/c/232051/ - remove WikiGrok from extension-list, extension is no longer deployed (duration: 00m 11s) [production]
15:40 <krenair@tin> Synchronized multiversion/MWMultiVersion.php: https://gerrit.wikimedia.org/r/#/c/231982/2 - clear up --wiki usage to mwscript (duration: 00m 12s) [production]
15:34 <krenair@tin> Synchronized php-1.26wmf18/extensions/VisualEditor/modules/ve-mw/init/ve.init.mw.Target.js: https://gerrit.wikimedia.org/r/#/c/232048/ (duration: 00m 11s) [production]
15:33 <andrewbogott> re-enabling the queue on tools-exec-1211 tools-exec-1212 tools-exec-1215 tools-exec-1403 tools-exec-1406 tools-master tools-shadow tools-webgrid-generic-1402 tools-webgrid-lighttpd-1203 tools-webgrid-lighttpd-1208 tools-webgrid-lighttpd-1403 tools-webgrid-lighttpd-1404 tools-webproxy-01 [tools]