2015-08-18
§
|
08:37 |
<valhallasw`cloud> |
sudo service gridengine-exec start on tools-webgrid-lighttpd-1404.eqiad.wmflabs" tools-webgrid-lighttpd-1406.eqiad.wmflabs" tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" |
[tools] |
08:33 |
<valhallasw`cloud> |
tools-webgrid-lighttpd-1403.eqiad.wmflabs, tools-webgrid-lighttpd-1404.eqiad.wmflabs and tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs are all broken (queue dropped because it is temporarily not available) |
[tools] |
08:30 |
<valhallasw`cloud> |
hostname mismatch: host is called tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs in config, but it was named tools-webgrid-lighttpd-1411.eqiad.wmflabs in the hostgroup config |
[tools] |
08:21 |
<valhallasw`cloud> |
still sudo qmod -e "*@tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" -> invalid queue "*@tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" |
[tools] |
08:20 |
<valhallasw`cloud> |
sudo qconf -mhgrp "@webgrid", added tools-webgrid-lighttpd-1411.eqiad.wmflabs |
[tools] |
08:18 |
<_joe_> |
reimaging mw1152 |
[production] |
08:14 |
<godog> |
restart cassandra on restbase100[569] to pick up latest openjdk |
[production] |
08:14 |
<valhallasw`cloud> |
and the hostgroup @webgrid doesn't even exist? (╯°□°)╯︵ ┻━┻ |
[tools] |
08:10 |
<valhallasw`cloud> |
/var/lib/gridengine/etc/queues/webgrid-lighttpd does not seem to be the correct configuration as the current config refers to '@webgrid' as host list. |
[tools] |
08:07 |
<valhallasw`cloud> |
sudo qconf -Ae /var/lib/gridengine/etc/exechosts/tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs -> root@tools-bastion-01.eqiad.wmflabs added "tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs" to exechost list |
[tools] |
08:06 |
<valhallasw`cloud> |
ok, success. /var/lib/gridengine/etc/exechosts/tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs now exists. Do I still have to add it manually to the grid? I suppose so. |
[tools] |
08:04 |
<_joe_> |
depooling mw1152 from the imagescalers pool |
[production] |
08:04 |
<valhallasw`cloud> |
installing packages from /data/project/.system/deb-trusty seems to fail. sudo apt-get update helps. |
[tools] |
08:03 |
<godog> |
restart cassandra on restbase100[348] to pick up latest openjdk |
[production] |
08:00 |
<valhallasw`cloud> |
running puppet agent -tv again |
[tools] |
07:55 |
<valhallasw`cloud> |
argh. Disabling toollabs::node::web::generic again and enabling toollabs::node::web::lighttpd |
[tools] |
07:54 |
<valhallasw`cloud> |
various issues such as Error: /Stage[main]/Gridengine::Submit_host/File[/var/lib/gridengine/default/common/accounting]/ensure: change from absent to link failed: Could not set 'link' on ensure: No such file or directory - /var/lib/gridengine/default/common at 17:/etc/puppet/modules/gridengine/manifests/submit_host.pp; probably an ordering issue in |
[tools] |
07:53 |
<valhallasw`cloud> |
Setting up adminbot (1.7.8) ... chmod: cannot access '/usr/lib/adminbot/README': No such file or directory --- ran sudo touch /usr/lib/adminbot/README |
[tools] |
07:37 |
<valhallasw`cloud> |
applying role::labs::tools::compute and toollabs::node::web::generic to \tools-webgrid-lighttpd-1411 |
[tools] |
07:31 |
<valhallasw`cloud> |
reading puppet suggests I should qconf -ah /var/lib/gridengine/etc/exechosts/tools-webgrid-lighttpd-1411.tools.eqiad.wmflabs but that file is missing? |
[tools] |
07:26 |
<valhallasw`cloud> |
andrewbogott built tools-webgrid-lighttpd-1411 yesterday but it's not actually added as exec host. Trying to figure out how to do that... |
[tools] |
07:23 |
<legoktm> |
live hacking on mw1017 for T109236 |
[production] |
05:45 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Tue Aug 18 05:45:47 UTC 2015 (duration 45m 46s) |
[production] |
02:25 |
<l10nupdate@tin> |
LocalisationUpdate completed (1.26wmf18) at 2015-08-18 02:25:28+00:00 |
[production] |
02:21 |
<l10nupdate@tin> |
Synchronized php-1.26wmf18/cache/l10n: l10nupdate for 1.26wmf18 (duration: 06m 50s) |
[production] |
00:02 |
<ottomata> |
analytics1041 down, attempting power cycle |
[production] |
2015-08-17
§
|
22:19 |
<matt_flaschen> |
LQT->Flow done on MediaWiki.org. |
[production] |
22:18 |
<legoktm> |
running schema change for [[gerrit:202344]] on beta |
[releng] |
21:57 |
<mattflaschen@tin> |
Synchronized wmf-config: LQT->Flow: Make frozen wikis no longer able to create LQT pages (duration: 00m 13s) |
[production] |
21:31 |
<chasemp> |
remove php5-xdebug from terbium per mattflaschen |
[production] |
21:09 |
<MaxSem> |
renamed Gadget:Invention, Travel, & Adventure --> Gadget Invention, Travel, & Adventure on enwiki using moveBatch.php to work around a permissions screwup |
[production] |
20:53 |
<bd808> |
T109369: Restarted logstash on logstash1003; parsoid gelf events not being recorded since 2015-08-15 |
[production] |
20:16 |
<subbu> |
deployed parsoid version 4b656b72 |
[production] |
19:19 |
<legoktm> |
freeing up disk space on 1012 |
[releng] |
19:19 |
<ottomata> |
stopping kafka on analytics1012, preparing to reinstall with Jessie and rename to kafka1012 |
[production] |
19:15 |
<legoktm> |
[11:45:39] <legoktm> !log freeing up disk space on 1017 |
[releng] |
19:15 |
<legoktm> |
restarted qa-morebots |
[releng] |
18:45 |
<legoktm> |
freeing up disk space on 1017 |
[releng] |
16:17 |
<andrewbogott> |
disable queues for tools-exec-1205 tools-exec-1207 tools-exec-1208 tools-exec-140 tools-exec-1404 tools-exec-1409 tools-exec-1410 tools-exec-catscan tools-web-static-01 tools-webgrid-lighttpd-1201 tools-webgrid-lighttpd-1205 tools-webgrid lighttpd-1206 tools-webgrid-lighttpd-1406 tools-webproxy-02 |
[tools] |
15:44 |
<krenair@tin> |
Synchronized wmf-config/extension-list: https://gerrit.wikimedia.org/r/#/c/232051/ - remove WikiGrok from extension-list, extension is no longer deployed (duration: 00m 11s) |
[production] |
15:40 |
<krenair@tin> |
Synchronized multiversion/MWMultiVersion.php: https://gerrit.wikimedia.org/r/#/c/231982/2 - clear up --wiki usage to mwscript (duration: 00m 12s) |
[production] |
15:34 |
<krenair@tin> |
Synchronized php-1.26wmf18/extensions/VisualEditor/modules/ve-mw/init/ve.init.mw.Target.js: https://gerrit.wikimedia.org/r/#/c/232048/ (duration: 00m 11s) |
[production] |
15:33 |
<andrewbogott> |
re-enabling the queue on tools-exec-1211 tools-exec-1212 tools-exec-1215 tools-exec-1403 tools-exec-1406 tools-master tools-shadow tools-webgrid-generic-1402 tools-webgrid-lighttpd-1203 tools-webgrid-lighttpd-1208 tools-webgrid-lighttpd-1403 tools-webgrid-lighttpd-1404 tools-webproxy-01 |
[tools] |
15:07 |
<krenair@tin> |
Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/231984/ (duration: 00m 13s) |
[production] |
15:05 |
<andrewbogott> |
rebooting labvirt1004 |
[production] |
15:03 |
<krenair@tin> |
Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/232021/ (duration: 00m 12s) |
[production] |
14:50 |
<andrewbogott> |
killing remaining jobs on tools-exec-1211 tools-exec-1212 tools-exec-1215 tools-exec-1403 tools-exec-1406 tools-master tools-shadow tools-webgrid-generic-1402 tools-webgrid-lighttpd-1203 tools-webgrid-lighttpd-1208 tools-webgrid-lighttpd-1403 tools-webgrid-lighttpd-1404 tools-webproxy-01 |
[tools] |
14:30 |
<mobrovac> |
restbase updated production cluster to ed17952 |
[production] |
14:12 |
<mobrovac> |
restbase deployed ed17952 on restbase1001 |
[production] |
13:58 |
<mobrovac> |
restbase deploying ed17952 on staging |
[production] |