2016-03-21
§
|
11:25 |
<elukey> |
beta: cherry picked https://gerrit.wikimedia.org/r/#/c/278713/ to test an updated to the cdh module (analytics) |
[releng] |
11:13 |
<hashar> |
beta: rebased puppet master which had a conflict on https://gerrit.wikimedia.org/r/#/c/274711/ which got merged meanwhile (saves Elukey ) |
[releng] |
11:02 |
<hashar> |
beta: added Elukey (wikimedia ops) to the project as member and admin |
[releng] |
10:52 |
<hashar> |
Live hacked puppet compiler on compiler02.puppet3-diffs.eqiad.wmflabs to debug it not processing submodules. Reinstalled it from the last tag in the process |
[production] |
10:24 |
<jynus> |
Altering user_properties engine to InnoDB on db1069:3313 |
[production] |
09:26 |
<jynus> |
Altering change_tag engine to InnoDB on db1069:3313 |
[production] |
09:09 |
<elukey> |
restarted hhvm on mw1116 |
[production] |
02:33 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Mon Mar 21 02:33:31 UTC 2016 (duration 8m 41s) |
[production] |
02:24 |
<mwdeploy@tin> |
sync-l10n completed (1.27.0-wmf.17) (duration: 10m 57s) |
[production] |
2016-03-19
§
|
22:28 |
<jynus> |
powercycling oxygen, looks kernel-dead |
[production] |
22:16 |
<urandom> |
removing 22G of heap dumps from restbase2004.codfw.wmnet |
[production] |
22:16 |
<urandom> |
removing 22G of heap dumps |
[production] |
22:07 |
<urandom> |
clearing snapshots on restbase2004.codfw.wmnet |
[production] |
20:04 |
<halfak> |
ores-web-01/02 and ores-worker-01/02/03/04 deleted. ores-web-03/04/05 and ores-worker-05/06/07/08/09/10 started and configured as replacements. |
[ores] |
16:43 |
<halfak> |
Manually ran `sudo apt-get install aspell-ar aspell-pl` across web and worker nodes |
[ores] |
15:48 |
<halfak> |
Ran puppet and restarted uwsgi on web-01 with 28 forks rather than 32 |
[ores] |
15:43 |
<reedy@tin> |
Synchronized wmf-config/throttle.php: Throttle rules for event T130447 (duration: 00m 26s) |
[production] |
15:32 |
<halfak> |
workers_per_core 32 --> 28 |
[ores] |
13:04 |
<hashar> |
Jenkins: added ldap-labs-codfw.wikimedia.org as a fallback LDAP server T130446 |
[releng] |
12:34 |
<godog> |
service supervisor stop, causing high traffic from ldap server T130446 |
[zulip] |
12:30 |
<godog> |
restart nslcd on zulip-01 |
[zulip] |
11:38 |
<godog> |
restart slapd on seaborgium, oom-killed |
[production] |
10:51 |
<hashar> |
Labs LDAP is probably down. T130446 Cant log to tools-login.wmflabs.org / Jenkins interface and Nodepool yields error 500 communicating with OpenStack API |
[production] |
02:31 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Sat Mar 19 02:31:46 UTC 2016 (duration 8m 31s) |
[production] |
02:23 |
<mwdeploy@tin> |
sync-l10n completed (1.27.0-wmf.17) (duration: 10m 07s) |
[production] |
01:54 |
<urandom> |
bootstrapping restbase1013-b.eqiad.wmnet : T125842 |
[production] |
2016-03-18
§
|
23:35 |
<krinkle@tin> |
Synchronized php-1.27.0-wmf.17/extensions/WikimediaEvents/modules/ext.wikimediaEvents.deprecate.js: (no message) (duration: 00m 35s) |
[production] |
21:11 |
<ostriches> |
cleaned up stale /srv/mediawiki/php-1.27.0-wmf.{10,11} from the apaches. |
[production] |
21:09 |
<krinkle@tin> |
Synchronized wmf-config/missing.php: (no message) (duration: 00m 25s) |
[production] |
20:53 |
<ottomata> |
reenabling puppet on krypton |
[production] |
19:52 |
<ottomata> |
temporarily disabling puppet on krypton |
[production] |
19:21 |
<ori> |
rebooting bohrium |
[production] |
19:20 |
<ori> |
upgraded bohrium VM: vcpus 2 => 8, ram 4 => 8g |
[production] |
19:06 |
<ori@tin> |
Synchronized wmf-config/logging.php: Iabca8858e: Allow finer-grained control over debug logging via XWD (duration: 00m 32s) |
[production] |
18:56 |
<demon@tin> |
Synchronized .arclint: no op really, co master sync (duration: 00m 39s) |
[production] |
18:13 |
<gehel> |
activating automatic deployment of portals (https://gerrit.wikimedia.org/r/#/c/276397/) |
[deployment-prep] |
18:08 |
<gehel> |
restarting elasticsearch server elastic1031.eqiad.wmnet |
[production] |
17:59 |
<mutante> |
netmon1001: failed torrus service - recovery steps as outlined on wikitech [[Torrus]] |
[production] |
17:55 |
<ori> |
on bohrium: /etc/apache2/sites-enabled/.links2 ; was causing puppet to refresh apache2 on each run |
[production] |
17:30 |
<gehel> |
restarting elasticsearch server elastic1030.eqiad.wmnet |
[production] |
17:16 |
<jzerebecki> |
reloading zuul for e33494f..89a9659 |
[releng] |
17:05 |
<gehel> |
restarting elasticsearch server elastic1029.eqiad.wmnet |
[production] |
16:53 |
<jynus> |
starting enwiki import to labs from dbstore1002 (expect lag and consistency problems during the hot import) |
[production] |
16:37 |
<moritzm> |
restarted hhvm on mw1205 |
[production] |