2017-06-27
§
|
06:40 |
<marostegui> |
Deploy alter table s7 - dbstore1002 - no_replicate_T166208.sh |
[production] |
05:58 |
<elukey> |
restored rdb2004 as slave of rdb2003 (end of experiment) |
[production] |
05:08 |
<marostegui> |
Global rename of Green Cardamom → GreenC - T168776 |
[production] |
05:04 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1079 - T166208 (duration: 00m 43s) |
[production] |
03:43 |
<mutante> |
smokeping on stretch means 2.6.11-3 vs 2.6.9-1 we had before |
[production] |
03:35 |
<mutante> |
smokeping - stop/rsync/fix permissions/start one more time to minimize gaps in graphs - now fully migrated netmon1001->netmon1002, historic data has been copied (T159756) |
[production] |
03:28 |
<mutante> |
netmon1002 - ganglia apache_status.py broken in stretch (?), ganglia deprecated, stopping gmond, aggregator role got removed, was for torrus |
[production] |
03:03 |
<mutante> |
netmon1002 - fixing permissions on /var/lib/smokeping rrd files (rsynced, inconstent UIDs ) |
[production] |
02:29 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Tue Jun 27 02:29:22 UTC 2017 (duration 6m 25s) |
[production] |
02:22 |
<l10nupdate@tin> |
scap sync-l10n completed (1.30.0-wmf.6) (duration: 07m 46s) |
[production] |
00:39 |
<mutante> |
netmon1001 - rsyncing smokeping data (/var/lib/smokeping) over to netmon1002 |
[production] |
2017-06-26
§
|
23:56 |
<zhuyifei1999_> |
oops, depooled encoding02 instead |
[video] |
23:54 |
<zhuyifei1999_> |
depool encoding03 |
[video] |
23:51 |
<maxsem@tin> |
Synchronized php-1.30.0-wmf.6/extensions/Kartographer/: https://gerrit.wikimedia.org/r/#/c/361584/ (duration: 00m 44s) |
[production] |
23:50 |
<zhuyifei1999_> |
Doing that again, should be finally working |
[video] |
23:38 |
<maxsem@tin> |
Synchronized fonts/: https://gerrit.wikimedia.org/r/361195 (duration: 00m 45s) |
[production] |
23:34 |
<zhuyifei1999_> |
Doing that again :( |
[video] |
23:24 |
<twentyafterfour@tin> |
Synchronized php-1.30.0-wmf.6/extensions/Scribunto/engines/LuaSandbox/Engine.php: deploy https://gerrit.wikimedia.org/r/#/c/361508 (duration: 00m 43s) |
[production] |
23:24 |
<zhuyifei1999_> |
Fixing tmpfiles configuration on encoding0{1..3}, and rebooting; expecing to see workers back alive after reboot |
[video] |
23:23 |
<twentyafterfour> |
deploying https://gerrit.wikimedia.org/r/#/c/361508 |
[production] |
23:16 |
<zhuyifei1999_> |
failed to start because systemd-tmpfiles-setup failed. investigating |
[video] |
23:00 |
<zhuyifei1999_> |
starting v2c workers at encoding02 & 03, somehow they didn't start automatically after reboot |
[video] |
22:56 |
<halfak@tin> |
Finished deploy [ores/deploy@82dfd56]: Unscheduled/urgent deploy (T168099) (duration: 30m 55s) |
[production] |
22:49 |
<bd808> |
Updated LDAP loginShell to /bin/bash for 969 accounts that were still set to /usr/local/bin/sillyshell (T86668) |
[production] |
22:34 |
<legoktm@tin> |
Synchronized php-1.30.0-wmf.6/extensions/Linter/includes/ApiRecordLint.php: Add debug logging for missing 'dsr' - T168900 (duration: 00m 43s) |
[production] |
22:32 |
<legoktm@tin> |
Synchronized wmf-config/InitialiseSettings.php: Enable 'Linter' debug log channel (duration: 00m 44s) |
[production] |
22:27 |
<mutante> |
netmon1001 - deactivate rancid crons - now running on netmon1002 instead - avoid duplicate mails (T159756) |
[production] |
22:25 |
<halfak@tin> |
Started deploy [ores/deploy@82dfd56]: Unscheduled/urgent deploy (T168099) |
[production] |
22:24 |
<halfak> |
deploying ores-prod-deploy:82dfd56 to beta (note: T168099) |
[releng] |
22:20 |
<halfak> |
deploying ores-prod-deploy:82dfd56 to beta |
[releng] |
21:50 |
<robh> |
shutting down and decommissioning mw117[0-9] per T168271 |
[production] |
21:27 |
<bawolff> |
deployed patch for T128209 |
[production] |
21:00 |
<robh> |
attempting firmware update on lvs1007, which is currently offline |
[production] |
20:38 |
<bsitzmann@tin> |
Finished deploy [mobileapps/deploy@07066c7]: Update mobileapps to 0b05026 (duration: 03m 41s) |
[production] |
20:34 |
<bsitzmann@tin> |
Started deploy [mobileapps/deploy@07066c7]: Update mobileapps to 0b05026 |
[production] |
20:33 |
<bearND> |
Update mobileapps to 0b05026 |
[releng] |
19:56 |
<herron> |
updated ops list accept_these_nonmembers regex (T168903) |
[production] |
19:41 |
<hashar> |
Restarted Jenkins to lower console log spam ( https://gerrit.wikimedia.org/r/#/c/359116/ ) |
[production] |
19:35 |
<urandom> |
T160570: Upgrading restbase-dev1003 to Cassandra 3.11.0 (release) |
[production] |
19:30 |
<urandom> |
T160570: Upgrading restbase-dev1002 to Cassandra 3.11.0 (release) |
[production] |
19:05 |
<mobrovac@tin> |
Finished deploy [restbase/deploy@3975ab2]: Update Parsoid HTML version to 1.5.0 - T39902 (duration: 06m 16s) |
[production] |
18:59 |
<mobrovac@tin> |
Started deploy [restbase/deploy@3975ab2]: Update Parsoid HTML version to 1.5.0 - T39902 |
[production] |
18:51 |
<arlolra> |
Updated Parsoid to b59045f2 (T39902, T149794) |
[production] |
18:44 |
<hashar> |
nodepool image-delete 1636 # Deletes snapshot-ci-trusty-1498491445 which lack nodejs when we still need it. |
[releng] |
18:33 |
<milimetric> |
Restarted celery workers on quarry-runner-01 and quarry-runner-02 (systemctl restart celery-quarry-worker.service) |
[quarry] |
18:32 |
<urandom> |
T160570: Upgrading restbase-dev1001 to Cassandra 3.11.0 (release) |
[production] |
18:31 |
<arlolra@tin> |
Finished deploy [parsoid/deploy@70538a6]: Updating Parsoid to b59045f2 (duration: 11m 13s) |
[production] |
18:23 |
<twentyafterfour> |
renamed previously active image to 'image-ci-trusty_bad_20170626' |
[releng] |
18:22 |
<twentyafterfour> |
reverted nodepool image-ci-trusty to previous version 'image-ci-trusty-old_20170626' |
[releng] |
18:20 |
<arlolra@tin> |
Started deploy [parsoid/deploy@70538a6]: Updating Parsoid to b59045f2 |
[production] |