2016-06-29
§
|
12:32 |
<moritzm> |
powercycling elastic1010, stuck on reboot |
[production] |
12:11 |
<moritzm> |
powercycling mw1260, stuck on reboot |
[production] |
11:38 |
<jynus> |
halfway moving otrs backups from dbstore1001 to es2001 |
[production] |
11:27 |
<gehel> |
powercycling elastic1009 - stuck in reboot |
[production] |
11:11 |
<moritzm> |
powercycling mw1223, stuck on reboot |
[production] |
11:09 |
<gehel> |
deleting broken dewiki_titlesuggest index from codfw (T138811) |
[production] |
10:31 |
<elukey> |
rebooting analytics100[12] (Hadoop Yarn/HDFS master and standby) - One at the time forcing failover manually with daemon restarts |
[production] |
09:54 |
<moritzm> |
powercycling mw1163, stuck on reboot |
[production] |
09:23 |
<gehel> |
banning elastic1001 to 1016 from cluster to prepare their decommissioning (T138329) |
[production] |
09:20 |
<ema> |
upgrading diamond to 3.5-6 (T138758) |
[production] |
09:01 |
<elukey> |
rebooting analytics1028->1057 for kernel upgrades (Hadoop worker nodes) |
[production] |
08:55 |
<moritzm> |
powercycling mw1111, stuck on reboot |
[production] |
08:44 |
<elukey> |
puppet stopped on analytics1027 to prevent Camus job to run (prep step for Hadoop kernel upgrades) |
[production] |
08:40 |
<moritzm> |
powercycling mw1108, stuck on reboot |
[production] |
08:12 |
<moritzm> |
powercycling mw1099, stuck on reboot |
[production] |
08:12 |
<moritzm> |
powercycling mw1097, stuck on reboot |
[production] |
08:05 |
<moritzm> |
powercycling mw1092, stuck on reboot |
[production] |
07:47 |
<moritzm> |
rolling reboot of appservers in eqiad for kernel security update |
[production] |
07:16 |
<moritzm> |
powercycling snapshot1002, reboot stuck |
[production] |
07:11 |
<moritzm> |
powercycling snapshot1001, reboot stuck |
[production] |
06:58 |
<moritzm> |
rebooting most snapshot hosts for kernel security update |
[production] |
03:28 |
<krinkle@tin> |
Synchronized wmf-config/InitialiseSettings.php: test2wiki (duration: 00m 33s) |
[production] |
02:56 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Wed Jun 29 02:56:32 UTC 2016 (duration 6m 30s) |
[production] |
02:50 |
<mwdeploy@tin> |
scap sync-l10n completed (1.28.0-wmf.8) (duration: 04m 48s) |
[production] |
02:30 |
<chasemp> |
labstore1004 is replicating NFS/DRBD shares to labstore1005 and they are large and it's taking a long time |
[production] |
02:29 |
<mwdeploy@tin> |
scap sync-l10n completed (1.28.0-wmf.7) (duration: 09m 21s) |
[production] |
02:18 |
<twentyafterfour@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: sync wikiversions.json - group0 to 1.28.0-wmf.8 refs T137492 |
[production] |
02:16 |
<twentyafterfour> |
promoting group0 to 1.28.0-wmf.8 |
[production] |
00:02 |
<twentyafterfour@tin> |
Finished scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492 (duration: 51m 59s) |
[production] |
2016-06-28
§
|
23:10 |
<twentyafterfour@tin> |
Started scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492 |
[production] |
23:10 |
<Krenair> |
wikitech-static working now, poke me on IRC or file a #wikitech.wikimedia.org ticket if you find any issues |
[production] |
23:10 |
<twentyafterfour> |
syncing new branch 1.28.0-wmf.8 refs T137492 |
[production] |
23:04 |
<ebernhardson@tin> |
Synchronized php-1.28.0-wmf.7/extensions/EventBus/EventBus.php: SWAT: EventBus: Match the expected format of response log key (duration: 00m 31s) |
[production] |
23:01 |
<Krenair> |
Updating MW version on wikitech-static to 1.27 (LTS) - https://lists.wikimedia.org/pipermail/mediawiki-announce/2016-June/000191.html |
[production] |
21:59 |
<halfak> |
deploying ores beec291 |
[production] |
21:33 |
<twentyafterfour@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7 |
[production] |
21:31 |
<twentyafterfour@tin> |
Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploy https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973 (duration: 00m 36s) |
[production] |
21:24 |
<twentyafterfour> |
deploying wmf.7 yet again, once CI finishes testing https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973 |
[production] |
20:24 |
<twentyafterfour@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: once again rolling back to wmf.6 refs T136973 T138550 |
[production] |
20:11 |
<twentyafterfour@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7 |
[production] |
20:09 |
<twentyafterfour@tin> |
Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploying https://gerrit.wikimedia.org/r/#/c/296440/ refs T138550, T136973 (duration: 02m 06s) |
[production] |
20:09 |
<twentyafterfour> |
deploying https://gerrit.wikimedia.org/r/#/c/296440/ to hopefully unblock wmf.7 deployments. refs T138550, T136973 |
[production] |
20:08 |
<gehel> |
disabling puppet on wdqs100[12] to cleanup after failed scap3 deplyoment |
[production] |
19:33 |
<twentyafterfour@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: Rolling back to wmf.6: save time regression is still present in wmf.7 |
[production] |
19:32 |
<twentyafterfour> |
Rolling back to wmf.6: T138550 is still a problem |
[production] |
19:23 |
<twentyafterfour@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7 |
[production] |
19:23 |
<twentyafterfour> |
Deploying 1.28.0-wmf.7 to all wikis |
[production] |
18:23 |
<mutante> |
zosma - fresh install, sign puppet certs, initial puppet run |
[production] |
16:16 |
<gehel> |
starting rolling restart of elasticsearch codfw cluster (T138811) |
[production] |
15:25 |
<thcipriani@tin> |
Synchronized portals: SWAT: [[gerrit:296399|Bumping portals to master (T136874)]] (duration: 00m 29s) |
[production] |