751-800 of 10000 results (30ms)
2016-06-29 §
10:31 <elukey> rebooting analytics100[12] (Hadoop Yarn/HDFS master and standby) - One at the time forcing failover manually with daemon restarts [production]
09:54 <moritzm> powercycling mw1163, stuck on reboot [production]
09:23 <gehel> banning elastic1001 to 1016 from cluster to prepare their decommissioning (T138329) [production]
09:20 <ema> upgrading diamond to 3.5-6 (T138758) [production]
09:01 <elukey> rebooting analytics1028->1057 for kernel upgrades (Hadoop worker nodes) [production]
08:55 <moritzm> powercycling mw1111, stuck on reboot [production]
08:44 <elukey> puppet stopped on analytics1027 to prevent Camus job to run (prep step for Hadoop kernel upgrades) [production]
08:40 <moritzm> powercycling mw1108, stuck on reboot [production]
08:12 <moritzm> powercycling mw1099, stuck on reboot [production]
08:12 <moritzm> powercycling mw1097, stuck on reboot [production]
08:05 <moritzm> powercycling mw1092, stuck on reboot [production]
07:47 <moritzm> rolling reboot of appservers in eqiad for kernel security update [production]
07:16 <moritzm> powercycling snapshot1002, reboot stuck [production]
07:11 <moritzm> powercycling snapshot1001, reboot stuck [production]
06:58 <moritzm> rebooting most snapshot hosts for kernel security update [production]
03:28 <krinkle@tin> Synchronized wmf-config/InitialiseSettings.php: test2wiki (duration: 00m 33s) [production]
02:56 <l10nupdate@tin> ResourceLoader cache refresh completed at Wed Jun 29 02:56:32 UTC 2016 (duration 6m 30s) [production]
02:50 <mwdeploy@tin> scap sync-l10n completed (1.28.0-wmf.8) (duration: 04m 48s) [production]
02:30 <chasemp> labstore1004 is replicating NFS/DRBD shares to labstore1005 and they are large and it's taking a long time [production]
02:29 <mwdeploy@tin> scap sync-l10n completed (1.28.0-wmf.7) (duration: 09m 21s) [production]
02:18 <twentyafterfour@tin> rebuilt wikiversions.php and synchronized wikiversions files: sync wikiversions.json - group0 to 1.28.0-wmf.8 refs T137492 [production]
02:16 <twentyafterfour> promoting group0 to 1.28.0-wmf.8 [production]
00:02 <twentyafterfour@tin> Finished scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492 (duration: 51m 59s) [production]
2016-06-28 §
23:10 <twentyafterfour@tin> Started scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492 [production]
23:10 <Krenair> wikitech-static working now, poke me on IRC or file a #wikitech.wikimedia.org ticket if you find any issues [production]
23:10 <twentyafterfour> syncing new branch 1.28.0-wmf.8 refs T137492 [production]
23:04 <ebernhardson@tin> Synchronized php-1.28.0-wmf.7/extensions/EventBus/EventBus.php: SWAT: EventBus: Match the expected format of response log key (duration: 00m 31s) [production]
23:01 <Krenair> Updating MW version on wikitech-static to 1.27 (LTS) - https://lists.wikimedia.org/pipermail/mediawiki-announce/2016-June/000191.html [production]
21:59 <halfak> deploying ores beec291 [production]
21:33 <twentyafterfour@tin> rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7 [production]
21:31 <twentyafterfour@tin> Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploy https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973 (duration: 00m 36s) [production]
21:24 <twentyafterfour> deploying wmf.7 yet again, once CI finishes testing https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973 [production]
20:24 <twentyafterfour@tin> rebuilt wikiversions.php and synchronized wikiversions files: once again rolling back to wmf.6 refs T136973 T138550 [production]
20:11 <twentyafterfour@tin> rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7 [production]
20:09 <twentyafterfour@tin> Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploying https://gerrit.wikimedia.org/r/#/c/296440/ refs T138550, T136973 (duration: 02m 06s) [production]
20:09 <twentyafterfour> deploying https://gerrit.wikimedia.org/r/#/c/296440/ to hopefully unblock wmf.7 deployments. refs T138550, T136973 [production]
20:08 <gehel> disabling puppet on wdqs100[12] to cleanup after failed scap3 deplyoment [production]
19:33 <twentyafterfour@tin> rebuilt wikiversions.php and synchronized wikiversions files: Rolling back to wmf.6: save time regression is still present in wmf.7 [production]
19:32 <twentyafterfour> Rolling back to wmf.6: T138550 is still a problem [production]
19:23 <twentyafterfour@tin> rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7 [production]
19:23 <twentyafterfour> Deploying 1.28.0-wmf.7 to all wikis [production]
18:23 <mutante> zosma - fresh install, sign puppet certs, initial puppet run [production]
16:16 <gehel> starting rolling restart of elasticsearch codfw cluster (T138811) [production]
15:25 <thcipriani@tin> Synchronized portals: SWAT: [[gerrit:296399|Bumping portals to master (T136874)]] (duration: 00m 29s) [production]
15:24 <thcipriani@tin> Synchronized portals/prod/wikipedia.org/assets: SWAT: [[gerrit:296399|Bumping portals to master (T136874)]] (duration: 00m 24s) [production]
15:16 <thcipriani@tin> Synchronized dblists/visualeditor-default.dblist: SWAT: Enable VisualEditor by default for all users of the [[gerrit:292750|French (T136993)]], [[gerrit:292751|English (T136992)]], and [[gerrit:292752|German (T136991)]] Wikivoyage (duration: 00m 24s) [production]
15:09 <thcipriani@tin> Synchronized dblists/visualeditor-default.dblist: SWAT: [[gerrit:292749|Enable VisualEditor by default for all users of the Italian Wikivoyage (T136994)]] (duration: 00m 25s) [production]
14:52 <gehel> powercycling elastic1004 (server not coming up during restart - T138811) [production]
13:46 <godog> bounce carbon on graphite machines after applying https://gerrit.wikimedia.org/r/266567 [production]
13:40 <elukey@palladium> conftool action : set/pooled=yes; selector: aqs1001.eqiad.wmnet [production]