3801-3850 of 10000 results (47ms)
2014-08-19 §
07:17 <hashar> Jenkins: manually cleared out a tmpfs partition on lanthanum.eqiad.wmnet which was causing all MediaWiki / extensions jobs to fail completely. {{bug|69731}}. We need disk space monitoring which is {{bug|69733}}. [production]
07:10 <bblack> ... and strontium passenger is failing to start up correctly again. icinga-wm disabled to avoid spam [production]
07:07 <bblack> restarted apache2 service on strontium/palladium, expect another small spike of puppet fail->ok [production]
03:21 <LocalisationUpdate> ResourceLoader cache refresh completed at Tue Aug 19 03:20:21 UTC 2014 (duration 20m 20s) [production]
02:37 <LocalisationUpdate> completed (1.24wmf17) at 2014-08-19 02:36:21+00:00 [production]
02:16 <LocalisationUpdate> completed (1.24wmf16) at 2014-08-19 02:15:14+00:00 [production]
2014-08-18 §
23:03 <andrewbogott> isolated virt1006, re-enabling puppet on virt1000 and virt1006 [production]
22:36 <andrewbogott> disabling puppet on virt1000 and virt1006 while I try to convince the scheduler to overlook virt1006 [production]
22:22 <^d> dropped apache01/02 instances, unused and need the resources [releng]
22:01 <bblack> done futzing w/ puppetmasters+neon, all agents enabled and bot back online [production]
21:28 <hashar> Zuul processing again. Definitely need to write doc about how to unstuck it [production]
21:02 <hashar> Zuul / Jenkins stalled again :-/ [production]
21:02 <hashar> Zuul / Jenkins stalled again :-/ [production]
19:35 <bblack> testing new passenger perf params on strontium/palladium. agents on those two and icinga-wm still disabled [production]
19:04 <bblack> restarted service apache2 on strontium - passenger for puppet master was dead again [production]
18:23 <manybubbles> finished upgrading elasticsearch in beta - everything seems ok so far [releng]
18:15 <bd808> Restarted salt-minion on deployment-mediawiki01 & deployment-rsync01 [releng]
18:15 <bd808> Ran `sudo pkill python` on deployment-rsync01 to kill hundreds of grain-ensure processes [releng]
18:12 <bd808> Ran `sudo pkill python` on deployment-mediawiki01 to kill hundreds of grain-ensure processes [releng]
18:10 <manybubbles> finally restarting beta's elasticsearch servers now that they have new jars [releng]
17:56 <bd808> Manually ran trebuchet fetches on deployment-elastic0* [releng]
17:49 <bd808> Forcing puppet run on deployment-elastic01 [releng]
17:47 <godog> upgraded hhvm on mediawiki02 to 3.3-dev+20140728+wmf5 [releng]
17:44 <bd808> Trying to restart minions again with `salt '*' -b 1 service.restart salt-minion` [releng]
17:39 <bd808> Restarting minions via `salt '*' service.restart salt-minion` [releng]
17:38 <bd808> Restarted salt-master service on deployment-salt [releng]
17:19 <bd808> 16:37 Restarted Apache and HHVM on deployment-mediawiki02 to pick up removal of /etc/php5/conf.d/mail.ini (logged in prod SAL by mistake) [releng]
17:00 <andrewbogott> added a (yuvi-built) python-txstatsd package to trusty on Carbon. [production]
16:59 <manybubbles|lunc> upgrading Elasticsearch in beta to 1.3.2 [releng]
16:37 <bd808> deployment-prep Restarted Apache and HHVM on deployment-mediawiki02 to pick up removal of /etc/php5/conf.d/mail.ini [production]
16:27 <yurik> Synchronized php-1.24wmf17/extensions: Syncing JsonConfig,ZeroPortal,ZeroBanner (duration: 01m 13s) [production]
16:22 <yurik> Synchronized php-1.24wmf16/extensions: Syncing JsonConfig,ZeroPortal,ZeroBanner (duration: 01m 22s) [production]
16:18 <legoktm> migrateAccount.php finished, 2014-08-18 15:42:12 processed 1528652 usernames (22.9/sec), 10 (0.0%) fully migrated, 7938 (0.5%) partially migrated [production]
16:11 <bd808> Manually applied https://gerrit.wikimedia.org/r/#/c/141287/12/templates/mail/exim4.minimal.erb on deployment-mediawiki02 and restarted exim4 service [releng]
16:05 <hashar> Jenkins tox based jobs are now runnable in parallel {{gerrit|154834}} [production]
15:36 <manybubbles> swat complete [production]
15:29 <manybubbles> Synchronized wmf-config/InitialiseSettings.php: SWAT - enable cirrus optimization - weighted all fields - on group0 wikis (duration: 00m 07s) [production]
15:29 <manybubbles> Synchronized wmf-config/CirrusSearch-common.php: SWAT - drop unused Cirrus parameter (duration: 00m 05s) [production]
15:28 <bd808> Puppet failing for deployment-mathoid due to duplicate definition error in trebuchet config [releng]
15:25 <manybubbles> Synchronized php-1.24wmf16/extensions/CentralAuth: SWAT - two centralauth fixes (duration: 00m 05s) [production]
15:22 <bblack> resuming slowly wiping varnish caches for mmap update (49 hosts to go), expect small 5xx spikes every ~1.5 hrs for the next few days [production]
15:22 <manybubbles> Synchronized wmf-config/: SWAT - noop - sync files adding bouncehandler to betalabs (duration: 00m 04s) [production]
15:19 <manybubbles> Synchronized wmf-config/InitialiseSettings.php: SWAT - create portal/portal talk namespaces on kowikisource (duration: 00m 04s) [production]
15:18 <manybubbles> Synchronized php-1.24wmf17/extensions/CentralAuth/: SWAT - two centralauth fixes (duration: 00m 04s) [production]
15:15 <bd808> Reinstated puppet patch to depool deployment-mediawiki01 and forced puppet run on all deployment-cache-* hosts [releng]
15:13 <manybubbles> Synchronized wmf-config/InitialiseSettings.php: SWAT - create eliminator role on viwiki (duration: 00m 05s) [production]
15:11 <manybubbles> Synchronized php-1.24wmf17/extensions/Wikidata/: (no message) (duration: 00m 07s) [production]
15:08 <manybubbles> Synchronized wmf-config/InitialiseSettings.php: SWAT - Add global-renamer group to metawiki (duration: 00m 04s) [production]
15:04 <bd808> Puppet run failing on deployment-mediawiki01 (apache won't start); Puppet disabled on deployment-mediawiki02 ('reason not specified') Probably needs to wait until Giuseppe is back from vacation for fixing. [releng]
15:00 <bd808> Rebooting deployment-eventlogging02 via wikitech; console filling with OOM killer messages and puppet runs failing with "Cannot allocate memory - fork(2)" [releng]