51-100 of 10000 results (26ms)
2015-12-23 §
10:11 <jynus> restarting and reconfiguring mysql at db2035 [production]
09:52 <gwicke> rebuilding restbase1004 [production]
09:51 <gwicke> wiped & started boostrap on restbase1008 [production]
09:18 <gwicke> nodetool removenode e2813bb9-f1f2-4d21-ac19-95a7a35b4513 in preparation for adding 1004 to the cluster without bootstrap [production]
02:30 <l10nupdate@tin> ResourceLoader cache refresh completed at Wed Dec 23 02:30:25 UTC 2015 (duration 7m 1s) [production]
02:23 <mwdeploy@tin> sync-l10n completed (1.27.0-wmf.9) (duration: 09m 18s) [production]
00:40 <krenair@tin> Synchronized wmf-config/CommonSettings-labs.php: https://gerrit.wikimedia.org/r/260696 & https://gerrit.wikimedia.org/r/260699 (duration: 05m 28s) [production]
00:37 <mutante> mw1133 - powercycle [production]
00:36 <legoktm> manually fixed up stuck global rename of "RCJU-ArCJ" -> "Archives cantonales jurassiennes" [production]
00:31 <matt_flaschen> Ran UPDATE flow_workflow SET workflow_page_id = 41854369 WHERE workflow_wiki = 'enwiki' AND workflow_namespace = 5 AND workflow_title_text = 'Flow/Developer_test_page' AND workflow_page_id = 48099373; to work around DB inconsistency (T117812) [production]
2015-12-22 §
21:46 <gwicke> restbase1004: tune2fs -m 0 /dev/mapper/restbase1004--vg-srv [production]
21:45 <gwicke> restbase1004: restarted bootstrap [production]
21:22 <gwicke> restbase1003: restarting cassandra to clear up disk space from old stream [production]
21:11 <gwicke> restbase1008: restarting cassandra to clear up disk space from old stream [production]
18:36 <robh> silver returned to normal service, wikitech.w.o certificate renewed. [production]
18:26 <robh> silver puppet staying stalled during toollabs issue (we dont want to rehup silver web serivce) [production]
18:17 <robh> puppet disabled on silver, going to update wikitech.wikimedia.org certificate [production]
18:10 <jynus> disabling event scheduling on db1046 [production]
18:03 <jynus> rolling schema change (ALTER TABLE ENGINE=TokuDB) on m4-master (db1046) log (eventlogging) [production]
16:44 <godog> bounce cassandra on restbase1004, restart bootstrap [production]
16:42 <mutante> powercycling crashed mw1144 [production]
16:41 <jynus> converting dbstore2001 (delayed slave) into an actual delayed slave, adding redundancy to dbstore1002 [production]
16:40 <godog> bounce cassandra on restbase1003 [production]
16:15 <akosiaris> upgrade cassandra on maps-test2001 [production]
16:15 <akosiaris> upgrade cassandra on maps-test2002 [production]
15:53 <mutante> kafka1001,1002 - crit - eventlogging not running (?) [production]
15:52 <mutante> restbase1003 - disk space, restbase1008 - disk space, restbase1004 - cassandra cql refused [production]
15:23 <akosiaris> upgrade cassandra on maps-test2003 [production]
15:06 <jynus> restarting and reconfiguring mysql at dbstore2001 [production]
15:06 <mutante> labtestcontrol2001 - puppet had not been running for a while, a bunch of changes have been applied incl. keys and passwords [production]
15:04 <mutante> enabling puppet on labtestcontrol2001 [production]
15:04 <akosiaris> upgraded cassandra on maps-test2004 [production]
11:54 <apergos> salt packages with wmf packages precise running on ms-{bf}e* in esams; trusty running on analytics103* in eqiad; jessie running on restbase2* in codfw [production]
11:43 <godog> restart cassandra bootstrap on restbase1004 [production]
10:09 <jynus> online resizing /srv/postgres on labsdb1006 +100GB [production]
10:06 <hashar> Restarting Jenkins [production]
09:54 <apergos> precise and trusty salt packages with wmf patches deployed manually on dataset1001 and analytics1001, seem to work fine [production]
08:42 <jynus> restarting and reconfiguring mysql at db2036 [production]
02:30 <l10nupdate@tin> ResourceLoader cache refresh completed at Tue Dec 22 02:30:28 UTC 2015 (duration 6m 54s) [production]
02:23 <mwdeploy@tin> sync-l10n completed (1.27.0-wmf.9) (duration: 09m 47s) [production]
00:29 <krenair@tin> Synchronized php-1.27.0-wmf.9/extensions/VisualEditor: https://gerrit.wikimedia.org/r/#/c/260492/ (duration: 00m 32s) [production]
00:22 <krenair@tin> Synchronized php-1.27.0-wmf.9/extensions/SyntaxHighlight_GeSHi/modules/ve-syntaxhighlight/ve.ui.MWSyntaxHighlightDialogTool.js: https://gerrit.wikimedia.org/r/#/c/260429/ (duration: 00m 30s) [production]
2015-12-21 §
20:49 <godog> restbase1004 bootstrap failed, restbase1007-a is down java.lang.RuntimeException: A node required to move the data consistently is down (/10.64.0.230). [production]
19:27 <legoktm> running checkLocalUser.php --delete=1 for real this time on terbium [production]
19:22 <godog> reimage restbase1004 [production]
19:14 <paravoid> powercycling mw1011 [production]
19:11 <paravoid> rolling restart of hhvm on the eqiad jobrunners [production]
18:47 <jynus> common-sync: Copying to mw1016.eqiad.wmnet from tin.eqiad.wmnet [production]
18:35 <ori> correction: previous log message was for mw1015, not mw1017 [production]
18:27 <ori> mw1017: enabled jemalloc profiling, restarted hhvm, now running hhvm-collect-heaps [production]