2015-07-28
§
|
17:49 |
<valhallasw`cloud> |
Jobs were drained at 19:43, but this did not decreade he rate, which is still at ~50k/minute. Now running "sysctl -w sunrpc.nfs_debug=1023 && sleep 2 && sysctl -w sunrpc.nfs_debug=0" which hopefully doesn't kill the server |
[tools] |
17:43 |
<valhallasw`cloud> |
rescheduled all webservice jobs on tools-webgrid-lighttpd-1401.eqiad.wmflabs, server is now empty |
[tools] |
17:16 |
<valhallasw`cloud> |
disabled queue "webgrid-lighttpd@tools-webgrid-lighttpd-1401.eqiad.wmflabs" |
[tools] |
17:04 |
<godog> |
start cassandra on restbase1007, tentative bootstrap |
[production] |
16:24 |
<YuviPanda> |
bounced create-dbusers on labstore1002 |
[production] |
16:03 |
<bd808> |
logstash1002 conversion to jessie done; log event volume returning to normal in index |
[production] |
16:01 |
<godog> |
bounce cassandra on xenon to test logstash logging |
[production] |
15:52 |
<bd808> |
installed logstash on logstash1002; forced puppet run |
[production] |
15:03 |
<thcipriani> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable VisualEditor for 5% of new accounts on enwiki [[gerrit:226338]] (duration: 00m 12s) |
[production] |
14:43 |
<cmjohnson1> |
powering down logstash1002 to remove disk and install jessie |
[production] |
14:28 |
<moritzm> |
restarted zookeeper on conf1003 to effect OpenJDK security update |
[production] |
14:16 |
<_joe_> |
re-enabled puppet on mw1152 for testing |
[production] |
14:16 |
<moritzm> |
restarted zookeeper on conf1002 to effect OpenJDK security update |
[production] |
13:58 |
<paravoid> |
upgrading baham to gdnsd 2.2.0 |
[production] |
13:41 |
<_joe_> |
disabled puppet on mw1152, thumb_handler testing |
[production] |
13:40 |
<moritzm> |
restarted zookeeper on conf1001 to effect OpenJDK security update |
[production] |
13:13 |
<jynus> |
temporarily changing master of db1069(s1) to db1051 in order to fix some labsdb inconsistencies on enwiki_p |
[production] |
12:29 |
<godog> |
reenable puppet on restbase1001 after merging https://gerrit.wikimedia.org/r/#/c/227355/ |
[production] |
11:18 |
<hashar> |
Assigning label "BetaClusterBastion" to https://integration.wikimedia.org/ci/computer/deployment-bastion.eqiad/ |
[releng] |
11:12 |
<hashar> |
Jenkins jobs for the beta cluster ended up stuck again. Found a workaround by removing the Jenkins label on deployment-bastion node and reinstating it. Seems to get rid of the deadlock ( ref: https://phabricator.wikimedia.org/T72597#1487801 ) |
[releng] |
10:31 |
<paravoid> |
merging a series of mail-related patches; ping me personally if problems arise |
[production] |
10:03 |
<mobrovac> |
citoid deploying d57ec96 |
[production] |
09:50 |
<hashar> |
deployment-apertium01 is back! The ferm rules were outdated / not maintained by puppet, dropped ferm entirely. |
[releng] |
09:41 |
<jynus> |
Synchronized wmf-config/db-eqiad.php: Increasing db1035 weight (duration: 00m 13s) |
[production] |
09:40 |
<hashar> |
rebooting deployment-apertium01 to ensure its ferm rules are properly loaded on boot ( https://phabricator.wikimedia.org/T106658 ) |
[releng] |
08:13 |
<moritzm> |
added elasticsearch-1.7.0 to carbon for jessie and trusty |
[production] |
07:30 |
<YuviPanda> |
dropped others20150724190859 on labstore1002 |
[production] |
06:53 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Tue Jul 28 06:53:21 UTC 2015 (duration 53m 20s) |
[production] |
02:30 |
<LocalisationUpdate> |
completed (1.26wmf15) at 2015-07-28 02:30:24+00:00 |
[production] |
02:26 |
<l10nupdate> |
Synchronized php-1.26wmf15/cache/l10n: (no message) (duration: 07m 29s) |
[production] |
02:07 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Tue Jul 28 02:07:52 UTC 2015 (duration 7m 51s) |
[production] |
02:06 |
<YuviPanda> |
removed pacct files from tools-bastion-01 |
[tools] |
02:03 |
<LocalisationUpdate> |
failed (1.26wmf15) at 2015-07-28 02:03:41+00:00 |
[production] |
01:11 |
<krenair> |
Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/227371/ (duration: 00m 11s) |
[production] |
00:46 |
<legoktm> |
deploying https://gerrit.wikimedia.org/r/227383 |
[releng] |
00:35 |
<krenair> |
Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/227381/ (duration: 00m 13s) |
[production] |
00:30 |
<krenair> |
Synchronized php-1.26wmf15/extensions/SiteMatrix/SiteMatrix_body.php: https://gerrit.wikimedia.org/r/#/c/227379/ (duration: 00m 12s) |
[production] |
00:00 |
<catrope> |
Finished scap: SWAT (duration: 22m 15s) |
[production] |
2015-07-27
§
|
23:53 |
<ori> |
Re-pooling mw1159 and mw1160 |
[production] |
23:38 |
<catrope> |
Started scap: SWAT |
[production] |
23:24 |
<catrope> |
Synchronized wmf-config/InitialiseSettings.php: SWAT (duration: 00m 12s) |
[production] |
23:23 |
<catrope> |
Synchronized w/static/images/project-logos/suwikiquote.png: Localized logo for suwikiquote (duration: 00m 12s) |
[production] |
23:04 |
<marxarelli> |
running `jenkins-jobs update config/ 'browsertests-*'` to deploy I3c61ff4089791375e21aadfa045d503dfd73ca0e |
[releng] |
22:19 |
<andrewbogott> |
rebooting labvirt1005 |
[production] |
21:50 |
<bd808> |
updated scap to dc8eda5 (Don't exclude PHP files from being synced) |
[production] |
21:34 |
<ori> |
Synchronized php-1.26wmf15/extensions/AbuseFilter: I13d29ea6: Revert "Conversion to using getMainStashInstance()" (duration: 00m 12s) |
[production] |
21:27 |
<valhallasw`cloud> |
turned off process accounting on tools-login while we try to find the root cause of [[phab:T107052]]: <pre>accton off</pre> |
[tools] |
21:24 |
<andrewbogott> |
rebooting labnet1002, just to see if I can |
[production] |
20:57 |
<ori> |
Synchronized wmf-config/CommonSettings.php: I1ca47ebc4: $wgEventLoggingSchemaApiUri: http -> https (duration: 00m 12s) |
[production] |
20:54 |
<bd808> |
installed libbcprov-java and restarted logstash on logstash1001 |
[production] |