2014-09-22
§
|
21:04 |
<bd808> |
production-logstash-eqiad healed by restarting elasticsearch on logstash1002 after OOM + split brain |
[production] |
20:54 |
<bd808> |
split brain on logstash1002 preceded by by java OOM for elasticsearch |
[production] |
20:52 |
<bd808> |
logstash1002 went split brain from rest of logstash elastic search cluster. restarting |
[production] |
20:24 |
<subbu> |
deployed Parsoid ff9476f9 |
[production] |
19:31 |
<hashar> |
Jenkins is broken for extensions patches proposed against the wmf branches {{bug|71133}} |
[production] |
18:32 |
<Krinkle> |
lanthanum tmpfs filled up again, purged manually (bug 71128) |
[production] |
17:22 |
<ori> |
updated HHVM on beta cluster to HHVM to 3.3.0-20140918+wmf1 |
[production] |
17:00 |
<demon> |
Synchronized wmf-config/InitialiseSettings.php: Push Cirrus' non-content enwiki shards apart (no-op) (duration: 00m 04s) |
[production] |
16:09 |
<bd808> |
Ori updating HHVM to 3.3.0-20140918+wmf1 (from deployment-prep SAL) |
[releng] |
16:08 |
<ori> |
updating HHVM to 3.3.0-20140918+wmf1 |
[releng] |
15:52 |
<godog> |
reboot ms-be2001 into PXE to test a re-install |
[production] |
15:07 |
<anomie> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable Graph extension on mediawiki.org [[gerrit:161908]] (duration: 00m 09s) |
[production] |
15:02 |
<anomie> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: Add securepoll-create-poll right to sysop on testwiki [[gerrit:161653]] (duration: 00m 09s) |
[production] |
15:01 |
<anomie> |
Synchronized wmf-config/CommonSettings.php: SWAT: Add REL1_24 as branch in ExtensionDistributor [[gerrit:161666]] (duration: 00m 10s) |
[production] |
14:12 |
<hashar> |
Jenkins deleted job mediawiki-core-lint , replaced by mediawiki-core-phplint |
[production] |
12:10 |
<apergos> |
shutdown of db1050 to install trusty |
[production] |
10:04 |
<hashar> |
Jenkins back and fully operational |
[production] |
09:55 |
<hashar> |
restarting jenkins |
[production] |
09:37 |
<hashar_> |
Jenkins: deleting old mediawiki extensions jobs (<tt>rm -fR /var/lib/jenkins/jobs/*testextensions-master</tt>). They are no more triggered and superseded by the <tt>*-testextension</tt> jobs. |
[releng] |
09:37 |
<hashar_> |
Jenkins: deleting old mediawiki extensions jobs (<tt>rm -fR /var/lib/jenkins/jobs/*testextensions-master</tt>). They are no more triggered and superseded by the <tt>*-testextension</tt> jobs. |
[production] |
03:36 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Mon Sep 22 03:36:40 UTC 2014 (duration 36m 39s) |
[production] |
02:41 |
<LocalisationUpdate> |
completed (1.24wmf22) at 2014-09-22 02:41:29+00:00 |
[production] |
02:29 |
<LocalisationUpdate> |
completed (1.24wmf21) at 2014-09-22 02:29:09+00:00 |
[production] |
02:16 |
<LocalisationUpdate> |
completed (1.24wmf20) at 2014-09-22 02:16:20+00:00 |
[production] |
2014-09-21
§
|
22:43 |
<ori> |
ms-be1008 overloaded starting 18:00:24 UTC, syslog says "BUG: soft lockup - CPU#1 stuck for 22s! [kworker/1:1:2196]". machine became unresponsive at 21:35, coinciding with a spike of 5xxs, lasting until Coren powercycled it at 22:10. |
[production] |
03:37 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Sun Sep 21 03:37:31 UTC 2014 (duration 37m 30s) |
[production] |
03:16 |
<springle> |
labsdb1001 mysqld restarted in gdb; crash loop with a labs user's table |
[production] |
02:46 |
<ori> |
Synchronized wmf-config/throttle.php: I7bb42b49a: Increase account creation throttle on enwiki for Cochrane colloquium. (duration: 00m 07s) |
[production] |
02:41 |
<LocalisationUpdate> |
completed (1.24wmf22) at 2014-09-21 02:41:36+00:00 |
[production] |
02:29 |
<LocalisationUpdate> |
completed (1.24wmf21) at 2014-09-21 02:29:51+00:00 |
[production] |
02:17 |
<LocalisationUpdate> |
completed (1.24wmf20) at 2014-09-21 02:16:56+00:00 |
[production] |
2014-09-20
§
|
22:28 |
<Krinkle> |
Reloading Zuul to deploy I0170766cfc06b8e6 |
[production] |
21:30 |
<bd808> |
Deleted /var/log/atop.* on deployment-bastion to free some disk space in /var |
[releng] |
21:29 |
<bd808> |
Deleted /var/log/account/pacct.* on deployment-bastion to free some disk space in /var |
[releng] |
20:30 |
<andrewbogott> |
rebooting virt1006 to make good and sure it doesn't spontaneously re-enter the compute pool |
[production] |
20:30 |
<andrewbogott_afk> |
moved all VMs off of virt1006, disabled compute service |
[production] |
14:43 |
<andrewbogott> |
movingdeployment-pdf02 to virt1009 |
[releng] |
03:46 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Sat Sep 20 03:46:00 UTC 2014 (duration 45m 59s) |
[production] |
02:46 |
<LocalisationUpdate> |
completed (1.24wmf22) at 2014-09-20 02:46:05+00:00 |
[production] |
02:33 |
<LocalisationUpdate> |
completed (1.24wmf21) at 2014-09-20 02:33:34+00:00 |
[production] |
02:19 |
<LocalisationUpdate> |
completed (1.24wmf20) at 2014-09-20 02:19:34+00:00 |
[production] |
00:36 |
<mutante> |
raised instance quota to 43 |
[releng] |