2016-07-25
§
|
09:18 |
<godog> |
swift eqiad-prod: ms-be102[3456] weight 1500 |
[production] |
03:26 |
<hashar> |
scandium: migrating zuul-merger repos from lead to gerrit.wikimedia.org: find /srv/ssd/zuul/git -path '*/.git/config' -print -execdir sed -i -e 's/lead.wikimedia.org/gerrit.wikimedia.org/' config \; |
[production] |
02:28 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Mon Jul 25 02:28:21 UTC 2016 (duration 5m 52s) |
[production] |
02:22 |
<mwdeploy@tin> |
scap sync-l10n completed (1.28.0-wmf.11) (duration: 09m 09s) |
[production] |
02:03 |
<ostriches> |
gerrit: reindexing lucene now that we have new data. searches/dashboards may look a tad weird for a bit |
[production] |
01:53 |
<hashar> |
starting Zuul |
[production] |
01:51 |
<mutante> |
restarted grrrit-wm |
[production] |
01:39 |
<ostriches> |
lead: turning puppet back on, here we go |
[production] |
01:38 |
<jynus> |
m2 replication on db2011 stopped, master binlog pos: db1020-bin.000968:1013334195 |
[production] |
01:37 |
<hashar> |
scandium: restarted zuul-merger |
[production] |
01:36 |
<ostriches> |
ytterbium: Stopped puppet, stopped gerrit process. |
[production] |
01:34 |
<mutante> |
switched gerrit-new to gerrit in DNS |
[production] |
01:30 |
<ostriches> |
lead: stopped puppet for a few minutes |
[production] |
01:17 |
<hashar> |
scandium: migrating zuul-merger repos to lead find /srv/ssd/zuul/git -path '*/.git/config' -print -execdir sed -i -e 's/ytterbium.wikimedia.org/lead.wikimedia.org/' config \; |
[production] |
01:10 |
<hashar> |
stopping CI |
[production] |
01:09 |
<jynus> |
reviewdb backup finished, available on db1020:/srv/tmp/2016-07-25_00-54-31/ |
[production] |
01:02 |
<ostriches> |
rsyncing latest git data from ytterbium to lead |
[production] |
00:57 |
<mutante> |
manually deleted reviewer-counts cron from gerrit2 user, runs as root and puppet does not remove crons unless ensure=>absent |
[production] |
00:55 |
<jynus> |
starting hot backup of db1020's reviewdb |
[production] |
2016-07-23
§
|
15:38 |
<godog> |
stop swift in esams test cluster, lots of logging from there |
[production] |
15:37 |
<godog> |
lithium sudo lvextend --size +10G -r /dev/mapper/lithium--vg-syslog |
[production] |
04:58 |
<ori> |
Gerrit is back up after service restart; was unavailable between ~ 04:29 - 04:57 UTC |
[production] |
04:56 |
<ori> |
Restarting Gerrit on ytterbium |
[production] |
04:48 |
<ori> |
Users report Gerrit is down; on ytterbium java is occupying two cores at 100% |
[production] |
03:48 |
<chasemp> |
gnt-instance reboot seaborgium.wikimedia.org |
[production] |
02:26 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Sat Jul 23 02:26:49 UTC 2016 (duration 5m 41s) |
[production] |
02:21 |
<mwdeploy@tin> |
scap sync-l10n completed (1.28.0-wmf.11) (duration: 08m 24s) |
[production] |
01:02 |
<tgr@tin> |
Synchronized php-1.28.0-wmf.11/extensions/CentralAuth/includes/CentralAuthPlugin.php: T141160 (duration: 00m 29s) |
[production] |
01:01 |
<tgr@tin> |
Synchronized php-1.28.0-wmf.11/extensions/CentralAuth/includes/CentralAuthHooks.php: T141160 (duration: 00m 27s) |
[production] |
01:00 |
<tgr@tin> |
Synchronized php-1.28.0-wmf.11/extensions/CentralAuth/includes/CentralAuthPrimaryAuthenticationProvider.php: T141160 (duration: 00m 28s) |
[production] |
00:37 |
<tgr> |
doing an emergency deploy of https://gerrit.wikimedia.org/r/#/c/300679 for T141160, creates dozens of new users per hour to be unattached on loginwiki which probably has weird consequences |
[production] |
2016-07-22
§
|
22:19 |
<aaron@tin> |
Synchronized wmf-config/InitialiseSettings.php: Enable debug logging for DBTransaction (duration: 00m 38s) |
[production] |
21:10 |
<ejegg> |
updated civicrm from 2f4805fa2d2a7c57881408be2b3a017d26d8f43e to d657255e1edebeccfc0a03bea70b78eb11375cf8 |
[production] |
20:58 |
<ejegg> |
disabled Worldpay audit parser job |
[production] |
18:59 |
<ejegg> |
rolled back payments from 79d2b67067fd7e579372b63e0d619eccfa3b9143 to 79cb53998c41f72d0fa49130ed1f66dc112b478c |
[production] |
18:54 |
<mutante> |
restart grrrit-wm |
[production] |
16:05 |
<Jeff_Green> |
running authdns-update to correct a DKIM public key on wikipedia.org |
[production] |
15:24 |
<anomie> |
Starting script to populate empty gu_auth_token [[phab:T140478]] |
[production] |
15:16 |
<urandom> |
T140825: Restarting Cassandra to apply 8MB trickle_fsync (restbase1015-a.eqiad.wmnet) |
[production] |
14:21 |
<gehel> |
rolling restart of logstash100[1-3] - T141063 |
[production] |
14:19 |
<urandom> |
T134016: Boostrapping restbase2004-c.codfw.wmnet |
[production] |
12:42 |
<jynus> |
applying new m5 db grants |
[production] |
11:12 |
<jynus> |
reimage dbproxy1009 T140983 |
[production] |
11:04 |
<jynus> |
applying new m2 db grants |
[production] |
10:47 |
<jynus> |
reimage dbproxy1007 T140983 |
[production] |
10:36 |
<jynus> |
applying new m1 db grants |
[production] |
10:27 |
<hashar> |
Restarting Jenkins entirely (deadlocked) |
[production] |
10:23 |
<hashar> |
Jenkins has some random deadlock. Will probably reboot it |
[production] |