2017-06-20
ยง
|
17:29 |
<elukey> |
running a script in tmux on rdb200[34] called "check" to dump periodically LLEN enwiki:jobqueue:enqueue:l-unclaimed |
[production] |
17:21 |
<elukey> |
restart redis-instance-tcp_6380.service on rdb2003 to force sync with its master |
[production] |
17:16 |
<elukey> |
restart redis-instance-tcp_6380.service on rdb2004 to force sync with its master |
[production] |
17:04 |
<XioNoX> |
re-enable igmp-snooping on asw-d-codfw |
[production] |
17:01 |
<bd808> |
Ran maintain-meta_p --all-databases on labsdb1003 |
[production] |
16:55 |
<bd808> |
Ran maintain-meta_p --all-databases on labsdb1001 |
[production] |
16:53 |
<paravoid> |
updating the d-i image for stretch in puppet volatile |
[production] |
16:09 |
<chasemp> |
openstack server delete admin-monitoring openstack project instances (we have leaked 7) |
[production] |
16:05 |
<elukey> |
reboot kafka1013 for kernel upgrade |
[production] |
15:08 |
<XioNoX> |
starting asw-d-codfw switch upgrade - T167274 |
[production] |
14:47 |
<elukey> |
rolling restart of druid100[123] for kernel upgrades |
[production] |
14:32 |
<XioNoX> |
depooled codfw - T167274 |
[production] |
14:27 |
<moritzm> |
rebooting scb1001 for kernel update |
[production] |
14:17 |
<hashar> |
CI is fully backup (following reboot of contint1001 / labnodepool1001 ) |
[production] |
14:16 |
<hashar> |
Upgraded Jenkins plugins |
[production] |
14:05 |
<hashar> |
Starting Jenkins on contint1001 |
[production] |
14:05 |
<elukey> |
reboot kafka2001 for kernel upgrade |
[production] |
14:02 |
<hashar> |
Rebooting contint1001 |
[production] |
14:00 |
<hashar> |
Stopping Nodepool service to prevent new builds |
[production] |
13:55 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1087 - T166207 (duration: 01m 41s) |
[production] |
13:55 |
<marostegui> |
Deploy alter table db1087 - s5 - T166207 |
[production] |
13:47 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1071 - T166207 (duration: 00m 41s) |
[production] |
13:44 |
<aude@tin> |
Synchronized wmf-config/Wikibase-production.php: Enable Wiktionary site links on test.wikidata (duration: 00m 43s) |
[production] |
13:42 |
<_joe_> |
manually started nrpe on ms-be1016 |
[production] |
13:39 |
<marostegui> |
Deploy alter table on db1049 - s5 - T166207 |
[production] |
13:39 |
<moritzm> |
rebooting labnodepool1001 for kernel update |
[production] |
13:37 |
<hashar> |
Restarting Jenkins |
[production] |
13:36 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=true; selector: name=sca1004.eqiad.wmnet |
[production] |
13:36 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=true; selector: name=mwdebug1002.eqiad.wmnet |
[production] |
13:33 |
<godog> |
pool thumbor100[34] into service - T168297 |
[production] |
13:26 |
<marostegui> |
Deploy alter table labsdb1010 - s5 - T166207 |
[production] |
13:14 |
<moritzm> |
rebooting restbase staging cluster (cerium/praseodymium/xenon) for kernel update |
[production] |
12:09 |
<gehel> |
starting cluster restart elasticsearch eqiad |
[production] |
12:00 |
<elukey> |
reboot analytics1029 -> analytics1069 for kernel upgrades (Hadoop worker nodes) |
[production] |
11:36 |
<moritzm> |
installing libgcrypt security updates |
[production] |
11:29 |
<moritzm> |
rebooting mediawiki app servers in codfw for kernel update |
[production] |
11:13 |
<akosiaris> |
renumber sca1004, mwdebug1002. Downtime should be a few minutes |
[production] |
11:08 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=mwdebug1002.eqiad.wmnet |
[production] |
10:56 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=sca1004.eqiad.wmnet |
[production] |
10:07 |
<moritzm> |
rebooting mwdebug servers for kernel update |
[production] |
10:03 |
<elukey> |
reboot kafka1012, analytics1028, aqs1004 for kernel upgrades (canary hosts) |
[production] |
10:00 |
<godog> |
reimage ms-be1016 with stretch |
[production] |
09:53 |
<godog> |
reset ms-be1014 idrac via ipmitool |
[production] |
09:46 |
<moritzm> |
rebooting app server canaries for kernel update |
[production] |
09:40 |
<godog> |
roll-restart thumbor to increase swift timeout |
[production] |
09:29 |
<marostegui> |
Rename table on db1089 enwiki.wikilove_image_log - T127219 |
[production] |
08:46 |
<marostegui> |
Drop table titlekey from s1 - T164949 |
[production] |
08:35 |
<godog> |
roll restart swift-proxy on ms-fe* to pick up thumbor changes |
[production] |
08:30 |
<_joe_> |
restarting gerrit T168360 |
[production] |
08:25 |
<_joe_> |
manually patching gerrit's systemd unit file to allow more open files |
[production] |