2017-06-20
ยง
|
17:52 |
<mutante> |
cobalt (gerrit) - re-enabling puppet, running it. nothing should change, the system unit file mentioned in T168360#3362314 does not get installed by puppet, it comes from the deb |
[production] |
17:49 |
<subbu> |
Since arlolra noticed some unexpected warnings from the canaries, the Parsoid deploy was rolled back, so Parsoid was not updated to e2e2b5f6 (contrary to what scap said above). |
[production] |
17:48 |
<gehel@tin> |
Finished deploy [wdqs/wdqs@b60d224]: (no justification provided) (duration: 01m 41s) |
[production] |
17:47 |
<XioNoX> |
repool codfw - T167274 |
[production] |
17:46 |
<gehel@tin> |
Started deploy [wdqs/wdqs@b60d224]: (no justification provided) |
[production] |
17:45 |
<gehel> |
deploying wdqs blazegraph and GUI updates |
[production] |
17:43 |
<mutante> |
RT - ununpentium - upgradeed rt4-db-mysql |
[production] |
17:42 |
<arlolra@tin> |
Finished deploy [parsoid/deploy@4b60bf9]: Updating Parsoid to e2e2b5f6 (duration: 07m 57s) |
[production] |
17:40 |
<mutante> |
mwreleaeses1001 - puppet node clean, puppet node deactivate - was reinstalled as releases1001 |
[production] |
17:34 |
<arlolra@tin> |
Started deploy [parsoid/deploy@4b60bf9]: Updating Parsoid to e2e2b5f6 |
[production] |
17:29 |
<elukey> |
running a script in tmux on rdb200[34] called "check" to dump periodically LLEN enwiki:jobqueue:enqueue:l-unclaimed |
[production] |
17:21 |
<elukey> |
restart redis-instance-tcp_6380.service on rdb2003 to force sync with its master |
[production] |
17:16 |
<elukey> |
restart redis-instance-tcp_6380.service on rdb2004 to force sync with its master |
[production] |
17:04 |
<XioNoX> |
re-enable igmp-snooping on asw-d-codfw |
[production] |
17:01 |
<bd808> |
Ran maintain-meta_p --all-databases on labsdb1003 |
[production] |
16:55 |
<bd808> |
Ran maintain-meta_p --all-databases on labsdb1001 |
[production] |
16:53 |
<paravoid> |
updating the d-i image for stretch in puppet volatile |
[production] |
16:09 |
<chasemp> |
openstack server delete admin-monitoring openstack project instances (we have leaked 7) |
[production] |
16:05 |
<elukey> |
reboot kafka1013 for kernel upgrade |
[production] |
15:08 |
<XioNoX> |
starting asw-d-codfw switch upgrade - T167274 |
[production] |
14:47 |
<elukey> |
rolling restart of druid100[123] for kernel upgrades |
[production] |
14:32 |
<XioNoX> |
depooled codfw - T167274 |
[production] |
14:27 |
<moritzm> |
rebooting scb1001 for kernel update |
[production] |
14:17 |
<hashar> |
CI is fully backup (following reboot of contint1001 / labnodepool1001 ) |
[production] |
14:16 |
<hashar> |
Upgraded Jenkins plugins |
[production] |
14:05 |
<hashar> |
Starting Jenkins on contint1001 |
[production] |
14:05 |
<elukey> |
reboot kafka2001 for kernel upgrade |
[production] |
14:02 |
<hashar> |
Rebooting contint1001 |
[production] |
14:00 |
<hashar> |
Stopping Nodepool service to prevent new builds |
[production] |
13:55 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1087 - T166207 (duration: 01m 41s) |
[production] |
13:55 |
<marostegui> |
Deploy alter table db1087 - s5 - T166207 |
[production] |
13:47 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1071 - T166207 (duration: 00m 41s) |
[production] |
13:44 |
<aude@tin> |
Synchronized wmf-config/Wikibase-production.php: Enable Wiktionary site links on test.wikidata (duration: 00m 43s) |
[production] |
13:42 |
<_joe_> |
manually started nrpe on ms-be1016 |
[production] |
13:39 |
<marostegui> |
Deploy alter table on db1049 - s5 - T166207 |
[production] |
13:39 |
<moritzm> |
rebooting labnodepool1001 for kernel update |
[production] |
13:37 |
<hashar> |
Restarting Jenkins |
[production] |
13:36 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=true; selector: name=sca1004.eqiad.wmnet |
[production] |
13:36 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=true; selector: name=mwdebug1002.eqiad.wmnet |
[production] |
13:33 |
<godog> |
pool thumbor100[34] into service - T168297 |
[production] |
13:26 |
<marostegui> |
Deploy alter table labsdb1010 - s5 - T166207 |
[production] |
13:14 |
<moritzm> |
rebooting restbase staging cluster (cerium/praseodymium/xenon) for kernel update |
[production] |
12:09 |
<gehel> |
starting cluster restart elasticsearch eqiad |
[production] |
12:00 |
<elukey> |
reboot analytics1029 -> analytics1069 for kernel upgrades (Hadoop worker nodes) |
[production] |
11:36 |
<moritzm> |
installing libgcrypt security updates |
[production] |
11:29 |
<moritzm> |
rebooting mediawiki app servers in codfw for kernel update |
[production] |
11:13 |
<akosiaris> |
renumber sca1004, mwdebug1002. Downtime should be a few minutes |
[production] |
11:08 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=mwdebug1002.eqiad.wmnet |
[production] |
10:56 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=sca1004.eqiad.wmnet |
[production] |
10:07 |
<moritzm> |
rebooting mwdebug servers for kernel update |
[production] |