2018-03-05
§
|
10:24 |
<elukey> |
drain + reboot analytics10[46-49] for kernel updates |
[production] |
10:23 |
<moritzm> |
rolling reboot of logstash* for kernel security update |
[production] |
09:33 |
<godog> |
roll restart swift in codfw to add thumbor private user |
[production] |
09:15 |
<marostegui> |
Deploy schema change on s7 codfw master (db2040), this will generate lag on codfw - T187089 T185128 T153182 |
[production] |
09:01 |
<godog> |
roll-restart thumbor to apply https://gerrit.wikimedia.org/r/416240 |
[production] |
08:54 |
<marostegui> |
Stop mariadb on db2037 to copy it to db1073 |
[production] |
08:25 |
<marostegui> |
Stop MySQL on db2078 for mariadb and kernel upgrade |
[production] |
07:20 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Remove db1073 from config (duration: 00m 58s) |
[production] |
07:18 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Remove db1073 from config (duration: 00m 59s) |
[production] |
07:06 |
<marostegui> |
Deploy schema change on s2 primary master db1054 - T185128 T153182 |
[production] |
02:08 |
<l10nupdate@tin> |
LocalisationUpdate failed: git pull of extensions failed |
[production] |
2018-03-03
§
|
14:16 |
<akosiaris> |
13:56:20 ema: powercycle ganeti1005 T181121 |
[production] |
13:56 |
<ema> |
powercycle ganeti1005 |
[production] |
13:25 |
<andrewbogott> |
forced quota update in admin-monitoring as well; the reserved fixed_ip value was incorrect |
[production] |
13:23 |
<andrewbogott> |
forcing quota update in nova with update quota_usages set reserved='-1' where project_id='contintcloud'; |
[production] |
13:10 |
<andrewbogott> |
restarting rabbitmq-server on labcontrol1001 |
[production] |
13:08 |
<andrewbogott> |
retarting nodepool |
[production] |
13:05 |
<andrewbogott> |
restarting nova-conductor |
[production] |
13:02 |
<andrewbogott> |
stopping nodepool for a bit while investigating openstack issues |
[production] |
02:14 |
<chasemp> |
labnodepool1001:~# service nodepool start |
[production] |
01:30 |
<chasemp> |
root@labnet1001:~# service nova-fullstack restart |
[production] |
01:21 |
<chasemp> |
labnodepool1001:~# service nodepool stop |
[production] |
2018-03-02
§
|
19:44 |
<jynus> |
restarting labsdb1010 |
[production] |
17:22 |
<mepps> |
updated payments-wiki 498f49a758 to ce68e8e80b |
[production] |
15:19 |
<elukey> |
drain + reboot analytics10[41-45] for kernel updates |
[production] |
15:15 |
<moritzm> |
rebooting auth* for kernel security updates |
[production] |
13:46 |
<elukey> |
drain + reboot analytics10[38,39,40,41] for kernel updates |
[production] |
13:22 |
<elukey> |
drain + reboot analytics10[33,34,36,37] for kernel updates |
[production] |
13:17 |
<moritzm> |
upgrading labtest trusty hosts to latest 4.4 kernel |
[production] |
12:23 |
<moritzm> |
rebooting kubetcd/kubestagetcd for kernel security update |
[production] |
12:00 |
<moritzm> |
rebooting etcd* for kernel security updates |
[production] |
11:58 |
<elukey> |
drain + reboot analytics10[29,31,32] for kernel updates |
[production] |
11:33 |
<moritzm> |
draining restbase1018 for eventual reboot for kernel security update |
[production] |
11:28 |
<akosiaris> |
upload to apt.wikimedia.org component thirdparty/ci distro jessie-wikimedia docker-ce_17.12.1~ce-0~debian_amd64 T177499 |
[production] |
11:07 |
<moritzm> |
rebooting mwdebug* for kernel security update |
[production] |
10:54 |
<ema> |
spare LVSs lvs[1011-1012], lvs[4001-4004]: reboot for retpoline kernel updates T188092 |
[production] |
10:53 |
<moritzm> |
draining restbase1017 for eventual reboot for kernel security update |
[production] |
10:46 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Fully repool db1114 (duration: 00m 57s) |
[production] |
10:18 |
<moritzm> |
draining restbase1016 for eventual reboot for kernel security update |
[production] |
10:18 |
<jynus> |
shutting down labsdb1010 |
[production] |
10:17 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Increase traffic for db1114 (duration: 00m 56s) |
[production] |
10:01 |
<elukey> |
deleted /etc/burrow/* from zookeeper main eqiad/codfw after https://gerrit.wikimedia.org/r/415818 (garbage to cleanup) |
[production] |
09:57 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Increase traffic for db1114 (duration: 00m 57s) |
[production] |
09:40 |
<moritzm> |
draining restbase1015 for eventual reboot for kernel security update |
[production] |
09:27 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Slowly pool db1114 in s1 after cloning it from db1073 - T183469 (duration: 01m 01s) |
[production] |
08:57 |
<moritzm> |
rebooting scb1004 for kernel security update (was omitted from earlier reboots due to hardware issues on scb1003) |
[production] |