2018-01-10
ยง
|
16:56 |
<elukey> |
reboot analytics1048->50 for kernel updates |
[analytics] |
16:55 |
<elukey> |
reboot analytics1047->50 for kernel updates |
[production] |
16:43 |
<akosiaris> |
wtp* rolling restarts for meltdown finished |
[production] |
16:39 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=ms-fe1006.eqiad.wmnet |
[production] |
16:38 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=ms-fe1008.eqiad.wmnet |
[production] |
16:35 |
<godog> |
bounce thumbor-instances on thumbor1001 |
[production] |
16:26 |
<anomie> |
Running cleanupUsersWithNoId.php on dewiki and wikidatawiki |
[production] |
16:23 |
<ottomata> |
restarting kafka jumbo brokers to apply java.security certpath restrictions |
[analytics] |
16:22 |
<ottomata> |
restarting kafka jumbo brokers to apply java.security certpath restrictions |
[production] |
16:08 |
<godog> |
roll-restart swift frontend in eqiad for kernel upgrade |
[production] |
16:06 |
<moritzm> |
migrating instances off ganeti2001 for subsequent reboot for kernel security update |
[production] |
16:05 |
<moritzm> |
switched ganeti master node in codfw to ganeti2004 |
[production] |
16:03 |
<marostegui> |
Deploy schema change on db1095.s5 - https://phabricator.wikimedia.org/T174569 |
[production] |
16:02 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1096:3315 - T174569 (duration: 01m 02s) |
[production] |
15:59 |
<godog> |
start cassandra-a on restbase1011 |
[production] |
15:37 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1097:3315 - T174569 (duration: 01m 03s) |
[production] |
15:32 |
<moritzm> |
rebooting yubico auth servers for kernel security update |
[production] |
15:14 |
<chasemp> |
tools-clushmaster-01:~$ clush -f 1 -w @k8s-worker "sudo puppet agent --enable && sudo puppet agent --test" |
[tools] |
15:14 |
<moritzm> |
reboot netmon1002 / netmon2001 for kernel security update |
[production] |
15:03 |
<chasemp> |
tools-k8s-master-01:~# for n in `kubectl get nodes | awk '{print $1}' | grep -v -e tools-worker-1001 -e tools-worker-1016 -e tools-worker-1016`; do kubectl cordon $n; done |
[tools] |
14:54 |
<ema> |
codfw LVSs: upgrade to latest jessie point release (8.10) T182656 and linux kernel 4.9.65-3+deb9u1~bpo8+2 (KPTI) T184267 |
[production] |
14:51 |
<godog> |
start cassandra-a on restbase1011 - T184100 |
[production] |
14:50 |
<zeljkof> |
EU SWAT finished |
[production] |
14:50 |
<jynus> |
dropping dewiki from dbstore2001:3318 T184599 |
[production] |
14:48 |
<_joe_> |
shutting down deployment-puppetdb01.deployment-prep.eqiad.wmflabs, unused |
[releng] |
14:47 |
<zfilipin@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:402780|translationadmin: remove configuration equal to CommonSettings.php (T184314)]] (duration: 01m 02s) |
[production] |
14:46 |
<zfilipin@tin> |
Synchronized wmf-config/CommonSettings.php: SWAT: [[gerrit:403410|translationadmin: typo fix]] (duration: 01m 03s) |
[production] |
14:42 |
<chasemp> |
new meltdown images are live in cloud land |
[production] |
14:41 |
<chasemp> |
tools-clushmaster-01:~$ clush -w @k8s-worker "sudo puppet agent --disable 'chase rollout'" |
[tools] |
14:34 |
<jynus> |
dropping wikidatawiki from dbstore2001:3315 T184599 |
[production] |
14:09 |
<zfilipin@tin> |
Synchronized wmf-config/throttle.php: SWAT: [[gerrit:403342|Lift the cap on IP address to create accounts on mrwiki (T184579)]] (duration: 01m 04s) |
[production] |
14:05 |
<moritzm> |
migrating instances off ganeti2002 for subsequent reboot for kernel security update |
[production] |
14:01 |
<chasemp> |
tools-k8s-master-01:~# kubectl uncordon tools-worker-1001.tools.eqiad.wmflabs |
[tools] |
13:57 |
<arturo> |
T184604 cleaned stalled log files that prevented logrotate from working. Triggered a couple of logrorate runs by hand in tools-worker-1020.tools.eqiad.wmflabs |
[tools] |
13:46 |
<arturo> |
T184604 aborrero@tools-k8s-master-01:~$ sudo kubectl uncordon tools-worker-1020.tools.eqiad.wmflabs |
[tools] |
13:45 |
<arturo> |
T184604 aborrero@tools-worker-1020:/var/log$ sudo mkdir /var/lib/kubelet/pods/bcb36fe1-7d3d-11e7-9b1a-fa163edef48a/volumes |
[tools] |
13:37 |
<moritzm> |
migrating instances off ganeti2003 for subsequent reboot for kernel security update |
[production] |
13:26 |
<arturo> |
sudo kubectl drain tools-worker-1020.tools.eqiad.wmflabs |
[tools] |
13:26 |
<_joe_> |
restarting pybal on lvs2003 |
[production] |
13:22 |
<arturo> |
empty by hand syslog and daemon.log files. They are so big that logrotate won't handle them |
[tools] |
13:20 |
<arturo> |
aborrero@tools-worker-1020:~$ sudo service kubelet restart |
[tools] |
13:18 |
<arturo> |
aborrero@tools-k8s-master-01:~$ sudo kubectl cordon tools-worker-1020.tools.eqiad.wmflabs for T184604 |
[tools] |
13:13 |
<arturo> |
detected low space in tools-worker-1020, big files in /var/log due to kubelet issue. Opened T184604 |
[tools] |
13:03 |
<mobrovac@tin> |
Finished deploy [restbase/deploy@a2aabfb]: API: add top-by-country, change recommendation route, fix duplicates in onthisday - T181520 T170877 T175974 (duration: 08m 00s) |
[production] |
12:55 |
<mobrovac@tin> |
Started deploy [restbase/deploy@a2aabfb]: API: add top-by-country, change recommendation route, fix duplicates in onthisday - T181520 T170877 T175974 |
[production] |
12:54 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1097:3315 - T174569 (duration: 01m 03s) |
[production] |
12:54 |
<marostegui> |
Deploy schema change on db1097:3315 - https://phabricator.wikimedia.org/T174569 |
[production] |
12:46 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1106 - T174569 (duration: 01m 03s) |
[production] |
12:38 |
<moritzm> |
migrating instances off ganeti2004 for subsequent reboot for kernel security update |
[production] |
12:19 |
<moritzm> |
migrating instances off ganeti2005 for subsequent reboot for kernel security update |
[production] |