1801-1850 of 10000 results (33ms)
2018-01-10 ยง
16:03 <marostegui> Deploy schema change on db1095.s5 - https://phabricator.wikimedia.org/T174569 [production]
16:02 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Depool db1096:3315 - T174569 (duration: 01m 02s) [production]
15:59 <godog> start cassandra-a on restbase1011 [production]
15:37 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1097:3315 - T174569 (duration: 01m 03s) [production]
15:32 <moritzm> rebooting yubico auth servers for kernel security update [production]
15:14 <chasemp> tools-clushmaster-01:~$ clush -f 1 -w @k8s-worker "sudo puppet agent --enable && sudo puppet agent --test" [tools]
15:14 <moritzm> reboot netmon1002 / netmon2001 for kernel security update [production]
15:03 <chasemp> tools-k8s-master-01:~# for n in `kubectl get nodes | awk '{print $1}' | grep -v -e tools-worker-1001 -e tools-worker-1016 -e tools-worker-1016`; do kubectl cordon $n; done [tools]
14:54 <ema> codfw LVSs: upgrade to latest jessie point release (8.10) T182656 and linux kernel 4.9.65-3+deb9u1~bpo8+2 (KPTI) T184267 [production]
14:51 <godog> start cassandra-a on restbase1011 - T184100 [production]
14:50 <zeljkof> EU SWAT finished [production]
14:50 <jynus> dropping dewiki from dbstore2001:3318 T184599 [production]
14:48 <_joe_> shutting down deployment-puppetdb01.deployment-prep.eqiad.wmflabs, unused [releng]
14:47 <zfilipin@tin> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:402780|translationadmin: remove configuration equal to CommonSettings.php (T184314)]] (duration: 01m 02s) [production]
14:46 <zfilipin@tin> Synchronized wmf-config/CommonSettings.php: SWAT: [[gerrit:403410|translationadmin: typo fix]] (duration: 01m 03s) [production]
14:42 <chasemp> new meltdown images are live in cloud land [production]
14:41 <chasemp> tools-clushmaster-01:~$ clush -w @k8s-worker "sudo puppet agent --disable 'chase rollout'" [tools]
14:34 <jynus> dropping wikidatawiki from dbstore2001:3315 T184599 [production]
14:09 <zfilipin@tin> Synchronized wmf-config/throttle.php: SWAT: [[gerrit:403342|Lift the cap on IP address to create accounts on mrwiki (T184579)]] (duration: 01m 04s) [production]
14:05 <moritzm> migrating instances off ganeti2002 for subsequent reboot for kernel security update [production]
14:01 <chasemp> tools-k8s-master-01:~# kubectl uncordon tools-worker-1001.tools.eqiad.wmflabs [tools]
13:57 <arturo> T184604 cleaned stalled log files that prevented logrotate from working. Triggered a couple of logrorate runs by hand in tools-worker-1020.tools.eqiad.wmflabs [tools]
13:46 <arturo> T184604 aborrero@tools-k8s-master-01:~$ sudo kubectl uncordon tools-worker-1020.tools.eqiad.wmflabs [tools]
13:45 <arturo> T184604 aborrero@tools-worker-1020:/var/log$ sudo mkdir /var/lib/kubelet/pods/bcb36fe1-7d3d-11e7-9b1a-fa163edef48a/volumes [tools]
13:37 <moritzm> migrating instances off ganeti2003 for subsequent reboot for kernel security update [production]
13:26 <arturo> sudo kubectl drain tools-worker-1020.tools.eqiad.wmflabs [tools]
13:26 <_joe_> restarting pybal on lvs2003 [production]
13:22 <arturo> empty by hand syslog and daemon.log files. They are so big that logrotate won't handle them [tools]
13:20 <arturo> aborrero@tools-worker-1020:~$ sudo service kubelet restart [tools]
13:18 <arturo> aborrero@tools-k8s-master-01:~$ sudo kubectl cordon tools-worker-1020.tools.eqiad.wmflabs for T184604 [tools]
13:13 <arturo> detected low space in tools-worker-1020, big files in /var/log due to kubelet issue. Opened T184604 [tools]
13:03 <mobrovac@tin> Finished deploy [restbase/deploy@a2aabfb]: API: add top-by-country, change recommendation route, fix duplicates in onthisday - T181520 T170877 T175974 (duration: 08m 00s) [production]
12:55 <mobrovac@tin> Started deploy [restbase/deploy@a2aabfb]: API: add top-by-country, change recommendation route, fix duplicates in onthisday - T181520 T170877 T175974 [production]
12:54 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Depool db1097:3315 - T174569 (duration: 01m 03s) [production]
12:54 <marostegui> Deploy schema change on db1097:3315 - https://phabricator.wikimedia.org/T174569 [production]
12:46 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1106 - T174569 (duration: 01m 03s) [production]
12:38 <moritzm> migrating instances off ganeti2004 for subsequent reboot for kernel security update [production]
12:19 <moritzm> migrating instances off ganeti2005 for subsequent reboot for kernel security update [production]
12:11 <moritzm> rebooting einsteinium for kernel security update [production]
11:51 <moritzm> migrating instances off ganeti2006 for subsequent reboot for kernel security update [production]
11:51 <elukey> re-run webrequest-load-wf-upload-2018-1-10-10 (failed due to reboots) [analytics]
11:45 <godog> downtime decomissioned restbase cassandra 2 hosts [production]
11:39 <moritzm> rebooting mw1201-mw1208 for kernel security update (along with update to HHVM 3.18.6) [production]
11:33 <marostegui> Deploy schema change on db1106 - T174569 [production]
11:27 <elukey> re-run webrequest-load-wf-text-2018-1-10-10 (failed due to reboots) [analytics]
11:26 <elukey> reboot analytics1044->47 for kernel updates [analytics]
11:26 <elukey> reboot analytics1044->47 for kernel updates [production]
11:23 <moritzm> migrating instances off ganeti2007 for subsequent reboot for kernel security update [production]
11:19 <volans> Icinga failover to tegmen completed - T170353 [production]
11:12 <moritzm> migrating instances off ganeti2008 for subsequent reboot for kernel security update [production]