2019-04-09
§
|
18:36 |
<andrewbogott> |
moving tools-worker-1016, tools-worker-1017 to eqiad1-r |
[tools] |
18:05 |
<andrewbogott> |
migrating tools-k8s-etcd-02 to eqiad1-r |
[tools] |
18:00 |
<andrewbogott> |
migrating tools-flannel-etcd-01 to eqiad1-r |
[tools] |
17:36 |
<andrewbogott> |
moving tools-worker-1014, tools-worker-1015 to eqiad1-r |
[tools] |
17:05 |
<andrewbogott> |
migrating tools-k8s-etcd-01 to eqiad1-r |
[tools] |
15:56 |
<andrewbogott> |
moving tools-worker-1012, tools-worker-1013 to eqiad1-r |
[tools] |
14:56 |
<bstorm_> |
cleared 4 queues on gridengine of E status (ldap again) |
[tools] |
14:07 |
<andrewbogott> |
moving tools-worker-1010, tools-worker-1011, tools-worker-1001 to eqiad1-r |
[tools] |
03:48 |
<andrewbogott> |
moving tools-worker-1008 and tools-worker-1009 to eqiad1-r |
[tools] |
02:07 |
<bstorm_> |
reloaded ferm on tools-flannel-etcd-0[1-3] to get the k8s node moves to register |
[tools] |
2019-04-04
§
|
21:21 |
<bd808> |
Uncordoned tools-worker-1013.tools.eqiad.wmflabs after reboot and forced puppet run |
[tools] |
20:53 |
<bd808> |
Rebooting tools-worker-1013 |
[tools] |
20:50 |
<bd808> |
Draining tools-worker-1013.tools.eqiad.wmflabs |
[tools] |
20:29 |
<bd808> |
Released floating IP and deleted instance tools-checker-01 via Horizon |
[tools] |
20:28 |
<bd808> |
Shutdown tools-checker-01 via Horizon |
[tools] |
20:17 |
<bd808> |
Repooled tools-webgrid-lighttpd-0906 after reboot, apt-get dist-upgrade, and forced puppet run |
[tools] |
20:13 |
<bd808> |
Hard reboot of tools-sgewebgrid-lighttpd-0906 via Horizon |
[tools] |
20:09 |
<bd808> |
Repooled tools-webgrid-lighttpd-0912 after reboot, apt-get dist-upgrade, and forced puppet run |
[tools] |
20:05 |
<bd808> |
Depooled and rebooted tools-sgewebgrid-lighttpd-0912 |
[tools] |
20:05 |
<bstorm_> |
rebooted tools-webgrid-lighttpd-0912 |
[tools] |
20:03 |
<bstorm_> |
depooled tools-webgrid-lighttpd-0912 |
[tools] |
19:59 |
<bstorm_> |
depooling and rebooting tools-webgrid-lighttpd-0906 |
[tools] |
19:43 |
<bd808> |
Repooled tools-sgewebgrid-lighttpd-0926 after reboot, apt-get dist-update, and forced puppet run |
[tools] |
19:36 |
<bd808> |
Hard reboot of tools-sgewebgrid-lighttpd-0926 via Horizon |
[tools] |
19:30 |
<bd808> |
Rebooting tools-sgewebgrid-lighttpd-0926 |
[tools] |
19:28 |
<bd808> |
Depooled tools-sgewebgrid-lighttpd-0926 |
[tools] |
19:13 |
<bstorm_> |
cleared E state from 7 queues |
[tools] |
17:32 |
<andrewbogott> |
moving tools-static-12 to cloudvirt1023 to keep the two static nodes off the same host |
[tools] |
2019-03-29
§
|
21:13 |
<bstorm_> |
depooled tools-sgewebgrid-generic-0903 because of some stuck jobs and odd load characteristics |
[tools] |
21:08 |
<bd808> |
Updated cherry-pick of https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/500095/ on tools-puppetmaster-01 (T219243) |
[tools] |
20:48 |
<bd808> |
Using root console to fix broken initial puppet run on tools-checker-03. |
[tools] |
20:32 |
<bd808> |
Creating tools-checker-03 with role::wmcs::toolforge::checker (T219243) |
[tools] |
20:24 |
<bd808> |
Cherry-picked https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/500095/ to tools-puppetmaster-01 for testing (T219243) |
[tools] |
20:22 |
<bd808> |
Disabled puppet on tools-checker-0{1,2} to make testing new role::wmcs::toolforge::checker easier (T219243) |
[tools] |
17:25 |
<bd808> |
Cleared the "Eqw" state of 44 jobs with `qstat -u '*' | grep Eqw | awk '{print $1;}' | xargs -L1 sudo qmod -cj` on tools-sgegrid-master |
[tools] |
17:16 |
<andrewbogott> |
aborted move of tools-static-12; will wait until tomorrow and give DNS caches more time to update |
[tools] |
17:11 |
<bd808> |
Restarted nginx on tools-static-13 |
[tools] |
16:53 |
<andrewbogott> |
moving tools-static-12 to eqiad1-r |
[tools] |