2021-04-16 §
23:15 <bstorm> cleaned up all source files for the grid with the old domain name to enable future node creation T277653 [tools]
14:38 <dcaro> added 'will get out of space in X days' panel to the dasboard https://grafana-labs.wikimedia.org/goto/kBlGd0uGk (T279990), we got <5days xd [tools]
11:35 <arturo> running `grid-configurator --all-domains` which basically added tools-sgebastion-10,11 as submit hosts and removed tools-sgegrid-master,shadow as submit hosts [tools]
2021-04-15 §
17:45 <bstorm> cleared error state from tools-sgeexec-0920.tools.eqiad.wmflabs for a failed job [tools]
2021-04-13 §
13:26 <dcaro> upgrade puppet and python-wmflib on tools-prometheus-03 [tools]
11:23 <arturo> deleted shutoff VM tools-package-builder-02 (T275864) [tools]
11:21 <arturo> deleted shutoff VM tools-sge-services-03,04 (T278354) [tools]
11:20 <arturo> deleted shutoff VM tools-docker-registry-03,04 (T278303) [tools]
11:18 <arturo> deleted shutoff VM tools-mail-02 (T278538) [tools]
11:17 <arturo> deleted shutoff VMs tools-static-12,13 (T278539) [tools]
2021-04-11 §
16:07 <bstorm> cleared E state from tools-sgeexec-0917 tools-sgeexec-0933 tools-sgeexec-0934 tools-sgeexec-0937 from failures of jobs 761759, 815031, 815056, 855676, 898936 [tools]
2021-04-08 §
18:25 <bstorm> cleaned up the deprecated entries in /data/project/.system_sge/gridengine/etc/submithosts for tools-sgegrid-master and tools-sgegrid-shadow using the old fqdns T277653 [tools]
09:24 <arturo> allocate & associate floating IP for tools-sgebastion-11, also with DNS A record `dev-buster.toolforge.org` (T275865) [tools]
09:22 <arturo> create DNS A record `login-buster.toolforge.org` pointing to (tools-sgebastion-10) (T275865) [tools]
09:20 <arturo> associate floating IP to tools-sgebastion-10 (T275865) [tools]
09:12 <arturo> created tools-sgebastion-11 (buster) (T275865) [tools]
2021-04-07 §
04:35 <andrewbogott> replacing the mx record '10 mail.tools.wmcloud.org' with '10 mail.tools.wmcloud.org.' — trying to fix axfr for the tools.wmcloud.org zone [tools]
2021-04-06 §
15:16 <bstorm> cleared queue state since a few had "errored" for failed jobs. [tools]
12:59 <dcaro> Removing etcd member tools-k8s-etcd-7.tools.eqiad1.wikimedia.cloud to get an odd number (T267082) [tools]
11:45 <arturo> upgrading jobutils & misctools to 1.42 everywhere [tools]
11:39 <arturo> cleaning up aptly: old package versions, old repos (jessie, trusty, precise) etc [tools]
10:31 <dcaro> Removing etcd member tools-k8s-etcd-6.tools.eqiad.wmflabs (T267082) [tools]
10:21 <arturo> published jobutils & misctools 1.42 (T278748) [tools]
10:21 <arturo> published jobutils & misctools 1.42 [tools]
10:21 <arturo> aptly repo had some weirdness due to the cinder volume: hardlinks created by aptly were broken, solved with `sudo aptly publish --skip-signing repo stretch-tools -force-overwrite` [tools]
10:07 <dcaro> adding new etcd member using the cookbook wmcs.toolforge.add_etcd_node (T267082) [tools]
10:05 <arturo> installed aptly from buster-backports on tools-services-05 to see if that makes any difference with an issue when publishing repos [tools]
09:53 <dcaro> Removing etcd member tools-k8s-etcd-4.tools.eqiad.wmflabs (T267082) [tools]
08:55 <dcaro> adding new etcd member using the cookbook wmcs.toolforge.add_etcd_node (T267082) [tools]
2021-04-05 §
17:02 <bstorm> chowned the data volume for the docker registry to docker-registry:docker-registry [tools]
09:56 <arturo> make jhernandez (IRC joakino) projectadmin (T278975) [tools]
2021-04-01 §
20:43 <bstorm> cleared error state from the grid queues caused by unspecified job errors [tools]
15:53 <dcaro> Removed etcd member tools-k8s-etcd-5.tools.eqiad.wmflabs, adding a new member (T267082) [tools]
15:43 <dcaro> Removing etcd member tools-k8s-etcd-5.tools.eqiad.wmflabs (T267082) [tools]
15:36 <dcaro> Added new etcd member tools-k8s-etcd-9.tools.eqiad1.wikimedia.cloud (T267082) [tools]
15:18 <dcaro> adding new etcd member using the cookbook wmcs.toolforge.add_etcd_node (T267082) [tools]
2021-03-31 §
15:57 <arturo> rebooting `tools-mail-03` after enabling NFS (T267082, T278538) [tools]
15:57 <arturo> rebooting `tools-mail-03` after enabling NFS (T [tools]
15:04 <arturo> created MX record for `tools.wmcloud.org` pointing to `mail.tools.wmcloud.org` [tools]
15:03 <arturo> created DNS A record `mail.tools.wmcloud.org` pointing to [tools]
14:56 <arturo> shutoff tools-mail-02 (T278538) [tools]
14:54 <arturo> point floating IP to tools-mail-03 (T278538) [tools]
14:45 <arturo> created VM `tools-mail-03` as Debian Buster (T278538) [tools]
14:39 <arturo> relocate some of the hiera keys for email server from project-level to prefix [tools]
09:44 <dcaro> running disk performance test on etcd-4 (round2) [tools]
09:05 <dcaro> running disk performance test on etcd-8 [tools]
08:43 <dcaro> running disk performance test on etcd-4 [tools]
2021-03-30 §
16:15 <bstorm> added `labstore::traffic_shaping::egress: 800mbps` to tools-static prefix T278539 [tools]
15:44 <arturo> shutoff tools-static-12/13 (T278539) [tools]
15:41 <arturo> point horizon web proxy `tools-static.wmflabs.org` to tools-static-14 (T278539) [tools]