51-100 of 3192 results (14ms)
2021-07-20 §
13:25 <majavah> apply buster systemd security updates [tools]
2021-07-19 §
23:24 <bstorm> applied matchPolicy: equivalent to tools ingress validation controller T280360 [tools]
16:43 <bstorm> cleared queue error state caused by excessive resource use by topicmatcher T282474 [tools]
2021-07-16 §
14:04 <arturo> deployed jobs-framework-api 42b7a885a5bc1bf00c300e8d77bd92e1430a8327 (T286132) [tools]
11:57 <arturo> added toollabs-webservice_0.75_all to jessie-tools aptly repo (T286003) [tools]
11:52 <arturo> created `jessie-tools` aptly repository on tools-services-05 (T286003) [tools]
2021-07-15 §
16:12 <arturo> deploy toolforge-jobs-framework-api git version d85d93ee1c5d4be6a526cf83e806b2679dde3875 (T285944, T286107, T285979, T286485, T286107) [tools]
15:55 <arturo> added toolforge-jobs-framework-cli_2_all.deb to buster-{tools,toolsbeta} (T285944) [tools]
2021-07-14 §
23:29 <bstorm> mounted nfs on tools-services-05 and backing up aptly to NFS dir T286003 [tools]
09:17 <majavah> copying calico 3.18.4 images from docker hub to docker-registry.tools.wmflabs.org T280342 [tools]
2021-07-12 §
16:56 <bstorm> deleted job 4720371 due to LDAP failure [tools]
16:51 <bstorm> cleared the E state from two job queues [tools]
2021-07-02 §
18:46 <bstorm> cleared error state for tools-sgeexec-0940.tools.eqiad.wmflabs [tools]
2021-07-01 §
22:08 <bstorm> releasing webservice 0.75 [tools]
17:03 <andrewbogott> rebooting tools-k8s-worker-[31,33,35,44,49,51,57-58,70].tools.eqiad1.wikimedia.cloud [tools]
16:47 <bstorm> remounted scratch everywhere...but mostly tools T224747 [tools]
15:47 <arturo> rebased labs/private.git [tools]
11:04 <arturo> added toolforge-jobs-framework-cli_1_all.deb to aptly buster-tools,buster-toolsbeta [tools]
10:34 <arturo> refreshed jobs-api deployment [tools]
2021-06-29 §
21:58 <bstorm> clearing one errored queue and a stack of discarded jobs [tools]
20:11 <majavah> toolforge kubernetes upgrade complete T280299 [tools]
17:03 <majavah> starting toolforge kubernetes 1.18 upgrade - T280299 [tools]
16:17 <arturo> deployed jobs-framework-api in the k8s cluster [tools]
15:33 <majavah> remove duplicate definitions from tools-clushmaster-02 /root/.ssh/known_hosts [tools]
15:12 <arturo> livehacking puppetmaster for T283238 [tools]
10:24 <dcaro> running puppet on the buster bastions after 20000 minutes failing... might break something [tools]
2021-06-15 §
19:02 <bstorm> cleared error status from a few queues [tools]
16:15 <majavah> deleting unused shutdown nodes: tools-checker-03 tools-k8s-haproxy-1 tools-k8s-haproxy-2 [tools]
2021-06-14 §
22:21 <bstorm> push docker-registry.tools.wmflabs.org/toolforge-python37-sssd-web:testing to test staged os.execv (and other patches) using toolsbeta toollabs-webservice version 0.75 T282975 [tools]
2021-06-13 §
08:15 <majavah> clear grid error state from tools-sgeexec-0907, tools-sgeexec-0916, tools-sgeexec-0940 [tools]
2021-06-12 §
14:39 <majavah> remove nonexistent tools-prometheus-04 and add tools-prometheus-05 to hiera key "prometheus_nodes" [tools]
13:53 <majavah> create empty bullseye-{tools,toolsbeta} repositories on tools-services-05 aptly [tools]
2021-06-10 §
17:38 <majavah> clear error state from tools-sgeexec-0907, task@tools-sgeexec-0939 [tools]
2021-06-09 §
13:57 <majavah> clear error state from exec nodes tools-sgeexec-0913, tools-sgeexec-0936, task@tools-sgeexec-0940 [tools]
2021-06-07 §
18:39 <bstorm> cleaning up more error conditions on grid queues [tools]
17:42 <majavah> delete `ingress-nginx` namespace and related objects T264221 [tools]
17:37 <majavah> remove tools-k8s-ingress-[1-3] from kubernetes, follow-up to https://sal.toolforge.org/log/nd7v2HkB1jz_IcWuCX5M T264221 [tools]
2021-06-04 §
21:30 <bstorm> deleting "tools-k8s-ingress-3", "tools-k8s-ingress-2", "tools-k8s-ingress-1" T264221 [tools]
21:21 <bstorm> cleared error state from 4 grid queues [tools]
2021-06-03 §
18:26 <majavah> renew prometheus kubernetes certificate T280301 [tools]
17:06 <majavah> renew admission webhook certificates T280301 [tools]
2021-06-01 §
10:10 <majavah> properly clean up deleted vms tools-k8s-haproxy-[1,2], tools-checker-03 from puppet after using the wrong fqdn first time [tools]
09:54 <majavah> clear error state from tools-sgeexec-0913, tools-sgeexec-0950 [tools]
2021-05-30 §
18:58 <majavah> clear grid error state from 14 queues [tools]
2021-05-27 §
18:03 <bstorm> adjusted profile::wmcs::kubeadm::etcd_latency_ms from 30 back to the default (10) [tools]
16:04 <bstorm> cleared error state from several exec node queues [tools]
14:49 <andrewbogott> swapping in three new etcd nodes with local storage: tools-k8s-etcd-13,14,15 [tools]
2021-05-24 §
10:36 <arturo> rebased labs/private.git after merge conflict [tools]
06:49 <majavah> remove scfc kubernetes admin access after bd808 removed tools.admin membership to avoid maintain-kubeusers crashes when it expires [tools]
2021-05-22 §
14:47 <majavah> manually remove jeh admin certificates and from maintain-kubeusers configmap T282725 [tools]