1151-1200 of 10000 results (30ms)
2024-01-29 ยง
11:26 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster [tools]
11:23 <wmbot~taavi@runko> END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster [tools]
11:22 <wmbot~taavi@runko> Added a new k8s worker-nfs tools-k8s-worker-nfs-4.tools.eqiad1.wikimedia.cloud to the cluster [tools]
11:14 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1224 (T355609)', diff saved to https://phabricator.wikimedia.org/P55787 and previous config saved to /var/cache/conftool/dbconfig/20240129-111434-marostegui.json [production]
11:12 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster [tools]
11:12 <wmbot~taavi@runko> END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-35 [tools]
11:10 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-35 [tools]
11:10 <wmbot~taavi@runko> END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-34 [tools]
11:09 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1224 (T355609)', diff saved to https://phabricator.wikimedia.org/P55786 and previous config saved to /var/cache/conftool/dbconfig/20240129-110955-marostegui.json [production]
11:09 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1224.eqiad.wmnet with reason: Maintenance [production]
11:09 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db1224.eqiad.wmnet with reason: Maintenance [production]
11:09 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1213:3316 (T355609)', diff saved to https://phabricator.wikimedia.org/P55785 and previous config saved to /var/cache/conftool/dbconfig/20240129-110933-marostegui.json [production]
11:09 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-34 [tools]
11:09 <wmbot~taavi@runko> END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-33 [tools]
11:07 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-33 [tools]
11:06 <wmbot~taavi@runko> END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-32 [tools]
11:05 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-airflow1007.eqiad.wmnet with reason: host reimage [production]
11:04 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-32 [tools]
11:01 <btullis@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-airflow1007.eqiad.wmnet with reason: host reimage [production]
11:01 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-31 [tools]
10:59 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-30 [tools]
10:57 <wmbot~taavi@runko> END (FAIL) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=99) for a worker-nfs role in the tools cluster [tools]
10:56 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster [tools]
10:54 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1213:3316', diff saved to https://phabricator.wikimedia.org/P55784 and previous config saved to /var/cache/conftool/dbconfig/20240129-105427-marostegui.json [production]
10:53 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc1054.eqiad.wmnet [production]
10:53 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2054.codfw.wmnet [production]
10:51 <wmbot~taavi@runko> END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster [tools]
10:51 <wmbot~taavi@runko> Added a new k8s worker-nfs tools-k8s-worker-nfs-3.tools.eqiad1.wikimedia.cloud to the cluster [tools]
10:47 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc2054.codfw.wmnet [production]
10:47 <jiji@cumin1002> START - Cookbook sre.hosts.reboot-single for host mc1054.eqiad.wmnet [production]
10:47 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host an-airflow1007.eqiad.wmnet with OS bullseye [production]
10:47 <blancadesal> increased harbor quota to 2GiB [tools.wd-shex-infer]
10:46 <btullis> upgrading an-airflow1007 to bullseye for T335261 [analytics]
10:46 <blancadesal> increased harbor quota for wd-shex-infer to 2GiB [tools]
10:45 <blancadesal> increased harbor quota to 2GiB [tools.lucaswerkmeister-test]
10:44 <blancadesal> increased harbor quota for lucaswerkmeister-test to 2GiB [tools]
10:39 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1213:3316', diff saved to https://phabricator.wikimedia.org/P55783 and previous config saved to /var/cache/conftool/dbconfig/20240129-103920-marostegui.json [production]
10:38 <arnaudb@cumin1002> END (FAIL) - Cookbook sre.mysql.clone (exit_code=99) Will create a clone of db2169.codfw.wmnet onto db2194.codfw.wmnet [production]
10:37 <arnaudb@cumin1002> START - Cookbook sre.mysql.clone Will create a clone of db2169.codfw.wmnet onto db2194.codfw.wmnet [production]
10:31 <wmbot~taavi@runko> START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster [tools]
10:31 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.grid.cleanup_queue_errors (exit_code=0) [tools]
10:31 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.grid.cleanup_queue_errors [tools]
10:24 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1213:3316 (T355609)', diff saved to https://phabricator.wikimedia.org/P55782 and previous config saved to /var/cache/conftool/dbconfig/20240129-102414-marostegui.json [production]
10:17 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1213:3316 (T355609)', diff saved to https://phabricator.wikimedia.org/P55781 and previous config saved to /var/cache/conftool/dbconfig/20240129-101757-marostegui.json [production]
10:17 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1213.eqiad.wmnet with reason: Maintenance [production]
10:17 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db1213.eqiad.wmnet with reason: Maintenance [production]
10:17 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1201 (T355609)', diff saved to https://phabricator.wikimedia.org/P55780 and previous config saved to /var/cache/conftool/dbconfig/20240129-101735-marostegui.json [production]
10:02 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1201', diff saved to https://phabricator.wikimedia.org/P55779 and previous config saved to /var/cache/conftool/dbconfig/20240129-100229-marostegui.json [production]
10:00 <moritzm> upload prometheus-ganeti-exporter 0.3+deb12u1 to apt.wikimedia.org T300152 [production]
09:56 <XioNoX> enable Puppet on all the ganeti servers for CR990968 deployment - T300152 [production]