3951-4000 of 10000 results (63ms)
2024-09-06 ยง
15:22 <kamila@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:22 <kamila@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw2430 to wikikube-worker2103 - kamila@cumin1002" [production]
15:19 <kamila@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw2430 to wikikube-worker2103 - kamila@cumin1002" [production]
15:16 <kamila@cumin1002> START - Cookbook sre.dns.netbox [production]
15:15 <kamila@cumin1002> START - Cookbook sre.hosts.rename from mw2430 to wikikube-worker2103 [production]
15:14 <topranks> disabling PyBal on lvs1017 to shift traffic to lvs1020 and allow work to fix faulty fibre link T374247 [production]
15:13 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs1017.eqiad.wmnet with reason: Move traffic off lvs1017 to lvs1020 to troubleshooot faulty link [production]
15:13 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs1017.eqiad.wmnet with reason: Move traffic off lvs1017 to lvs1020 to troubleshooot faulty link [production]
15:07 <kamila@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw2430.codfw.wmnet [production]
15:07 <kamila@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host mw2430.codfw.wmnet [production]
15:02 <hnowlan@cumin1002> END (PASS) - Cookbook sre.k8s.renumber-node (exit_code=0) Renumbering for host wikikube-worker2098.codfw.wmnet [production]
15:02 <hnowlan@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2098.codfw.wmnet [production]
15:02 <hnowlan@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2098.codfw.wmnet [production]
14:52 <hnowlan@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2098.codfw.wmnet with OS bullseye [production]
14:51 <btullis@cumin1002> END (PASS) - Cookbook sre.wikireplicas.add-wiki (exit_code=0) for database bdrwiki (T371759) [production]
14:44 <hnowlan@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2095.codfw.wmnet with OS bullseye [production]
14:42 <hnowlan@cumin1002> END (FAIL) - Cookbook sre.k8s.renumber-node (exit_code=99) Renumbering for host wikikube-worker2095.codfw.wmnet [production]
14:42 <hnowlan@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker2095.codfw.wmnet with OS bullseye [production]
14:42 <hnowlan@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2095.codfw.wmnet with OS bullseye [production]
14:42 <hnowlan@cumin1002> START - Cookbook sre.k8s.renumber-node Renumbering for host wikikube-worker2095.codfw.wmnet [production]
14:41 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestage2001.codfw.wmnet with OS bookworm [production]
14:28 <akosiaris> repool kubernetes1059 T365993 [production]
14:28 <akosiaris@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host kubernetes1059.eqiad.wmnet [production]
14:27 <akosiaris@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host kubernetes1059.eqiad.wmnet [production]
14:25 <btullis@cumin1002> START - Cookbook sre.wikireplicas.add-wiki for database bdrwiki (T371759) [production]
14:23 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestage2001.codfw.wmnet with reason: host reimage [production]
14:21 <James_F> jforrester@integration-castor05:/srv/castor$ sudo -u jenkins-deploy rm -rf /srv/castor/castor-mw-ext-and-skins/master/mwext-node18-docs-publish/ # T373937 [releng]
14:20 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestage2001.codfw.wmnet with reason: host reimage [production]
14:19 <fnegri@cloudcumin1001> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on liwa3.linkwatcher.eqiad1.wikimedia.cloud (T367536) [linkwatcher]
14:19 <fnegri@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on liwa3.linkwatcher.eqiad1.wikimedia.cloud (T367536) [linkwatcher]
14:17 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.renumber-node (exit_code=0) Renumbering for host wikikube-worker2102.codfw.wmnet [production]
14:17 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2102.codfw.wmnet [production]
14:17 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2102.codfw.wmnet [production]
14:13 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2102.codfw.wmnet with OS bullseye [production]
14:10 <akosiaris> restart pybal on lvs1019 [production]
14:08 <fnegri@cloudcumin1001> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on coibot.linkwatcher.eqiad1.wikimedia.cloud (T367536) [linkwatcher]
14:07 <fnegri@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on coibot.linkwatcher.eqiad1.wikimedia.cloud (T367536) [linkwatcher]
14:07 <akosiaris> silence alerts based on alertname=PHPFPMTooBusy,deployment=mw-wikifunctions,site=codfw T374241 [production]
13:59 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host kubestage2001.codfw.wmnet with OS bookworm [production]
13:58 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubestage2001.codfw.wmnet [production]
13:58 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host kubestage2001.codfw.wmnet [production]
13:56 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.renumber-node (exit_code=0) Renumbering for host kubestage2001.codfw.wmnet [production]
13:56 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host kubestage2001.codfw.wmnet [production]
13:56 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host kubestage2001.codfw.wmnet [production]
13:52 <btullis@cumin1002> END (FAIL) - Cookbook sre.wikireplicas.add-wiki (exit_code=99) for database bdrwiki (T371759) [production]
13:52 <jayme> homer lsw1-a6-codfw* commit 'T372878' [production]
13:51 <btullis@cumin1002> START - Cookbook sre.wikireplicas.add-wiki for database bdrwiki (T371759) [production]
13:51 <fnegri@cloudcumin1001> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on coibot.linkwatcher.eqiad1.wikimedia.cloud (T367536) [linkwatcher]
13:51 <fnegri@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on coibot.linkwatcher.eqiad1.wikimedia.cloud (T367536) [linkwatcher]
13:48 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2102.codfw.wmnet with reason: host reimage [production]