351-400 of 10000 results (121ms)
2024-11-20 ยง
12:20 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host cp7007.magru.wmnet with OS bullseye [production]
12:19 <sukhe@cumin2002> END (FAIL) - Cookbook sre.hosts.dhcp (exit_code=99) for host cp7007.magru.wmnet [production]
12:18 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2146.codfw.wmnet with reason: host reimage [production]
12:16 <sukhe@cumin2002> START - Cookbook sre.hosts.dhcp for host cp7007.magru.wmnet [production]
12:16 <sukhe@cumin1002> END (FAIL) - Cookbook sre.hosts.dhcp (exit_code=99) for host cp7007.magru.wmnet [production]
12:15 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2145.codfw.wmnet with reason: host reimage [production]
12:14 <sukhe@cumin1002> START - Cookbook sre.hosts.dhcp for host cp7007.magru.wmnet [production]
12:11 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2147.codfw.wmnet with reason: host reimage [production]
12:08 <sukhe> disable puppet on cumin2002 to test cumin alias for A:installserver [production]
12:07 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2148.codfw.wmnet with reason: host reimage [production]
12:04 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2149.codfw.wmnet with reason: host reimage [production]
12:01 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2144.codfw.wmnet with reason: host reimage [production]
11:59 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2149.codfw.wmnet with reason: host reimage [production]
11:59 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2148.codfw.wmnet with reason: host reimage [production]
11:58 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2147.codfw.wmnet with reason: host reimage [production]
11:57 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2146.codfw.wmnet with reason: host reimage [production]
11:57 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2145.codfw.wmnet with reason: host reimage [production]
11:56 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2143.codfw.wmnet with reason: host reimage [production]
11:56 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2144.codfw.wmnet with reason: host reimage [production]
11:40 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2149.codfw.wmnet with OS bookworm [production]
11:39 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2148.codfw.wmnet with OS bookworm [production]
11:39 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2147.codfw.wmnet with OS bookworm [production]
11:38 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2146.codfw.wmnet with OS bookworm [production]
11:38 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2145.codfw.wmnet with OS bookworm [production]
11:37 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2144.codfw.wmnet with OS bookworm [production]
11:36 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2143.codfw.wmnet with OS bookworm [production]
11:30 <fabfur@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_magru [production]
11:24 <fabfur@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_magru [production]
11:22 <akosiaris> decommission cxserver endpoints /api/rest_v1/transform/html/from, /api/rest_v1/transform/word/from from RESTBase T375616 [production]
10:43 <btullis@cumin1002> END (PASS) - Cookbook sre.ceph.roll-restart-reboot-server (exit_code=0) rolling reboot on P{cephosd1001.eqiad.wmnet} and (A:cephosd) [production]
10:38 <fabfur@cumin1002> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_magru [production]
10:38 <fabfur@cumin1002> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_magru [production]
10:37 <fabfur@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_esams [production]
10:34 <fabfur@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_esams [production]
10:33 <btullis@cumin1002> START - Cookbook sre.ceph.roll-restart-reboot-server rolling reboot on P{cephosd1001.eqiad.wmnet} and (A:cephosd) [production]
10:33 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on kafka-main[1001,1006].eqiad.wmnet with reason: Hardware refresh [production]
10:33 <jayme> re-enabled puppet on all k8s controll planes for rollout of T380142 [production]
10:33 <jiji@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on kafka-main[1001,1006].eqiad.wmnet with reason: Hardware refresh [production]
10:22 <effie> removing leadership from kafka-main1001 - T363214 [production]
10:19 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
10:18 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
09:52 <aklapper@deploy2002> rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.4 refs T375663 [production]
09:44 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
09:44 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
09:41 <kevinbazira@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:38 <akosiaris> decommission cxserver endpoints /api/rest_v1/list/(pair|tool|languagepairs) from RESTBase T375616 [production]
09:35 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
09:34 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply [production]
09:33 <aklapper@deploy2002> Finished scap sync-world: Backport for [[gerrit:1093172|EditionLookup: Update EntityLookup calls (T380304)]] (duration: 13m 33s) [production]
09:33 <fabfur@cumin1002> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_esams [production]