1851-1900 of 10000 results (130ms)
2024-11-14 ยง
14:30 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-restart (exit_code=0) rolling restart_daemons on A:dnsbox and A:magru and A:dnsbox [production]
14:27 <kartik@deploy2002> Finished scap sync-world: Backport for [[gerrit:1091227|CX3 Build 0.2.0+20241114]] (duration: 13m 23s) [production]
14:25 <sukhe@cumin1002> START - Cookbook sre.dns.roll-restart rolling restart_daemons on A:dnsbox and A:magru and A:dnsbox [production]
14:22 <kartik@deploy2002> kartik: Continuing with sync [production]
14:18 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns (exit_code=0) rolling restart_daemons on A:wikidough and A:wikidough [production]
14:17 <kartik@deploy2002> kartik: Backport for [[gerrit:1091227|CX3 Build 0.2.0+20241114]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:13 <kartik@deploy2002> Started scap sync-world: Backport for [[gerrit:1091227|CX3 Build 0.2.0+20241114]] [production]
14:05 <sukhe@cumin1002> START - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns rolling restart_daemons on A:wikidough and A:wikidough [production]
13:50 <aqu@deploy2002> Finished deploy [airflow-dags/analytics@2220747]: Stage Refine parallelization improvment [airflow-dags@2220747d] (duration: 01m 08s) [production]
13:49 <aqu@deploy2002> Started deploy [airflow-dags/analytics@2220747]: Stage Refine parallelization improvment [airflow-dags@2220747d] [production]
13:38 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti7004.magru.wmnet [production]
13:36 <aqu@deploy2002> Finished deploy [airflow-dags/analytics_test@2220747]: Stage Refine parallelization improvment [airflow-dags@2220747d] (duration: 00m 15s) [production]
13:36 <aqu@deploy2002> Started deploy [airflow-dags/analytics_test@2220747]: Stage Refine parallelization improvment [airflow-dags@2220747d] [production]
13:30 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti7004.magru.wmnet [production]
13:21 <kcvelaga@deploy2002> Finished deploy [airflow-dags/analytics_product@c5ab766]: T379546 (duration: 00m 54s) [production]
13:21 <kcvelaga@deploy2002> Started deploy [airflow-dags/analytics_product@c5ab766]: T379546 [production]
13:19 <oblivian@cumin1002> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Fix search button height - oblivian@cumin1002" [production]
13:18 <oblivian@cumin1002> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Fix search button height - oblivian@cumin1002 [production]
13:18 <oblivian@cumin1002> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Fix search button height - oblivian@cumin1002 [production]
13:18 <oblivian@cumin1002> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Fix search button height - oblivian@cumin1002" [production]
13:05 <jayme@cumin2002> END (PASS) - Cookbook sre.k8s.reimage-stacked-control-plane (exit_code=0) Reimaging k8s control planes of cluster wikikube-codfw: containerd migration [production]
13:04 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-ctrl2003.codfw.wmnet with OS bookworm [production]
12:54 <jmm@cumin2002> END (PASS) - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas (exit_code=0) rolling restart_daemons on A:schema-eqiad [production]
12:53 <jmm@cumin2002> START - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas rolling restart_daemons on A:schema-eqiad [production]
12:53 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti7004.magru.wmnet [production]
12:52 <moritzm> installing apache2 security updates [production]
12:51 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti7004.magru.wmnet [production]
12:51 <dreamyjazz@deploy2002> Finished scap sync-world: Backport for [[gerrit:1090511|Hide IP reveal tools on Special:AbuseLog and Special:GlobalBlockList (T379583)]] (duration: 09m 08s) [production]
12:49 <moritzm> failover ganeti master of magru02 to ganeti7002 [production]
12:46 <dreamyjazz@deploy2002> dreamyjazz: Continuing with sync [production]
12:45 <dreamyjazz@deploy2002> dreamyjazz: Backport for [[gerrit:1090511|Hide IP reveal tools on Special:AbuseLog and Special:GlobalBlockList (T379583)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
12:43 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti7002.magru.wmnet [production]
12:42 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-ctrl2003.codfw.wmnet with reason: host reimage [production]
12:41 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1090511|Hide IP reveal tools on Special:AbuseLog and Special:GlobalBlockList (T379583)]] [production]
12:38 <jayme@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-ctrl2003.codfw.wmnet with reason: host reimage [production]
12:35 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti7002.magru.wmnet [production]
12:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti7002.magru.wmnet [production]
12:25 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti7002.magru.wmnet [production]
12:22 <jayme@cumin2002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2003.codfw.wmnet with OS bookworm [production]
12:19 <jmm@cumin2002> END (PASS) - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas (exit_code=0) rolling restart_daemons on A:schema-codfw [production]
12:18 <jmm@cumin2002> START - Cookbook sre.misc-clusters.roll-restart-reboot-eventschemas rolling restart_daemons on A:schema-codfw [production]
12:17 <jayme@cumin2002> START - Cookbook sre.k8s.reimage-stacked-control-plane Reimaging k8s control planes of cluster wikikube-codfw: containerd migration [production]
12:10 <jmm@cumin2002> END (PASS) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=0) rolling restart_daemons on A:ncredir [production]
12:00 <jmm@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-ncredir rolling restart_daemons on A:ncredir [production]
11:57 <moritzm> restarting postfix on inbound/outbound servers to pick up openssl updates [production]
11:17 <moritzm> installing openssl security updates [production]
11:08 <jayme@cumin2002> END (PASS) - Cookbook sre.k8s.reimage-stacked-control-plane (exit_code=0) Reimaging k8s control planes of cluster wikikube-codfw: containerd migration [production]
11:08 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-ctrl2001.codfw.wmnet with OS bookworm [production]
10:47 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub: sync on production [production]
10:45 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-ctrl2001.codfw.wmnet with reason: host reimage [production]