2151-2200 of 10000 results (98ms)
2023-11-15 ยง
13:55 <XioNoX> disable peering/transit on cr1-esams for linecard reboot - T346779 [production]
13:52 <joal@deploy2002> Finished deploy [analytics/refinery@3e9df5d]: Regular analytics weekly train - HOTFIX [analytics/refinery@3e9df5d8] (duration: 08m 16s) [production]
13:50 <taavi> deploy https://gerrit.wikimedia.org/r/c/operations/homer/public/+/973769/ core routers [production]
13:44 <joal@deploy2002> Started deploy [analytics/refinery@3e9df5d]: Regular analytics weekly train - HOTFIX [analytics/refinery@3e9df5d8] [production]
13:42 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:41 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
13:40 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
13:39 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: etcd::v3::kubernetes [production]
13:38 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
13:31 <sfaci@deploy2002> Finished deploy [airflow-dags/analytics_test@5a47584]: Regular analytics weekly train [airflow/analytics_test@5a475842] (duration: 00m 14s) [production]
13:31 <sfaci@deploy2002> Started deploy [airflow-dags/analytics_test@5a47584]: Regular analytics weekly train [airflow/analytics_test@5a475842] [production]
13:29 <sfaci@deploy2002> Finished deploy [airflow-dags/analytics@5a47584]: Regular analytics weekly train [airflow/analytics@5a475842] (duration: 00m 27s) [production]
13:29 <sfaci@deploy2002> Started deploy [airflow-dags/analytics@5a47584]: Regular analytics weekly train [airflow/analytics@5a475842] [production]
13:28 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: etcd::v3::kubernetes [production]
13:22 <sfaci@deploy2002> Finished deploy [airflow-dags/analytics_test@be05071]: Regular analytics weekly train [airflow/analytics_test@c203642a] (duration: 00m 06s) [production]
13:21 <sfaci@deploy2002> Started deploy [airflow-dags/analytics_test@be05071]: Regular analytics weekly train [airflow/analytics_test@c203642a] [production]
13:18 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
13:18 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
13:17 <topranks> resetting FPC1 card in cr1-esams which has a major error and gone offline (T351304) [production]
13:14 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest2003.codfw.wmnet with OS bullseye [production]
13:14 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmooney@cumin1001" [production]
13:10 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmooney@cumin1001" [production]
13:05 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1138.eqiad.wmnet with reason: Maintenance [production]
13:05 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1138.eqiad.wmnet with reason: Maintenance [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
12:56 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
12:56 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
12:56 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
12:55 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
12:54 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
12:54 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
12:52 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2003.codfw.wmnet with reason: host reimage [production]
12:49 <cmooney@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2003.codfw.wmnet with reason: host reimage [production]
12:33 <cmooney@cumin1001> START - Cookbook sre.hosts.reimage for host sretest2003.codfw.wmnet with OS bullseye [production]
11:57 <stevemunene@deploy2002> Finished deploy [airflow-dags/wmde@91810bc]: (no justification provided) (duration: 00m 10s) [production]
11:56 <stevemunene@deploy2002> Started deploy [airflow-dags/wmde@91810bc]: (no justification provided) [production]
11:52 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: insetup::unowned [production]
11:48 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: insetup::unowned [production]
11:25 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host thanos-fe2001.codfw.wmnet [production]
11:24 <taavi> update cr*-{codfw,eqiad} firewall policy via homer to update cloudcontrol1006 addressing [production]
11:24 <btullis@deploy2002> helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main [production]
11:21 <btullis@deploy2002> helmfile [eqiad] START helmfile.d/services/datahub: apply on main [production]
11:20 <btullis@cumin1001> END (ERROR) - Cookbook sre.druid.roll-restart-workers (exit_code=97) for Druid analytics cluster: Roll restart of Druid jvm daemons. [production]
11:18 <btullis@cumin1001> START - Cookbook sre.druid.roll-restart-workers for Druid analytics cluster: Roll restart of Druid jvm daemons. [production]
11:17 <btullis@deploy2002> helmfile [codfw] DONE helmfile.d/services/datahub: sync on main [production]