4251-4300 of 10000 results (128ms)
2023-11-15 ยง
13:21 <sfaci@deploy2002> Started deploy [airflow-dags/analytics_test@be05071]: Regular analytics weekly train [airflow/analytics_test@c203642a] [production]
13:18 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
13:18 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
13:17 <topranks> resetting FPC1 card in cr1-esams which has a major error and gone offline (T351304) [production]
13:14 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest2003.codfw.wmnet with OS bullseye [production]
13:14 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmooney@cumin1001" [production]
13:10 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmooney@cumin1001" [production]
13:05 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1138.eqiad.wmnet with reason: Maintenance [production]
13:05 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1138.eqiad.wmnet with reason: Maintenance [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
12:56 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
12:56 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
12:56 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
12:55 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
12:54 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
12:54 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
12:52 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2003.codfw.wmnet with reason: host reimage [production]
12:49 <cmooney@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2003.codfw.wmnet with reason: host reimage [production]
12:33 <cmooney@cumin1001> START - Cookbook sre.hosts.reimage for host sretest2003.codfw.wmnet with OS bullseye [production]
11:57 <stevemunene@deploy2002> Finished deploy [airflow-dags/wmde@91810bc]: (no justification provided) (duration: 00m 10s) [production]
11:56 <stevemunene@deploy2002> Started deploy [airflow-dags/wmde@91810bc]: (no justification provided) [production]
11:52 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: insetup::unowned [production]
11:48 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: insetup::unowned [production]
11:25 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host thanos-fe2001.codfw.wmnet [production]
11:24 <taavi> update cr*-{codfw,eqiad} firewall policy via homer to update cloudcontrol1006 addressing [production]
11:24 <btullis@deploy2002> helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main [production]
11:21 <btullis@deploy2002> helmfile [eqiad] START helmfile.d/services/datahub: apply on main [production]
11:20 <btullis@cumin1001> END (ERROR) - Cookbook sre.druid.roll-restart-workers (exit_code=97) for Druid analytics cluster: Roll restart of Druid jvm daemons. [production]
11:18 <btullis@cumin1001> START - Cookbook sre.druid.roll-restart-workers for Druid analytics cluster: Roll restart of Druid jvm daemons. [production]
11:17 <btullis@deploy2002> helmfile [codfw] DONE helmfile.d/services/datahub: sync on main [production]
11:15 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host thanos-fe2001.codfw.wmnet [production]
11:14 <btullis@deploy2002> helmfile [codfw] START helmfile.d/services/datahub: apply on main [production]
10:46 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: miscweb [production]
10:44 <tchanders@deploy2002> helmfile [eqiad] DONE helmfile.d/services/ipoid: apply [production]
10:42 <tchanders@deploy2002> helmfile [eqiad] START helmfile.d/services/ipoid: apply [production]
10:41 <tchanders@deploy2002> helmfile [eqiad] DONE helmfile.d/services/ipoid: apply [production]
10:40 <tchanders@deploy2002> helmfile [eqiad] START helmfile.d/services/ipoid: apply [production]
10:39 <oblivian@deploy2002> helmfile [codfw] DONE helmfile.d/services/mobileapps: sync [production]
10:39 <oblivian@deploy2002> helmfile [codfw] START helmfile.d/services/mobileapps: sync [production]
10:39 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: sync [production]
10:39 <oblivian@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: sync [production]
10:39 <_joe_> roll restart of mobileapps in codfw and eqiad [production]
10:34 <oblivian@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply [production]
10:31 <oblivian@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-int: apply [production]
10:31 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]