701-750 of 10000 results (85ms)
2023-11-15 ยง
14:21 <awight@deploy2002> Started scap: Backport for [[gerrit:974200|prod: Enable $wgCampaignEventsEnableParticipantQuestions (T347607)]] [production]
14:20 <brouberol@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-druid1003.eqiad.wmnet with reason: host reimage [production]
14:18 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host kubernetes2054.codfw.wmnet [production]
14:09 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host thanos-be2001.codfw.wmnet [production]
14:08 <sukhe> running authdns-update to depool esams [production]
14:03 <brouberol@cumin1001> START - Cookbook sre.hosts.reimage for host an-druid1003.eqiad.wmnet with OS bullseye [production]
14:03 <joal@deploy2002> Finished deploy [analytics/refinery@3e9df5d]: Regular analytics weekly train - HOTFIX [analytics/refinery@3e9df5d8] (duration: 00m 06s) [production]
14:03 <joal@deploy2002> Started deploy [analytics/refinery@3e9df5d]: Regular analytics weekly train - HOTFIX [analytics/refinery@3e9df5d8] [production]
14:03 <XioNoX> reboot fpc0 on cr1-esams - T346779 [production]
14:00 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
13:59 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host thanos-be2001.codfw.wmnet [production]
13:59 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::control [production]
13:55 <XioNoX> disable peering/transit on cr1-esams for linecard reboot - T346779 [production]
13:52 <joal@deploy2002> Finished deploy [analytics/refinery@3e9df5d]: Regular analytics weekly train - HOTFIX [analytics/refinery@3e9df5d8] (duration: 08m 16s) [production]
13:50 <taavi> deploy https://gerrit.wikimedia.org/r/c/operations/homer/public/+/973769/ core routers [production]
13:44 <joal@deploy2002> Started deploy [analytics/refinery@3e9df5d]: Regular analytics weekly train - HOTFIX [analytics/refinery@3e9df5d8] [production]
13:42 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:41 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
13:40 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
13:39 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: etcd::v3::kubernetes [production]
13:38 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
13:31 <sfaci@deploy2002> Finished deploy [airflow-dags/analytics_test@5a47584]: Regular analytics weekly train [airflow/analytics_test@5a475842] (duration: 00m 14s) [production]
13:31 <sfaci@deploy2002> Started deploy [airflow-dags/analytics_test@5a47584]: Regular analytics weekly train [airflow/analytics_test@5a475842] [production]
13:29 <sfaci@deploy2002> Finished deploy [airflow-dags/analytics@5a47584]: Regular analytics weekly train [airflow/analytics@5a475842] (duration: 00m 27s) [production]
13:29 <sfaci@deploy2002> Started deploy [airflow-dags/analytics@5a47584]: Regular analytics weekly train [airflow/analytics@5a475842] [production]
13:28 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: etcd::v3::kubernetes [production]
13:22 <sfaci@deploy2002> Finished deploy [airflow-dags/analytics_test@be05071]: Regular analytics weekly train [airflow/analytics_test@c203642a] (duration: 00m 06s) [production]
13:21 <sfaci@deploy2002> Started deploy [airflow-dags/analytics_test@be05071]: Regular analytics weekly train [airflow/analytics_test@c203642a] [production]
13:18 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
13:18 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
13:17 <topranks> resetting FPC1 card in cr1-esams which has a major error and gone offline (T351304) [production]
13:14 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest2003.codfw.wmnet with OS bullseye [production]
13:14 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmooney@cumin1001" [production]
13:10 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmooney@cumin1001" [production]
13:05 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1138.eqiad.wmnet with reason: Maintenance [production]
13:05 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1138.eqiad.wmnet with reason: Maintenance [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
12:57 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
12:56 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
12:56 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
12:56 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
12:55 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
12:54 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
12:54 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
12:52 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2003.codfw.wmnet with reason: host reimage [production]
12:49 <cmooney@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2003.codfw.wmnet with reason: host reimage [production]