401-450 of 10000 results (78ms)
2023-11-15 ยง
16:04 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host lvs6003.drmrs.wmnet [production]
15:58 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db1130.eqiad.wmnet [production]
15:58 <arnaudb@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:58 <arnaudb@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1130.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - arnaudb@cumin1001" [production]
15:57 <arnaudb@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1130.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - arnaudb@cumin1001" [production]
15:56 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host lvs6003.drmrs.wmnet [production]
15:55 <arnaudb@cumin1001> START - Cookbook sre.dns.netbox [production]
15:49 <arnaudb@cumin1001> START - Cookbook sre.hosts.decommission for hosts db1130.eqiad.wmnet [production]
15:48 <fabfur> swapped cp1107 <-> cp1082 (T349244) [production]
15:46 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host doh6001.wikimedia.org [production]
15:46 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp1107.eqiad.wmnet [production]
15:46 <fabfur@cumin1001> START - Cookbook sre.hosts.remove-downtime for cp1107.eqiad.wmnet [production]
15:44 <fabfur> swapped cp1106 <-> cp1081 (T349244) [production]
15:43 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp1106.eqiad.wmnet [production]
15:43 <fabfur@cumin1001> START - Cookbook sre.hosts.remove-downtime for cp1106.eqiad.wmnet [production]
15:41 <godog> bounce prometheus-blackbox-exporter on prometheus4002 [production]
15:40 <godog> bounce prometheus@ops on prometheus4002 [production]
15:39 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host doh6001.wikimedia.org [production]
15:33 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host durum1001.eqiad.wmnet [production]
15:28 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage [production]
15:28 <arnaudb@cumin1001> dbctl commit (dc=all): 'depool db1127', diff saved to https://phabricator.wikimedia.org/P53485 and previous config saved to /var/cache/conftool/dbconfig/20231115-152836-arnaudb.json [production]
15:26 <cmooney@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage [production]
15:25 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host durum1001.eqiad.wmnet [production]
15:23 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host restbase1024.eqiad.wmnet [production]
15:22 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db1127.eqiad.wmnet [production]
15:22 <arnaudb@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:22 <arnaudb@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1127.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - arnaudb@cumin1001" [production]
15:21 <arnaudb@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db1127.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - arnaudb@cumin1001" [production]
15:19 <arnaudb@cumin1001> START - Cookbook sre.dns.netbox [production]
15:16 <eevans@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host aqs1012.eqiad.wmnet with OS bullseye [production]
15:13 <arnaudb@cumin1001> START - Cookbook sre.hosts.decommission for hosts db1127.eqiad.wmnet [production]
15:12 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host restbase1024.eqiad.wmnet [production]
15:09 <cmooney@cumin1001> START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS bullseye [production]
15:08 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2004.codfw.wmnet with reason: host reimage [production]
15:05 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: vrts [production]
15:05 <cmooney@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2004.codfw.wmnet with reason: host reimage [production]
15:00 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: vrts [production]
14:50 <cmooney@cumin1001> START - Cookbook sre.hosts.reimage for host sretest2004.codfw.wmnet with OS bullseye [production]
14:47 <awight@deploy2002> Finished scap: Backport for [[gerrit:974169|GrowthExperiments: enable AddLink backend for 16,17th rounds of wikis (T308142 T308143)]] (duration: 08m 16s) [production]
14:47 <brouberol@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-druid1003.eqiad.wmnet with OS bullseye [production]
14:45 <jbond@cumin1001> END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: wmcs::openstack::codfw1dev::control [production]
14:42 <awight@deploy2002> sgimeno and awight: Continuing with sync [production]
14:41 <awight@deploy2002> sgimeno and awight: Backport for [[gerrit:974169|GrowthExperiments: enable AddLink backend for 16,17th rounds of wikis (T308142 T308143)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:39 <awight@deploy2002> Started scap: Backport for [[gerrit:974169|GrowthExperiments: enable AddLink backend for 16,17th rounds of wikis (T308142 T308143)]] [production]
14:37 <awight@deploy2002> Finished scap: Backport for [[gerrit:974200|prod: Enable $wgCampaignEventsEnableParticipantQuestions (T347607)]] (duration: 16m 09s) [production]
14:35 <claime> Raised mw-on-k8s to 20% of external traffic, rollout will happen over the next half hour - T348122 [production]
14:31 <awight@deploy2002> daimona and awight: Continuing with sync [production]
14:31 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
14:30 <eevans@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1012.eqiad.wmnet with OS bullseye [production]
14:26 <joal@deploy2002> Finished deploy [analytics/refinery@3e9df5d] (hadoop-test): Regular analytics weekly train - TEST - HOTFIX [analytics/refinery@3e9df5d8] (duration: 03m 13s) [production]