5601-5650 of 10000 results (102ms)
2023-11-15 ยง
22:57 <bking@cumin2002> START - Cookbook sre.puppet.renew-cert for cloudelastic1008.wikimedia.org: Renew puppet certificate - bking@cumin2002 [production]
22:41 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cloudelastic[1007-1010].wikimedia.org with reason: new cloudelastic hosts TT351354 [production]
22:41 <ryankemper@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on cloudelastic[1007-1010].wikimedia.org with reason: new cloudelastic hosts TT351354 [production]
22:20 <ryankemper> T351354 Merged https://gerrit.wikimedia.org/r/c/operations/puppet/+/974693; running puppet on hosts [production]
19:39 <topranks> re-enabling puppet on DNS hosts to adjust TTL setting in BIRD (T350488) [production]
19:37 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1010.wikimedia.org with OS bullseye [production]
19:36 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1009.wikimedia.org with OS bullseye [production]
19:34 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1008.wikimedia.org with OS bullseye [production]
19:23 <jhuneidi@deploy2002> Synchronized php: group1 wikis to 1.42.0-wmf.5 refs T350081 (duration: 05m 52s) [production]
19:17 <jhuneidi@deploy2002> rebuilt and synchronized wikiversions files: group1 wikis to 1.42.0-wmf.5 refs T350081 [production]
19:15 <dzahn@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: aphlict [production]
19:10 <topranks> merging patch to remove TTL restriction on Bird Anycast BGP peerings (T350488) [production]
19:09 <dzahn@cumin1001> START - Cookbook sre.puppet.migrate-role for role: aphlict [production]
19:07 <taavi@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudlb2001-dev.codfw.wmnet [production]
19:07 <mutante> aphlict2001 - restart aphlict service after puppet 7 upgrade [production]
19:05 <jbond@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::virt_ceph [production]
19:01 <taavi@cumin1001> START - Cookbook sre.puppet.migrate-host for host cloudlb2001-dev.codfw.wmnet [production]
19:00 <taavi@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudgw2003-dev.codfw.wmnet [production]
18:59 <jbond@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::services [production]
18:59 <dzahn@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host aphlict2001.codfw.wmnet [production]
18:59 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::virt_ceph [production]
18:58 <jbond@cumin1001> END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: wmcs::openstack::codfw1dev::virt_ceph [production]
18:56 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1001.eqiad.wmnet with OS bullseye [production]
18:54 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::virt_ceph [production]
18:54 <jbond@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::net [production]
18:54 <dzahn@cumin1001> START - Cookbook sre.puppet.migrate-host for host aphlict2001.codfw.wmnet [production]
18:54 <taavi@cumin1001> START - Cookbook sre.puppet.migrate-host for host cloudgw2003-dev.codfw.wmnet [production]
18:51 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::services [production]
18:49 <taavi@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudgw2002-dev.codfw.wmnet [production]
18:45 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::net [production]
18:42 <topranks> Reset BGP to lvs4010 from cr3-ulsfo to validate new config T350488 [production]
18:41 <taavi@cumin1001> START - Cookbook sre.puppet.migrate-host for host cloudgw2002-dev.codfw.wmnet [production]
18:36 <topranks> remove TTL setting on server-facing BGP peerings on cr3-ulsfo T350488 [production]
18:25 <jbond@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wmcs::openstack::codfw1dev::db [production]
18:16 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1010.wikimedia.org with OS bullseye [production]
18:15 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1009.wikimedia.org with OS bullseye [production]
18:14 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::db [production]
18:12 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1008.wikimedia.org with OS bullseye [production]
18:05 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1141 (T348183)', diff saved to https://phabricator.wikimedia.org/P53488 and previous config saved to /var/cache/conftool/dbconfig/20231115-180503-arnaudb.json [production]
18:04 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1141.eqiad.wmnet with reason: Maintenance [production]
18:04 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1141.eqiad.wmnet with reason: Maintenance [production]
18:01 <jynus> All restart_daemons were successful [production]
18:01 <root@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-codfw [production]
17:57 <bking@cumin1001> START - Cookbook sre.wdqs.data-reload [production]
17:57 <bking@cumin1001> END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97) [production]
17:56 <bking@cumin1001> START - Cookbook sre.wdqs.data-reload [production]
17:56 <root@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-codfw [production]
17:52 <inflatador> bking@wdqs1024 reboot host to hopefully reduce data reload failures T349011 [production]
17:51 <bking@cumin1001> END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97) [production]
17:29 <hnowlan@cumin1001> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on P{lvs1019*,lvs2013*} and A:lvs (T349796) [production]