501-550 of 10000 results (77ms)
2024-02-14 ยง
18:02 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
18:01 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: apply new master settings - bking@cumin2002 - T355617 [production]
17:59 <hnowlan@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2282.codfw.wmnet with reason: Testing if reimage is stable T355333 [production]
17:59 <hnowlan@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on mw2282.codfw.wmnet with reason: Testing if reimage is stable T355333 [production]
17:58 <fnegri@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudvirtlocal1002.eqiad.wmnet [production]
17:56 <hnowlan@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw2282.codfw.wmnet with OS bullseye [production]
17:49 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2104 (re)pooling @ 25%: T355864 - Post migration repool of db2104', diff saved to https://phabricator.wikimedia.org/P56787 and previous config saved to /var/cache/conftool/dbconfig/20240214-174906-arnaudb.json [production]
17:49 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2145 (re)pooling @ 100%: T355864 - Post migration repool of db2145', diff saved to https://phabricator.wikimedia.org/P56786 and previous config saved to /var/cache/conftool/dbconfig/20240214-174900-arnaudb.json [production]
17:48 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:1003408|Enable echo conditional defaults for loginwiki since 2013 (T357072)]] (duration: 12m 08s) [production]
17:44 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: apply new master settings - bking@cumin2002 - T355617 [production]
17:41 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
17:39 <fnegri@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirtlocal1001.eqiad.wmnet [production]
17:39 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:1003408|Enable echo conditional defaults for loginwiki since 2013 (T357072)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
17:36 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:1003408|Enable echo conditional defaults for loginwiki since 2013 (T357072)]] [production]
17:33 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2145 (re)pooling @ 75%: T355864 - Post migration repool of db2145', diff saved to https://phabricator.wikimedia.org/P56785 and previous config saved to /var/cache/conftool/dbconfig/20240214-173355-arnaudb.json [production]
17:32 <fnegri@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudvirtlocal1001.eqiad.wmnet [production]
17:32 <hnowlan@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2282.codfw.wmnet with reason: host reimage [production]
17:32 <fnegri@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host cloudvirtlocal1001.eqiad.wmnet [production]
17:29 <hnowlan@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2282.codfw.wmnet with reason: host reimage [production]
17:18 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2145 (re)pooling @ 50%: T355864 - Post migration repool of db2145', diff saved to https://phabricator.wikimedia.org/P56784 and previous config saved to /var/cache/conftool/dbconfig/20240214-171850-arnaudb.json [production]
17:13 <hnowlan@cumin2002> START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye [production]
17:10 <fabfur> enabled puppet on A:cp-upload to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/1003109 selectively (T357479) [production]
17:03 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2145 (re)pooling @ 25%: T355864 - Post migration repool of db2145', diff saved to https://phabricator.wikimedia.org/P56783 and previous config saved to /var/cache/conftool/dbconfig/20240214-170345-arnaudb.json [production]
17:03 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2121 (re)pooling @ 100%: T355864 - Post migration repool of db2121', diff saved to https://phabricator.wikimedia.org/P56782 and previous config saved to /var/cache/conftool/dbconfig/20240214-170339-arnaudb.json [production]
16:56 <fabfur> disabled puppet on A:cp-upload to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/1003109 selectively (T357479) [production]
16:52 <hnowlan@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye [production]
16:48 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2121 (re)pooling @ 75%: T355864 - Post migration repool of db2121', diff saved to https://phabricator.wikimedia.org/P56781 and previous config saved to /var/cache/conftool/dbconfig/20240214-164834-arnaudb.json [production]
16:48 <hnowlan@cumin2002> START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye [production]
16:37 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest2005.codfw.wmnet with OS bookworm [production]
16:33 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2121 (re)pooling @ 50%: T355864 - Post migration repool of db2121', diff saved to https://phabricator.wikimedia.org/P56780 and previous config saved to /var/cache/conftool/dbconfig/20240214-163330-arnaudb.json [production]
16:20 <hnowlan@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye [production]
16:19 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=maps2005.codfw.wmnet [production]
16:18 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2121 (re)pooling @ 25%: T355864 - Post migration repool of db2121', diff saved to https://phabricator.wikimedia.org/P56779 and previous config saved to /var/cache/conftool/dbconfig/20240214-161824-arnaudb.json [production]
16:17 <cgoubert@cumin2002> conftool action : set/pooled=yes; selector: name=(mw2402|mw2403|mw2404|mw2405|mw2407|mw2408|mw2409|mw2401|mw2410|mw2411|parse2001|parse2002|parse2003).* [production]
16:16 <claime> Repooling mw2402|mw2403|mw2404|mw2405|mw2407|mw2408|mw2409|mw2401|mw2410|mw2411|parse2001|parse2002|parse2003 for T355864 [production]
16:16 <claime> Uncordoning kubernetes2019.codfw.wmnet kubernetes2018.codfw.wmnet mw2420.codfw.wmnet mw2421.codfw.wmnet mw2406.codfw.wmnet mw2422.codfw.wmnet mw2423.codfw.wmnet for T355864 [production]
16:07 <topranks> Moving server uplinks from old switch to new codfw rack A5 T355864 [production]
16:07 <ayounsi@cumin1002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on sretest2005.codfw.wmnet with reason: host reimage [production]
16:07 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2005.codfw.wmnet with reason: host reimage [production]
16:07 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on 38 hosts with reason: Migrating servers in codfw rack A5 to lsw1-a5-codfw [production]
16:06 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on 38 hosts with reason: Migrating servers in codfw rack A5 to lsw1-a5-codfw [production]
16:04 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: apply new master settings - bking@cumin2002 - T355617 [production]
15:59 <topranks> disable puppet fleet-wide to allow for distruption to puppetmaster/puppetserver during network maint T355864 [production]
15:59 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: apply new master settings - bking@cumin2002 - T355617 [production]
15:55 <bking@cumin2002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: apply new master settings - bking@cumin2002 - T355617 [production]
15:54 <hnowlan@cumin2002> START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye [production]
15:53 <ayounsi@cumin1002> START - Cookbook sre.hosts.reimage for host sretest2005.codfw.wmnet with OS bookworm [production]
15:53 <hnowlan@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host mw2282.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
15:53 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: apply new master settings - bking@cumin2002 - T355617 [production]
15:51 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on asw-a-codfw,cr[1-2]-codfw,lsw1-a5-codfw.mgmt with reason: prepping for server uplink migration codfw rack a5 [production]