2024-01-30
ยง
|
17:22 |
<jforrester@deploy2002> |
Finished scap: Backport for [[gerrit:994202|Do not search for elements if no previews have been registered (T355933 T356186 T356193)]], [[gerrit:994203|Do not search for elements if no previews have been registered (T355933 T356186 T356193)]] (duration: 11m 51s) |
[production] |
17:21 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.reimage for host sretest2005.codfw.wmnet with OS bookworm |
[production] |
17:15 |
<jforrester@deploy2002> |
jforrester: Continuing with sync |
[production] |
17:14 |
<jforrester@deploy2002> |
jforrester: Backport for [[gerrit:994202|Do not search for elements if no previews have been registered (T355933 T356186 T356193)]], [[gerrit:994203|Do not search for elements if no previews have been registered (T355933 T356186 T356193)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
17:13 |
<ayounsi@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest2005.codfw.wmnet with OS bookworm |
[production] |
17:10 |
<jforrester@deploy2002> |
Started scap: Backport for [[gerrit:994202|Do not search for elements if no previews have been registered (T355933 T356186 T356193)]], [[gerrit:994203|Do not search for elements if no previews have been registered (T355933 T356186 T356193)]] |
[production] |
16:57 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: activate new elastic config - bking@cumin2002 - T355617 |
[production] |
16:56 |
<bking@cumin2002> |
conftool action : set/weight=10; selector: name=cloudelastic1009.wikimedia.org |
[production] |
16:56 |
<bking@cumin2002> |
conftool action : set/weight=10; selector: name=cloudelastic1008.wikimedia.org |
[production] |
16:56 |
<bking@cumin2002> |
conftool action : set/weight=10; selector: name=cloudelastic1007.wikimedia.org |
[production] |
16:54 |
<claime> |
Running homer 'cr*codfw*' commit 'T351074' |
[production] |
16:54 |
<jgiannelos@deploy2002> |
helmfile [staging] DONE helmfile.d/services/mobileapps: sync |
[production] |
16:54 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/mobileapps: sync |
[production] |
16:49 |
<mutante> |
gitlab is back |
[production] |
16:48 |
<jgiannelos@deploy2002> |
helmfile [staging] DONE helmfile.d/services/mobileapps: apply |
[production] |
16:47 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/mobileapps: apply |
[production] |
16:47 |
<jgiannelos@deploy2002> |
helmfile [staging] DONE helmfile.d/services/mobileapps: apply |
[production] |
16:47 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/mobileapps: apply |
[production] |
16:44 |
<mutante> |
gitlab is down for maintenance for a few minutes |
[production] |
16:34 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: activate new elastic config - bking@cumin2002 - T355617 |
[production] |
16:29 |
<dzahn@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on gitlab.wikimedia.org with reason: server move |
[production] |
16:29 |
<dzahn@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on gitlab.wikimedia.org with reason: server move |
[production] |
16:28 |
<dzahn@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on gitlab2002.wikimedia.org with reason: server move |
[production] |
16:28 |
<dzahn@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on gitlab2002.wikimedia.org with reason: server move |
[production] |
16:25 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1466.eqiad.wmnet with OS bullseye |
[production] |
16:21 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1457.eqiad.wmnet with OS bullseye |
[production] |
16:18 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw2366.codfw.wmnet with OS bullseye |
[production] |
16:14 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1440.eqiad.wmnet with OS bullseye |
[production] |
16:14 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: activate new elastic config - bking@cumin2002 - T355617 |
[production] |
16:13 |
<bking@cumin2002> |
conftool action : set/pooled=yes; selector: name=cloudelastic1008.wikimedia.org |
[production] |
16:13 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw2370.codfw.wmnet with OS bullseye |
[production] |
16:11 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.reimage for host sretest2005.codfw.wmnet with OS bookworm |
[production] |
16:09 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1482.eqiad.wmnet with OS bullseye |
[production] |
16:08 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw2368.codfw.wmnet with OS bullseye |
[production] |
16:06 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1466.eqiad.wmnet with reason: host reimage |
[production] |
16:03 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1459.eqiad.wmnet with OS bullseye |
[production] |
16:02 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1457.eqiad.wmnet with reason: host reimage |
[production] |
15:59 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2366.codfw.wmnet with reason: host reimage |
[production] |
15:58 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cloudelastic1010.eqiad.wmnet with reason: T355617 |
[production] |
15:58 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on cloudelastic1010.eqiad.wmnet with reason: T355617 |
[production] |
15:56 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1440.eqiad.wmnet with reason: host reimage |
[production] |
15:54 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: activate new elastic config - bking@cumin2002 - T355617 |
[production] |
15:53 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2370.codfw.wmnet with reason: host reimage |
[production] |
15:50 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1482.eqiad.wmnet with reason: host reimage |
[production] |
15:47 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2368.codfw.wmnet with reason: host reimage |
[production] |
15:44 |
<hnowlan@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1459.eqiad.wmnet with reason: host reimage |
[production] |
15:42 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2370.codfw.wmnet with reason: host reimage |
[production] |
15:42 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1457.eqiad.wmnet with reason: host reimage |
[production] |
15:42 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1466.eqiad.wmnet with reason: host reimage |
[production] |
15:42 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2366.codfw.wmnet with reason: host reimage |
[production] |