2022-06-07
ยง
|
17:13 |
<dduvall@deploy1002> |
Started scap: testwikis wikis to 1.39.0-wmf.15 refs T308068 |
[production] |
16:56 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:50 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
16:50 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:43 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
16:32 |
<dduvall> |
scap deploy-promote testwikis failed at invocation of logstash_checker.py ("logstash_checker.py: error: argument --delay: invalid int value: '40.406498670578'") T308068 |
[production] |
16:21 |
<dduvall@deploy1002> |
scap failed: RuntimeError scap failed: average error rate on 8/8 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org for details) (duration: 14m 27s) |
[production] |
16:21 |
<dduvall@deploy1002> |
scap failed: average error rate on 8/8 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org for details) |
[production] |
16:18 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:17 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
16:17 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:16 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
16:11 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:08 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
16:08 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:06 |
<dduvall@deploy1002> |
Started scap: testwikis wikis to 1.39.0-wmf.15 refs T308068 |
[production] |
16:06 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
15:08 |
<volans@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
14:57 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1001.eqiad.wmnet with OS buster |
[production] |
14:52 |
<volans> |
upgrading spicerack to v2.6.0 on cumin2002 |
[production] |
14:50 |
<volans> |
uploaded spicerack_2.6.0 to apt.wikimedia.org bullseye-wikimedia |
[production] |
14:48 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.kafka.roll-restart-brokers (exit_code=0) for Kafka A:kafka-jumbo-eqiad cluster: Roll restart of jvm daemons. |
[production] |
14:45 |
<moritzm> |
adding additional disk for /srv to webperf2004 T305460 |
[production] |
14:43 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage |
[production] |
14:41 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage |
[production] |
14:30 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster |
[production] |
14:02 |
<ayounsi@cumin1001> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
14:02 |
<ayounsi@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
14:02 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox-exports.discovery.wmnet on all recursors |
[production] |
14:02 |
<jbond@cumin1001> |
START - Cookbook sre.dns.wipe-cache netbox-exports.discovery.wmnet on all recursors |
[production] |
14:00 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox-exports.wikimedia.org on all recursors |
[production] |
14:00 |
<jbond@cumin1001> |
START - Cookbook sre.dns.wipe-cache netbox-exports.wikimedia.org on all recursors |
[production] |
13:58 |
<btullis@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply |
[production] |
13:57 |
<btullis@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-main: apply |
[production] |
13:55 |
<btullis@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/eventgate-main: apply |
[production] |
13:54 |
<btullis@deploy1002> |
helmfile [codfw] START helmfile.d/services/eventgate-main: apply |
[production] |
13:54 |
<btullis@deploy1002> |
helmfile [staging] DONE helmfile.d/services/eventgate-main: apply |
[production] |
13:53 |
<btullis@deploy1002> |
helmfile [staging] START helmfile.d/services/eventgate-main: apply |
[production] |
13:53 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox-exports.discovery.wmnet on all recursors |
[production] |
13:53 |
<jbond@cumin1001> |
START - Cookbook sre.dns.wipe-cache netbox-exports.discovery.wmnet on all recursors |
[production] |
13:53 |
<btullis@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply |
[production] |
13:52 |
<btullis@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply |
[production] |
13:51 |
<btullis@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/eventgate-logging-external: apply |
[production] |
13:51 |
<btullis@deploy1002> |
helmfile [codfw] START helmfile.d/services/eventgate-logging-external: apply |
[production] |
13:50 |
<btullis@deploy1002> |
helmfile [staging] DONE helmfile.d/services/eventgate-logging-external: apply |
[production] |
13:50 |
<btullis@deploy1002> |
helmfile [staging] START helmfile.d/services/eventgate-logging-external: apply |
[production] |
13:48 |
<btullis@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics: apply |
[production] |
13:47 |
<btullis@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-analytics: apply |
[production] |
13:47 |
<btullis@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/eventgate-analytics: apply |
[production] |
13:46 |
<btullis@deploy1002> |
helmfile [codfw] START helmfile.d/services/eventgate-analytics: apply |
[production] |