2025-02-20
§
|
20:52 |
<mutante> |
welcome new deployer Arthur Taylor (T386349) |
[production] |
20:18 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs2001.codfw.wmnet: Upgrading to Cassandra 4.1.8 (canary) — T385819 - eevans@cumin1002 |
[production] |
20:11 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching aqs2001.codfw.wmnet: Upgrading to Cassandra 4.1.8 (canary) — T385819 - eevans@cumin1002 |
[production] |
20:10 |
<jhathaway@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on ms-be2088.codfw.wmnet with reason: T381919 |
[production] |
19:53 |
<jforrester@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply |
[production] |
19:52 |
<jforrester@deploy2002> |
helmfile [eqiad] START helmfile.d/services/wikifunctions: apply |
[production] |
19:52 |
<jforrester@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply |
[production] |
19:51 |
<jforrester@deploy2002> |
helmfile [codfw] START helmfile.d/services/wikifunctions: apply |
[production] |
19:50 |
<cmooney@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 18:00:00 on netflow1002.eqiad.wmnet with reason: keeping gnmic running in debug mode to observe performance change |
[production] |
19:50 |
<jforrester@deploy2002> |
helmfile [staging] DONE helmfile.d/services/wikifunctions: apply |
[production] |
19:49 |
<jforrester@deploy2002> |
helmfile [staging] START helmfile.d/services/wikifunctions: apply |
[production] |
19:12 |
<cmooney@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on netflow1002.eqiad.wmnet with reason: keeping gnmic running in debug mode to observe performance change |
[production] |
19:11 |
<dancy@deploy2002> |
rebuilt and synchronized wikiversions files: group2 to 1.44.0-wmf.17 refs T382368 |
[production] |
18:42 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster relforge: test operations in mixed opensearch/elasticsearch cluster - bking@cumin2002 - T380752: |
[production] |
18:42 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster relforge: test operations in mixed opensearch/elasticsearch cluster - bking@cumin2002 - T380752: |
[production] |
18:18 |
<bd808@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:18 |
<bd808@deploy2002> |
helmfile [codfw] START helmfile.d/services/developer-portal: apply |
[production] |
18:17 |
<bd808@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:11 |
<bd808@deploy2002> |
helmfile [eqiad] START helmfile.d/services/developer-portal: apply |
[production] |
18:10 |
<bd808@deploy2002> |
helmfile [staging] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:09 |
<bd808@deploy2002> |
helmfile [staging] START helmfile.d/services/developer-portal: apply |
[production] |
17:51 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching cassandra-dev200[2-3].codfw.wmnet: Upgrading to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
17:47 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 |
[production] |
17:37 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching cassandra-dev200[2-3].codfw.wmnet: Upgrading to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
17:29 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 |
[production] |
17:29 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching cassandra-dev2001.codfw.wmnet: Upgrade to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
17:22 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching cassandra-dev2001.codfw.wmnet: Upgrade to Cassandra 4.1.8 — T385819 - eevans@cumin1002 |
[production] |
17:19 |
<rzl@deploy2002> |
Finished scap sync-world: T385520 (duration: 09m 01s) |
[production] |
17:13 |
<rzl@deploy2002> |
rzl: Continuing with sync |
[production] |
17:12 |
<rzl@deploy2002> |
rzl: T385520 synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
17:10 |
<rzl@deploy2002> |
Started scap sync-world: T385520 |
[production] |
17:08 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
17:08 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
17:05 |
<arlolra@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: apply |
[production] |
17:04 |
<arlolra@deploy2002> |
helmfile [eqiad] START helmfile.d/services/changeprop: apply |
[production] |
16:58 |
<arlolra@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/changeprop: apply |
[production] |
16:56 |
<mutante> |
phab1004 (phabricator) - systemctl stop phabricator_stats_job_mfa_check timer and service; systemctl (gerrit:1117489) |
[production] |
16:55 |
<arlolra@deploy2002> |
helmfile [codfw] START helmfile.d/services/changeprop: apply |
[production] |
16:50 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
16:49 |
<arlolra@deploy2002> |
helmfile [staging] DONE helmfile.d/services/changeprop: apply |
[production] |
16:49 |
<arlolra@deploy2002> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
16:45 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
16:35 |
<dancy@deploy2002> |
Installation of scap version "4.137.0" completed for 204 hosts |
[production] |
16:31 |
<dancy@deploy2002> |
Installing scap version "4.137.0" for 204 host(s) |
[production] |
16:27 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
16:27 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
16:13 |
<elukey@puppetserver1001> |
conftool action : set/pooled=inactive:weight=5; selector: name=wikikube-worker1004.eqiad.wmnet,dc=eqiad,cluster=maps,service=kartotherian-k8s-ssl |
[production] |
16:12 |
<elukey@puppetserver1001> |
conftool action : set/pooled=inactive:weight=5; selector: name=wikikube-worker2003.codfw.wmnet,dc=codfw,cluster=maps,service=kartotherian-k8s-ssl |
[production] |
16:10 |
<vgutierrez> |
updating liberica to version 0.10 in ulsfo load balancers |
[production] |
16:03 |
<vgutierrez> |
upload liberica 0.9 to apt.wm.o (bookworm-wikimedia) |
[production] |