2024-06-12
§
|
23:49 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1182 (T352010)', diff saved to https://phabricator.wikimedia.org/P64751 and previous config saved to /var/cache/conftool/dbconfig/20240612-234923-ladsgroup.json |
[production] |
22:17 |
<brett@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp4037.ulsfo.wmnet |
[production] |
22:13 |
<krinkle@deploy1002> |
Finished scap: Backport for [[gerrit:891733|Move etcd.php from wmf-config/ to src/ (T308932)]] (duration: 13m 42s) |
[production] |
22:10 |
<eevans@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/data-gateway: apply |
[production] |
22:08 |
<eevans@deploy1002> |
helmfile [eqiad] START helmfile.d/services/data-gateway: apply |
[production] |
22:07 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp4037.ulsfo.wmnet with OS bullseye |
[production] |
22:06 |
<eevans@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/data-gateway: apply |
[production] |
22:04 |
<krinkle@deploy1002> |
krinkle: Continuing with sync |
[production] |
22:04 |
<eevans@deploy1002> |
helmfile [codfw] START helmfile.d/services/data-gateway: apply |
[production] |
22:03 |
<krinkle@deploy1002> |
krinkle: Backport for [[gerrit:891733|Move etcd.php from wmf-config/ to src/ (T308932)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:59 |
<krinkle@deploy1002> |
Started scap: Backport for [[gerrit:891733|Move etcd.php from wmf-config/ to src/ (T308932)]] |
[production] |
21:44 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4037.ulsfo.wmnet with reason: host reimage |
[production] |
21:42 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:41 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp4037.ulsfo.wmnet with reason: host reimage |
[production] |
21:36 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/device-analytics: sync |
[production] |
21:36 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/device-analytics: sync |
[production] |
21:36 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/device-analytics: apply |
[production] |
21:35 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/device-analytics: apply |
[production] |
21:34 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/edit-analytics: apply |
[production] |
21:33 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/edit-analytics: apply |
[production] |
21:33 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/editor-analytics: apply |
[production] |
21:32 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/editor-analytics: apply |
[production] |
21:31 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/geo-analytics: sync |
[production] |
21:31 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/geo-analytics: sync |
[production] |
21:30 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/geo-analytics: apply |
[production] |
21:30 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/geo-analytics: apply |
[production] |
21:28 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/image-suggestion: sync |
[production] |
21:28 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/image-suggestion: apply |
[production] |
21:28 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/image-suggestion: apply |
[production] |
21:27 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/media-analytics: apply |
[production] |
21:26 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/media-analytics: apply |
[production] |
21:25 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/page-analytics: apply |
[production] |
21:24 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/page-analytics: apply |
[production] |
21:22 |
<swfrench@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/data-gateway: sync |
[production] |
21:22 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/data-gateway: sync |
[production] |
21:21 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:cassandra-dev: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:20 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs1010.eqiad.wmnet: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:19 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS bullseye |
[production] |
21:18 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4037.ulsfo.wmnet with OS bullseye |
[production] |
21:17 |
<swfrench@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/data-gateway: apply |
[production] |
21:17 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/data-gateway: apply |
[production] |
21:13 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching aqs1010.eqiad.wmnet: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:11 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.wdqs.data-reload (exit_code=0) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
21:05 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop analytics cluster |
[production] |
21:05 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
21:04 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS bullseye |
[production] |