2024-06-12
ยง
|
21:42 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:41 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp4037.ulsfo.wmnet with reason: host reimage |
[production] |
21:36 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/device-analytics: sync |
[production] |
21:36 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/device-analytics: sync |
[production] |
21:36 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/device-analytics: apply |
[production] |
21:35 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/device-analytics: apply |
[production] |
21:34 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/edit-analytics: apply |
[production] |
21:33 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/edit-analytics: apply |
[production] |
21:33 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/editor-analytics: apply |
[production] |
21:32 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/editor-analytics: apply |
[production] |
21:31 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/geo-analytics: sync |
[production] |
21:31 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/geo-analytics: sync |
[production] |
21:30 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/geo-analytics: apply |
[production] |
21:30 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/geo-analytics: apply |
[production] |
21:28 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/image-suggestion: sync |
[production] |
21:28 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/image-suggestion: apply |
[production] |
21:28 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/image-suggestion: apply |
[production] |
21:27 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/media-analytics: apply |
[production] |
21:26 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/media-analytics: apply |
[production] |
21:25 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/page-analytics: apply |
[production] |
21:24 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/page-analytics: apply |
[production] |
21:22 |
<swfrench@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/data-gateway: sync |
[production] |
21:22 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/data-gateway: sync |
[production] |
21:21 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:cassandra-dev: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:20 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs1010.eqiad.wmnet: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:19 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS bullseye |
[production] |
21:18 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4037.ulsfo.wmnet with OS bullseye |
[production] |
21:17 |
<swfrench@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/data-gateway: apply |
[production] |
21:17 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/data-gateway: apply |
[production] |
21:13 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching aqs1010.eqiad.wmnet: Apply remote logging fix (r1042273) - eevans@cumin1002 |
[production] |
21:11 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.wdqs.data-reload (exit_code=0) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
21:05 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop analytics cluster |
[production] |
21:05 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
21:04 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS bullseye |
[production] |
20:53 |
<cjming> |
end of UTC late backport window |
[production] |
20:52 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1041674|Don't squish images in non-responsive skins e.g. Vector 2010 (T113101)]] (duration: 12m 52s) |
[production] |
20:47 |
<brett@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp4037.ulsfo.wmnet |
[production] |
20:44 |
<cjming@deploy1002> |
cjming, jdlrobson: Continuing with sync |
[production] |
20:42 |
<cjming@deploy1002> |
cjming, jdlrobson: Backport for [[gerrit:1041674|Don't squish images in non-responsive skins e.g. Vector 2010 (T113101)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:39 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1041674|Don't squish images in non-responsive skins e.g. Vector 2010 (T113101)]] |
[production] |
20:29 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1041748|Disable quick surveys using deprecated configuration (T367128)]] (duration: 11m 59s) |
[production] |
20:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209 (T367261)', diff saved to https://phabricator.wikimedia.org/P64750 and previous config saved to /var/cache/conftool/dbconfig/20240612-202233-marostegui.json |
[production] |
20:21 |
<cjming@deploy1002> |
jdlrobson, cjming: Continuing with sync |
[production] |
20:19 |
<cjming@deploy1002> |
jdlrobson, cjming: Backport for [[gerrit:1041748|Disable quick surveys using deprecated configuration (T367128)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:17 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1041748|Disable quick surveys using deprecated configuration (T367128)]] |
[production] |
20:10 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on A:cp-upload_codfw |
[production] |
20:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209', diff saved to https://phabricator.wikimedia.org/P64749 and previous config saved to /var/cache/conftool/dbconfig/20240612-200726-marostegui.json |
[production] |
20:00 |
<swfrench@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/data-gateway: apply |
[production] |
19:59 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/data-gateway: apply |
[production] |
19:58 |
<brennen@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.43.0-wmf.9 refs T361403 |
[production] |