2024-06-10
§
|
20:00 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2203 (T352010)', diff saved to https://phabricator.wikimedia.org/P64555 and previous config saved to /var/cache/conftool/dbconfig/20240610-200039-ladsgroup.json |
[production] |
19:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1224 (T364069)', diff saved to https://phabricator.wikimedia.org/P64554 and previous config saved to /var/cache/conftool/dbconfig/20240610-195826-marostegui.json |
[production] |
19:58 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1224.eqiad.wmnet with reason: Maintenance |
[production] |
19:58 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1224.eqiad.wmnet with reason: Maintenance |
[production] |
19:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64553 and previous config saved to /var/cache/conftool/dbconfig/20240610-195804-marostegui.json |
[production] |
19:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1201', diff saved to https://phabricator.wikimedia.org/P64552 and previous config saved to /var/cache/conftool/dbconfig/20240610-194256-marostegui.json |
[production] |
19:27 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1201', diff saved to https://phabricator.wikimedia.org/P64551 and previous config saved to /var/cache/conftool/dbconfig/20240610-192749-marostegui.json |
[production] |
19:22 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
19:12 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64550 and previous config saved to /var/cache/conftool/dbconfig/20240610-191242-marostegui.json |
[production] |
19:02 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
19:02 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
18:11 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) |
[production] |
17:50 |
<amastilovic@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:50 |
<amastilovic@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:47 |
<amastilovic@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:46 |
<amastilovic@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64547 and previous config saved to /var/cache/conftool/dbconfig/20240610-174349-marostegui.json |
[production] |
17:43 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance |
[production] |
17:43 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance |
[production] |
17:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64546 and previous config saved to /var/cache/conftool/dbconfig/20240610-174327-marostegui.json |
[production] |
17:37 |
<otto@deploy1002> |
helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:36 |
<otto@deploy1002> |
helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:30 |
<dancy@deploy1002> |
Installation of scap version "4.87.0" completed for 285 hosts |
[production] |
17:29 |
<amastilovic@deploy1002> |
helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:29 |
<amastilovic@deploy1002> |
helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
17:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64545 and previous config saved to /var/cache/conftool/dbconfig/20240610-172820-marostegui.json |
[production] |
17:25 |
<dancy@deploy1002> |
Installing scap version "4.87.0" for 285 hosts |
[production] |
17:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64544 and previous config saved to /var/cache/conftool/dbconfig/20240610-171313-marostegui.json |
[production] |
17:01 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance |
[production] |
17:01 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance |
[production] |
16:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64543 and previous config saved to /var/cache/conftool/dbconfig/20240610-165806-marostegui.json |
[production] |
16:26 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: apply |
[production] |
16:21 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/thumbor: apply |
[production] |
16:20 |
<marostegui> |
Drop flaggedpage_pending from s1 T365568 |
[production] |
16:05 |
<cdanis> |
💙cdanis@cumin1002.eqiad.wmnet ~ 🕛☕ sudo cumin -b 8 '*.codfw.wmnet and C:geoip::data::puppet%fetch_ipinfo_dbs=true' 'sha512sum /usr/share/GeoIPInfo/GeoLite2-ASN.mmdb || run-puppet-agent' |
[production] |
16:01 |
<cdanis> |
💙cdanis@puppetserver2001.codfw.wmnet ~ 🕛☕ sudo systemctl restart sync-puppet-volatile |
[production] |
16:00 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/thumbor: apply |
[production] |
16:00 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev |
[production] |
15:54 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/thumbor: apply |
[production] |
15:47 |
<marostegui> |
Drop flaggedpage_pending from s3 T365568 |
[production] |
15:46 |
<marostegui> |
Drop flaggedpage_pending from s5 T365568 |
[production] |
15:43 |
<marostegui> |
Drop flaggedpage_pending from s2 T365568 |
[production] |
15:42 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: apply |
[production] |
15:42 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
15:41 |
<godog> |
bounce benthos@mw_accesslog_metrics.service on centrallog hosts |
[production] |
15:41 |
<marostegui> |
Drop flaggedpage_pending from s7 T365568 |
[production] |
15:40 |
<marostegui> |
Drop flaggedpage_pending from s6 T365568 |
[production] |
15:34 |
<ladsgroup@deploy1002> |
Synchronized portals: (no justification provided) (duration: 11m 20s) |
[production] |
15:31 |
<eevans@cumin1002> |
START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev |
[production] |
15:31 |
<swfrench@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/proton: apply |
[production] |