651-700 of 10000 results (110ms)
2024-06-10 §
20:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2203 (T352010)', diff saved to https://phabricator.wikimedia.org/P64555 and previous config saved to /var/cache/conftool/dbconfig/20240610-200039-ladsgroup.json [production]
19:58 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1224 (T364069)', diff saved to https://phabricator.wikimedia.org/P64554 and previous config saved to /var/cache/conftool/dbconfig/20240610-195826-marostegui.json [production]
19:58 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1224.eqiad.wmnet with reason: Maintenance [production]
19:58 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1224.eqiad.wmnet with reason: Maintenance [production]
19:58 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64553 and previous config saved to /var/cache/conftool/dbconfig/20240610-195804-marostegui.json [production]
19:42 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1201', diff saved to https://phabricator.wikimedia.org/P64552 and previous config saved to /var/cache/conftool/dbconfig/20240610-194256-marostegui.json [production]
19:27 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1201', diff saved to https://phabricator.wikimedia.org/P64551 and previous config saved to /var/cache/conftool/dbconfig/20240610-192749-marostegui.json [production]
19:22 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) [production]
19:12 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64550 and previous config saved to /var/cache/conftool/dbconfig/20240610-191242-marostegui.json [production]
19:02 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) [production]
19:02 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) [production]
18:11 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) [production]
17:50 <amastilovic@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:50 <amastilovic@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:47 <amastilovic@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:46 <amastilovic@deploy1002> helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64547 and previous config saved to /var/cache/conftool/dbconfig/20240610-174349-marostegui.json [production]
17:43 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance [production]
17:43 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance [production]
17:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64546 and previous config saved to /var/cache/conftool/dbconfig/20240610-174327-marostegui.json [production]
17:37 <otto@deploy1002> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:36 <otto@deploy1002> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:30 <dancy@deploy1002> Installation of scap version "4.87.0" completed for 285 hosts [production]
17:29 <amastilovic@deploy1002> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:29 <amastilovic@deploy1002> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:28 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64545 and previous config saved to /var/cache/conftool/dbconfig/20240610-172820-marostegui.json [production]
17:25 <dancy@deploy1002> Installing scap version "4.87.0" for 285 hosts [production]
17:13 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64544 and previous config saved to /var/cache/conftool/dbconfig/20240610-171313-marostegui.json [production]
17:01 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
17:01 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
16:58 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64543 and previous config saved to /var/cache/conftool/dbconfig/20240610-165806-marostegui.json [production]
16:26 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
16:21 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
16:20 <marostegui> Drop flaggedpage_pending from s1 T365568 [production]
16:05 <cdanis> 💙cdanis@cumin1002.eqiad.wmnet ~ 🕛☕ sudo cumin -b 8 '*.codfw.wmnet and C:geoip::data::puppet%fetch_ipinfo_dbs=true' 'sha512sum /usr/share/GeoIPInfo/GeoLite2-ASN.mmdb || run-puppet-agent' [production]
16:01 <cdanis> 💙cdanis@puppetserver2001.codfw.wmnet ~ 🕛☕ sudo systemctl restart sync-puppet-volatile [production]
16:00 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
16:00 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev [production]
15:54 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
15:47 <marostegui> Drop flaggedpage_pending from s3 T365568 [production]
15:46 <marostegui> Drop flaggedpage_pending from s5 T365568 [production]
15:43 <marostegui> Drop flaggedpage_pending from s2 T365568 [production]
15:42 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/thumbor: apply [production]
15:42 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/thumbor: apply [production]
15:41 <godog> bounce benthos@mw_accesslog_metrics.service on centrallog hosts [production]
15:41 <marostegui> Drop flaggedpage_pending from s7 T365568 [production]
15:40 <marostegui> Drop flaggedpage_pending from s6 T365568 [production]
15:34 <ladsgroup@deploy1002> Synchronized portals: (no justification provided) (duration: 11m 20s) [production]
15:31 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev [production]
15:31 <swfrench@deploy1002> helmfile [eqiad] DONE helmfile.d/services/proton: apply [production]