4401-4450 of 10000 results (106ms)
2024-06-10 §
19:02 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) [production]
19:02 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) [production]
18:11 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet) [production]
17:50 <amastilovic@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:50 <amastilovic@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:47 <amastilovic@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:46 <amastilovic@deploy1002> helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64547 and previous config saved to /var/cache/conftool/dbconfig/20240610-174349-marostegui.json [production]
17:43 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance [production]
17:43 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance [production]
17:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64546 and previous config saved to /var/cache/conftool/dbconfig/20240610-174327-marostegui.json [production]
17:37 <otto@deploy1002> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:36 <otto@deploy1002> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:30 <dancy@deploy1002> Installation of scap version "4.87.0" completed for 285 hosts [production]
17:29 <amastilovic@deploy1002> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:29 <amastilovic@deploy1002> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:28 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64545 and previous config saved to /var/cache/conftool/dbconfig/20240610-172820-marostegui.json [production]
17:25 <dancy@deploy1002> Installing scap version "4.87.0" for 285 hosts [production]
17:13 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64544 and previous config saved to /var/cache/conftool/dbconfig/20240610-171313-marostegui.json [production]
17:01 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
17:01 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance [production]
16:58 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64543 and previous config saved to /var/cache/conftool/dbconfig/20240610-165806-marostegui.json [production]
16:26 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
16:21 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
16:20 <marostegui> Drop flaggedpage_pending from s1 T365568 [production]
16:05 <cdanis> 💙cdanis@cumin1002.eqiad.wmnet ~ 🕛☕ sudo cumin -b 8 '*.codfw.wmnet and C:geoip::data::puppet%fetch_ipinfo_dbs=true' 'sha512sum /usr/share/GeoIPInfo/GeoLite2-ASN.mmdb || run-puppet-agent' [production]
16:01 <cdanis> 💙cdanis@puppetserver2001.codfw.wmnet ~ 🕛☕ sudo systemctl restart sync-puppet-volatile [production]
16:00 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
16:00 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev [production]
15:54 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/thumbor: apply [production]
15:47 <marostegui> Drop flaggedpage_pending from s3 T365568 [production]
15:46 <marostegui> Drop flaggedpage_pending from s5 T365568 [production]
15:43 <marostegui> Drop flaggedpage_pending from s2 T365568 [production]
15:42 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/thumbor: apply [production]
15:42 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/thumbor: apply [production]
15:41 <godog> bounce benthos@mw_accesslog_metrics.service on centrallog hosts [production]
15:41 <marostegui> Drop flaggedpage_pending from s7 T365568 [production]
15:40 <marostegui> Drop flaggedpage_pending from s6 T365568 [production]
15:34 <ladsgroup@deploy1002> Synchronized portals: (no justification provided) (duration: 11m 20s) [production]
15:31 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev [production]
15:31 <swfrench@deploy1002> helmfile [eqiad] DONE helmfile.d/services/proton: apply [production]
15:29 <swfrench@deploy1002> helmfile [eqiad] START helmfile.d/services/proton: apply [production]
15:22 <ladsgroup@deploy1002> Synchronized portals/wikipedia.org/assets: (no justification provided) (duration: 10m 28s) [production]
15:07 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2024.codfw.wmnet [production]
15:07 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2024.codfw.wmnet [production]
15:05 <cdobbins@cumin1002> conftool action : set/pooled=yes; selector: name=4046.ulsfo.wmnet [production]
15:04 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:1041091|errorpages: Add dark mode support]] (duration: 17m 15s) [production]
15:03 <cgoubert@deploy1002> helmfile [eqiad] DONE helmfile.d/services/shellbox-timeline: apply [production]
15:02 <cgoubert@deploy1002> helmfile [eqiad] START helmfile.d/services/shellbox-timeline: apply [production]
15:02 <cgoubert@deploy1002> helmfile [eqiad] DONE helmfile.d/services/shellbox-syntaxhighlight: apply [production]