production SAL

4401-4450 of 10000 results (113ms)

2024-06-10 §
19:02	<ryankemper@cumin2002>	START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet)	[production]
19:02	<ryankemper@cumin2002>	END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet)	[production]
18:11	<ryankemper@cumin2002>	START - Cookbook sre.wdqs.data-reload reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ using stat1009.eqiad.wmnet)	[production]
17:50	<amastilovic@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:50	<amastilovic@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:47	<amastilovic@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:46	<amastilovic@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:43	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1201 (T364069)', diff saved to https://phabricator.wikimedia.org/P64547 and previous config saved to /var/cache/conftool/dbconfig/20240610-174349-marostegui.json	[production]
17:43	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance	[production]
17:43	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance	[production]
17:43	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64546 and previous config saved to /var/cache/conftool/dbconfig/20240610-174327-marostegui.json	[production]
17:37	<otto@deploy1002>	helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:36	<otto@deploy1002>	helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:30	<dancy@deploy1002>	Installation of scap version "4.87.0" completed for 285 hosts	[production]
17:29	<amastilovic@deploy1002>	helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:29	<amastilovic@deploy1002>	helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply	[production]
17:28	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64545 and previous config saved to /var/cache/conftool/dbconfig/20240610-172820-marostegui.json	[production]
17:25	<dancy@deploy1002>	Installing scap version "4.87.0" for 285 hosts	[production]
17:13	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P64544 and previous config saved to /var/cache/conftool/dbconfig/20240610-171313-marostegui.json	[production]
17:01	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance	[production]
17:01	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2200.codfw.wmnet with reason: Maintenance	[production]
16:58	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T364069)', diff saved to https://phabricator.wikimedia.org/P64543 and previous config saved to /var/cache/conftool/dbconfig/20240610-165806-marostegui.json	[production]
16:26	<hnowlan@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/thumbor: apply	[production]
16:21	<hnowlan@deploy1002>	helmfile [eqiad] START helmfile.d/services/thumbor: apply	[production]
16:20	<marostegui>	Drop flaggedpage_pending from s1 T365568	[production]
16:05	<cdanis>	💙cdanis@cumin1002.eqiad.wmnet ~ 🕛☕ sudo cumin -b 8 '*.codfw.wmnet and C:geoip::data::puppet%fetch_ipinfo_dbs=true' 'sha512sum /usr/share/GeoIPInfo/GeoLite2-ASN.mmdb \|\| run-puppet-agent'	[production]
16:01	<cdanis>	💙cdanis@puppetserver2001.codfw.wmnet ~ 🕛☕ sudo systemctl restart sync-puppet-volatile	[production]
16:00	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/thumbor: apply	[production]
16:00	<eevans@cumin1002>	END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev	[production]
15:54	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/thumbor: apply	[production]
15:47	<marostegui>	Drop flaggedpage_pending from s3 T365568	[production]
15:46	<marostegui>	Drop flaggedpage_pending from s5 T365568	[production]
15:43	<marostegui>	Drop flaggedpage_pending from s2 T365568	[production]
15:42	<hnowlan@deploy1002>	helmfile [staging] DONE helmfile.d/services/thumbor: apply	[production]
15:42	<hnowlan@deploy1002>	helmfile [staging] START helmfile.d/services/thumbor: apply	[production]
15:41	<godog>	bounce benthos@mw_accesslog_metrics.service on centrallog hosts	[production]
15:41	<marostegui>	Drop flaggedpage_pending from s7 T365568	[production]
15:40	<marostegui>	Drop flaggedpage_pending from s6 T365568	[production]
15:34	<ladsgroup@deploy1002>	Synchronized portals: (no justification provided) (duration: 11m 20s)	[production]
15:31	<eevans@cumin1002>	START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev	[production]
15:31	<swfrench@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/proton: apply	[production]
15:29	<swfrench@deploy1002>	helmfile [eqiad] START helmfile.d/services/proton: apply	[production]
15:22	<ladsgroup@deploy1002>	Synchronized portals/wikipedia.org/assets: (no justification provided) (duration: 10m 28s)	[production]
15:07	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2024.codfw.wmnet	[production]
15:07	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2024.codfw.wmnet	[production]
15:05	<cdobbins@cumin1002>	conftool action : set/pooled=yes; selector: name=4046.ulsfo.wmnet	[production]
15:04	<ladsgroup@deploy1002>	Finished scap: Backport for [[gerrit:1041091\|errorpages: Add dark mode support]] (duration: 17m 15s)	[production]
15:03	<cgoubert@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/shellbox-timeline: apply	[production]
15:02	<cgoubert@deploy1002>	helmfile [eqiad] START helmfile.d/services/shellbox-timeline: apply	[production]
15:02	<cgoubert@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/shellbox-syntaxhighlight: apply	[production]