2001-2050 of 10000 results (114ms)
2024-06-27 §
20:47 <vriley@cumin1002> START - Cookbook sre.dns.netbox [production]
20:44 <jhuneidi@deploy1002> kharlan, jhuneidi: Continuing with sync [production]
20:40 <otto@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:40 <otto@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:39 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5023.eqsin.wmnet with OS bullseye [production]
20:37 <jhuneidi@deploy1002> kharlan, jhuneidi: Backport for [[gerrit:1050460|testwiki: Enable QuickSurveys (T368459)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:37 <otto@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:37 <otto@deploy1002> helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:35 <jhuneidi@deploy1002> Started scap: Backport for [[gerrit:1050460|testwiki: Enable QuickSurveys (T368459)]] [production]
20:34 <jhuneidi@deploy1002> Finished scap: Backport for [[gerrit:1050441|QuickSurveys: Add testing survey configuration (T368459)]] (duration: 14m 45s) [production]
20:29 <jhuneidi@deploy1002> kharlan, jhuneidi: Continuing with sync [production]
20:24 <otto@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:24 <otto@deploy1002> helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:22 <jhuneidi@deploy1002> kharlan, jhuneidi: Backport for [[gerrit:1050441|QuickSurveys: Add testing survey configuration (T368459)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:21 <otto@deploy1002> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:21 <otto@deploy1002> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:20 <jhuneidi@deploy1002> Started scap: Backport for [[gerrit:1050441|QuickSurveys: Add testing survey configuration (T368459)]] [production]
20:17 <jhuneidi@deploy1002> Finished scap: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] (duration: 11m 09s) [production]
20:16 <brett@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5023.eqsin.wmnet [production]
20:11 <jhuneidi@deploy1002> jhuneidi, kemayo: Continuing with sync [production]
20:08 <jhuneidi@deploy1002> jhuneidi, kemayo: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:06 <jhuneidi@deploy1002> Started scap: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] [production]
20:03 <bking@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:03 <bking@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
19:55 <brett@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5022.eqsin.wmnet [production]
19:53 <ottomata> deleted mw-page-content-change-enrich stuck jobmanager pod: kubectl -n mw-page-content-change-enrich delete pod flink-app-main-859d98c57b-zrgwk - T368667 [production]
19:51 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1059.eqiad.wmnet with OS bookworm [production]
19:48 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5022.eqsin.wmnet with OS bullseye [production]
19:33 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1064.eqiad.wmnet with OS bookworm [production]
19:27 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1059.eqiad.wmnet with reason: host reimage [production]
19:23 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1059.eqiad.wmnet with reason: host reimage [production]
19:14 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage [production]
19:10 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1064.eqiad.wmnet with reason: host reimage [production]
19:09 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage [production]
19:07 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt1059.eqiad.wmnet with OS bookworm [production]
19:07 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1064.eqiad.wmnet with reason: host reimage [production]
19:00 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply [production]
19:00 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
19:00 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:52 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt1064.eqiad.wmnet with OS bookworm [production]
18:36 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye [production]
18:36 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5022.eqsin.wmnet with OS bullseye [production]
18:19 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply [production]
18:19 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:19 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye [production]
18:19 <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Server swap — T362033 [production]
18:18 <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Server swap — T362033 [production]
18:15 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:14 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1067.eqiad.wmnet with OS bookworm [production]
18:12 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply [production]