5501-5550 of 10000 results (114ms)
2024-06-27 §
20:21 <otto@deploy1002> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:20 <jhuneidi@deploy1002> Started scap: Backport for [[gerrit:1050441|QuickSurveys: Add testing survey configuration (T368459)]] [production]
20:17 <jhuneidi@deploy1002> Finished scap: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] (duration: 11m 09s) [production]
20:16 <brett@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5023.eqsin.wmnet [production]
20:11 <jhuneidi@deploy1002> jhuneidi, kemayo: Continuing with sync [production]
20:08 <jhuneidi@deploy1002> jhuneidi, kemayo: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:06 <jhuneidi@deploy1002> Started scap: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] [production]
20:03 <bking@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:03 <bking@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
19:55 <brett@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5022.eqsin.wmnet [production]
19:53 <ottomata> deleted mw-page-content-change-enrich stuck jobmanager pod: kubectl -n mw-page-content-change-enrich delete pod flink-app-main-859d98c57b-zrgwk - T368667 [production]
19:51 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1059.eqiad.wmnet with OS bookworm [production]
19:48 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5022.eqsin.wmnet with OS bullseye [production]
19:33 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1064.eqiad.wmnet with OS bookworm [production]
19:27 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1059.eqiad.wmnet with reason: host reimage [production]
19:23 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1059.eqiad.wmnet with reason: host reimage [production]
19:14 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage [production]
19:10 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1064.eqiad.wmnet with reason: host reimage [production]
19:09 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage [production]
19:07 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt1059.eqiad.wmnet with OS bookworm [production]
19:07 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1064.eqiad.wmnet with reason: host reimage [production]
19:00 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply [production]
19:00 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
19:00 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:52 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt1064.eqiad.wmnet with OS bookworm [production]
18:36 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye [production]
18:36 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5022.eqsin.wmnet with OS bullseye [production]
18:19 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply [production]
18:19 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:19 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye [production]
18:19 <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Server swap — T362033 [production]
18:18 <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Server swap — T362033 [production]
18:15 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:14 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1067.eqiad.wmnet with OS bookworm [production]
18:12 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply [production]
18:12 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:12 <jhuneidi@deploy1002> rebuilt and synchronized wikiversions files: group2 wikis to 1.43.0-wmf.11 refs T366956 [production]
18:12 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1065.eqiad.wmnet with OS bookworm [production]
18:11 <bking@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply [production]
18:11 <bking@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply [production]
18:10 <brett@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5022.eqsin.wmnet [production]
18:08 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1066.eqiad.wmnet with OS bookworm [production]
18:08 <ejegg> fundraising civicrm upgraded from 13a13f3a to 43fc2c89 [production]
18:04 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1058.eqiad.wmnet with OS bookworm [production]
17:59 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1057.eqiad.wmnet with OS bookworm [production]
17:51 <brett@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5021.eqsin.wmnet [production]
17:50 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1067.eqiad.wmnet with reason: host reimage [production]
17:47 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1065.eqiad.wmnet with reason: host reimage [production]
17:45 <swfrench@deploy1002> Finished scap: Deploying securityContext changes for T362978 to main release (duration: 04m 09s) [production]
17:43 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1066.eqiad.wmnet with reason: host reimage [production]