2024-06-27
§
|
20:20 |
<jhuneidi@deploy1002> |
Started scap: Backport for [[gerrit:1050441|QuickSurveys: Add testing survey configuration (T368459)]] |
[production] |
20:17 |
<jhuneidi@deploy1002> |
Finished scap: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] (duration: 11m 09s) |
[production] |
20:16 |
<brett@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5023.eqsin.wmnet |
[production] |
20:11 |
<jhuneidi@deploy1002> |
jhuneidi, kemayo: Continuing with sync |
[production] |
20:08 |
<jhuneidi@deploy1002> |
jhuneidi, kemayo: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:06 |
<jhuneidi@deploy1002> |
Started scap: Backport for [[gerrit:1050432|Enable DiscussionTools permalinks on enwiki (T365974)]] |
[production] |
20:03 |
<bking@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
20:03 |
<bking@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
19:55 |
<brett@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp5022.eqsin.wmnet |
[production] |
19:53 |
<ottomata> |
deleted mw-page-content-change-enrich stuck jobmanager pod: kubectl -n mw-page-content-change-enrich delete pod flink-app-main-859d98c57b-zrgwk - T368667 |
[production] |
19:51 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1059.eqiad.wmnet with OS bookworm |
[production] |
19:48 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5022.eqsin.wmnet with OS bullseye |
[production] |
19:33 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1064.eqiad.wmnet with OS bookworm |
[production] |
19:27 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1059.eqiad.wmnet with reason: host reimage |
[production] |
19:23 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1059.eqiad.wmnet with reason: host reimage |
[production] |
19:14 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage |
[production] |
19:10 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1064.eqiad.wmnet with reason: host reimage |
[production] |
19:09 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5022.eqsin.wmnet with reason: host reimage |
[production] |
19:07 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt1059.eqiad.wmnet with OS bookworm |
[production] |
19:07 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1064.eqiad.wmnet with reason: host reimage |
[production] |
19:00 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply |
[production] |
19:00 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply |
[production] |
19:00 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:52 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudvirt1064.eqiad.wmnet with OS bookworm |
[production] |
18:36 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye |
[production] |
18:36 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5022.eqsin.wmnet with OS bullseye |
[production] |
18:19 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:19 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:19 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5022.eqsin.wmnet with OS bullseye |
[production] |
18:19 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Server swap — T362033 |
[production] |
18:18 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Server swap — T362033 |
[production] |
18:15 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:14 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1067.eqiad.wmnet with OS bookworm |
[production] |
18:12 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:12 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:12 |
<jhuneidi@deploy1002> |
rebuilt and synchronized wikiversions files: group2 wikis to 1.43.0-wmf.11 refs T366956 |
[production] |
18:12 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1065.eqiad.wmnet with OS bookworm |
[production] |
18:11 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:11 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply |
[production] |
18:10 |
<brett@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5022.eqsin.wmnet |
[production] |
18:08 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1066.eqiad.wmnet with OS bookworm |
[production] |
18:08 |
<ejegg> |
fundraising civicrm upgraded from 13a13f3a to 43fc2c89 |
[production] |
18:04 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1058.eqiad.wmnet with OS bookworm |
[production] |
17:59 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1057.eqiad.wmnet with OS bookworm |
[production] |
17:51 |
<brett@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp5021.eqsin.wmnet |
[production] |
17:50 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1067.eqiad.wmnet with reason: host reimage |
[production] |
17:47 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1065.eqiad.wmnet with reason: host reimage |
[production] |
17:45 |
<swfrench@deploy1002> |
Finished scap: Deploying securityContext changes for T362978 to main release (duration: 04m 09s) |
[production] |
17:43 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1066.eqiad.wmnet with reason: host reimage |
[production] |
17:41 |
<swfrench@deploy1002> |
Started scap: Deploying securityContext changes for T362978 to main release |
[production] |