2025-06-26
§
|
23:46 |
<dwisehaupt@dns1004> |
END - running authdns-update |
[production] |
23:45 |
<dwisehaupt@dns1004> |
START - running authdns-update |
[production] |
23:33 |
<urandom> |
bootstrapping Cassandra/sessionstore2006-a — T390514 |
[production] |
23:28 |
<eevans@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sessionstore2006.codfw.wmnet |
[production] |
23:22 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host sessionstore2006.codfw.wmnet |
[production] |
23:14 |
<eevans@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
22:52 |
<eevans@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sessionstore2006.codfw.wmnet with reason: host reimage |
[production] |
22:49 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore2006.codfw.wmnet with reason: host reimage |
[production] |
22:39 |
<joal@deploy1003> |
Finished deploy [airflow-dags/analytics_test@6d2c335]: HOTFIX - Deploy artifacts for airflow-dags/analytics_test (duration: 00m 21s) |
[production] |
22:39 |
<joal@deploy1003> |
Started deploy [airflow-dags/analytics_test@6d2c335]: HOTFIX - Deploy artifacts for airflow-dags/analytics_test |
[production] |
22:32 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
22:28 |
<eevans@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
22:08 |
<joal@deploy1003> |
Finished deploy [airflow-dags/analytics@1e85992]: HOTFIX - Deploy artifacts for airflow-dags/analytics (duration: 00m 37s) |
[production] |
22:07 |
<joal@deploy1003> |
Started deploy [airflow-dags/analytics@1e85992]: HOTFIX - Deploy artifacts for airflow-dags/analytics |
[production] |
22:07 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
22:06 |
<eevans@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
22:05 |
<joal@deploy1003> |
Finished deploy [airflow-dags/analytics_test@1e85992]: HOTFIX - Deploy artifacts for airflow-dags/analytics_test (duration: 01m 21s) |
[production] |
22:03 |
<joal@deploy1003> |
Started deploy [airflow-dags/analytics_test@1e85992]: HOTFIX - Deploy artifacts for airflow-dags/analytics_test |
[production] |
21:52 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
21:52 |
<eevans@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
21:45 |
<sbassett> |
Deployed security mitigations for T389010 and T395468 (sync-world) |
[production] |
21:29 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
21:29 |
<eevans@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
21:21 |
<jhuneidi@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1162967|Release the CampaignEvents extension to all Wikipedias (T396784)]] (duration: 30m 43s) |
[production] |
21:15 |
<jhuneidi@deploy1003> |
cmelo, jhuneidi: Continuing with sync |
[production] |
21:15 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2006.codfw.wmnet with OS bullseye |
[production] |
21:12 |
<jhathaway@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1003.eqiad.wmnet with OS bookworm |
[production] |
21:08 |
<urandom> |
decommissioning Cassandra/sessionstore2006-a — T390514 |
[production] |
20:56 |
<jhathaway@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1003.eqiad.wmnet with reason: host reimage |
[production] |
20:53 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1003.eqiad.wmnet with reason: host reimage |
[production] |
20:52 |
<jhuneidi@deploy1003> |
cmelo, jhuneidi: Backport for [[gerrit:1162967|Release the CampaignEvents extension to all Wikipedias (T396784)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:50 |
<jhuneidi@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1162967|Release the CampaignEvents extension to all Wikipedias (T396784)]] |
[production] |
20:36 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1003.eqiad.wmnet with OS bookworm |
[production] |
20:35 |
<jhathaway@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1003.eqiad.wmnet with OS bookworm |
[production] |
20:26 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1003.eqiad.wmnet with OS bookworm |
[production] |
19:18 |
<joal@deploy1003> |
Finished deploy [airflow-dags/analytics@c3ba96d]: Deploy artifacts for airflow-dags/main (duration: 00m 41s) |
[production] |
19:18 |
<joal@deploy1003> |
Started deploy [airflow-dags/analytics@c3ba96d]: Deploy artifacts for airflow-dags/main |
[production] |
19:07 |
<jhuneidi@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1163704|Revert^2 "Activate feature to resolve wikibase link labels in pilot wiki changelists" (T388685)]] (duration: 15m 12s) |
[production] |
19:01 |
<jhuneidi@deploy1003> |
joelyrookewmde, jhuneidi: Continuing with sync |
[production] |
18:54 |
<jasmine@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts wikikube-worker[1032-1033].eqiad.wmnet |
[production] |
18:54 |
<jasmine@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
18:54 |
<jasmine@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikikube-worker[1032-1033].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jasmine@cumin1002" |
[production] |
18:54 |
<jhuneidi@deploy1003> |
joelyrookewmde, jhuneidi: Backport for [[gerrit:1163704|Revert^2 "Activate feature to resolve wikibase link labels in pilot wiki changelists" (T388685)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
18:53 |
<jasmine@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wikikube-worker[1032-1033].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jasmine@cumin1002" |
[production] |
18:52 |
<jhuneidi@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1163704|Revert^2 "Activate feature to resolve wikibase link labels in pilot wiki changelists" (T388685)]] |
[production] |
18:37 |
<jasmine@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
18:36 |
<jhuneidi@deploy1003> |
rebuilt and synchronized wikiversions files: group2 to 1.45.0-wmf.7 refs T392177 |
[production] |
18:35 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.ff |
[production] |
18:27 |
<jasmine@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts wikikube-worker[1032-1033].eqiad.wmnet |
[production] |
18:26 |
<andrew@cumin1003> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cloudcephosd2002-dev.codfw.wmnet |
[production] |