2901-2950 of 10000 results (40ms)
2024-12-09 §
21:29 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
21:28 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
21:27 <cjming@deploy2002> cjming, ebernhardson: Continuing with sync [production]
21:27 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
21:27 <cjming@deploy2002> cjming, ebernhardson: Backport for [[gerrit:1100158|cirrus: Enable mlr-2024 for select wikis (T377128)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:22 <cjming@deploy2002> Started scap sync-world: Backport for [[gerrit:1100158|cirrus: Enable mlr-2024 for select wikis (T377128)]] [production]
21:21 <cjming@deploy2002> Finished scap sync-world: Backport for [[gerrit:1101541|Actually load IRS in production (T374105)]] (duration: 12m 29s) [production]
21:14 <cjming@deploy2002> cjming, mszabo: Continuing with sync [production]
21:13 <cjming@deploy2002> cjming, mszabo: Backport for [[gerrit:1101541|Actually load IRS in production (T374105)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:08 <cjming@deploy2002> Started scap sync-world: Backport for [[gerrit:1101541|Actually load IRS in production (T374105)]] [production]
20:25 <aqu@deploy2002> Finished deploy [airflow-dags/analytics@1d9b4b5]: Canary events generation: pooling (duration: 01m 46s) [production]
20:23 <aqu@deploy2002> Started deploy [airflow-dags/analytics@1d9b4b5]: Canary events generation: pooling [production]
20:07 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching aqs1010.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
19:58 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching aqs1010.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
18:17 <gmodena@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-dump-rev-content-reconcile-enrich: apply [production]
18:17 <gmodena@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-dump-rev-content-reconcile-enrich: apply [production]
18:06 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs1025.eqiad.wmnet with OS bullseye [production]
17:52 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
17:51 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
17:47 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1025.eqiad.wmnet with reason: host reimage [production]
17:44 <cdanis> 💙cdanis@cumin1002.eqiad.wmnet ~ 🕧☕ sudo cumin 'A:cp' 'enable-puppet "cdanis testing in production I464702d8fb T381771"' [production]
17:43 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1025.eqiad.wmnet with reason: host reimage [production]
17:36 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
17:35 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
17:22 <jelto@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1072,1074-1075].eqiad.wmnet [production]
17:22 <jelto@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1072,1074-1075].eqiad.wmnet [production]
17:20 <jelto> homer 'lsw1-e3-eqiad*' commit 'T377876' [production]
17:18 <cdanis> T381771 💙cdanis@cp1107.eqiad.wmnet ~ 🕧☕ sudo run-puppet-agent --force [production]
17:16 <jelto@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1073.eqiad.wmnet with OS bookworm [production]
17:15 <cdanis> 💙cdanis@cumin1002.eqiad.wmnet ~ 🕛☕ sudo cumin 'A:cp' 'disable-puppet "cdanis testing in production I464702d8fb T381771"' [production]
17:14 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1025.eqiad.wmnet with OS bullseye [production]
16:59 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
16:58 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
16:47 <hnowlan@deploy2002> Finished scap sync-world: Rebuild and deploy to pick up new php8.1 base (duration: 21m 09s) [production]
16:26 <hnowlan@deploy2002> Started scap sync-world: Rebuild and deploy to pick up new php8.1 base [production]
16:12 <moritzm> rebalance Ganeti cluster in codfw/B following server refresh T376594 [production]
16:07 <brouberol> kubectl uncordon dse-k8s-worker1005.eqiad.wmnet [analytics]
16:06 <jayme@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[2089-2090].codfw.wmnet [production]
16:06 <jayme@cumin2002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[2089-2090].codfw.wmnet [production]
16:05 <hnowlan@deploy2002> Finished scap sync-world: Rebuild and deploy to pick up new php8.1 base (duration: 23m 00s) [production]
15:56 <jelto@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1073.eqiad.wmnet with OS bookworm [production]
15:55 <jelto@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1073.eqiad.wmnet with OS bookworm [production]
15:46 <brouberol> kubectl cordon dse-k8s-worker1005.eqiad.wmnet [analytics]
15:45 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2090.codfw.wmnet with OS bookworm [production]
15:44 <hnowlan@deploy2002> Started scap sync-world: Rebuild and deploy to pick up new php8.1 base [production]
15:43 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2089.codfw.wmnet with OS bookworm [production]
15:34 <Emperor> depool/restart swift/repool ms-fe1012 [production]
15:34 <mszabo@deploy2002> Finished scap sync-world: Backport for [[gerrit:1101069|dialog: Fix wrong title on Types of unacceptable behavior step (T381529)]], [[gerrit:1101070|dialog: Fix spacing between buttons in the dialog footer (T381530)]], [[gerrit:1100101|Prep IRS config for testwiki]] (duration: 13m 39s) [production]
15:33 <Emperor> depool/restart swift/repool ms-fe1010 [production]
15:28 <mszabo@deploy2002> mszabo: Continuing with sync [production]