2651-2700 of 10000 results (133ms)
2025-06-23 §
06:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1160', diff saved to https://phabricator.wikimedia.org/P78558 and previous config saved to /var/cache/conftool/dbconfig/20250623-060140-marostegui.json [production]
05:58 <marostegui@cumin1002> dbctl commit (dc=all): 'db2215 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78557 and previous config saved to /var/cache/conftool/dbconfig/20250623-055840-root.json [production]
05:55 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 7 hosts with reason: T397597 [production]
05:48 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
05:47 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
05:47 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2215 T397419', diff saved to https://phabricator.wikimedia.org/P78556 and previous config saved to /var/cache/conftool/dbconfig/20250623-054725-marostegui.json [production]
05:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1160 (T396130)', diff saved to https://phabricator.wikimedia.org/P78555 and previous config saved to /var/cache/conftool/dbconfig/20250623-054633-marostegui.json [production]
05:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote db2196 to x1 primary T397419', diff saved to https://phabricator.wikimedia.org/P78554 and previous config saved to /var/cache/conftool/dbconfig/20250623-054616-marostegui.json [production]
05:45 <marostegui> Starting x1 codfw failover from db2215 to db2196 - T397419 [production]
05:42 <marostegui@cumin1002> dbctl commit (dc=all): 'Set db2196 with weight 0 T397419', diff saved to https://phabricator.wikimedia.org/P78553 and previous config saved to /var/cache/conftool/dbconfig/20250623-054206-root.json [production]
05:41 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 15 hosts with reason: Primary switchover x1 T397419 [production]
05:38 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1160 (T396130)', diff saved to https://phabricator.wikimedia.org/P78552 and previous config saved to /var/cache/conftool/dbconfig/20250623-053857-marostegui.json [production]
05:38 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1160.eqiad.wmnet with reason: Maintenance [production]
05:33 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1150.eqiad.wmnet with reason: Maintenance [production]
2025-06-20 §
21:14 <cjming@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
21:14 <cjming@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
19:19 <sukhe> sudo cumin -b11 'A:cp' "run-puppet-agent --enable 'merging CR 1160381'": T390924 [production]
19:19 <sukhe> sudo cumin -b11 'A:cp' "run-puppet-agent 'merging CR 1160381'": T390924 [production]
19:16 <sukhe> enabling puppet on cp4037 to merge CR 1160381: add `ismobile=1' for mobile requests: T390924 [production]
19:10 <sukhe> sudo cumin 'A:cp' "disable-puppet 'merging CR 1160381'": T390924 [production]
18:36 <bking@cumin2002> conftool action : set/weight=10:pooled=no; selector: name=cirrussearch2113\.codfw\.wmnet [production]
18:34 <aokoth@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/services/miscweb: apply [production]
18:33 <aokoth@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/services/miscweb: apply [production]
18:32 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
18:30 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
18:19 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
18:08 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
18:05 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
18:05 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
18:00 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
17:55 <gmodena@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:55 <gmodena@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:53 <gmodena@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:53 <gmodena@deploy1003> helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:49 <gmodena@deploy1003> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:49 <gmodena@deploy1003> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:47 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
17:47 <fceratto@cumin1002> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) es2045* slowly with 10 steps - Pooling in slowly [production]
17:35 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
17:25 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
17:21 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply [production]
17:21 <aokoth@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply [production]
17:06 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host aux-k8s-worker1008.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
17:01 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host aux-k8s-worker1008.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
16:02 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host aux-k8s-worker1008.eqiad.wmnet with OS bookworm [production]
16:01 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host aux-k8s-worker1008.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:55 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host aux-k8s-worker1008.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:54 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host aux-k8s-worker1008.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:51 <dancy@deploy1003> Installation of scap version "4.181.0" completed for 2 hosts [production]
15:49 <dancy@deploy1003> Installing scap version "4.181.0" for 2 host(s) [production]