251-300 of 10000 results (113ms)
2026-05-14 ยง
12:54 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host db1280 [production]
12:53 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host db1280 [production]
12:53 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:53 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt [db1280] - vriley@cumin1003" [production]
12:53 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt [db1280] - vriley@cumin1003" [production]
12:50 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host db1279.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
12:50 <fceratto@cumin1003> dbctl commit (dc=all): 'Set db2161 with weight 0 T426291', diff saved to https://phabricator.wikimedia.org/P92527 and previous config saved to /var/cache/conftool/dbconfig/20260514-125014-fceratto.json [production]
12:49 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host db1279 [production]
12:49 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 27 hosts with reason: Primary switchover s8 T426291 [production]
12:49 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
12:47 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host db1279 [production]
12:47 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:47 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt [db1279] - vriley@cumin1003" [production]
12:47 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt [db1279] - vriley@cumin1003" [production]
12:47 <kartik@deploy1003> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
12:46 <kartik@deploy1003> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
12:42 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
12:42 <cmooney@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin1003.eqiad.wmnet with reason: update bgp groups for dse-k8s-wdqs - cmooney@cumin1003 [production]
12:40 <cmooney@cumin1003> START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin1003.eqiad.wmnet with reason: update bgp groups for dse-k8s-wdqs - cmooney@cumin1003 [production]
12:31 <cmooney@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 28458 [production]
12:27 <cmooney@cumin1003> START - Cookbook sre.network.peering with action 'configure' for AS: 28458 [production]
12:27 <marostegui@cumin1003> dbctl commit (dc=all): 'Repool pc3 with pc2023 as codfw master T418973', diff saved to https://phabricator.wikimedia.org/P92526 and previous config saved to /var/cache/conftool/dbconfig/20260514-122707-marostegui.json [production]
12:21 <jiji@deploy1003> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
12:21 <jiji@deploy1003> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
12:19 <marostegui@cumin1003> dbctl commit (dc=all): 'Add pc2023 to pc3 codfw master T418973', diff saved to https://phabricator.wikimedia.org/P92525 and previous config saved to /var/cache/conftool/dbconfig/20260514-121958-marostegui.json [production]
12:18 <marostegui@cumin1003> dbctl commit (dc=all): 'Add pc2023 to pc3 T418973', diff saved to https://phabricator.wikimedia.org/P92524 and previous config saved to /var/cache/conftool/dbconfig/20260514-121839-marostegui.json [production]
11:31 <jiji@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-mcrouter: apply [production]
11:31 <jiji@deploy1003> helmfile [codfw] START helmfile.d/services/mw-mcrouter: apply [production]
11:08 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-mcrouter: apply [production]
11:08 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-mcrouter: apply [production]
11:02 <btullis@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dse-k8s-wdqs-test1001.eqiad.wmnet with OS bookworm [production]
11:01 <jiji@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/redioscope: sync [production]
11:00 <jiji@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/redioscope: sync [production]
11:00 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
11:00 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
10:53 <jiji@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/redioscope: apply [production]
10:53 <jiji@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/redioscope: apply [production]
10:53 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1063.eqiad.wmnet with OS bullseye [production]
10:49 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1069.eqiad.wmnet with OS bullseye [production]
10:45 <marostegui@cumin1003> dbctl commit (dc=all): 'Remove db2152 from dbctl T424344', diff saved to https://phabricator.wikimedia.org/P92523 and previous config saved to /var/cache/conftool/dbconfig/20260514-104521-marostegui.json [production]
10:41 <jiji@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'sync'. [production]
10:40 <jiji@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/admin 'sync'. [production]
10:38 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1063.eqiad.wmnet with reason: host reimage [production]
10:34 <jiji@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/redioscope: apply [production]
10:34 <jiji@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/redioscope: apply [production]
10:34 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1069.eqiad.wmnet with reason: host reimage [production]
10:27 <jiji@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1063.eqiad.wmnet with reason: host reimage [production]
10:27 <jiji@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1069.eqiad.wmnet with reason: host reimage [production]
10:25 <atsuko@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/dse-k8s-services/opensearch-ttmserver-test: apply [production]
10:25 <atsuko@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/opensearch-ttmserver-test: apply [production]