|
2026-05-15
ยง
|
| 10:28 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2010.codfw.wmnet with reason: host reimage |
[production] |
| 10:24 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2064.codfw.wmnet with reason: host reimage |
[production] |
| 10:23 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply |
[production] |
| 10:23 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply |
[production] |
| 10:22 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply |
[production] |
| 10:22 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply |
[production] |
| 10:20 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2064.codfw.wmnet with reason: host reimage |
[production] |
| 10:12 |
<cmooney@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 10:12 |
<cmooney@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: modify entries for ulsfo router interfaces - cmooney@cumin1003" |
[production] |
| 10:12 |
<cmooney@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: modify entries for ulsfo router interfaces - cmooney@cumin1003" |
[production] |
| 10:10 |
<topranks> |
Migrate ulsfo cr<->cr traffic to use path via switches not direct link T424611 |
[production] |
| 10:04 |
<cmooney@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
| 10:04 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-be2064.codfw.wmnet with OS bullseye |
[production] |
| 10:01 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie |
[production] |
| 10:01 |
<aokoth@cumin1003> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Security Release - T426298 |
[production] |
| 10:00 |
<elukey@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS trixie |
[production] |
| 09:56 |
<topranks> |
Migrate cr3-ulsfo link to asw1-22-ulsfo to tagged interface T424611 |
[production] |
| 09:49 |
<jiji@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-mcrouter: apply |
[production] |
| 09:48 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie |
[production] |
| 09:48 |
<elukey@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS trixie |
[production] |
| 09:33 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie |
[production] |
| 09:32 |
<mvernon@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2064.codfw.wmnet with OS bullseye |
[production] |
| 09:32 |
<topranks> |
Migrate cr4-ulsfo link to asw1-23-ulsfo to tagged interface T424611 |
[production] |
| 09:30 |
<jiji@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-mcrouter: apply |
[production] |
| 09:30 |
<jiji@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/mw-mcrouter: apply |
[production] |
| 09:30 |
<mvernon@cumin2002> |
END (FAIL) - Cookbook sre.swift.convert-disks (exit_code=99) for host ms-be2065 |
[production] |
| 09:30 |
<jiji@deploy1003> |
helmfile [codfw] START helmfile.d/services/mw-mcrouter: apply |
[production] |
| 09:10 |
<elukey@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2010.codfw.wmnet with OS trixie |
[production] |
| 09:08 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on db2218.codfw.wmnet with reason: Host crashed T426383 |
[production] |
| 09:08 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2064 |
[production] |
| 09:08 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2064 |
[production] |
| 09:06 |
<mvernon@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host ms-be2064 |
[production] |
| 09:06 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ms-be2064.codfw.wmnet 56.32.192.10.in-addr.arpa 6.5.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 09:06 |
<mvernon@cumin2002> |
START - Cookbook sre.dns.wipe-cache ms-be2064.codfw.wmnet 56.32.192.10.in-addr.arpa 6.5.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 09:06 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 09:06 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2064 - mvernon@cumin2002" |
[production] |
| 09:06 |
<mvernon@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2064 - mvernon@cumin2002" |
[production] |
| 09:03 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host sretest2010.codfw.wmnet with OS trixie |
[production] |
| 09:02 |
<mvernon@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
| 09:02 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host ms-be2064 |
[production] |
| 09:01 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-be2064.codfw.wmnet with OS bullseye |
[production] |
| 09:00 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db2218 T426380', diff saved to https://phabricator.wikimedia.org/P92553 and previous config saved to /var/cache/conftool/dbconfig/20260515-090000-marostegui.json |
[production] |
| 08:58 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Promote db2220 to s7 primary T426380', diff saved to https://phabricator.wikimedia.org/P92552 and previous config saved to /var/cache/conftool/dbconfig/20260515-085836-marostegui.json |
[production] |
| 08:56 |
<marostegui> |
Starting s7 codfw failover from db2218 to db2220 - T426380 |
[production] |
| 08:54 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 28 hosts with reason: Primary switchover s7 T426380 |
[production] |
| 08:54 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Set db2220 with weight 0 T426380', diff saved to https://phabricator.wikimedia.org/P92551 and previous config saved to /var/cache/conftool/dbconfig/20260515-085420-marostegui.json |
[production] |
| 08:41 |
<mvernon@cumin2002> |
START - Cookbook sre.swift.convert-disks for host ms-be2065 |
[production] |
| 08:41 |
<mvernon@cumin2002> |
END (FAIL) - Cookbook sre.swift.convert-disks (exit_code=99) for host ms-be2064 |
[production] |
| 08:28 |
<elukey@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host kafka-logging1006.eqiad.wmnet with OS trixie |
[production] |
| 08:17 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host kafka-logging1006.eqiad.wmnet with OS trixie |
[production] |