2024-07-02
ยง
|
11:12 |
<claime> |
pooling and uncordoning wikikube-worker2025.codfw.wmnet|wikikube-worker2026.codfw.wmnet|wikikube-worker2027.codfw.wmnet|wikikube-worker2028.codfw.wmnet|wikikube-worker2029.codfw.wmnet - T351074 |
[production] |
11:11 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts kubemaster[2001-2002].codfw.wmnet |
[production] |
11:11 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
11:11 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: kubemaster[2001-2002].codfw.wmnet decommissioned, removing all IPs except the asset tag one - jiji@cumin1002" |
[production] |
11:07 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 27 hosts with reason: Primary switchover s6 T369021 |
[production] |
11:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db2214 with weight 0 T369021', diff saved to https://phabricator.wikimedia.org/P65649 and previous config saved to /var/cache/conftool/dbconfig/20240702-110750-root.json |
[production] |
11:07 |
<jiji@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: kubemaster[2001-2002].codfw.wmnet decommissioned, removing all IPs except the asset tag one - jiji@cumin1002" |
[production] |
11:07 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 27 hosts with reason: Primary switchover s6 T369021 |
[production] |
11:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2173 (T364069)', diff saved to https://phabricator.wikimedia.org/P65648 and previous config saved to /var/cache/conftool/dbconfig/20240702-110442-marostegui.json |
[production] |
11:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2165 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P65647 and previous config saved to /var/cache/conftool/dbconfig/20240702-110111-root.json |
[production] |
10:56 |
<jiji@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
10:54 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component builds-builder |
[tools] |
10:54 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-builder |
[tools] |
10:54 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component builds-builder |
[toolsbeta] |
10:54 |
<James_F> |
jforrester@doc1003:~$ sudo -u doc-uploader rm -rf /srv/doc/cover-extensions/Listings/ # T354997 |
[releng] |
10:53 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-builder |
[toolsbeta] |
10:53 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component builds-builder |
[toolsbeta] |
10:53 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-builder |
[toolsbeta] |
10:50 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts kubemaster[2001-2002].codfw.wmnet |
[production] |
10:47 |
<James_F> |
Zuul: [mediawiki/extensions/Listings] Mark as archived, for T354997 |
[releng] |
10:46 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2165 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P65646 and previous config saved to /var/cache/conftool/dbconfig/20240702-104605-root.json |
[production] |
10:42 |
<pfischer@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
10:42 |
<pfischer@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
10:42 |
<pfischer@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
10:41 |
<pfischer@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
10:35 |
<brouberol@cumin1002> |
START - Cookbook sre.druid.roll-restart-workers for Druid public cluster: Roll restart of Druid jvm daemons. |
[production] |
10:34 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-master1003.eqiad.wmnet |
[production] |
10:32 |
<brouberol@cumin1002> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:dse-k8s-worker |
[production] |
10:28 |
<fabfur> |
upgrading A:cp-eqiad to haproxy 2.8.10 (T367756) |
[production] |
10:27 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_eqiad |
[production] |
10:27 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_eqiad |
[production] |
10:26 |
<btullis> |
rebooting an-master1003 (current standby namenode and resourcemanager) for T366555 |
[analytics] |
10:25 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host an-master1003.eqiad.wmnet |
[production] |
10:06 |
<jynus@cumin1002> |
dbctl commit (dc=all): 'Repool es1025 at 100% weight T363812', diff saved to https://phabricator.wikimedia.org/P65645 and previous config saved to /var/cache/conftool/dbconfig/20240702-100636-jynus.json |
[production] |
10:04 |
<btullis> |
killing stuck gobblin jobs |
[analytics] |
10:02 |
<claime> |
homer 'cr*codfw*' commit 'T351074' |
[production] |
09:56 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component wmcs-k8s-metrics |
[tools] |
09:56 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.component.deploy for component wmcs-k8s-metrics |
[tools] |
09:56 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component wmcs-k8s-metrics |
[toolsbeta] |
09:56 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.component.deploy for component wmcs-k8s-metrics |
[toolsbeta] |
09:53 |
<jiji@cumin1002> |
conftool action : set/pooled=no; selector: name=kubemaster200[1-2].codfw.wmnet |
[production] |
09:52 |
<elukey> |
volatile dir on puppetserver1001 with the new point release (12.6) for Bookworm |
[production] |
09:48 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on kubemaster[2001-2002].codfw.wmnet with reason: decom |
[production] |
09:47 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on kubemaster[2001-2002].codfw.wmnet with reason: decom |
[production] |
09:23 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component cert-manager |
[tools] |
09:22 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.component.deploy for component cert-manager |
[tools] |
09:22 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component cert-manager |
[toolsbeta] |
09:22 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.component.deploy for component cert-manager |
[toolsbeta] |
09:20 |
<brouberol@cumin1002> |
START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:dse-k8s-worker |
[production] |
09:15 |
<jynus@cumin1002> |
dbctl commit (dc=all): 'Repool es1025 at 50% weight T363812', diff saved to https://phabricator.wikimedia.org/P65644 and previous config saved to /var/cache/conftool/dbconfig/20240702-091508-jynus.json |
[production] |