2024-09-11
§
|
16:31 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2167 (re)pooling @ 25%: T373101', diff saved to https://phabricator.wikimedia.org/P68934 and previous config saved to /var/cache/conftool/dbconfig/20240911-163152-arnaudb.json |
[production] |
16:31 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2127 (re)pooling @ 25%: T373101', diff saved to https://phabricator.wikimedia.org/P68933 and previous config saved to /var/cache/conftool/dbconfig/20240911-163147-arnaudb.json |
[production] |
16:31 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2116 (re)pooling @ 25%: T373101', diff saved to https://phabricator.wikimedia.org/P68932 and previous config saved to /var/cache/conftool/dbconfig/20240911-163142-arnaudb.json |
[production] |
16:31 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2115 (re)pooling @ 25%: T373101', diff saved to https://phabricator.wikimedia.org/P68931 and previous config saved to /var/cache/conftool/dbconfig/20240911-163137-arnaudb.json |
[production] |
16:28 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp2038.codfw.wmnet |
[production] |
16:28 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for cp2038.codfw.wmnet |
[production] |
16:28 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp2037.codfw.wmnet |
[production] |
16:27 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for cp2037.codfw.wmnet |
[production] |
16:25 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (None, T373791) xfer wikidata_main from wdqs2022.codfw.wmnet -> wdqs2021.codfw.wmnet w/ force delete existing files, repooling neither afterwards |
[production] |
16:21 |
<bking@deploy1003> |
Finished deploy [wdqs/wdqs@316bf7f]: 8 (duration: 00m 12s) |
[production] |
16:21 |
<bking@deploy1003> |
Started deploy [wdqs/wdqs@316bf7f]: 8 |
[production] |
16:20 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on 24 hosts with reason: Move server uplinks codfw racks C7 |
[production] |
16:20 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:20:00 on 24 hosts with reason: Move server uplinks codfw racks C7 |
[production] |
16:18 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on db2135.codfw.wmnet with reason: network maintenance |
[production] |
16:18 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on db2135.codfw.wmnet with reason: network maintenance |
[production] |
16:08 |
<topranks> |
begin server uplink moves from asw-c6-codfw to lsw1-c6-codfw T373101 |
[production] |
16:07 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on 34 hosts with reason: Move server uplinks codfw racks C6 |
[production] |
16:07 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:20:00 on 34 hosts with reason: Move server uplinks codfw racks C6 |
[production] |
16:04 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for main branch |
[admin] |
16:04 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan for main branch |
[admin] |
16:03 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan+apply for main branch |
[admin] |
16:03 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch |
[admin] |
16:01 |
<aborrero@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan+apply for main branch |
[admin] |
16:00 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch |
[admin] |
15:59 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/43 |
[admin] |
15:59 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/43 |
[admin] |
15:58 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/43 |
[admin] |
15:58 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/43 |
[admin] |
15:56 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2145 (T371742)', diff saved to https://phabricator.wikimedia.org/P68930 and previous config saved to /var/cache/conftool/dbconfig/20240911-155608-ladsgroup.json |
[production] |
15:56 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2145.codfw.wmnet with reason: Maintenance |
[production] |
15:55 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2145.codfw.wmnet with reason: Maintenance |
[production] |
15:55 |
<mutante> |
moscovium - apt-get upgrade - installing new apache2 version and more package upgrades |
[production] |
15:50 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wdqs2021.codfw.wmnet with reason: T373791 |
[production] |
15:50 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs2021.codfw.wmnet with reason: T373791 |
[production] |
15:49 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wdqs2021.codfw.wmnet with OS bullseye |
[production] |
15:41 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'depool db2115 db2116 db2127 db2167 db2168 db2179 db2180 db2210 es2022 es2038 - T370852', diff saved to https://phabricator.wikimedia.org/P68929 and previous config saved to /var/cache/conftool/dbconfig/20240911-154114-arnaudb.json |
[production] |
15:37 |
<urandom> |
depooling thanos-fe2004.codfw.wmnet — T373101 |
[production] |
15:36 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on moscovium.eqiad.wmnet with reason: nftables migration |
[production] |
15:36 |
<mutante> |
moscovium - rebooting for nftables migration |
[production] |
15:36 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.downtime for 0:10:00 on moscovium.eqiad.wmnet with reason: nftables migration |
[production] |
15:35 |
<mutante> |
phab2002 - rebooting for nftables migration |
[production] |
15:35 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on phab2002.codfw.wmnet with reason: nftables migration |
[production] |
15:35 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.downtime for 0:10:00 on phab2002.codfw.wmnet with reason: nftables migration |
[production] |
15:33 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan+apply for main branch |
[admin] |
15:32 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch |
[admin] |
15:32 |
<aborrero@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan+apply for main branch |
[admin] |
15:31 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch |
[admin] |
15:31 |
<topranks> |
push server and vlan configuration to lsw1-c6-codfw with Homer to prep physical moves T373101 |
[production] |
15:30 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/44 |
[admin] |
15:30 |
<aborrero@cloudcumin1001> |
START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/44 |
[admin] |