1101-1150 of 10000 results (30ms)
2024-10-15 ยง
16:14 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component api-gateway [tools]
16:10 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2154 (T371742)', diff saved to https://phabricator.wikimedia.org/P70000 and previous config saved to /var/cache/conftool/dbconfig/20241015-161018-ladsgroup.json [production]
16:08 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component api-gateway [toolsbeta]
16:07 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P69999 and previous config saved to /var/cache/conftool/dbconfig/20241015-160758-arnaudb.json [production]
16:03 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P69998 and previous config saved to /var/cache/conftool/dbconfig/20241015-160351-ladsgroup.json [production]
16:01 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component api-gateway [toolsbeta]
16:01 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depool db2205 T377164', diff saved to https://phabricator.wikimedia.org/P69997 and previous config saved to /var/cache/conftool/dbconfig/20241015-160106-ladsgroup.json [production]
15:53 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176 (T367781)', diff saved to https://phabricator.wikimedia.org/P69996 and previous config saved to /var/cache/conftool/dbconfig/20241015-155251-arnaudb.json [production]
15:52 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Promote db2209 to s3 primary and set section read-write T377164', diff saved to https://phabricator.wikimedia.org/P69995 and previous config saved to /var/cache/conftool/dbconfig/20241015-155240-ladsgroup.json [production]
15:50 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan+apply for main branch [admin]
15:50 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202 (T376905)', diff saved to https://phabricator.wikimedia.org/P69994 and previous config saved to /var/cache/conftool/dbconfig/20241015-154844-ladsgroup.json [production]
15:50 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan+apply for main branch [admin]
15:48 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Set s3 codfw as read-only for maintenance - T377164', diff saved to https://phabricator.wikimedia.org/P69993 and previous config saved to /var/cache/conftool/dbconfig/20241015-154834-ladsgroup.json [production]
15:48 <Amir1> Starting s3 codfw failover from db2205 to db2209 - T377164 [production]
15:46 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db2176 (T367781)', diff saved to https://phabricator.wikimedia.org/P69992 and previous config saved to /var/cache/conftool/dbconfig/20241015-154318-arnaudb.json [production]
15:46 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2176.codfw.wmnet with reason: Maintenance [production]
15:45 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db2176.codfw.wmnet with reason: Maintenance [production]
15:45 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2174 (T367781)', diff saved to https://phabricator.wikimedia.org/P69991 and previous config saved to /var/cache/conftool/dbconfig/20241015-154256-arnaudb.json [production]
15:44 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Set db2209 with weight 0 T377164', diff saved to https://phabricator.wikimedia.org/P69990 and previous config saved to /var/cache/conftool/dbconfig/20241015-154228-ladsgroup.json [production]
15:44 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for main branch [admin]
15:43 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 24 hosts with reason: Primary switchover s3 T377164 [production]
15:43 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:42 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on 24 hosts with reason: Primary switchover s3 T377164 [production]
15:42 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1202 (T376905)', diff saved to https://phabricator.wikimedia.org/P69989 and previous config saved to /var/cache/conftool/dbconfig/20241015-154027-ladsgroup.json [production]
15:41 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1202.eqiad.wmnet with reason: Maintenance [production]
15:40 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1202.eqiad.wmnet with reason: Maintenance [production]
15:40 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194 (T376905)', diff saved to https://phabricator.wikimedia.org/P69988 and previous config saved to /var/cache/conftool/dbconfig/20241015-154002-ladsgroup.json [production]
15:38 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for main branch [admin]
15:38 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:35 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for main branch [admin]
15:35 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:35 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for main branch [admin]
15:35 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:32 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for main branch [admin]
15:32 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:32 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for main branch [admin]
15:32 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:31 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for main branch [admin]
15:31 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:31 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for main branch [admin]
15:31 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for main branch [admin]
15:27 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P69987 and previous config saved to /var/cache/conftool/dbconfig/20241015-152749-arnaudb.json [production]
15:26 <akosiaris> run gnt-cluster verify-disks after ganeti1034 forceful reboot [production]
15:24 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P69986 and previous config saved to /var/cache/conftool/dbconfig/20241015-152456-ladsgroup.json [production]
15:22 <volans> force-rebooting ganeti1034 stuck due to drbd traces via mgmt [production]
15:19 <akosiaris@cumin1002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1034.eqiad.wmnet [production]
15:17 <akosiaris> drain ganeti1034 of VMs, hardware might be misbehaving [production]
15:16 <akosiaris@cumin1002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet [production]
15:12 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P69985 and previous config saved to /var/cache/conftool/dbconfig/20241015-151243-arnaudb.json [production]
15:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P69984 and previous config saved to /var/cache/conftool/dbconfig/20241015-150948-ladsgroup.json [production]