2251-2300 of 10000 results (83ms)
2024-01-18 ยง
17:30 <topranks> Re-enabling PyBal on lvs2011 after network migration T352912 [production]
17:30 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2093.codfw.wmnet with OS bullseye [production]
17:28 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2099.codfw.wmnet with OS bullseye [production]
17:27 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2092.codfw.wmnet with OS bullseye [production]
17:25 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2091.codfw.wmnet with OS bullseye [production]
17:21 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P54945 and previous config saved to /var/cache/conftool/dbconfig/20240118-172134-marostegui.json [production]
17:20 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2098.codfw.wmnet with OS bullseye [production]
17:14 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2102.codfw.wmnet with reason: host reimage [production]
17:11 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2102.codfw.wmnet with reason: host reimage [production]
17:11 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2089.codfw.wmnet with OS bullseye [production]
17:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194 (T354336)', diff saved to https://phabricator.wikimedia.org/P54944 and previous config saved to /var/cache/conftool/dbconfig/20240118-170627-marostegui.json [production]
17:06 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2088.codfw.wmnet with OS bullseye [production]
17:04 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1194 (T354336)', diff saved to https://phabricator.wikimedia.org/P54943 and previous config saved to /var/cache/conftool/dbconfig/20240118-170417-marostegui.json [production]
17:04 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1194.eqiad.wmnet with reason: Maintenance [production]
17:04 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1194.eqiad.wmnet with reason: Maintenance [production]
17:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T354336)', diff saved to https://phabricator.wikimedia.org/P54942 and previous config saved to /var/cache/conftool/dbconfig/20240118-170355-marostegui.json [production]
16:54 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2102.codfw.wmnet with OS bullseye [production]
16:49 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2101.codfw.wmnet with OS bullseye [production]
16:48 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P54941 and previous config saved to /var/cache/conftool/dbconfig/20240118-164848-marostegui.json [production]
16:42 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2100.codfw.wmnet with OS bullseye [production]
16:36 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2090.codfw.wmnet with OS bullseye [production]
16:35 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2099.codfw.wmnet with OS bullseye [production]
16:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P54940 and previous config saved to /var/cache/conftool/dbconfig/20240118-163342-marostegui.json [production]
16:33 <hashar@deploy2002> Finished deploy [integration/docroot@1d9323f]: Remove Wikimedia Design Style Guide from the list - T347895 (duration: 00m 07s) [production]
16:33 <hashar@deploy2002> Started deploy [integration/docroot@1d9323f]: Remove Wikimedia Design Style Guide from the list - T347895 [production]
16:27 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2098.codfw.wmnet with OS bullseye [production]
16:25 <sukhe> running authdns-update for T355308 [production]
16:22 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2097.codfw.wmnet with OS bullseye [production]
16:18 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2090.codfw.wmnet with reason: host reimage [production]
16:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T354336)', diff saved to https://phabricator.wikimedia.org/P54939 and previous config saved to /var/cache/conftool/dbconfig/20240118-161834-marostegui.json [production]
16:18 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2096.codfw.wmnet with OS bullseye [production]
16:18 <claime> Running puppet on 'P{P:kubernetes::node} and not P{F:lldp.parent ~ lsw}' - T352883 [production]
16:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1191 (T354336)', diff saved to https://phabricator.wikimedia.org/P54938 and previous config saved to /var/cache/conftool/dbconfig/20240118-161624-marostegui.json [production]
16:16 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
16:16 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
16:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1174 (T354336)', diff saved to https://phabricator.wikimedia.org/P54937 and previous config saved to /var/cache/conftool/dbconfig/20240118-161602-marostegui.json [production]
16:15 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2090.codfw.wmnet with reason: host reimage [production]
16:15 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2095.codfw.wmnet with OS bullseye [production]
16:12 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2094.codfw.wmnet with OS bullseye [production]
16:09 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2093.codfw.wmnet with OS bullseye [production]
16:06 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2092.codfw.wmnet with OS bullseye [production]
16:06 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 6 hosts with reason: moving lvs2011 network link T352912 [production]
16:06 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on 6 hosts with reason: moving lvs2011 network link T352912 [production]
16:06 <cmooney@cumin1002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cr2-codfw,cr[1-2]-codfw IPv6,re0.cr1-codfw.mgmt,re0.cr2-codfw.mgmt cr1-codfw with reason: moving lvs2011 network link T352912 [production]
16:05 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cr2-codfw,cr[1-2]-codfw IPv6,re0.cr1-codfw.mgmt,re0.cr2-codfw.mgmt cr1-codfw with reason: moving lvs2011 network link T352912 [production]
16:04 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs2011.codfw.wmnet with reason: moving lvs2011 network link T352912 [production]
16:04 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs2011.codfw.wmnet with reason: moving lvs2011 network link T352912 [production]
16:04 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host elastic2091.codfw.wmnet with OS bullseye [production]
16:03 <claime> Running puppet on 'P{P:kubernetes::node} and P{F:lldp.parent ~ lsw}' - T352883 [production]
16:02 <topranks> disabling PyBal and puppet on lvs2011, moving traffic to lvs2014 ahead of network change T352912 [production]