2024-05-30
ยง
|
06:50 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 8674 |
[production] |
06:49 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'configure' for AS: 8674 |
[production] |
06:48 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:48 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:48 |
<ayounsi@cumin1002> |
END (FAIL) - Cookbook sre.network.peering (exit_code=99) with action 'configure' for AS: 8674 |
[production] |
06:47 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'configure' for AS: 8674 |
[production] |
06:46 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:46 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P63642 and previous config saved to /var/cache/conftool/dbconfig/20240530-064519-marostegui.json |
[production] |
06:36 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:36 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:33 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:33 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:31 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:31 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:30 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2122 (T366123)', diff saved to https://phabricator.wikimedia.org/P63641 and previous config saved to /var/cache/conftool/dbconfig/20240530-063011-marostegui.json |
[production] |
06:19 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:19 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
06:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2122 (T366123)', diff saved to https://phabricator.wikimedia.org/P63640 and previous config saved to /var/cache/conftool/dbconfig/20240530-060023-marostegui.json |
[production] |
06:00 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2122.codfw.wmnet with reason: Maintenance |
[production] |
06:00 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2122.codfw.wmnet with reason: Maintenance |
[production] |
05:59 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2121 (T366123)', diff saved to https://phabricator.wikimedia.org/P63639 and previous config saved to /var/cache/conftool/dbconfig/20240530-055959-marostegui.json |
[production] |
05:56 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
05:56 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
05:44 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P63638 and previous config saved to /var/cache/conftool/dbconfig/20240530-054451-marostegui.json |
[production] |
05:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P63636 and previous config saved to /var/cache/conftool/dbconfig/20240530-052941-marostegui.json |
[production] |
05:20 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1163 (T364299)', diff saved to https://phabricator.wikimedia.org/P63635 and previous config saved to /var/cache/conftool/dbconfig/20240530-052006-marostegui.json |
[production] |
05:19 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1163.eqiad.wmnet with reason: Maintenance |
[production] |
05:19 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db1163.eqiad.wmnet with reason: Maintenance |
[production] |
05:17 |
<marostegui> |
Deploy schema changes on old s8 eqiad master (db1209) dbmaint T355609 T356166 |
[production] |
05:16 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
05:16 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
05:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2121 (T366123)', diff saved to https://phabricator.wikimedia.org/P63634 and previous config saved to /var/cache/conftool/dbconfig/20240530-051433-marostegui.json |
[production] |
05:14 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
05:13 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
05:12 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2121 (T366123)', diff saved to https://phabricator.wikimedia.org/P63633 and previous config saved to /var/cache/conftool/dbconfig/20240530-051220-marostegui.json |
[production] |
05:12 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2121.codfw.wmnet with reason: Maintenance |
[production] |
05:11 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2121.codfw.wmnet with reason: Maintenance |
[production] |
05:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1209 T364541', diff saved to https://phabricator.wikimedia.org/P63632 and previous config saved to /var/cache/conftool/dbconfig/20240530-051132-root.json |
[production] |
05:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote db1192 to s8 primary and set section read-write T364541', diff saved to https://phabricator.wikimedia.org/P63631 and previous config saved to /var/cache/conftool/dbconfig/20240530-051031-marostegui.json |
[production] |
05:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set s8 eqiad as read-only for maintenance - T364541', diff saved to https://phabricator.wikimedia.org/P63630 and previous config saved to /var/cache/conftool/dbconfig/20240530-051012-marostegui.json |
[production] |
05:09 |
<marostegui> |
Starting s8 eqiad failover from db1209 to db1192 - T364541 |
[production] |
05:02 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
04:56 |
<logmsgbot> |
@deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
04:56 |
<logmsgbot> |
@deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
04:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Remove db1192 from API/vslow/dump T364541', diff saved to https://phabricator.wikimedia.org/P63629 and previous config saved to /var/cache/conftool/dbconfig/20240530-044328-root.json |
[production] |
04:43 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 33 hosts with reason: Primary switchover s8 T364541 |
[production] |
04:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db1192 with weight 0 T364541', diff saved to https://phabricator.wikimedia.org/P63628 and previous config saved to /var/cache/conftool/dbconfig/20240530-044249-root.json |
[production] |
04:42 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 33 hosts with reason: Primary switchover s8 T364541 |
[production] |
04:42 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2098.codfw.wmnet with reason: Maintenance |
[production] |