2024-07-23
§
|
06:58 |
<kartik@deploy1002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
05:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2173 (T367856)', diff saved to https://phabricator.wikimedia.org/P66892 and previous config saved to /var/cache/conftool/dbconfig/20240723-050042-marostegui.json |
[production] |
05:00 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
05:00 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
05:00 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2173.codfw.wmnet with reason: Maintenance |
[production] |
05:00 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2173.codfw.wmnet with reason: Maintenance |
[production] |
05:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170 (T367856)', diff saved to https://phabricator.wikimedia.org/P66891 and previous config saved to /var/cache/conftool/dbconfig/20240723-050004-marostegui.json |
[production] |
04:44 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P66890 and previous config saved to /var/cache/conftool/dbconfig/20240723-044457-marostegui.json |
[production] |
04:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P66889 and previous config saved to /var/cache/conftool/dbconfig/20240723-042950-marostegui.json |
[production] |
04:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170 (T367856)', diff saved to https://phabricator.wikimedia.org/P66888 and previous config saved to /var/cache/conftool/dbconfig/20240723-041442-marostegui.json |
[production] |
04:01 |
<mwpresync@deploy1002> |
Pruned MediaWiki: 1.43.0-wmf.12 (duration: 01m 00s) |
[production] |
03:54 |
<mwpresync@deploy1002> |
Finished scap: testwikis to 1.43.0-wmf.15 refs T366960 (duration: 51m 50s) |
[production] |
03:03 |
<mwpresync@deploy1002> |
Started scap sync-world: testwikis to 1.43.0-wmf.15 refs T366960 |
[production] |
01:28 |
<dani@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/miscweb: apply |
[production] |
01:27 |
<dani@deploy1002> |
helmfile [codfw] START helmfile.d/services/miscweb: apply |
[production] |
01:27 |
<dani@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
01:27 |
<dani@deploy1002> |
helmfile [eqiad] START helmfile.d/services/miscweb: apply |
[production] |
01:27 |
<dani@deploy1002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
01:27 |
<dani@deploy1002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [codfw] START helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [eqiad] START helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [codfw] START helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [eqiad] START helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
01:24 |
<dani@deploy1002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
00:22 |
<eevans@deploy1002> |
helmfile [staging] DONE helmfile.d/services/data-gateway: apply |
[production] |
00:22 |
<eevans@deploy1002> |
helmfile [staging] START helmfile.d/services/data-gateway: apply |
[production] |
00:05 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM netflow2003.codfw.wmnet |
[production] |
00:02 |
<cmooney@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM netflow2003.codfw.wmnet |
[production] |
00:00 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on netflow2003.codfw.wmnet with reason: reboot netflow2003 |
[production] |
00:00 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:15:00 on netflow2003.codfw.wmnet with reason: reboot netflow2003 |
[production] |
2024-07-22
§
|
23:08 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "set lsw in codfw to active - cmooney@cumin1002" |
[production] |
23:07 |
<cmooney@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "set lsw in codfw to active - cmooney@cumin1002" |
[production] |
23:05 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
23:03 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
22:47 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephmon1005.eqiad.wmnet with OS bullseye |
[production] |
22:38 |
<jclark@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephmon1004.eqiad.wmnet with OS bullseye |
[production] |
22:37 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: elastic110[0-2]* for T348977 - bking@cumin2002 |
[production] |
22:36 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.ban Banning hosts: elastic110[0-2]* for T348977 - bking@cumin2002 |
[production] |
22:35 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic[1100-1102].eqiad.wmnet with reason: T348977 |
[production] |
22:34 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic[1100-1102].eqiad.wmnet with reason: T348977 |
[production] |
22:01 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephmon1005.eqiad.wmnet with OS bullseye |
[production] |
21:52 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcephmon1004.eqiad.wmnet with OS bullseye |
[production] |
21:30 |
<catrope@deploy1002> |
Finished scap: Backport for [[gerrit:1055941|Do not unreview pages when they are moved (T370593)]] (duration: 20m 27s) |
[production] |