2024-03-06
ยง
|
12:33 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2374.codfw.wmnet with reason: host reimage |
[production] |
12:30 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2373.codfw.wmnet with reason: host reimage |
[production] |
12:30 |
<jnuche@deploy2002> |
jnuche: Continuing with sync |
[production] |
12:30 |
<jnuche@deploy2002> |
jnuche: Backport for [[gerrit:1009231|Add missing function argument to titleWithoutPrefix call (T359290)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
12:28 |
<jnuche@deploy2002> |
Started scap: Backport for [[gerrit:1009231|Add missing function argument to titleWithoutPrefix call (T359290)]] |
[production] |
12:27 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2376.codfw.wmnet with reason: host reimage |
[production] |
12:26 |
<jiji@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-mcrouter: apply |
[production] |
12:24 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2371.codfw.wmnet with reason: host reimage |
[production] |
12:22 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2372.codfw.wmnet with reason: host reimage |
[production] |
12:20 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2375.codfw.wmnet with reason: host reimage |
[production] |
12:18 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2372.codfw.wmnet with reason: host reimage |
[production] |
12:18 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2376.codfw.wmnet with reason: host reimage |
[production] |
12:18 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2374.codfw.wmnet with reason: host reimage |
[production] |
12:18 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2373.codfw.wmnet with reason: host reimage |
[production] |
12:18 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2124 (T352010)', diff saved to https://phabricator.wikimedia.org/P58581 and previous config saved to /var/cache/conftool/dbconfig/20240306-121800-ladsgroup.json |
[production] |
12:18 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2371.codfw.wmnet with reason: host reimage |
[production] |
12:17 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2375.codfw.wmnet with reason: host reimage |
[production] |
12:17 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2124.codfw.wmnet with reason: Maintenance |
[production] |
12:17 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2124.codfw.wmnet with reason: Maintenance |
[production] |
12:10 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:10 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:02 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw2376.codfw.wmnet with OS bullseye |
[production] |
12:02 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw2375.codfw.wmnet with OS bullseye |
[production] |
12:02 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw2374.codfw.wmnet with OS bullseye |
[production] |
12:02 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw2373.codfw.wmnet with OS bullseye |
[production] |
12:02 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw2372.codfw.wmnet with OS bullseye |
[production] |
12:02 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw2371.codfw.wmnet with OS bullseye |
[production] |
12:01 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:01 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:59 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:59 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:54 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:54 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:53 |
<moritzm> |
restarting Exim on the MXes to pick up new GNU TLS |
[production] |
11:52 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ldap.roll-restart-reboot-replica (exit_code=0) rolling restart_daemons on A:ldap-replicas-eqiad |
[production] |
11:51 |
<jmm@cumin2002> |
START - Cookbook sre.ldap.roll-restart-reboot-replica rolling restart_daemons on A:ldap-replicas-eqiad |
[production] |
11:48 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ldap.roll-restart-reboot-replica (exit_code=0) rolling restart_daemons on A:ldap-replicas-codfw |
[production] |
11:47 |
<jmm@cumin2002> |
START - Cookbook sre.ldap.roll-restart-reboot-replica rolling restart_daemons on A:ldap-replicas-codfw |
[production] |
11:43 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:43 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:42 |
<claime> |
Depooling mw2371.codfw.wmnet,mw2372.codfw.wmnet,mw2373.codfw.wmnet,mw2374.codfw.wmnet,mw2375.codfw.wmnet,mw2376.codfw.wmnet for reimage to kubernetes - T351074 |
[production] |
11:41 |
<cgoubert@cumin2002> |
conftool action : set/weight=30; selector: cluster=api_appserver,service=canary,dc=codfw |
[production] |
11:41 |
<cgoubert@cumin2002> |
conftool action : set/pooled=yes; selector: cluster=api_appserver,service=canary,dc=codfw |
[production] |
11:40 |
<claime> |
pooling new canaries - T351074 |
[production] |
11:37 |
<claime> |
Enabling and running puppet on deployment servers - T351074 |
[production] |
11:33 |
<claime> |
Enabling and running puppet on new canaries mw2283.codfw.wmnet,mw2284.codfw.wmnet - T351074 |
[production] |
11:31 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:31 |
<logmsgbot> |
@deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
11:31 |
<claime> |
Disabling puppet on mw2374.codfw.wmnet,mw2376.codfw.wmnet,mw2283.codfw.wmnet,mw2284.codfw.wmnet,mw2371.codfw.wmnet,mw2372.codfw.wmnet,mw2373.codfw.wmnet,mw2375.codfw.wmnet for canary api_appserver move - T351074 |
[production] |
11:28 |
<claime> |
Disabling puppet on deployment servers for canary api_appserver move - T351074 |
[production] |