2025-02-06
ยง
|
12:45 |
<kamila@deploy2002> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
12:44 |
<hnowlan@deploy1003> |
helmfile [codfw] START helmfile.d/services/changeprop: apply |
[production] |
12:44 |
<raymond-ndibe@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component envvars-admission |
[toolsbeta] |
12:43 |
<raymond-ndibe@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-emailer |
[toolsbeta] |
12:43 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ganeti1038.eqiad.wmnet with reason: remove from cluster for reimage |
[production] |
12:41 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1038.eqiad.wmnet |
[production] |
12:40 |
<kamila@deploy2002> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
12:40 |
<kamila@deploy2002> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
12:40 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: apply |
[production] |
12:39 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/changeprop: apply |
[production] |
12:37 |
<raymond-ndibe@cloudcumin1001> |
END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component components-api |
[tools] |
12:37 |
<raymond-ndibe@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component components-api |
[tools] |
12:37 |
<raymond-ndibe@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component jobs-emailer |
[toolsbeta] |
12:35 |
<raymond-ndibe@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-admission |
[tools] |
12:35 |
<raymond-ndibe@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api |
[toolsbeta] |
12:30 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1117521|Set categorylinks to write both everywhere except commonswiki (T385164)]] (duration: 11m 50s) |
[production] |
12:27 |
<moritzm> |
installing openjpeg2 security updates |
[production] |
12:26 |
<raymond-ndibe@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component components-api |
[toolsbeta] |
12:26 |
<raymond-ndibe@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component ingress-admission |
[tools] |
12:25 |
<raymond-ndibe@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-admission |
[toolsbeta] |
12:23 |
<ladsgroup@deploy2002> |
ladsgroup: Continuing with sync |
[production] |
12:22 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
12:21 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. |
[production] |
12:21 |
<ladsgroup@deploy2002> |
ladsgroup: Backport for [[gerrit:1117521|Set categorylinks to write both everywhere except commonswiki (T385164)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
12:20 |
<andrewbogott> |
hard rebooted 6 workers for T385264 |
[puppet-diffs] |
12:18 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1117521|Set categorylinks to write both everywhere except commonswiki (T385164)]] |
[production] |
12:17 |
<raymond-ndibe@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component ingress-admission |
[toolsbeta] |
12:11 |
<hnowlan@deploy1003> |
helmfile [staging] DONE helmfile.d/services/changeprop: apply |
[production] |
12:11 |
<hnowlan@deploy1003> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
12:07 |
<andrewbogott> |
rebooting all servers for T385264 |
[deployment-prep] |
12:07 |
<andrewbogott> |
rebooting all servers for T385264 |
[releng] |
12:06 |
<moritzm> |
installing bind9 security updates (client-side libs/tools only) |
[production] |
11:58 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
11:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2208 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73311 and previous config saved to /var/cache/conftool/dbconfig/20250206-115556-root.json |
[production] |
11:53 |
<klausman@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
11:53 |
<klausman@deploy2002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
11:53 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
11:52 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. |
[production] |
11:51 |
<fnegri@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb1016.eqiad.wmnet |
[production] |
11:51 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
11:51 |
<jgiannelos@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply |
[production] |
11:50 |
<jgiannelos@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mobileapps: apply |
[production] |
11:50 |
<jgiannelos@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/mobileapps: apply |
[production] |
11:49 |
<jgiannelos@deploy2002> |
helmfile [codfw] START helmfile.d/services/mobileapps: apply |
[production] |
11:49 |
<jgiannelos@deploy2002> |
helmfile [staging] DONE helmfile.d/services/mobileapps: apply |
[production] |
11:49 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/mobileapps: apply |
[production] |
11:48 |
<fnegri@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host clouddb1016.eqiad.wmnet |
[production] |
11:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2208 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73310 and previous config saved to /var/cache/conftool/dbconfig/20250206-114051-root.json |
[production] |
11:40 |
<moritzm> |
installing iperf3 security updates |
[production] |
11:34 |
<fnegri@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on clouddb1016.eqiad.wmnet with reason: Rebooting clouddb1016 T384946 |
[production] |