2025-04-23
ยง
|
10:41 |
<jgiannelos@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/mobileapps: apply |
[production] |
10:40 |
<jgiannelos@deploy1003> |
helmfile [codfw] START helmfile.d/services/mobileapps: apply |
[production] |
10:40 |
<jgiannelos@deploy1003> |
helmfile [staging] DONE helmfile.d/services/mobileapps: apply |
[production] |
10:40 |
<jgiannelos@deploy1003> |
helmfile [staging] START helmfile.d/services/mobileapps: apply |
[production] |
10:39 |
<hnowlan> |
migrating various minor mobileapps/PCS APIs to serve via the rest-gateway instead of restbase |
[production] |
10:27 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1169 (T391056)', diff saved to https://phabricator.wikimedia.org/P75291 and previous config saved to /var/cache/conftool/dbconfig/20250423-102752-fceratto.json |
[production] |
10:27 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1169.eqiad.wmnet with reason: Maintenance |
[production] |
10:06 |
<aborrero@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host cloudgw1004.eqiad.wmnet |
[production] |
09:57 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) 3.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.3.0.e.f.0.0.0.a.0.8.c.e.2.0.a.2.ip6.arpa on all recursors |
[production] |
09:57 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.wipe-cache 3.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.3.0.e.f.0.0.0.a.0.8.c.e.2.0.a.2.ip6.arpa on all recursors |
[production] |
09:57 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.3.0.e.f.0.0.0.a.0.8.c.e.2.0.a.2.ip6.arpa on all recursors |
[production] |
09:57 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.wipe-cache 2.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.3.0.e.f.0.0.0.a.0.8.c.e.2.0.a.2.ip6.arpa on all recursors |
[production] |
09:57 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
09:56 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: correct dns record for cloudgw vip eqiad - cmooney@cumin1002" |
[production] |
09:56 |
<cmooney@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: correct dns record for cloudgw vip eqiad - cmooney@cumin1002" |
[production] |
09:52 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
09:52 |
<cmooney@cumin1002> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
09:49 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
09:43 |
<taavi> |
updating security group rules to include IPv6 terms |
[metricsinfra] |
09:43 |
<wmbot~taavi@tools-bastion-12> |
bin/stashbot.sh restart |
[tools.stashbot] |
09:29 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
09:14 |
<arturo> |
enable IPv6 on cloudgw (T380174) -- includes server reboot |
[admin] |
09:10 |
<aborrero@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudgw1004.eqiad.wmnet |
[production] |
09:04 |
<aborrero@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host cloudgw1004.eqiad.wmnet |
[production] |
08:29 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc-misc2001.codfw.wmnet |
[production] |
08:24 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host mc-misc2001.codfw.wmnet |
[production] |
08:18 |
<moritzm> |
installing openjpeg2 security updates |
[production] |
08:02 |
<taavi@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1137731|Add WMCS v6 range to relevant exclusions (T386689)]] (duration: 11m 58s) |
[production] |
07:56 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-serve-ctrl1002.eqiad.wmnet |
[production] |
07:56 |
<taavi@deploy1003> |
taavi: Continuing with sync |
[production] |
07:55 |
<taavi@deploy1003> |
taavi: Backport for [[gerrit:1137731|Add WMCS v6 range to relevant exclusions (T386689)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:52 |
<elukey@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM ml-serve-ctrl1002.eqiad.wmnet |
[production] |
07:51 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-serve-ctrl1001.eqiad.wmnet |
[production] |
07:50 |
<taavi@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1137731|Add WMCS v6 range to relevant exclusions (T386689)]] |
[production] |
07:46 |
<elukey@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM ml-serve-ctrl1001.eqiad.wmnet |
[production] |
07:41 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-serve-ctrl2001.codfw.wmnet |
[production] |
07:36 |
<elukey@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM ml-serve-ctrl2001.codfw.wmnet |
[production] |
07:33 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-serve-ctrl2002.codfw.wmnet |
[production] |
07:28 |
<elukey@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM ml-serve-ctrl2002.codfw.wmnet |
[production] |
07:28 |
<elukey> |
reboot ml-serve-ctrl* VMs to pick up new cpu/memory settings - T392289 |
[production] |
07:27 |
<elukey> |
elukey@ganeti1048:~$ sudo gnt-instance modify -B memory=6g,vcpus=4 ml-serve-ctrl1001.eqiad.wmnet - T392289 |
[production] |
07:27 |
<elukey> |
elukey@ganeti1048:~$ sudo gnt-instance modify -B memory=6g,vcpus=4 ml-serve-ctrl1002.eqiad.wmnet - T392289 |
[production] |
07:27 |
<elukey> |
elukey@ganeti2032:~$ sudo gnt-instance modify -B memory=6g,vcpus=4 ml-serve-ctrl2002.codfw.wmnet - T392289 |
[production] |
07:26 |
<elukey> |
elukey@ganeti2032:~$ sudo gnt-instance modify -B memory=6g,vcpus=4 ml-serve-ctrl2001.codfw.wmnet - T392289 |
[production] |
07:24 |
<kartik@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1136975|Add channel for ContentTranslation logging (T391311)]] (duration: 16m 53s) |
[production] |
07:19 |
<moritzm> |
installing libapache2-mod-auth-openidc security updates |
[production] |
07:17 |
<kartik@deploy1003> |
abi, kartik: Continuing with sync |
[production] |
07:12 |
<kartik@deploy1003> |
abi, kartik: Backport for [[gerrit:1136975|Add channel for ContentTranslation logging (T391311)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:07 |
<kartik@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1136975|Add channel for ContentTranslation logging (T391311)]] |
[production] |
07:02 |
<taavi> |
rebooting tools-mail-4 with stuck NFS handles |
[tools] |