2351-2400 of 10000 results (37ms)
2021-07-20 ยง
15:48 <oblivian@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:23 <vgutierrez> pool dns1002 - T286069 [production]
15:21 <vgutierrez> pool cp[1087-1090].eqiad.wmnet - T286069 [production]
15:19 <jmm@puppetmaster1001> conftool action : set/pooled=yes; selector: name=ldap-replica1004.wikimedia.org [production]
15:14 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1297.eqiad.wmnet [production]
15:14 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1290.eqiad.wmnet [production]
15:14 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1289.eqiad.wmnet [production]
15:06 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 12 hosts with reason: Deploying schema change to s3 T281058 [production]
15:06 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on 12 hosts with reason: Deploying schema change to s3 T281058 [production]
14:53 <urbanecm> Start server-side upload for 7 large PNG files (T285708) [production]
14:51 <herron> depooled and scheduled downtime for kafka-main100[45] [production]
14:51 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on lvs1016.eqiad.wmnet with reason: eqiad row D maintenance [production]
14:50 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on lvs1016.eqiad.wmnet with reason: eqiad row D maintenance [production]
14:48 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dns1002.wikimedia.org with reason: eqiad row D maintenance [production]
14:48 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on dns1002.wikimedia.org with reason: eqiad row D maintenance [production]
14:46 <vgutierrez> depool dns1002 - T286069 [production]
14:40 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on cp[1087-1090].eqiad.wmnet with reason: eqiad row D maintenance [production]
14:40 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on cp[1087-1090].eqiad.wmnet with reason: eqiad row D maintenance [production]
14:36 <vgutierrez> depool cp[1087-1090].eqiad.wmnet - T286069 [production]
14:30 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 18 hosts with reason: Deploying schema change to s8 T281058 [production]
14:30 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on 18 hosts with reason: Deploying schema change to s8 T281058 [production]
14:25 <jayme@deploy1002> helmfile [staging-eqiad] DONE helmfile.d/admin 'sync'. [production]
14:25 <jayme@deploy1002> helmfile [staging-eqiad] START helmfile.d/admin 'sync'. [production]
14:22 <jayme@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
14:21 <jayme@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
14:12 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 18 hosts with reason: Deploying schema change to s4 T281058 [production]
14:12 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on 18 hosts with reason: Deploying schema change to s4 T281058 [production]
14:09 <jiji@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:08 <jiji@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
14:03 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=maps2009.codfw.wmnet [production]
14:00 <jgiannelos@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . [production]
13:56 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=maps2008.codfw.wmnet [production]
13:50 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 15 hosts with reason: Deploying schema change to s7 T281058 [production]
13:50 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 15 hosts with reason: Deploying schema change to s7 T281058 [production]
13:45 <hnowlan@puppetmaster1001> conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=eqiad [production]
13:45 <hnowlan@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=codfw [production]
13:43 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=maps200[89].codfw.wmnet [production]
13:30 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=maps20(10|0[1-9]).codfw.wmnet [production]
13:25 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 15 hosts with reason: Deploying schema change to s2 T281058 [production]
13:25 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 15 hosts with reason: Deploying schema change to s2 T281058 [production]
13:14 <gehel> set/pooled=inactive on elastic1039 - disk failure - T285643 [production]
13:14 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 13 hosts with reason: Deploying schema change to s5 T281058 [production]
13:14 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 13 hosts with reason: Deploying schema change to s5 T281058 [production]
13:13 <gehel@puppetmaster1001> conftool action : set/pooled=inactive; selector: name=elastic1039.eqiad.wmnet [production]
12:44 <moritzm> installing systemd security updates on buster [production]
12:23 <elukey> reboot ml-serve-ctrl vms to pick up new vcores settings [production]
12:22 <elukey> bump vcpus from 2 to 4 on ml-serve-ctrl VMs on Ganeti (load/cpu usage increased steadily since we deployed kubelets on them) [production]
11:58 <Lucas_WMDE> EU config+backport window done [production]
11:58 <lucaswerkmeister-wmde@deploy1002> Synchronized wmf-config/CommonSettings-labs.php: Config: [[gerrit:705505|Avoid using User::newFrom* methods]] (3/3) (duration: 00m 56s) [production]
11:58 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on maps1007.eqiad.wmnet with reason: Testing impact of tilerator [production]