2022-11-15
ยง
|
13:29 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti4008.ulsfo.wmnet |
[production] |
13:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1157 (T321130)', diff saved to https://phabricator.wikimedia.org/P39725 and previous config saved to /var/cache/conftool/dbconfig/20221115-132637-marostegui.json |
[production] |
13:22 |
<sukhe> |
running homer for Gerrit: 856946 in cr*-ulsfo* |
[production] |
13:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1170:3312', diff saved to https://phabricator.wikimedia.org/P39724 and previous config saved to /var/cache/conftool/dbconfig/20221115-132103-marostegui.json |
[production] |
13:20 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
13:20 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
13:19 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 13335 |
[production] |
13:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1157 (T321130)', diff saved to https://phabricator.wikimedia.org/P39723 and previous config saved to /var/cache/conftool/dbconfig/20221115-131710-marostegui.json |
[production] |
13:17 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1157.eqiad.wmnet with reason: Maintenance |
[production] |
13:16 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on db1157.eqiad.wmnet with reason: Maintenance |
[production] |
13:08 |
<moritzm> |
failover ganeti master in ulsfo to ganeti4005 |
[production] |
13:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1170:3312', diff saved to https://phabricator.wikimedia.org/P39722 and previous config saved to /var/cache/conftool/dbconfig/20221115-130557-marostegui.json |
[production] |
13:00 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
12:59 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
12:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1112 (T321130)', diff saved to https://phabricator.wikimedia.org/P39721 and previous config saved to /var/cache/conftool/dbconfig/20221115-125950-marostegui.json |
[production] |
12:59 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 13335 |
[production] |
12:53 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti6004.drmrs.wmnet |
[production] |
12:50 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1170:3312 (T321126)', diff saved to https://phabricator.wikimedia.org/P39720 and previous config saved to /var/cache/conftool/dbconfig/20221115-125050-marostegui.json |
[production] |
12:49 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudgw2003-dev.codfw.wmnet with OS bullseye |
[production] |
12:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1170:3312 (T321126)', diff saved to https://phabricator.wikimedia.org/P39719 and previous config saved to /var/cache/conftool/dbconfig/20221115-124830-marostegui.json |
[production] |
12:48 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
12:48 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
12:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1162 (T321126)', diff saved to https://phabricator.wikimedia.org/P39718 and previous config saved to /var/cache/conftool/dbconfig/20221115-124808-marostegui.json |
[production] |
12:47 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti6004.drmrs.wmnet |
[production] |
12:45 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on idp-test1002.wikimedia.org with reason: experiment with CAS 6.6 |
[production] |
12:45 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on idp-test1002.wikimedia.org with reason: experiment with CAS 6.6 |
[production] |
12:44 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1112', diff saved to https://phabricator.wikimedia.org/P39717 and previous config saved to /var/cache/conftool/dbconfig/20221115-124443-marostegui.json |
[production] |
12:43 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp4052.ulsfo.wmnet,service=varnish-fe |
[production] |
12:43 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp4052.ulsfo.wmnet,service=ats-be |
[production] |
12:43 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp4052.ulsfo.wmnet,service=ats-tls |
[production] |
12:37 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2166 (re)pooling @ 100%: After upgrade', diff saved to https://phabricator.wikimedia.org/P39716 and previous config saved to /var/cache/conftool/dbconfig/20221115-123735-root.json |
[production] |
12:36 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw2003-dev.codfw.wmnet with reason: host reimage |
[production] |
12:33 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2162 (re)pooling @ 100%: After upgrade', diff saved to https://phabricator.wikimedia.org/P39715 and previous config saved to /var/cache/conftool/dbconfig/20221115-123326-root.json |
[production] |
12:33 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1162', diff saved to https://phabricator.wikimedia.org/P39714 and previous config saved to /var/cache/conftool/dbconfig/20221115-123302-marostegui.json |
[production] |
12:31 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw2003-dev.codfw.wmnet with reason: host reimage |
[production] |
12:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1112', diff saved to https://phabricator.wikimedia.org/P39713 and previous config saved to /var/cache/conftool/dbconfig/20221115-122937-marostegui.json |
[production] |
12:25 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti6002.drmrs.wmnet |
[production] |
12:22 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2166 (re)pooling @ 75%: After upgrade', diff saved to https://phabricator.wikimedia.org/P39712 and previous config saved to /var/cache/conftool/dbconfig/20221115-122230-root.json |
[production] |
12:18 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti6002.drmrs.wmnet |
[production] |
12:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2162 (re)pooling @ 75%: After upgrade', diff saved to https://phabricator.wikimedia.org/P39711 and previous config saved to /var/cache/conftool/dbconfig/20221115-121821-root.json |
[production] |
12:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1162', diff saved to https://phabricator.wikimedia.org/P39710 and previous config saved to /var/cache/conftool/dbconfig/20221115-121755-marostegui.json |
[production] |
12:16 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudgw2003-dev.codfw.wmnet with OS bullseye |
[production] |
12:14 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
12:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1112 (T321130)', diff saved to https://phabricator.wikimedia.org/P39709 and previous config saved to /var/cache/conftool/dbconfig/20221115-121431-marostegui.json |
[production] |
12:13 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
12:11 |
<hnowlan> |
resyncing maps2005 replica |
[production] |
12:11 |
<hnowlan@cumin1001> |
START - Cookbook sre.postgresql.postgres-init |
[production] |
12:07 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2166 (re)pooling @ 50%: After upgrade', diff saved to https://phabricator.wikimedia.org/P39708 and previous config saved to /var/cache/conftool/dbconfig/20221115-120725-root.json |
[production] |
12:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1112 (T321130)', diff saved to https://phabricator.wikimedia.org/P39707 and previous config saved to /var/cache/conftool/dbconfig/20221115-120502-marostegui.json |
[production] |
12:04 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |