2024-04-08
§
|
08:24 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
08:23 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
08:13 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Bump db2112 weight T361786', diff saved to https://phabricator.wikimedia.org/P59784 and previous config saved to /var/cache/conftool/dbconfig/20240408-081320-arnaudb.json |
[production] |
08:12 |
<kartik@deploy1002> |
kartik: Continuing with sync |
[production] |
08:12 |
<volans> |
restarted stashbot that had died few minutes ago |
[production] |
08:09 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Promote db2203 to s1 primary T361786', diff saved to https://phabricator.wikimedia.org/P59783 and previous config saved to /var/cache/conftool/dbconfig/20240408-080910-arnaudb.json |
[production] |
08:08 |
<arnaudb> |
Starting s1 codfw failover from db2112 to db2203 - T361786 |
[production] |
08:08 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:1017528|Enable the unified dashboard on the test instance for all languages (T360607)]] |
[production] |
07:57 |
<filippo@deploy1002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
07:57 |
<jayme@deploy1002> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
07:56 |
<filippo@deploy1002> |
helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
07:56 |
<jayme@deploy1002> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
07:56 |
<jayme@deploy1002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
07:55 |
<jayme@deploy1002> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
07:48 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:1017268|Add Kartographer Parsoid support to hewikivoyage (T342871 T361025)]] (duration: 35m 43s) |
[production] |
07:47 |
<moritzm> |
installing util-linux security updates on bullseye/bookworm |
[production] |
07:44 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1156 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P59782 and previous config saved to /var/cache/conftool/dbconfig/20240408-074448-root.json |
[production] |
07:40 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Set db2203 with weight 0 T361786', diff saved to https://phabricator.wikimedia.org/P59781 and previous config saved to /var/cache/conftool/dbconfig/20240408-074006-arnaudb.json |
[production] |
07:39 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 37 hosts with reason: Primary switchover s1 T361786 |
[production] |
07:38 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 37 hosts with reason: Primary switchover s1 T361786 |
[production] |
07:35 |
<arnaudb@cumin1002> |
START - Cookbook sre.mysql.clone Will create a clone of db2114.codfw.wmnet onto db2214.codfw.wmnet |
[production] |
07:35 |
<kartik@deploy1002> |
kartik and ihurbain: Continuing with sync |
[production] |
07:32 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Cloning db2114 in db2214 for T355422', diff saved to https://phabricator.wikimedia.org/P59780 and previous config saved to /var/cache/conftool/dbconfig/20240408-073239-arnaudb.json |
[production] |
07:32 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2214.codfw.wmnet with reason: provisionning db2214.codfw.wmnet - T355422 |
[production] |
07:32 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2214.codfw.wmnet with reason: provisionning db2214.codfw.wmnet - T355422 |
[production] |
07:32 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2114.codfw.wmnet with reason: provisionning db2214.codfw.wmnet - T355422 |
[production] |
07:31 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2114.codfw.wmnet with reason: provisionning db2214.codfw.wmnet - T355422 |
[production] |
07:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1156 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P59779 and previous config saved to /var/cache/conftool/dbconfig/20240408-072942-root.json |
[production] |
07:25 |
<kartik@deploy1002> |
kartik and ihurbain: Backport for [[gerrit:1017268|Add Kartographer Parsoid support to hewikivoyage (T342871 T361025)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1156 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P59778 and previous config saved to /var/cache/conftool/dbconfig/20240408-071436-root.json |
[production] |
07:12 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:1017268|Add Kartographer Parsoid support to hewikivoyage (T342871 T361025)]] |
[production] |
06:59 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1156 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P59777 and previous config saved to /var/cache/conftool/dbconfig/20240408-065931-root.json |
[production] |
06:44 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1156 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P59776 and previous config saved to /var/cache/conftool/dbconfig/20240408-064424-root.json |
[production] |
06:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1156 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P59775 and previous config saved to /var/cache/conftool/dbconfig/20240408-062919-root.json |
[production] |
06:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1156 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P59774 and previous config saved to /var/cache/conftool/dbconfig/20240408-061413-root.json |
[production] |
06:05 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1156', diff saved to https://phabricator.wikimedia.org/P59773 and previous config saved to /var/cache/conftool/dbconfig/20240408-060554-root.json |
[production] |
04:02 |
<denisse> |
Cleaning Prometheus and Thanos-BE log gzips older than 45 days on centrallog2002 |
[production] |
04:01 |
<denisse> |
Cleaning Prometheus and Thanos-BE log gzips older than 45 days on centrallog1002 |
[production] |
2024-04-06
§
|
15:33 |
<jhathaway@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM mx-out2001.wikimedia.org |
[production] |
15:33 |
<jhathaway@cumin2002> |
START - Cookbook sre.ganeti.reboot-vm for VM mx-out2001.wikimedia.org |
[production] |
03:41 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
03:41 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
03:41 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195 (T360332)', diff saved to https://phabricator.wikimedia.org/P59763 and previous config saved to /var/cache/conftool/dbconfig/20240406-034152-arnaudb.json |
[production] |
03:26 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P59762 and previous config saved to /var/cache/conftool/dbconfig/20240406-032644-arnaudb.json |
[production] |
03:11 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P59761 and previous config saved to /var/cache/conftool/dbconfig/20240406-031136-arnaudb.json |
[production] |
02:56 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195 (T360332)', diff saved to https://phabricator.wikimedia.org/P59760 and previous config saved to /var/cache/conftool/dbconfig/20240406-025629-arnaudb.json |
[production] |
02:54 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db2195 (T360332)', diff saved to https://phabricator.wikimedia.org/P59759 and previous config saved to /var/cache/conftool/dbconfig/20240406-025411-arnaudb.json |
[production] |
02:54 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2195.codfw.wmnet with reason: Maintenance |
[production] |
02:53 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2195.codfw.wmnet with reason: Maintenance |
[production] |
02:53 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2181 (T360332)', diff saved to https://phabricator.wikimedia.org/P59758 and previous config saved to /var/cache/conftool/dbconfig/20240406-025348-arnaudb.json |
[production] |