|
2023-11-16
§
|
| 09:00 |
<godog> |
bounce prometheus instances on prometheus2006 to test p7 upgrade |
[production] |
| 08:59 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: kubernetes::worker |
[production] |
| 08:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: thanos::frontend |
[production] |
| 08:37 |
<kharlan@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/ipoid: apply |
[production] |
| 08:37 |
<kharlan@deploy2002> |
helmfile [eqiad] START helmfile.d/services/ipoid: apply |
[production] |
| 08:34 |
<moritzm> |
installing ruby-rails-html-sanitizer security updates |
[production] |
| 08:30 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: thanos::frontend |
[production] |
| 08:25 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host clouddumps1001.wikimedia.org |
[production] |
| 08:22 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host prometheus2006.codfw.wmnet |
[production] |
| 08:19 |
<taavi@cumin1001> |
START - Cookbook sre.puppet.migrate-host for host clouddumps1001.wikimedia.org |
[production] |
| 08:18 |
<taavi@cumin1001> |
END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudcumin2001.codfw.wmnet |
[production] |
| 08:17 |
<moritzm> |
installing elfutils security updates |
[production] |
| 08:12 |
<taavi@cumin1001> |
START - Cookbook sre.puppet.migrate-host for host cloudcumin2001.codfw.wmnet |
[production] |
| 08:09 |
<moritzm> |
installing python-git security updates |
[production] |
| 08:07 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-host for host prometheus2006.codfw.wmnet |
[production] |
| 08:03 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host ncredir4001.ulsfo.wmnet |
[production] |
| 07:54 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-host for host ncredir4001.ulsfo.wmnet |
[production] |
| 07:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: prometheus::pop |
[production] |
| 07:30 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: prometheus::pop |
[production] |
| 06:30 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2132,2160].codfw.wmnet,db[1119,1164,1217].eqiad.wmnet with reason: Switch |
[production] |
| 06:30 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on db[2132,2160].codfw.wmnet,db[1119,1164,1217].eqiad.wmnet with reason: Switch |
[production] |
| 06:07 |
<cmooney@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2004.codfw.wmnet with OS bullseye |
[production] |
| 05:48 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1007.wikimedia.org with OS bullseye |
[production] |
| 05:36 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db1143 (T348183)', diff saved to https://phabricator.wikimedia.org/P53499 and previous config saved to /var/cache/conftool/dbconfig/20231116-053616-arnaudb.json |
[production] |
| 05:36 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1143.eqiad.wmnet with reason: Maintenance |
[production] |
| 05:36 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1143.eqiad.wmnet with reason: Maintenance |
[production] |
| 05:35 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1142 (T348183)', diff saved to https://phabricator.wikimedia.org/P53498 and previous config saved to /var/cache/conftool/dbconfig/20231116-053554-arnaudb.json |
[production] |
| 05:20 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1142', diff saved to https://phabricator.wikimedia.org/P53497 and previous config saved to /var/cache/conftool/dbconfig/20231116-052048-arnaudb.json |
[production] |
| 05:05 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1142', diff saved to https://phabricator.wikimedia.org/P53496 and previous config saved to /var/cache/conftool/dbconfig/20231116-050542-arnaudb.json |
[production] |
| 04:57 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcontrol2001-dev.codfw.wmnet with OS bookworm |
[production] |
| 04:50 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1142 (T348183)', diff saved to https://phabricator.wikimedia.org/P53495 and previous config saved to /var/cache/conftool/dbconfig/20231116-045035-arnaudb.json |
[production] |
| 04:38 |
<cstone> |
payments-wiki upgraded from 6affb60a to eae2f35e |
[production] |
| 04:30 |
<cstone> |
payments-wiki upgraded from 084370bb to 6affb60a |
[production] |
| 04:24 |
<ryankemper@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudelastic1007.wikimedia.org with OS bullseye |
[production] |
| 03:44 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcontrol2001-dev.codfw.wmnet with reason: host reimage |
[production] |
| 03:40 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcontrol2001-dev.codfw.wmnet with reason: host reimage |
[production] |
| 03:40 |
<ejegg> |
fundraising civicrm upgraded from 6e53198c to 32679ea3 |
[production] |
| 03:19 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudcontrol2001-dev.codfw.wmnet with OS bookworm |
[production] |
| 01:53 |
<cstone> |
payments-wiki upgraded from b4465e23 to 084370bb |
[production] |
| 01:34 |
<eileen> |
civicrm upgraded from ec6992e0 to 6e53198c |
[production] |
| 00:27 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1008.wikimedia.org with OS bullseye |
[production] |
|
2023-11-15
§
|
| 23:50 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db1142 (T348183)', diff saved to https://phabricator.wikimedia.org/P53494 and previous config saved to /var/cache/conftool/dbconfig/20231115-235044-arnaudb.json |
[production] |
| 23:50 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1142.eqiad.wmnet with reason: Maintenance |
[production] |
| 23:50 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1142.eqiad.wmnet with reason: Maintenance |
[production] |
| 23:50 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1141 (T348183)', diff saved to https://phabricator.wikimedia.org/P53493 and previous config saved to /var/cache/conftool/dbconfig/20231115-235023-arnaudb.json |
[production] |
| 23:35 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1141', diff saved to https://phabricator.wikimedia.org/P53492 and previous config saved to /var/cache/conftool/dbconfig/20231115-233516-arnaudb.json |
[production] |
| 23:20 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1141', diff saved to https://phabricator.wikimedia.org/P53491 and previous config saved to /var/cache/conftool/dbconfig/20231115-232010-arnaudb.json |
[production] |
| 23:05 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1141 (T348183)', diff saved to https://phabricator.wikimedia.org/P53490 and previous config saved to /var/cache/conftool/dbconfig/20231115-230504-arnaudb.json |
[production] |
| 23:04 |
<ryankemper@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudelastic1008.wikimedia.org with OS bullseye |
[production] |
| 22:59 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.puppet.renew-cert (exit_code=99) for cloudelastic1007.wikimedia.org: Renew puppet certificate - bking@cumin2002 |
[production] |