2023-09-29
§
|
10:43 |
<jelto@cumin1001> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: GitLab version upgrade |
[production] |
10:43 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2136', diff saved to https://phabricator.wikimedia.org/P52764 and previous config saved to /var/cache/conftool/dbconfig/20230929-104318-arnaudb.json |
[production] |
10:35 |
<jiji@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/push-notifications: apply |
[production] |
10:35 |
<jiji@deploy2002> |
helmfile [eqiad] START helmfile.d/services/push-notifications: apply |
[production] |
10:28 |
<arnaudb@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host backup1010.eqiad.wmnet with OS bookworm |
[production] |
10:28 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2136 (T343198)', diff saved to https://phabricator.wikimedia.org/P52763 and previous config saved to /var/cache/conftool/dbconfig/20230929-102812-arnaudb.json |
[production] |
10:19 |
<jiji@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/machinetranslation: apply |
[production] |
10:19 |
<jiji@deploy2002> |
helmfile [codfw] START helmfile.d/services/machinetranslation: apply |
[production] |
10:18 |
<jiji@deploy2002> |
helmfile [staging] DONE helmfile.d/services/machinetranslation: apply |
[production] |
10:18 |
<jiji@deploy2002> |
helmfile [staging] START helmfile.d/services/machinetranslation: apply |
[production] |
10:09 |
<jiji@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/push-notifications: apply |
[production] |
10:09 |
<elukey@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: sync |
[production] |
10:09 |
<elukey@deploy2002> |
helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: sync |
[production] |
10:09 |
<jiji@deploy2002> |
helmfile [codfw] START helmfile.d/services/push-notifications: apply |
[production] |
09:33 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/wikifeeds: apply |
[production] |
09:08 |
<elukey@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: sync |
[production] |
09:08 |
<elukey@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: sync |
[production] |
05:32 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2136 (T343198)', diff saved to https://phabricator.wikimedia.org/P52760 and previous config saved to /var/cache/conftool/dbconfig/20230929-053158-arnaudb.json |
[production] |
05:31 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2136.codfw.wmnet with reason: Maintenance |
[production] |
05:31 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2136.codfw.wmnet with reason: Maintenance |
[production] |
05:31 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2119 (T343198)', diff saved to https://phabricator.wikimedia.org/P52759 and previous config saved to /var/cache/conftool/dbconfig/20230929-053136-arnaudb.json |
[production] |
05:16 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2119', diff saved to https://phabricator.wikimedia.org/P52758 and previous config saved to /var/cache/conftool/dbconfig/20230929-051630-arnaudb.json |
[production] |
05:01 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2119', diff saved to https://phabricator.wikimedia.org/P52757 and previous config saved to /var/cache/conftool/dbconfig/20230929-050123-arnaudb.json |
[production] |
04:46 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2119 (T343198)', diff saved to https://phabricator.wikimedia.org/P52756 and previous config saved to /var/cache/conftool/dbconfig/20230929-044617-arnaudb.json |
[production] |
02:57 |
<andrew@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudservices2005-dev.codfw.wmnet with OS bookworm |
[production] |
01:48 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2180 (T343198)', diff saved to https://phabricator.wikimedia.org/P52755 and previous config saved to /var/cache/conftool/dbconfig/20230929-014825-arnaudb.json |
[production] |
01:40 |
<ejegg> |
payments-wiki upgraded from c4c9b938 to d6ad0376 |
[production] |
01:33 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P52754 and previous config saved to /var/cache/conftool/dbconfig/20230929-013319-arnaudb.json |
[production] |
01:18 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P52753 and previous config saved to /var/cache/conftool/dbconfig/20230929-011813-arnaudb.json |
[production] |
01:03 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2180 (T343198)', diff saved to https://phabricator.wikimedia.org/P52752 and previous config saved to /var/cache/conftool/dbconfig/20230929-010306-arnaudb.json |
[production] |
00:26 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host db1227.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
00:24 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host db1227.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
2023-09-28
§
|
23:50 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2180 (T343198)', diff saved to https://phabricator.wikimedia.org/P52751 and previous config saved to /var/cache/conftool/dbconfig/20230928-235053-arnaudb.json |
[production] |
23:50 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2180.codfw.wmnet with reason: Maintenance |
[production] |
23:50 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2180.codfw.wmnet with reason: Maintenance |
[production] |
23:50 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3316 (T343198)', diff saved to https://phabricator.wikimedia.org/P52750 and previous config saved to /var/cache/conftool/dbconfig/20230928-235032-arnaudb.json |
[production] |
23:42 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2119 (T343198)', diff saved to https://phabricator.wikimedia.org/P52749 and previous config saved to /var/cache/conftool/dbconfig/20230928-234246-arnaudb.json |
[production] |
23:42 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2119.codfw.wmnet with reason: Maintenance |
[production] |
23:42 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2119.codfw.wmnet with reason: Maintenance |
[production] |
23:42 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2110 (T343198)', diff saved to https://phabricator.wikimedia.org/P52748 and previous config saved to /var/cache/conftool/dbconfig/20230928-234224-arnaudb.json |
[production] |
23:35 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3316', diff saved to https://phabricator.wikimedia.org/P52747 and previous config saved to /var/cache/conftool/dbconfig/20230928-233525-arnaudb.json |
[production] |
23:27 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2110', diff saved to https://phabricator.wikimedia.org/P52746 and previous config saved to /var/cache/conftool/dbconfig/20230928-232718-arnaudb.json |
[production] |
23:20 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3316', diff saved to https://phabricator.wikimedia.org/P52745 and previous config saved to /var/cache/conftool/dbconfig/20230928-232019-arnaudb.json |
[production] |
23:12 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2110', diff saved to https://phabricator.wikimedia.org/P52744 and previous config saved to /var/cache/conftool/dbconfig/20230928-231211-arnaudb.json |
[production] |
23:05 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2171:3316 (T343198)', diff saved to https://phabricator.wikimedia.org/P52743 and previous config saved to /var/cache/conftool/dbconfig/20230928-230512-arnaudb.json |
[production] |
22:57 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2110 (T343198)', diff saved to https://phabricator.wikimedia.org/P52742 and previous config saved to /var/cache/conftool/dbconfig/20230928-225705-arnaudb.json |
[production] |
22:40 |
<wfan> |
payments-wiki change from c4c9b938 to 20828b07 |
[production] |
22:03 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudservices2005-dev.codfw.wmnet with reason: host reimage |
[production] |
22:02 |
<bking@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudelastic1007.eqiad.wmnet with OS bullseye |
[production] |
22:00 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudservices2005-dev.codfw.wmnet with reason: host reimage |
[production] |