2024-07-18
§
|
11:03 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply on staging |
[production] |
10:54 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
10:38 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw2432.codfw.wmnet with OS buster |
[production] |
10:28 |
<cgoubert@cumin1002> |
END (ERROR) - Cookbook sre.hosts.convert-disks (exit_code=97) for host mw2432 |
[production] |
10:17 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.convert-disks for host mw2432 |
[production] |
10:08 |
<cgoubert@cumin1002> |
END (FAIL) - Cookbook sre.hosts.convert-disks (exit_code=99) for host mw2432 |
[production] |
10:04 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.convert-disks for host mw2432 |
[production] |
09:56 |
<cgoubert@cumin1002> |
END (FAIL) - Cookbook sre.hosts.convert-disks (exit_code=99) for host mw2432 |
[production] |
09:52 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.convert-disks for host mw2432 |
[production] |
09:46 |
<kevinbazira@deploy1002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . |
[production] |
09:46 |
<kevinbazira@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . |
[production] |
09:44 |
<elukey> |
upgrade spicerack to 8.8.0 on cumin2002 - testing the new release |
[production] |
09:43 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: sync on staging |
[production] |
09:26 |
<elukey> |
uploaded spicerack_8.8.0 to apt.wikimedia.org bullseye-wikimedia |
[production] |
09:26 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply on staging |
[production] |
09:08 |
<btullis> |
disabled check-private-data.timer on clouddb1021, pending decom. |
[production] |
09:06 |
<dcausse@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
09:06 |
<dcausse@deploy1002> |
helmfile [eqiad] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
09:02 |
<dcausse@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
09:02 |
<dcausse@deploy1002> |
helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
08:56 |
<dcausse@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
08:55 |
<dcausse@deploy1002> |
helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
08:51 |
<dcausse@deploy1002> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
08:51 |
<dcausse@deploy1002> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
08:47 |
<dcausse@deploy1002> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
08:47 |
<dcausse@deploy1002> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
08:13 |
<aklapper@deploy1002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.43.0-wmf.14 refs T366959 |
[production] |
04:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2155 (T367856)', diff saved to https://phabricator.wikimedia.org/P66806 and previous config saved to /var/cache/conftool/dbconfig/20240718-043817-marostegui.json |
[production] |
04:38 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
04:37 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
04:37 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2155.codfw.wmnet with reason: Maintenance |
[production] |
04:37 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2155.codfw.wmnet with reason: Maintenance |
[production] |
04:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2147 (T367856)', diff saved to https://phabricator.wikimedia.org/P66805 and previous config saved to /var/cache/conftool/dbconfig/20240718-043739-marostegui.json |
[production] |
04:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P66804 and previous config saved to /var/cache/conftool/dbconfig/20240718-042232-marostegui.json |
[production] |
04:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2147', diff saved to https://phabricator.wikimedia.org/P66803 and previous config saved to /var/cache/conftool/dbconfig/20240718-040725-marostegui.json |
[production] |
03:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2147 (T367856)', diff saved to https://phabricator.wikimedia.org/P66802 and previous config saved to /var/cache/conftool/dbconfig/20240718-035218-marostegui.json |
[production] |
00:35 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: elastic110[0-2]* for row maint - ryankemper@cumin2002 - T348977 |
[production] |
00:35 |
<ryankemper@cumin2002> |
START - Cookbook sre.elasticsearch.ban Banning hosts: elastic110[0-2]* for row maint - ryankemper@cumin2002 - T348977 |
[production] |
00:05 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2220 (T367781)', diff saved to https://phabricator.wikimedia.org/P66801 and previous config saved to /var/cache/conftool/dbconfig/20240718-000500-arnaudb.json |
[production] |
2024-07-17
§
|
23:50 |
<mutante> |
phabricator (phab1004) - deployed gerrit:1054907 ; restarted apache |
[production] |
23:49 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2220', diff saved to https://phabricator.wikimedia.org/P66800 and previous config saved to /var/cache/conftool/dbconfig/20240717-234953-arnaudb.json |
[production] |
23:34 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2220', diff saved to https://phabricator.wikimedia.org/P66799 and previous config saved to /var/cache/conftool/dbconfig/20240717-233446-arnaudb.json |
[production] |
23:19 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2220 (T367781)', diff saved to https://phabricator.wikimedia.org/P66798 and previous config saved to /var/cache/conftool/dbconfig/20240717-231939-arnaudb.json |
[production] |
23:16 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db2220 (T367781)', diff saved to https://phabricator.wikimedia.org/P66797 and previous config saved to /var/cache/conftool/dbconfig/20240717-231612-arnaudb.json |
[production] |
23:16 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2220.codfw.wmnet with reason: Maintenance |
[production] |
23:16 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
23:15 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2220.codfw.wmnet with reason: Maintenance |
[production] |
23:15 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2208 (T367781)', diff saved to https://phabricator.wikimedia.org/P66796 and previous config saved to /var/cache/conftool/dbconfig/20240717-231550-arnaudb.json |
[production] |
23:14 |
<jclark@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
23:13 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcephmon1006 |
[production] |