2024-10-14
ยง
|
15:46 |
<aikochou@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . |
[production] |
15:16 |
<isaranto@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
15:15 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1187 (T376905)', diff saved to https://phabricator.wikimedia.org/P69816 and previous config saved to /var/cache/conftool/dbconfig/20241014-151546-ladsgroup.json |
[production] |
15:15 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance |
[production] |
15:15 |
<isaranto@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
15:15 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance |
[production] |
15:15 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1180 (T376905)', diff saved to https://phabricator.wikimedia.org/P69815 and previous config saved to /var/cache/conftool/dbconfig/20241014-151521-ladsgroup.json |
[production] |
15:07 |
<elukey@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
15:06 |
<elukey@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/admin 'sync'. |
[production] |
15:05 |
<isaranto@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
15:00 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P69814 and previous config saved to /var/cache/conftool/dbconfig/20241014-150014-ladsgroup.json |
[production] |
14:45 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P69813 and previous config saved to /var/cache/conftool/dbconfig/20241014-144507-ladsgroup.json |
[production] |
14:43 |
<aikochou@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . |
[production] |
14:43 |
<jayme@deploy1003> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
14:41 |
<jayme@deploy1003> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
14:41 |
<jayme@deploy1003> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
14:39 |
<jayme@deploy1003> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
14:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1180 (T376905)', diff saved to https://phabricator.wikimedia.org/P69812 and previous config saved to /var/cache/conftool/dbconfig/20241014-143000-ladsgroup.json |
[production] |
14:16 |
<stevemunene@cumin1002> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts an-worker1177.eqiad.wmnet |
[production] |
14:16 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
14:16 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1177.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" |
[production] |
14:16 |
<stevemunene@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1177.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" |
[production] |
14:12 |
<stevemunene@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
14:12 |
<Lucas_WMDE> |
UTC afternoon backport+config window done |
[production] |
14:10 |
<Lucas_WMDE> |
[untruncated duration: 06m 48s] |
[production] |
14:09 |
<lucaswerkmeister-wmde@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1079923|refactor(tests): don't use per-method coverage annotation]], [[gerrit:1079894|refactor(HomepageHooks): extract method for simpler modifyability]], [[gerrit:1079915|Clear LinkRecommendation suggestions on page save (T364341 T372337)]], [[gerrit:1079925|Run fixLinkRecommendationData even when disabled in CC (T373176)]] (duration: 0 |
[production] |
14:07 |
<stevemunene@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts an-worker1177.eqiad.wmnet |
[production] |
14:07 |
<stevemunene@cumin1002> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts an-worker1176.eqiad.wmnet |
[production] |
14:07 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
14:07 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1176.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" |
[production] |
14:06 |
<stevemunene@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1176.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" |
[production] |
14:04 |
<lucaswerkmeister-wmde@deploy2002> |
migr, lucaswerkmeister-wmde: Continuing with sync |
[production] |
14:04 |
<lucaswerkmeister-wmde@deploy2002> |
migr, lucaswerkmeister-wmde: Backport for [[gerrit:1079923|refactor(tests): don't use per-method coverage annotation]], [[gerrit:1079894|refactor(HomepageHooks): extract method for simpler modifyability]], [[gerrit:1079915|Clear LinkRecommendation suggestions on page save (T364341 T372337)]], [[gerrit:1079925|Run fixLinkRecommendationData even when disabled in CC (T373176)]] synced to |
[production] |
14:03 |
<stevemunene@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
14:02 |
<lucaswerkmeister-wmde@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1079923|refactor(tests): don't use per-method coverage annotation]], [[gerrit:1079894|refactor(HomepageHooks): extract method for simpler modifyability]], [[gerrit:1079915|Clear LinkRecommendation suggestions on page save (T364341 T372337)]], [[gerrit:1079925|Run fixLinkRecommendationData even when disabled in CC (T373176)]] |
[production] |
13:58 |
<stevemunene@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts an-worker1176.eqiad.wmnet |
[production] |
13:46 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1079984|Update interwiki.php]] (duration: 07m 00s) |
[production] |
13:45 |
<kcvelaga@deploy2002> |
Finished deploy [airflow-dags/analytics_product@fbcf880]: T375480 (duration: 01m 07s) |
[production] |
13:44 |
<kcvelaga@deploy2002> |
Started deploy [airflow-dags/analytics_product@fbcf880]: T375480 |
[production] |
13:41 |
<ladsgroup@deploy2002> |
ladsgroup: Continuing with sync |
[production] |
13:41 |
<ladsgroup@deploy2002> |
ladsgroup: Backport for [[gerrit:1079984|Update interwiki.php]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:39 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1079984|Update interwiki.php]] |
[production] |
13:35 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts aux-k8s-etcd1002.eqiad.wmnet |
[production] |
13:35 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
13:35 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: aux-k8s-etcd1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - elukey@cumin1002" |
[production] |
13:34 |
<elukey@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: aux-k8s-etcd1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - elukey@cumin1002" |
[production] |
13:31 |
<elukey@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
13:29 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1180 (T376905)', diff saved to https://phabricator.wikimedia.org/P69811 and previous config saved to /var/cache/conftool/dbconfig/20241014-132944-ladsgroup.json |
[production] |
13:29 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1180.eqiad.wmnet with reason: Maintenance |
[production] |
13:29 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1180.eqiad.wmnet with reason: Maintenance |
[production] |