1351-1400 of 10000 results (115ms)
2024-10-14 ยง
17:01 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T376905)', diff saved to https://phabricator.wikimedia.org/P69820 and previous config saved to /var/cache/conftool/dbconfig/20241014-170123-ladsgroup.json [production]
16:51 <fnegri@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on cloudvirt1063.eqiad.wmnet with reason: cloudvirt1063 needs maintenance T375223 [production]
16:50 <fnegri@cumin1002> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on cloudvirt1063.eqiad.wmnet with reason: cloudvirt1063 needs maintenance T375223 [production]
16:46 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P69819 and previous config saved to /var/cache/conftool/dbconfig/20241014-164616-ladsgroup.json [production]
16:31 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P69818 and previous config saved to /var/cache/conftool/dbconfig/20241014-163109-ladsgroup.json [production]
16:16 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T376905)', diff saved to https://phabricator.wikimedia.org/P69817 and previous config saved to /var/cache/conftool/dbconfig/20241014-161602-ladsgroup.json [production]
16:03 <sergi0> Running `sgimeno@mwmaint2002:~$ foreachwiki userOptions.php --delete --old=1 growthexperiments-tour-newimpact-discovery` (T376461) [production]
15:52 <aikochou@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
15:46 <aikochou@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
15:16 <isaranto@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
15:15 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1187 (T376905)', diff saved to https://phabricator.wikimedia.org/P69816 and previous config saved to /var/cache/conftool/dbconfig/20241014-151546-ladsgroup.json [production]
15:15 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance [production]
15:15 <isaranto@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
15:15 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance [production]
15:15 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1180 (T376905)', diff saved to https://phabricator.wikimedia.org/P69815 and previous config saved to /var/cache/conftool/dbconfig/20241014-151521-ladsgroup.json [production]
15:07 <elukey@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'sync'. [production]
15:06 <elukey@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'sync'. [production]
15:05 <isaranto@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
15:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P69814 and previous config saved to /var/cache/conftool/dbconfig/20241014-150014-ladsgroup.json [production]
14:45 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P69813 and previous config saved to /var/cache/conftool/dbconfig/20241014-144507-ladsgroup.json [production]
14:43 <aikochou@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' . [production]
14:43 <jayme@deploy1003> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:41 <jayme@deploy1003> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
14:41 <jayme@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:39 <jayme@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
14:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1180 (T376905)', diff saved to https://phabricator.wikimedia.org/P69812 and previous config saved to /var/cache/conftool/dbconfig/20241014-143000-ladsgroup.json [production]
14:16 <stevemunene@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts an-worker1177.eqiad.wmnet [production]
14:16 <stevemunene@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:16 <stevemunene@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1177.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" [production]
14:16 <stevemunene@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1177.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" [production]
14:12 <stevemunene@cumin1002> START - Cookbook sre.dns.netbox [production]
14:12 <Lucas_WMDE> UTC afternoon backport+config window done [production]
14:10 <Lucas_WMDE> [untruncated duration: 06m 48s] [production]
14:09 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1079923|refactor(tests): don't use per-method coverage annotation]], [[gerrit:1079894|refactor(HomepageHooks): extract method for simpler modifyability]], [[gerrit:1079915|Clear LinkRecommendation suggestions on page save (T364341 T372337)]], [[gerrit:1079925|Run fixLinkRecommendationData even when disabled in CC (T373176)]] (duration: 0 [production]
14:07 <stevemunene@cumin1002> START - Cookbook sre.hosts.decommission for hosts an-worker1177.eqiad.wmnet [production]
14:07 <stevemunene@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts an-worker1176.eqiad.wmnet [production]
14:07 <stevemunene@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:07 <stevemunene@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1176.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" [production]
14:06 <stevemunene@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1176.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002" [production]
14:04 <lucaswerkmeister-wmde@deploy2002> migr, lucaswerkmeister-wmde: Continuing with sync [production]
14:04 <lucaswerkmeister-wmde@deploy2002> migr, lucaswerkmeister-wmde: Backport for [[gerrit:1079923|refactor(tests): don't use per-method coverage annotation]], [[gerrit:1079894|refactor(HomepageHooks): extract method for simpler modifyability]], [[gerrit:1079915|Clear LinkRecommendation suggestions on page save (T364341 T372337)]], [[gerrit:1079925|Run fixLinkRecommendationData even when disabled in CC (T373176)]] synced to [production]
14:03 <stevemunene@cumin1002> START - Cookbook sre.dns.netbox [production]
14:02 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1079923|refactor(tests): don't use per-method coverage annotation]], [[gerrit:1079894|refactor(HomepageHooks): extract method for simpler modifyability]], [[gerrit:1079915|Clear LinkRecommendation suggestions on page save (T364341 T372337)]], [[gerrit:1079925|Run fixLinkRecommendationData even when disabled in CC (T373176)]] [production]
13:58 <stevemunene@cumin1002> START - Cookbook sre.hosts.decommission for hosts an-worker1176.eqiad.wmnet [production]
13:46 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1079984|Update interwiki.php]] (duration: 07m 00s) [production]
13:45 <kcvelaga@deploy2002> Finished deploy [airflow-dags/analytics_product@fbcf880]: T375480 (duration: 01m 07s) [production]
13:44 <kcvelaga@deploy2002> Started deploy [airflow-dags/analytics_product@fbcf880]: T375480 [production]
13:41 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
13:41 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:1079984|Update interwiki.php]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:39 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1079984|Update interwiki.php]] [production]