1601-1650 of 10000 results (99ms)
2025-08-05 ยง
15:17 <sukhe@dns1004> START - running authdns-update [production]
15:17 <dzahn@cumin2002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 0:30:00 on phab.wmfusercontent.org with reason: version upgrade [production]
15:14 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab2002.codfw.wmnet with reason: phab deploy [production]
15:14 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab1004.eqiad.wmnet with reason: phab deploy [production]
15:11 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bookworm [production]
15:10 <jhancock@cumin1003> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
15:07 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P80823 and previous config saved to /var/cache/conftool/dbconfig/20250805-150701-fceratto.json [production]
14:57 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host cp2043.codfw.wmnet with OS bookworm [production]
14:51 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P80822 and previous config saved to /var/cache/conftool/dbconfig/20250805-145153-fceratto.json [production]
14:49 <jhancock@cumin1003> START - Cookbook sre.dns.netbox [production]
14:49 <jhancock@cumin1003> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
14:49 <jhancock@cumin1003> END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding dbprov2007 to codfw - jhancock@cumin1003" [production]
14:47 <jhancock@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding dbprov2007 to codfw - jhancock@cumin1003" [production]
14:44 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply [production]
14:43 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-ml: apply [production]
14:42 <jhancock@cumin1003> START - Cookbook sre.dns.netbox [production]
14:39 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply [production]
14:37 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-ml: apply [production]
14:36 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227 (T399728)', diff saved to https://phabricator.wikimedia.org/P80821 and previous config saved to /var/cache/conftool/dbconfig/20250805-143646-fceratto.json [production]
14:33 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1227 (T399728)', diff saved to https://phabricator.wikimedia.org/P80820 and previous config saved to /var/cache/conftool/dbconfig/20250805-143359-fceratto.json [production]
14:33 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1227.eqiad.wmnet with reason: Maintenance [production]
14:33 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202 (T399728)', diff saved to https://phabricator.wikimedia.org/P80819 and previous config saved to /var/cache/conftool/dbconfig/20250805-143336-fceratto.json [production]
14:26 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:24 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
14:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P80818 and previous config saved to /var/cache/conftool/dbconfig/20250805-141829-fceratto.json [production]
14:18 <cgoubert@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
14:17 <cgoubert@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
14:17 <mszabo@deploy1003> Finished scap sync-world: Backport for [[gerrit:1175899|UserInfoCard: Cap maximum count for thanks given/received (T398354)]] (duration: 36m 20s) [production]
14:17 <cgoubert@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
14:16 <cgoubert@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
14:16 <cgoubert@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:15 <cgoubert@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
14:15 <cgoubert@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:14 <cgoubert@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
14:14 <cgoubert@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
14:13 <cgoubert@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
14:13 <cgoubert@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:12 <cgoubert@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
14:12 <cgoubert@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
14:09 <cgoubert@deploy1003> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
14:09 <cgoubert@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:08 <cgoubert@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
14:06 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
14:06 <cgoubert@deploy1003> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:06 <btullis@cumin1003> START - Cookbook sre.hosts.rename from snapshot1011 to dse-k8s-worker1015 [production]
14:05 <mszabo@deploy1003> mszabo: Continuing with sync [production]
14:04 <cgoubert@deploy1003> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
14:03 <cgoubert@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:03 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P80817 and previous config saved to /var/cache/conftool/dbconfig/20250805-140321-fceratto.json [production]
14:02 <mszabo@deploy1003> mszabo: Backport for [[gerrit:1175899|UserInfoCard: Cap maximum count for thanks given/received (T398354)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]