6401-6450 of 10000 results (50ms)
2025-08-05 ยง
12:26 <gkyziridis@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
12:17 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component tools-webservice [toolsbeta]
12:17 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component tools-webservice [toolsbeta]
12:17 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component tool-webservice [toolsbeta]
12:17 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component tool-webservice [toolsbeta]
12:16 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component webservice [toolsbeta]
12:16 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component webservice [toolsbeta]
12:16 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component webservice-cli [toolsbeta]
12:16 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component webservice-cli [toolsbeta]
12:16 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component tools-webservice [toolsbeta]
12:15 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component tools-webservice [toolsbeta]
12:11 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T399728)', diff saved to https://phabricator.wikimedia.org/P80805 and previous config saved to /var/cache/conftool/dbconfig/20250805-121132-fceratto.json [production]
12:08 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1191 (T399728)', diff saved to https://phabricator.wikimedia.org/P80803 and previous config saved to /var/cache/conftool/dbconfig/20250805-120857-fceratto.json [production]
12:08 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
12:08 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T399728)', diff saved to https://phabricator.wikimedia.org/P80802 and previous config saved to /var/cache/conftool/dbconfig/20250805-120835-fceratto.json [production]
11:53 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P80801 and previous config saved to /var/cache/conftool/dbconfig/20250805-115327-fceratto.json [production]
11:38 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P80800 and previous config saved to /var/cache/conftool/dbconfig/20250805-113820-fceratto.json [production]
11:25 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-82, tools-k8s-worker-nfs-24 [tools]
11:23 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T399728)', diff saved to https://phabricator.wikimedia.org/P80799 and previous config saved to /var/cache/conftool/dbconfig/20250805-112312-fceratto.json [production]
11:20 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1181 (T399728)', diff saved to https://phabricator.wikimedia.org/P80798 and previous config saved to /var/cache/conftool/dbconfig/20250805-112036-fceratto.json [production]
11:20 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
11:20 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1174 (T399728)', diff saved to https://phabricator.wikimedia.org/P80797 and previous config saved to /var/cache/conftool/dbconfig/20250805-112014-fceratto.json [production]
11:16 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-82, tools-k8s-worker-nfs-24 [tools]
11:15 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dumpsdata1003.eqiad.wmnet [production]
11:15 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:15 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dumpsdata1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1003" [production]
11:14 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dumpsdata1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1003" [production]
11:05 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P80796 and previous config saved to /var/cache/conftool/dbconfig/20250805-110506-fceratto.json [production]
10:56 <jnuche@deploy1003> Finished deploy [releng/jenkins-deploy@62138e1] (releasing): T401180 (duration: 00m 32s) [production]
10:56 <jnuche@deploy1003> Started deploy [releng/jenkins-deploy@62138e1] (releasing): T401180 [production]
10:55 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
10:50 <btullis@cumin1003> START - Cookbook sre.hosts.decommission for hosts dumpsdata1003.eqiad.wmnet [production]
10:49 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P80795 and previous config saved to /var/cache/conftool/dbconfig/20250805-104959-fceratto.json [production]
10:47 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts snapshot1010.eqiad.wmnet [production]
10:47 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:47 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: snapshot1010.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1003" [production]
10:47 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: snapshot1010.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1003" [production]
10:39 <xSavitar> Ran fixStuckGlobalRename.php for T400862 [production]
10:36 <xSavitar> Ran fixStuckGlobalRename.php for T400974 [production]
10:34 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1174 (T399728)', diff saved to https://phabricator.wikimedia.org/P80794 and previous config saved to /var/cache/conftool/dbconfig/20250805-103451-fceratto.json [production]
10:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1174 (T399728)', diff saved to https://phabricator.wikimedia.org/P80793 and previous config saved to /var/cache/conftool/dbconfig/20250805-103213-fceratto.json [production]
10:32 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1174.eqiad.wmnet with reason: Maintenance [production]
10:31 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
10:30 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170 (T399728)', diff saved to https://phabricator.wikimedia.org/P80792 and previous config saved to /var/cache/conftool/dbconfig/20250805-103055-fceratto.json [production]
10:23 <jelto@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host gitlab2002.wikimedia.org with OS bookworm [production]
10:18 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
10:15 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P80791 and previous config saved to /var/cache/conftool/dbconfig/20250805-101548-fceratto.json [production]
10:12 <hashar@deploy1003> Finished scap sync-world: Backport for [[gerrit:1175851|In robots.txt permit access to the sitemap API (T400023 T396684)]] (duration: 08m 01s) [production]
10:09 <btullis@cumin1003> START - Cookbook sre.hosts.decommission for hosts snapshot1010.eqiad.wmnet [production]
10:06 <hashar@deploy1003> tstarling, hashar: Continuing with sync [production]