1-50 of 10000 results (18ms)
2025-07-03 ยง
19:36 <joal@deploy1003> Finished deploy [airflow-dags/analytics@7ba4a7b]: BUGFIX - Synchronize artifact for airflow_dags/analytics (duration: 01m 02s) [production]
19:35 <joal@deploy1003> Started deploy [airflow-dags/analytics@7ba4a7b]: BUGFIX - Synchronize artifact for airflow_dags/analytics [production]
19:34 <joal@deploy1003> Finished deploy [airflow-dags/analytics_test@7ba4a7b]: BUGFIX - Synchronize artifact for airflow_dags/analytics_test (duration: 00m 16s) [production]
19:34 <joal@deploy1003> Started deploy [airflow-dags/analytics_test@7ba4a7b]: BUGFIX - Synchronize artifact for airflow_dags/analytics_test [production]
17:33 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1176.eqiad.wmnet with OS bullseye [production]
17:26 <joal@deploy1003> Finished deploy [airflow-dags/analytics@9088e59]: Synchronize artifacts for airflow_dags/analytics (duration: 00m 40s) [production]
17:25 <joal@deploy1003> Started deploy [airflow-dags/analytics@9088e59]: Synchronize artifacts for airflow_dags/analytics [production]
17:24 <joal@deploy1003> Finished deploy [airflow-dags/analytics_test@9088e59]: Synchronize artifacat for airflow_dags/analytics_test (duration: 00m 15s) [production]
17:24 <joal@deploy1003> Started deploy [airflow-dags/analytics_test@9088e59]: Synchronize artifacat for airflow_dags/analytics_test [production]
17:18 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1176.eqiad.wmnet with reason: host reimage [production]
17:15 <stevemunene@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1176.eqiad.wmnet with reason: host reimage [production]
17:13 <cmooney@cumin1003> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
17:13 <cmooney@cumin1003> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
17:13 <cmooney@cumin1003> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary [production]
17:12 <cmooney@cumin1003> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary [production]
17:01 <stevemunene@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1176.eqiad.wmnet with OS bullseye [production]
16:36 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api [tools]
16:32 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
16:32 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
16:32 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
16:31 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
16:30 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [tools]
16:11 <vgutierrez> repooling cp7006 [production]
16:09 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp7006.magru.wmnet [production]
16:09 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for cp7006.magru.wmnet [production]
15:52 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
15:52 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
15:46 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
15:46 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
15:42 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1176.eqiad.wmnet with OS bullseye [production]
15:38 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1176.eqiad.wmnet with OS bullseye [production]
15:34 <vgutierrez@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp7006.magru.wmnet with reason: testing [production]
15:33 <vgutierrez> depooling cp7006 for testing [production]
15:31 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2213 (T395241)', diff saved to https://phabricator.wikimedia.org/P78755 and previous config saved to /var/cache/conftool/dbconfig/20250703-153141-fceratto.json [production]
15:25 <jmm@cumin1003> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:aux-worker-codfw [production]
15:23 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
15:22 <vgutierrez> lvs5006 migrated to katran - T396561 [production]
15:21 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs5006.eqsin.wmnet [production]
15:21 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs5006.eqsin.wmnet [production]
15:16 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P78754 and previous config saved to /var/cache/conftool/dbconfig/20250703-151633-fceratto.json [production]
15:15 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api [toolsbeta]
15:10 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [toolsbeta]
15:10 <vgutierrez@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on lvs5006.eqsin.wmnet with reason: katran migration [production]
15:04 <jmm@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:aux-worker-codfw [production]
15:04 <jmm@cumin1003> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:aux-worker-eqiad [production]
15:01 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P78753 and previous config saved to /var/cache/conftool/dbconfig/20250703-150126-fceratto.json [production]
14:56 <jynus@cumin1002> DONE (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for backup1007.eqiad.wmnet: Renew puppet certificate - jynus@cumin1002 [production]
14:55 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry1005.eqiad.wmnet [production]
14:51 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host registry1005.eqiad.wmnet [production]
14:50 <jynus@cumin1002> DONE (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for backup1006.eqiad.wmnet: Renew puppet certificate - jynus@cumin1002 [production]