2101-2150 of 10000 results (62ms)
2024-08-12 ยง
14:38 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
14:38 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
14:34 <jgiannelos@deploy1003> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
14:23 <zabe@deploy1003> Finished scap: Backport for [[gerrit:1061152|Further configuration for bdrwiki (T371760)]] (duration: 21m 07s) [production]
14:20 <dhinus> "./bin/stashbot.sh restart #stashbot had quit IRC" [tools.stashbot]
14:01 <zabe@deploy1003> Started scap sync-world: Backport for [[gerrit:1061152|Further configuration for bdrwiki (T371760)]] [production]
13:46 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply [production]
13:46 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/shellbox-video: apply [production]
13:33 <klausman@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
13:33 <klausman@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
13:25 <jgiannelos@deploy1003> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
13:24 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
13:24 <jgiannelos@deploy1003> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
13:05 <stevemunene> Bump airflow version on `an-test-client1002` T365449 [analytics]
12:37 <elukey> restart exim4 on list2001 to pick up the new TLS material [production]
12:35 <elukey> restart exim4 on list1004 to pick up the new TLS material [production]
12:32 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db1240.eqiad.wmnet with reason: Maintenance [production]
12:32 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db1240.eqiad.wmnet with reason: Maintenance [production]
12:31 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [tools]
12:25 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [tools]
12:16 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
12:11 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [toolsbeta]
12:11 <elukey@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:restbase-codfw: Openjdk upgrade - elukey@cumin1002 [production]
12:05 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component jobs-api [toolsbeta]
12:04 <kevinbazira@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
12:03 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
12:03 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [toolsbeta]
12:00 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component components-api [toolsbeta]
12:00 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [toolsbeta]
11:59 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . [production]
11:52 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.ceph.wait_for_rebalance (exit_code=99) [admin]
11:51 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component tools-webservice [tools]
11:51 <dcaro@cloudcumin1001> START - Cookbook wmcs.ceph.osd.undrain_node (T363344) [admin]
11:51 <dcaro@cloudcumin1001> END (ERROR) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=97) (T363344) [admin]
11:51 <dcaro@cloudcumin1001> START - Cookbook wmcs.ceph.osd.undrain_node (T363344) [admin]
11:51 <dcaro@cloudcumin1001> END (ERROR) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=97) (T363344) [admin]
11:51 <dcaro@cloudcumin1001> START - Cookbook wmcs.ceph.osd.undrain_node (T363344) [admin]
11:46 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.component.deploy for component tools-webservice [tools]
11:42 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component tools-webservice [toolsbeta]
11:37 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.component.deploy for component tools-webservice [toolsbeta]
11:36 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component tools-webservice [toolsbeta]
11:35 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.toolforge.component.deploy for component tools-webservice [toolsbeta]
11:26 <hnowlan> rebuilding php7.4-fpm and php7.4-fpm-multiversion-base to pick up healthz worker awareness change (r/1060867) [production]
11:22 <ladsgroup@cumin1002> conftool action : set/pooled=no; selector: name=clouddb1017.eqiad.wmnet,service=s1 [production]
11:10 <kevinbazira@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
11:06 <isaranto@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
11:04 <isaranto@deploy1003> helmfile [ml-serve-eqiad] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
11:03 <isaranto@deploy1003> helmfile [ml-serve-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
10:37 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.component.deploy (exit_code=99) for component tools-webservice [toolsbeta]
10:31 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component tools-webservice [toolsbeta]