2701-2750 of 10000 results (82ms)
2023-06-27 ยง
17:20 <brennen@deploy1002> Finished scap: testwikis wikis to 1.41.0-wmf.15 refs T340243 (duration: 42m 56s) [production]
16:49 <mutante> webperf1003/2003 restarted apache after deploying gerrit:932441 [production]
16:37 <brennen@deploy1002> Started scap: testwikis wikis to 1.41.0-wmf.15 refs T340243 [production]
16:36 <brennen> train 1.41.0-wmf.15: re-running scap stage-train (T340243) [production]
16:03 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
15:51 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
15:36 <jbond> puppet-merge fixed again [production]
15:35 <root@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:34 <root@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:34 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti-test2002.codfw.wmnet [production]
15:34 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2002.codfw.wmnet [production]
15:33 <root@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:33 <root@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:32 <root@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:32 <root@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti-test2001.codfw.wmnet [production]
15:24 <root@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti-test2001.codfw.wmnet [production]
15:24 <root@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:24 <jbond> puppet-merge temprrarily broken [production]
15:23 <jbond> hi all fyi i have temporarily broken puppet-merge, fix is being done [production]
15:23 <root@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:23 <root@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:21 <root@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:20 <root@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2001.codfw.wmnet [production]
15:01 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
14:53 <mforns@deploy1002> Finished deploy [airflow-dags/analytics@5e77b01]: (no justification provided) (duration: 00m 10s) [production]
14:52 <mforns@deploy1002> Started deploy [airflow-dags/analytics@5e77b01]: (no justification provided) [production]
14:47 <root@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti-test2001.codfw.wmnet [production]
14:46 <root@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2001.codfw.wmnet [production]
14:41 <elukey@cumin1001> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Roll restart to pick up new certs and openjdk version - elukey@cumin1001 [production]
14:27 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
14:24 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti-test2002.codfw.wmnet [production]
14:24 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti-test2002.codfw.wmnet [production]
14:23 <elukey@cumin1001> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Roll restart to pick up new certs and openjdk version - elukey@cumin1001 [production]
14:21 <elukey@cumin1001> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Roll restart to pick up new certs and openjdk version - elukey@cumin1001 [production]
14:17 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti-test2002.codfw.wmnet [production]
14:16 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti-test2002.codfw.wmnet [production]
14:04 <elukey@cumin1001> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Roll restart to pick up new certs and openjdk version - elukey@cumin1001 [production]
13:32 <elukey> expand ml-staging200[12] kubelet partitions - T339231 [production]
13:27 <stevemunene@cumin1001> START - Cookbook sre.hosts.reimage for host an-test-worker1003.eqiad.wmnet with OS bullseye [production]
13:26 <joal@deploy1002> Finished deploy [airflow-dags/analytics@9eca77f]: Regular analytics weekly train [airflow-dags/analytics@9eca77f7] (duration: 00m 09s) [production]
13:26 <stevemunene@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-test-worker1003.eqiad.wmnet with OS bullseye [production]
13:26 <joal@deploy1002> Started deploy [airflow-dags/analytics@9eca77f]: Regular analytics weekly train [airflow-dags/analytics@9eca77f7] [production]
13:18 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
13:06 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
12:58 <elukey@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
12:57 <elukey@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
12:57 <marostegui> Failover m3-master to dbproxy1026 T337812 [production]
11:55 <daniel@deploy1002> Finished scap: Backport for [[gerrit:933437|Parsoid: Disable PC writes on enwiki (T339867)]] (duration: 12m 06s) [production]
11:51 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/wikifeeds: apply [production]
11:50 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]