1951-2000 of 10000 results (95ms)
2023-12-07 ยง
11:48 <btullis@deploy2002> Started deploy [analytics/refinery@b6499b1] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@b6499b17] [production]
11:33 <kamila@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
11:33 <kamila@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
11:30 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
11:30 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
11:30 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
11:30 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
11:17 <klausman@deploy2002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
11:17 <klausman@deploy2002> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
11:14 <aikochou@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:14 <klausman@deploy2002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
11:13 <klausman@deploy2002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
11:13 <klausman@deploy2002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
11:12 <klausman@deploy2002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
11:10 <aikochou@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:10 <brouberol@cumin1001> END (PASS) - Cookbook sre.hadoop.roll-restart-masters (exit_code=0) restart masters for Hadoop test cluster: Restart of jvm daemons. [production]
11:01 <brouberol@cumin1001> START - Cookbook sre.hadoop.roll-restart-masters restart masters for Hadoop test cluster: Restart of jvm daemons. [production]
10:58 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
10:58 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
10:54 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: cluster::management [production]
10:53 <brouberol@cumin1001> END (PASS) - Cookbook sre.hadoop.roll-restart-masters (exit_code=0) restart masters for Hadoop test cluster: Restart of jvm daemons. [production]
10:51 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
10:51 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2118.codfw.wmnet with reason: Maintenance [production]
10:51 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
10:50 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]
10:45 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: cluster::management [production]
10:38 <kamila@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
10:38 <kamila@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
10:35 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
10:34 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
10:34 <kamila@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply [production]
10:34 <kamila@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-int: apply [production]
10:33 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
10:33 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
10:33 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1183.eqiad.wmnet with reason: Maintenance [production]
10:32 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1183.eqiad.wmnet with reason: Maintenance [production]
10:27 <brouberol@cumin1001> START - Cookbook sre.hadoop.roll-restart-masters restart masters for Hadoop test cluster: Restart of jvm daemons. [production]
10:23 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2114.codfw.wmnet with reason: Maintenance [production]
10:22 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2114.codfw.wmnet with reason: Maintenance [production]
10:22 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1173.eqiad.wmnet with reason: Maintenance [production]
10:22 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1173.eqiad.wmnet with reason: Maintenance [production]
09:42 <akosiaris@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
09:42 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
09:41 <akosiaris@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
09:40 <akosiaris@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
09:40 <akosiaris@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
09:39 <akosiaris@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]
08:52 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 31 days, 0:00:00 on sretest1001.eqiad.wmnet with reason: WIP nftables [production]
08:52 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 31 days, 0:00:00 on sretest1001.eqiad.wmnet with reason: WIP nftables [production]
08:32 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host bast4005.wikimedia.org [production]