401-450 of 10000 results (29ms)
2021-08-06 ยง
16:34 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on maps1005.eqiad.wmnet with reason: Awaiting reimaging, depooled. [production]
16:34 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on maps1005.eqiad.wmnet with reason: Awaiting reimaging, depooled. [production]
16:30 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
16:30 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission for hosts peek2001.codfw.wmnet [production]
16:29 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 4:00:00 on peek2001.codfw.wmnet with reason: decom [production]
16:29 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 8 days, 4:00:00 on peek2001.codfw.wmnet with reason: decom [production]
16:03 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
16:02 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
15:14 <hnowlan> removing maps1005 from old maps cassandra cluster before reimaging [production]
14:35 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=maps1005.eqiad.wmnet [production]
14:29 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on maps2005.codfw.wmnet with reason: Reimaging [production]
14:29 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on maps2005.codfw.wmnet with reason: Reimaging [production]
14:26 <hnowlan@cumin2002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on maps2005.codfw.wmnet with reason: REIMAGE [production]
14:24 <hnowlan@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on maps2005.codfw.wmnet with reason: REIMAGE [production]
13:35 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=maps1006.eqiad.wmnet [production]
13:07 <zpapierski@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . [production]
12:56 <godog> test thanos 0.22 on thanos-fe2001 - T288326 [production]
12:48 <oblivian@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
12:34 <oblivian@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
12:26 <oblivian@deploy1002> helmfile [codfw] DONE helmfile.d/admin 'sync'. [production]
12:25 <oblivian@deploy1002> helmfile [codfw] START helmfile.d/admin 'sync'. [production]
12:25 <oblivian@deploy1002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
12:25 <oblivian@deploy1002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
12:23 <oblivian@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'sync'. [production]
12:22 <oblivian@deploy1002> helmfile [eqiad] START helmfile.d/admin 'sync'. [production]
12:22 <oblivian@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'sync'. [production]
12:22 <oblivian@deploy1002> helmfile [eqiad] START helmfile.d/admin 'sync'. [production]
12:21 <oblivian@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'sync'. [production]
12:21 <oblivian@deploy1002> helmfile [eqiad] START helmfile.d/admin 'sync'. [production]
12:20 <oblivian@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
12:20 <oblivian@deploy1002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
11:45 <jayme> enabling dragonfly dfdaemon on kubernetes200* [production]
11:16 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on maps1006.eqiad.wmnet with reason: REIMAGE [production]
11:14 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on maps1006.eqiad.wmnet with reason: REIMAGE [production]
10:16 <kormat@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1181.eqiad.wmnet with reason: REIMAGE [production]
10:14 <kormat@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1181.eqiad.wmnet with reason: REIMAGE [production]
09:58 <kormat> reimaging db1181 (s7) to buster T288244 [production]
09:15 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on maps2005.codfw.wmnet with reason: Rebuilding as buster replica of maps1009 [production]
09:15 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on maps2005.codfw.wmnet with reason: Rebuilding as buster replica of maps1009 [production]
09:15 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=maps2005.codfw.wmnet [production]
09:14 <dcausse@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . [production]
08:38 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:38 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
08:31 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:30 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
08:10 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
08:09 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
07:58 <godog> test thanos 0.21 on thanos-fe2001 - T288326 [production]
07:48 <elukey@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
07:48 <elukey@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]