851-900 of 10000 results (34ms)
2021-02-19 ยง
18:35 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1367.eqiad.wmnet [production]
18:32 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw2272.codfw.wmnet [production]
18:30 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1341.eqiad.wmnet [production]
18:30 <mutante> mw1367 - powercycled - stuck in reboot [production]
18:29 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw2272.codfw.wmnet [production]
18:16 <wm-bot> <lucaswerkmeister> deployed f66f631598 (auth improvements) [tools.lexeme-forms]
18:07 <Urbanecm> Password reset for User:Kolyma (T274737) [production]
17:36 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1341.eqiad.wmnet with reason: REIMAGE [production]
17:34 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1341.eqiad.wmnet with reason: REIMAGE [production]
17:33 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2272.codfw.wmnet with reason: REIMAGE [production]
17:31 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2272.codfw.wmnet with reason: REIMAGE [production]
17:29 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1367.eqiad.wmnet with reason: REIMAGE [production]
17:27 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1367.eqiad.wmnet with reason: REIMAGE [production]
16:57 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1141.eqiad.wmnet with reason: REIMAGE [production]
16:55 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1140.eqiad.wmnet with reason: REIMAGE [production]
16:55 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1141.eqiad.wmnet with reason: REIMAGE [production]
16:53 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1134.eqiad.wmnet with reason: REIMAGE [production]
16:53 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1140.eqiad.wmnet with reason: REIMAGE [production]
16:51 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1134.eqiad.wmnet with reason: REIMAGE [production]
15:56 <wm-bot> <bd808> Restarted to regain irc nick [tools.bridgebot]
15:53 <elukey> restart oozie again to test another setting for role/admins [analytics]
15:43 <ottomata> installing spark 2.4.4 without hadoop jars on analytics test cluster - T274384 [analytics]
15:31 <elukey> restart oozie to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/665352 [analytics]
14:34 <joal> rerun mobile_apps-uniques-daily-wf-2021-2-18 [analytics]
14:29 <mbsantos@deploy1001> Finished deploy [tilerator/deploy@937deb5]: (no justification provided) (duration: 00m 15s) [production]
14:28 <mbsantos@deploy1001> Started deploy [tilerator/deploy@937deb5]: (no justification provided) [production]
14:00 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
14:00 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:51 <hashar> Reupdating tox jobs since https://gerrit.wikimedia.org/r/c/integration/config/+/664897 did not get merged [releng]
13:49 <hashar> Updating Jenkins jobs for "Remove dependency on Maven binaries and wrapper script." | https://gerrit.wikimedia.org/r/c/integration/config/+/651791/ [releng]
13:43 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:43 <akosiaris@deploy1001> helmfile [eqiad] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:43 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:43 <akosiaris@deploy1001> helmfile [codfw] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:43 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:42 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'staging' . [production]
13:42 <akosiaris@deploy1001> helmfile [staging] Ran 'sync' command on namespace 'echostore' for release 'production' . [production]
13:41 <godog> reset-failed ifup@ens13 on prometheus5001 - T273026 [production]
13:39 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host prometheus5001.eqsin.wmnet [production]
13:31 <gehel@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1010.eqiad.wmnet with reason: REIMAGE [production]
13:29 <gehel@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1010.eqiad.wmnet with reason: REIMAGE [production]
13:22 <filippo@cumin1001> START - Cookbook sre.hosts.reboot-single for host prometheus5001.eqsin.wmnet [production]
12:42 <arturo> deploying new version of the ingress admission controller [toolsbeta]
12:31 <arturo> deploying new version of toolforge ingress admission controller [tools]
11:46 <arturo> merged https://gerrit.wikimedia.org/r/c/operations/puppet/+/662941 (T274139) which should only affect toolsbeta [toolsbeta]
10:27 <arturo> create DNS record `jobs.svc.toolsbeta.eqiad1.wikimedia.cloud` with CNAME to `k8s.toolsbeta.eqiad1.wikimedia.cloud` (T274139) [toolsbeta]
10:25 <arturo> create DNS zone `svc.toolsbeta.eqiad1.wikimedia.cloud` (T274139) [toolsbeta]