4601-4650 of 10000 results (37ms)
2021-07-19 ยง
17:42 <ryankemper> [Elastic] Noted `Jul 16 18:31:20 elastic2038 elasticsearch[957]: 2021-07-16 18:31:20,657 main ERROR Unknown GELF server hostname:udp:logstash.svc.eqiad.wmnet` in elasticsearch service logs (unit had been running for 2 days) thus the restart of the elasticsearch service [production]
17:41 <ryankemper> [Elastic] Restarted elasticsearch services on `elastic2038`; afterwards restarted prometheus exporters; no units failed any longer [production]
17:30 <volans> running puppet on elastic2038 after nework was restored [production]
17:26 <mbsantos@deploy1002> Finished deploy [kartotherian/deploy@978b674]: (no justification provided) (duration: 00m 14s) [production]
17:26 <mbsantos@deploy1002> Started deploy [kartotherian/deploy@978b674]: (no justification provided) [production]
17:26 <mbsantos@deploy1002> Finished deploy [kartotherian/deploy@978b674]: (no justification provided) (duration: 00m 16s) [production]
17:25 <mbsantos@deploy1002> Started deploy [kartotherian/deploy@978b674]: (no justification provided) [production]
17:25 <mbsantos@deploy1002> Finished deploy [kartotherian/deploy@978b674]: (no justification provided) (duration: 00m 21s) [production]
17:25 <mbsantos@deploy1002> Started deploy [kartotherian/deploy@978b674]: (no justification provided) [production]
17:24 <mbsantos@deploy1002> Finished deploy [kartotherian/deploy@978b674]: (no justification provided) (duration: 00m 21s) [production]
17:24 <mbsantos@deploy1002> Started deploy [kartotherian/deploy@978b674]: (no justification provided) [production]
17:23 <mbsantos@deploy1002> Finished deploy [kartotherian/deploy@978b674]: (no justification provided) (duration: 00m 21s) [production]
17:23 <volans> running authdns-update to force-update authdns2001 [production]
17:23 <mbsantos@deploy1002> Started deploy [kartotherian/deploy@978b674]: (no justification provided) [production]
17:23 <volans@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:21 <XioNoX> remove ns1 redirect - T286787 [production]
17:19 <volans@cumin2002> START - Cookbook sre.dns.netbox [production]
17:17 <volans@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:14 <volans@cumin2002> START - Cookbook sre.dns.netbox [production]
17:13 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts mw[1286-1287].eqiad.wmnet [production]
17:10 <XioNoX> enable asw-a2-codfw access ports - T286787 [production]
17:04 <XioNoX> enable cr1-codfw / et-0/0/0 - T286787 [production]
16:54 <brennen> gerrit up and running with manual configuration edit to use ipv4 address [production]
16:51 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=logstash2021.codfw.wmnet [production]
16:51 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission for hosts mw[1286-1287].eqiad.wmnet [production]
16:46 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts mw1284.eqiad.wmnet [production]
16:40 <dancy@deploy1002> Finished deploy [gerrit/gerrit@4f29981]: Gerrit to 3.2.11 on gerrit1001 (duration: 00m 08s) [production]
16:40 <hashar> Upgrading gerrit1001 with dancy & brennen [production]
16:40 <dancy@deploy1002> Started deploy [gerrit/gerrit@4f29981]: Gerrit to 3.2.11 on gerrit1001 [production]
16:40 <XioNoX> update asw-a2-codfw serial number - T286787 [production]
16:39 <dcausse@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . [production]
16:33 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission for hosts mw1284.eqiad.wmnet [production]
16:31 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gerrit1001.wikimedia.org with reason: maintenance [production]
16:31 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on gerrit1001.wikimedia.org with reason: maintenance [production]
16:31 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit2001.wikimedia.org with reason: maintenance [production]
16:31 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit2001.wikimedia.org with reason: maintenance [production]
16:24 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash2021.codfw.wmnet with reason: maintenace [production]
16:24 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash2021.codfw.wmnet with reason: maintenace [production]
16:21 <jgiannelos@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'tegola-vector-tiles' for release 'main' . [production]
16:21 <hashar> upgrading gerrit replica on gerrit2001 and restarting [production]
16:21 <mutante> depooled logstash2021 for dcops maintenance work [production]
16:20 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=logstash2021.codfw.wmnet [production]
16:19 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw128[6-7].eqiad.wmnet [production]
16:19 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw1284.eqiad.wmnet [production]
16:18 <dancy@deploy1002> Finished deploy [gerrit/gerrit@4f29981]: Gerrit to 3.2.11 on gerrit2001 (duration: 00m 10s) [production]
16:18 <dancy@deploy1002> Started deploy [gerrit/gerrit@4f29981]: Gerrit to 3.2.11 on gerrit2001 [production]
16:15 <krinkle@deploy1002> Synchronized wmf-config/InitialiseSettings.php: 310be45f7 (duration: 00m 57s) [production]
16:12 <mutante> mw1434, mw1435, mw1436 - new API appservers in production, pooled first time [production]
16:11 <dzahn@cumin1001> conftool action : set/weight=30; selector: name=mw143[4-6].eqiad.wmnet [production]
16:08 <dzahn@cumin1001> conftool action : set/pooled=yes; selector: name=mw143[5-6].eqiad.wmnet [production]