2019-12-04
ยง
|
19:43 |
<twentyafterfour@deploy1001> |
Finished deploy [phabricator/deployment@e4e2b22]: deploy phabricator to phab2001.codfw.wmnet (duration: 00m 31s) |
[production] |
19:43 |
<twentyafterfour@deploy1001> |
Started deploy [phabricator/deployment@e4e2b22]: deploy phabricator to phab2001.codfw.wmnet |
[production] |
19:38 |
<milimetric@deploy1001> |
deploy aborted: Weekly train deploy (duration: 00m 21s) |
[production] |
19:38 |
<milimetric@deploy1001> |
Started deploy [analytics/refinery@c8de2ab]: Weekly train deploy |
[production] |
19:21 |
<Amir1> |
morning SWAT is done |
[production] |
19:19 |
<rzl@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
19:17 |
<ladsgroup@deploy1001> |
Synchronized php-1.35.0-wmf.8/extensions/Wikibase/repo/includes/ParserOutput/FullEntityParserOutputGenerator.php: SWAT: [[gerrit:554330|Remove no-op 'jquery.ui.core.styles' from FullEntityParserOutputGenerator]] (T219604 T239594) (duration: 01m 06s) |
[production] |
19:16 |
<rzl@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
18:55 |
<bblack> |
dns1001: back to normal again |
[production] |
18:54 |
<bblack> |
dns1001: stop bird.service again, briefly |
[production] |
18:52 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-logging-external' for release 'logging-external' . |
[production] |
18:50 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-logging-external' for release 'logging-external' . |
[production] |
18:49 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-logging-external' for release 'logging-external' . |
[production] |
18:46 |
<bblack> |
dns1001: restart bird.service |
[production] |
18:45 |
<arlolra> |
Updated Parsoid to b81bbf4 (T239643, T239830, T238456, T239841) |
[production] |
18:41 |
<bblack> |
dns1001: stopping just bird |
[production] |
18:32 |
<arlolra@deploy1001> |
Finished deploy [parsoid/deploy@0910e18]: Updating Parsoid to b81bbf4 (duration: 08m 11s) |
[production] |
18:24 |
<arlolra@deploy1001> |
Started deploy [parsoid/deploy@0910e18]: Updating Parsoid to b81bbf4 |
[production] |
18:08 |
<bblack> |
dns1002: back to normal state |
[production] |
18:05 |
<bblack> |
dns1002: stopping recursive dns to test failure theory (same method as prere-imaging earlier, intended to not cause impact) |
[production] |
17:54 |
<bblack> |
dns1001: back to normal state |
[production] |
17:51 |
<bblack> |
dns1001: stopping recursive dns to test failure theory (same method as prere-imaging earlier, intended to not cause impact) |
[production] |
17:50 |
<ladsgroup@deploy1001> |
Synchronized php-1.35.0-wmf.5/extensions/Wikibase/repo/includes/ParserOutput/FullEntityParserOutputGenerator.php: T229407, part III (duration: 01m 01s) |
[production] |
17:25 |
<bblack@cumin1001> |
conftool action : set/pooled=yes; selector: name=dns[12]001.wikimedia.org |
[production] |
17:25 |
<_joe_> |
repooling mw1348 |
[production] |
17:21 |
<_joe_> |
depooling mw1348 for debugging |
[production] |
17:15 |
<jynus> |
killing dump threads on db1118 T143870 |
[production] |
17:13 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:11 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
17:09 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:07 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:49 |
<bblack@cumin1001> |
conftool action : set/pooled=no; selector: name=dns[12]001.wikimedia.org |
[production] |
16:48 |
<bblack> |
dns[12]001 - reimaging to buster |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,service=nginx,name=mw2267.codfw.wmnet,cluster=jobrunner |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,service=apache2,name=mw2267.codfw.wmnet,cluster=jobrunner |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,service=nginx,name=mw2267.codfw.wmnet,cluster=videoscaler |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,service=apache2,name=mw2267.codfw.wmnet,cluster=videoscaler |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2272.codfw.wmnet,dc=codfw,service=nginx,cluster=appserver |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2272.codfw.wmnet,dc=codfw,service=apache2,cluster=appserver |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2273.codfw.wmnet,dc=codfw,cluster=appserver,service=nginx |
[production] |
16:48 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2273.codfw.wmnet,dc=codfw,cluster=appserver,service=apache2 |
[production] |
16:33 |
<ejegg> |
updated fundraising CiviCRM from 970b7b214b to 6812488f3a |
[production] |
16:32 |
<effie> |
enagle puppet on mwdebug1001 |
[production] |
16:32 |
<effie> |
enagle puppet on mw1348 |
[production] |
16:30 |
<rzl@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
16:28 |
<rzl@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:25 |
<effie> |
disable puppet on mw1348 |
[production] |
15:57 |
<papaul> |
rebooting ms-fe2007 for HW maintenance |
[production] |
15:49 |
<rzl@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:47 |
<rzl@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |