2017-07-18
§
|
23:53 |
<mutante> |
netmon1002 - copied Letsencrypt cert/key for librenms from netmon1001 for migration after netmon1002 has been reinstalled and now has RAID. (T159756) |
[production] |
23:40 |
<thcipriani@tin> |
Synchronized wmf-config/InterwikiSortOrders.php: SWAT: [[gerrit:365451|Add din to InterwikiSortOrders]] T168518 (duration: 00m 46s) |
[production] |
23:35 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:365942|Add Welsh mobile logo (just changes 'k' to 'c']] PART II (duration: 00m 46s) |
[production] |
23:34 |
<thcipriani@tin> |
Synchronized static/images/mobile/copyright/wikipedia-wordmark-cy.svg: SWAT: [[gerrit:365942|Add Welsh mobile logo (just changes 'k' to 'c']] PART I (duration: 00m 47s) |
[production] |
23:27 |
<thcipriani@tin> |
Synchronized php-1.30.0-wmf.9/extensions/Thanks/extension.json: SWAT: [[gerrit:366168|Add missing jQueryMsg dependency for mobile diff view]] T170917 (duration: 00m 47s) |
[production] |
23:22 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:360371|Enable OOjs UI EditPage buttons on all Wikipedias]] T162849 (duration: 00m 47s) |
[production] |
23:13 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:365884|Enable CodeMirror on simplewiki for better testing and more exposure]] (duration: 00m 48s) |
[production] |
22:58 |
<thcipriani> |
restared jobrunner on mw1299.eqiad.wmnet mw1168.eqiad.wmnet mw1164.eqiad.wmnet mw1305.eqiad.wmnet mw1304.eqiad.wmnet mw1301.eqiad.wmnet mw1259.eqiad.wmnet mw1166.eqiad.wmnet mw1300.eqiad.wmnet |
[production] |
22:42 |
<krinkle@tin> |
Finished deploy [jobrunner/jobrunner@5f6099f]: (no justification provided) (duration: 08m 18s) |
[production] |
22:34 |
<krinkle@tin> |
Started deploy [jobrunner/jobrunner@5f6099f]: (no justification provided) |
[production] |
22:02 |
<krinkle@tin> |
Finished deploy [jobrunner/jobrunner@5f6099f]: (no justification provided) (duration: 07m 58s) |
[production] |
21:54 |
<krinkle@tin> |
Started deploy [jobrunner/jobrunner@5f6099f]: (no justification provided) |
[production] |
21:43 |
<Krinkle> |
Attempt to deploy mediawiki/services/jobrunner – https://gerrit.wikimedia.org/r/#/c/349364/ - failed. |
[production] |
19:56 |
<dzahn@neodymium> |
conftool action : set/pooled=yes; selector: name=mw2202.codfw.wmnet |
[production] |
19:48 |
<robh> |
starting wipe on cp400[1-4] per T169020 |
[production] |
19:15 |
<demon@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: group0 to wmf.10 |
[production] |
18:59 |
<demon@tin> |
Synchronized php-1.30.0-wmf.9/extensions/MobileFrontend/extension.json: One (more) last thing (duration: 02m 49s) |
[production] |
18:51 |
<demon@tin> |
Synchronized php-1.30.0-wmf.9/extensions/MobileFrontend/extension.json: One last thing (duration: 02m 55s) |
[production] |
18:42 |
<mutante> |
netmon1002 - reinstall OS - didn't use the right partman recipe - didn't have md0 - revoke old puppet cert , salt-key, scheduled downtime, services over at netmon2001 |
[production] |
18:36 |
<mutante> |
mw2202 - scheduled downtime - mainboard replacement |
[production] |
18:36 |
<ejegg> |
updated payments-wiki from bdc52265d78c55cfc6a732f14519f5f79c9d2d94 to c3be2bfd8f2b9f9eac4c80b45096713c7fdcceff |
[production] |
18:29 |
<demon@tin> |
Finished scap: mobilefrontend wmf.9 + forced l10n rebuild (duration: 20m 53s) |
[production] |
18:26 |
<mutante> |
mw2202 - remove /etc/udev/rules.d/70-persistent-net.rules for mainboard replacement - to detect new NICs with new MACs (T170307) |
[production] |
18:24 |
<dzahn@neodymium> |
conftool action : set/pooled=no; selector: name=mw2202.codfw.wmnet |
[production] |
18:08 |
<demon@tin> |
Started scap: mobilefrontend wmf.9 + forced l10n rebuild |
[production] |
18:02 |
<ottomata> |
stopping kafka on kafka1012 again, i think we swapped the wrong disk T168927 |
[production] |
17:55 |
<awight@tin> |
Finished deploy [ores/deploy@1d35aa5]: T170485 (duration: 35m 06s) |
[production] |
17:47 |
<mutante> |
smokeping - switched to netmon2001 - ping times to codfw hosts went down - ping times to eqiad hosts went up - since service is on both but data has been synced over |
[production] |
17:41 |
<demon@tin> |
Synchronized wmf-config/InitialiseSettings.php: labtest typofix for tgr (duration: 00m 46s) |
[production] |
17:21 |
<mobrovac@tin> |
Finished deploy [parsoid/deploy@1eaa07e]: Bring wtp2019 up to date and repool it - T146113 (duration: 01m 02s) |
[production] |
17:20 |
<mobrovac@tin> |
Started deploy [parsoid/deploy@1eaa07e]: Bring wtp2019 up to date and repool it - T146113 |
[production] |
17:20 |
<awight@tin> |
Started deploy [ores/deploy@1d35aa5]: T170485 |
[production] |
17:18 |
<demon@tin> |
Finished scap: testwiki to wmf.10 + l10n cache build (duration: 24m 23s) |
[production] |
17:16 |
<ottomata> |
stopping kafka broker on kafka1012 to replace disk T168927 |
[production] |
16:53 |
<demon@tin> |
Started scap: testwiki to wmf.10 + l10n cache build |
[production] |
16:45 |
<oblivian@tin> |
Started deploy [search/MjoLniR@0140aed]: init |
[production] |
16:44 |
<oblivian@tin> |
Started deploy [search/MjoLniR@0140aed]: (no justification provided) |
[production] |
16:40 |
<demon@tin> |
Pruned MediaWiki: 1.30.0-wmf.7 [keeping static files] (duration: 06m 06s) |
[production] |
16:31 |
<godog> |
finish rollout of thumbor 1.1 in eqiad - T170677 |
[production] |
16:00 |
<marostegui> |
Deploy alter table on s1 - labsdb1003 - T166204 |
[production] |
15:59 |
<ema> |
power-cycle cp2017, stuck rebooting |
[production] |
15:45 |
<tgr@tin> |
Synchronized wmf-config/InitialiseSettings.php: T170863 deploy TemplateStyles to some non-content wikis (all target wikis) (duration: 00m 45s) |
[production] |
15:37 |
<tgr@tin> |
Finished scap: T170863 deploy TemplateStyles to some non-content wikis (first step: testwiki/labstestwiki only) (forcing; canary errors are unrelated) (duration: 10m 19s) |
[production] |
15:26 |
<tgr@tin> |
Started scap: T170863 deploy TemplateStyles to some non-content wikis (first step: testwiki/labstestwiki only) (forcing; canary errors are unrelated) |
[production] |
15:14 |
<marostegui> |
Stop MySQL and shutdown pc2006 for mainboard replacement - T170520 |
[production] |
15:08 |
<tgr@tin> |
scap failed: RuntimeError scap failed: average error rate on 1/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/3888cca979647b9381a7739b0bdbc88e for details) (duration: 09m 42s) |
[production] |
15:07 |
<tgr@tin> |
scap failed: average error rate on 1/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/3888cca979647b9381a7739b0bdbc88e for details) |
[production] |
14:58 |
<tgr@tin> |
Started scap: T170863 deploy TemplateStyles to some non-content wikis (first step: testwiki/labstestwiki only) |
[production] |
14:55 |
<godog> |
upload and roll-upgrade thumbor to 1.1 - T170677 |
[production] |
14:44 |
<zeljkof> |
EU SWAT finished! |
[production] |