2020-02-06
ยง
|
15:52 |
<bblack> |
lvs5003 - restart pybal for dual bgp session config - T180069 |
[production] |
15:50 |
<moritzm> |
installing python-ecdsa security updates |
[production] |
15:50 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:41 |
<moritzm> |
installing jsoup security updates |
[production] |
15:30 |
<vgutierrez> |
depool & reimage ncredir4001 as buster - T243391 |
[production] |
15:29 |
<vgutierrez> |
depool & reimage cp4024 as buster - T242093 |
[production] |
15:28 |
<vgutierrez> |
pooling ncredir4002 running buster - T243391 |
[production] |
15:27 |
<moritzm> |
installing sudo security updates on jessie |
[production] |
15:23 |
<vgutierrez> |
pooling cp4025 with buster - T242093 |
[production] |
15:14 |
<ema> |
A:mw-api: force puppet run to increase keepalive_requests from 100 to 200 https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/570670/ T241145 |
[production] |
15:09 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:07 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:59 |
<godog> |
extend graphite1004 / graphite2003 fs +200G |
[production] |
14:56 |
<vgutierrez> |
depool and reimage ncredir4002 as buster - T243391 |
[production] |
14:46 |
<vgutierrez> |
depool & reimage cp4025 as buster - T242093 |
[production] |
14:16 |
<akosiaris> |
20mins in with eventgate-analytics/eqiad depooled from discovery, no issues yet. |
[production] |
14:14 |
<ema> |
run puppet on mw-api-canary to revert nginx keepalive_requests bump T241145 |
[production] |
13:55 |
<marostegui> |
Stop MySQL on es1019, upgrade and poweroff for on-site maintenance - T243963 |
[production] |
13:54 |
<akosiaris@cumin1001> |
conftool action : set/pooled=false; selector: name=eqiad,dnsdisc=eventgate-analytics |
[production] |
13:53 |
<akosiaris> |
depool eqiad eventgate-analytics for testing purposes. Requests will flow to codfw, monitoring https://grafana.wikimedia.org/d/RIA1lzDZk/application-servers-red-dashboard?orgId=1&from=now-30m&to=now for issues. |
[production] |
13:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es1019 for onsite maintenance T243963', diff saved to https://phabricator.wikimedia.org/P10321 and previous config saved to /var/cache/conftool/dbconfig/20200206-135157-marostegui.json |
[production] |
13:45 |
<XioNoX> |
rollback deactivate BGP transits on cr3-knams |
[production] |
13:34 |
<elukey> |
repool mw1347 with mcrouter running with 10 proxy threads (was: 5) |
[production] |
13:31 |
<XioNoX> |
reboot cr3-knams |
[production] |
13:30 |
<elukey> |
depool mw1347 to test some mcrouter settings |
[production] |
13:27 |
<XioNoX> |
deactivate BGP transits on cr3-knams |
[production] |
13:22 |
<vgutierrez> |
Enable server session sharing on ats-tls in cp4031 - T244464 |
[production] |
13:10 |
<XioNoX> |
rollback: deactivate BGP transits on cr2-eqsin |
[production] |
13:00 |
<XioNoX> |
reboot cr2-eqsin for sw upgrade |
[production] |
13:00 |
<addshore> |
SWAT done |
[production] |
13:00 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: resync REVERT Enable EntitySourceBasedFederation for group1 (duration: 01m 07s) |
[production] |
12:59 |
<XioNoX> |
deactivate BGP transits on cr2-eqsin |
[production] |
12:58 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: REVERT Enable EntitySourceBasedFederation for group1 T243395, due to T244479 (duration: 01m 07s) |
[production] |
12:52 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Enable EntitySourceBasedFederation for group1 T243395 (duration: 01m 06s) |
[production] |
12:46 |
<addshore@deploy1001> |
Synchronized php-1.35.0-wmf.18/extensions/Babel: REVERT Fetch central babel information over SQL query, not API (T243726) (duration: 01m 07s) |
[production] |
12:44 |
<addshore@deploy1001> |
sync-file aborted: Fetch central babel information over SQL query, not API (T243726) (duration: 01m 04s) |
[production] |
12:40 |
<vgutierrez> |
pooling cp3065 - T242093 |
[production] |
12:39 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Enable EntitySourceBasedFederation for group0 T243395 (duration: 01m 07s) |
[production] |
12:34 |
<cparle@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Re-enable delayed new upload jobs for MachineVision extension (duration: 01m 08s) |
[production] |
12:26 |
<cparle@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Remove handler deleted from the MachineVision extension (duration: 01m 05s) |
[production] |
12:25 |
<XioNoX> |
remove full-duplex statement from eqsin Tata link (not supported on Junos 18, as 10G is full duplex anyway) |
[production] |
12:24 |
<cparle@deploy1001> |
Synchronized php-1.35.0-wmf.18/extensions/MachineVision: Use the wbsetclaim API to add depicts statements (duration: 01m 09s) |
[production] |
12:07 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 5e1cbb2: Enable CX in te, kn, gu, mr and pawiki as a default tool (T243271, T243272, T243273, T243274, T243275) (duration: 01m 09s) |
[production] |
11:41 |
<akosiaris> |
upgrade etherpad-lite on etherpad1002 to 1.8.0-1 |
[production] |
11:38 |
<kart_> |
Updated cxserver to 2020-02-05-051751-production (T244230, T234323) |
[production] |
11:35 |
<kartik@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' . |
[production] |
11:33 |
<akosiaris> |
upload etherpad-lite_1.8.0-1 to apt.wikimedia.org buster-wikimedia/main |
[production] |
11:31 |
<kartik@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'cxserver' for release 'production' . |
[production] |
11:28 |
<kartik@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'cxserver' for release 'staging' . |
[production] |
11:14 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |