5651-5700 of 10000 results (75ms)
2020-02-06 ยง
15:30 <vgutierrez> depool & reimage ncredir4001 as buster - T243391 [production]
15:29 <vgutierrez> depool & reimage cp4024 as buster - T242093 [production]
15:28 <vgutierrez> pooling ncredir4002 running buster - T243391 [production]
15:27 <moritzm> installing sudo security updates on jessie [production]
15:23 <vgutierrez> pooling cp4025 with buster - T242093 [production]
15:14 <ema> A:mw-api: force puppet run to increase keepalive_requests from 100 to 200 https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/570670/ T241145 [production]
15:09 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:07 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:59 <godog> extend graphite1004 / graphite2003 fs +200G [production]
14:56 <vgutierrez> depool and reimage ncredir4002 as buster - T243391 [production]
14:46 <vgutierrez> depool & reimage cp4025 as buster - T242093 [production]
14:16 <akosiaris> 20mins in with eventgate-analytics/eqiad depooled from discovery, no issues yet. [production]
14:14 <ema> run puppet on mw-api-canary to revert nginx keepalive_requests bump T241145 [production]
13:55 <marostegui> Stop MySQL on es1019, upgrade and poweroff for on-site maintenance - T243963 [production]
13:54 <akosiaris@cumin1001> conftool action : set/pooled=false; selector: name=eqiad,dnsdisc=eventgate-analytics [production]
13:53 <akosiaris> depool eqiad eventgate-analytics for testing purposes. Requests will flow to codfw, monitoring https://grafana.wikimedia.org/d/RIA1lzDZk/application-servers-red-dashboard?orgId=1&from=now-30m&to=now for issues. [production]
13:51 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool es1019 for onsite maintenance T243963', diff saved to https://phabricator.wikimedia.org/P10321 and previous config saved to /var/cache/conftool/dbconfig/20200206-135157-marostegui.json [production]
13:45 <XioNoX> rollback deactivate BGP transits on cr3-knams [production]
13:34 <elukey> repool mw1347 with mcrouter running with 10 proxy threads (was: 5) [production]
13:31 <XioNoX> reboot cr3-knams [production]
13:30 <elukey> depool mw1347 to test some mcrouter settings [production]
13:27 <XioNoX> deactivate BGP transits on cr3-knams [production]
13:22 <vgutierrez> Enable server session sharing on ats-tls in cp4031 - T244464 [production]
13:10 <XioNoX> rollback: deactivate BGP transits on cr2-eqsin [production]
13:00 <XioNoX> reboot cr2-eqsin for sw upgrade [production]
13:00 <addshore> SWAT done [production]
13:00 <addshore@deploy1001> Synchronized wmf-config/InitialiseSettings.php: resync REVERT Enable EntitySourceBasedFederation for group1 (duration: 01m 07s) [production]
12:59 <XioNoX> deactivate BGP transits on cr2-eqsin [production]
12:58 <addshore@deploy1001> Synchronized wmf-config/InitialiseSettings.php: REVERT Enable EntitySourceBasedFederation for group1 T243395, due to T244479 (duration: 01m 07s) [production]
12:52 <addshore@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Enable EntitySourceBasedFederation for group1 T243395 (duration: 01m 06s) [production]
12:46 <addshore@deploy1001> Synchronized php-1.35.0-wmf.18/extensions/Babel: REVERT Fetch central babel information over SQL query, not API (T243726) (duration: 01m 07s) [production]
12:44 <addshore@deploy1001> sync-file aborted: Fetch central babel information over SQL query, not API (T243726) (duration: 01m 04s) [production]
12:40 <vgutierrez> pooling cp3065 - T242093 [production]
12:39 <addshore@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Enable EntitySourceBasedFederation for group0 T243395 (duration: 01m 07s) [production]
12:34 <cparle@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Re-enable delayed new upload jobs for MachineVision extension (duration: 01m 08s) [production]
12:26 <cparle@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Remove handler deleted from the MachineVision extension (duration: 01m 05s) [production]
12:25 <XioNoX> remove full-duplex statement from eqsin Tata link (not supported on Junos 18, as 10G is full duplex anyway) [production]
12:24 <cparle@deploy1001> Synchronized php-1.35.0-wmf.18/extensions/MachineVision: Use the wbsetclaim API to add depicts statements (duration: 01m 09s) [production]
12:07 <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: 5e1cbb2: Enable CX in te, kn, gu, mr and pawiki as a default tool (T243271, T243272, T243273, T243274, T243275) (duration: 01m 09s) [production]
11:41 <akosiaris> upgrade etherpad-lite on etherpad1002 to 1.8.0-1 [production]
11:38 <kart_> Updated cxserver to 2020-02-05-051751-production (T244230, T234323) [production]
11:35 <kartik@deploy1001> helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' . [production]
11:33 <akosiaris> upload etherpad-lite_1.8.0-1 to apt.wikimedia.org buster-wikimedia/main [production]
11:31 <kartik@deploy1001> helmfile [CODFW] Ran 'apply' command on namespace 'cxserver' for release 'production' . [production]
11:28 <kartik@deploy1001> helmfile [STAGING] Ran 'apply' command on namespace 'cxserver' for release 'staging' . [production]
11:14 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:11 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:21 <akosiaris> undo "switchover selectively eventgate-analytics.discovery.wmnet to codfw for mw1331 and mw1348". no effect observed [production]
10:20 <akosiaris> undo "switchover selectively eventgate-analytics.discovery.wmnet to codfw for mw1331 and mw1348" [production]
10:19 <vgutierrez> Enabling HTTP keepalive between ats-tls and varnish-frontend on cp4031 - T244464 [production]