2020-02-21
§
|
12:29 |
<mark> |
cr3-esams: Shutdown GRE tunnels over Telia |
[production] |
12:27 |
<akosiaris> |
repool mathoid at eqiad, test complete |
[production] |
12:27 |
<akosiaris@cumin1001> |
conftool action : set/pooled=true; selector: name=eqiad,dnsdisc=mathoid |
[production] |
12:20 |
<moritzm> |
rebooting boron |
[production] |
12:20 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
12:20 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:17 |
<moritzm> |
bumped memory for boron.eqiad.wmnet to 16G |
[production] |
12:04 |
<mark> |
cr3-esams: request chassis fpc offline slot 1 |
[production] |
11:57 |
<mark> |
Disabled Telia transit on cr3-esams |
[production] |
11:57 |
<mark> |
Set VRRP prio cost to 50 on cr3-esams to make it backup VRRP |
[production] |
11:48 |
<elukey> |
restart varnishkafka-webrequest on cp3052 (stuck in timeouts to kafka, analytics alarms raised) |
[production] |
11:47 |
<elukey> |
restart varnishkafka-webrequest on cp3056/cp3058/cp3054/cp3064 (stuck in timeouts to kafka, analytics alarms raised) |
[production] |
11:39 |
<elukey> |
restart varnishkafka on cp3057 (stuck in timeouts to kafka, analytics alarms raised) |
[production] |
11:21 |
<godog> |
bounce logstash on logstash1023 - see if can catch up with elastic7 kafka lag |
[production] |
11:14 |
<elukey> |
reboot stat1005 - GPU blocked at 100% after issue with tensorflow |
[production] |
09:18 |
<akosiaris> |
depool mathoid in eqiad for a test |
[production] |
09:18 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=false; selector: name=eqiad,dnsdisc=mathoid |
[production] |
08:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1107 after 10.4 testing - T242702', diff saved to https://phabricator.wikimedia.org/P10473 and previous config saved to /var/cache/conftool/dbconfig/20200221-085405-marostegui.json |
[production] |
08:34 |
<fdans@deploy1001> |
Finished deploy [analytics/refinery@4d56021]: deploying refinery (duration: 14m 55s) |
[production] |
08:19 |
<fdans@deploy1001> |
Started deploy [analytics/refinery@4d56021]: deploying refinery |
[production] |
08:02 |
<akosiaris> |
disable mod_remoteip on otrs host, following merge of https://gerrit.wikimedia.org/r/573877 |
[production] |
06:58 |
<marostegui> |
Stop MySQL on labsdb1012 to clone labsdb1011 - T245797 |
[production] |
06:58 |
<marostegui> |
Stop MySQL on labsdb1012 to clone labsdb1011 - |
[production] |
06:34 |
<marostegui> |
Stop mysql on es1024 to clone es1025 - T243052 |
[production] |
05:57 |
<marostegui> |
Start MySQL on labsdb1011 without replication - T245797 |
[production] |
05:44 |
<marostegui> |
Reload haproxy on dbproxy1010, dbproxy1011, dbproxy18 - T245797 |
[production] |
02:53 |
<bstorm_> |
depooled labsdb1011 and set weight 10 on labsdb1009 vs 3 on labsdb1010 T245797 |
[production] |
02:43 |
<ejegg> |
updated Fundraising CiviCRM from a6b222c19f to c086fd4e0b |
[production] |
02:27 |
<bstorm_> |
stopped mariadb on labsdb1011 because it keeps crashing anyway |
[production] |
01:05 |
<jforrester@deploy1001> |
Synchronized wmf-config/CommonSettings.php: Sync Beta-Cluster-only change to CommonSettings now we're sure we won't revert (duration: 00m 56s) |
[production] |
01:04 |
<andrew@deploy1001> |
Finished deploy [horizon/deploy@13ca90a]: Remove guided puppet config mode; this gets us back to working with latest puppet packages. (duration: 03m 32s) |
[production] |
01:01 |
<andrew@deploy1001> |
Started deploy [horizon/deploy@13ca90a]: Remove guided puppet config mode; this gets us back to working with latest puppet packages. |
[production] |
2020-02-20
§
|
23:50 |
<jforrester@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: T245787 [nlwiki] Add noindex for NS_USER and NS_USER_TALK (duration: 00m 56s) |
[production] |
23:46 |
<jforrester@deploy1001> |
Synchronized wmf-config/CommonSettings.php: Stop setting wgVectorPrintLogo for back-compat., not read since wmf.19 (duration: 00m 56s) |
[production] |
23:45 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw232[0-4].codfw.wmnet |
[production] |
23:45 |
<mutante> |
gerrit1002 - test VM - rebooting for new disk |
[production] |
23:33 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw231[7-9].codfw.wmnet |
[production] |
23:33 |
<dzahn@cumin1001> |
conftool action : set/weight=15; selector: name=mw232[0-4].codfw.wmnet |
[production] |
23:32 |
<dzahn@cumin1001> |
conftool action : set/weight=15; selector: name=mw231[7-9].codfw.wmnet |
[production] |
23:32 |
<dzahn@cumin1001> |
conftool action : set/weight=15; selector: name=mw2381[7-9].codfw.wmnet |
[production] |
23:25 |
<mutante> |
ganeti1003 - adding another virtual 20G disk to gerrit1002 (T243808) |
[production] |
23:14 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
23:12 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
23:04 |
<jforrester@deploy1001> |
Synchronized php-1.35.0-wmf.20/includes/pager/IndexPager.php: IndexPager: Limit offset params to the max of the indices available (duration: 00m 56s) |
[production] |
23:01 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
22:59 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
22:28 |
<ebernhardson> |
restart mjolnir-kafka-bulk-daemon across eqiad |
[production] |
22:28 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
22:28 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
22:28 |
<ebernhardson@deploy1001> |
Finished deploy [search/mjolnir/deploy@8908dd1]: daemons: Install stack printing signal handler on SIGUSR1 (duration: 05m 05s) |
[production] |