2019-11-27
ยง
|
16:22 |
<ejegg> |
shifted daily silverpop export start time one hour earlier |
[production] |
16:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1080 for schema change', diff saved to https://phabricator.wikimedia.org/P9768 and previous config saved to /var/cache/conftool/dbconfig/20191127-161525-marostegui.json |
[production] |
16:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1089 after schema change', diff saved to https://phabricator.wikimedia.org/P9767 and previous config saved to /var/cache/conftool/dbconfig/20191127-161450-marostegui.json |
[production] |
16:06 |
<ema> |
cp3050: set proxy.config.http.server_session_sharing.match to "ip" T238494 |
[production] |
15:57 |
<_joe_> |
restarting pybal on lvs1015 |
[production] |
15:56 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:56 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:55 |
<_joe_> |
restarting pybal on lvs1016 |
[production] |
15:52 |
<jynus> |
disabling puppet on dbprov1001 to test bacula restore T238048 |
[production] |
15:47 |
<papaul> |
testing redundancy power on scs-a1-codfw |
[production] |
15:47 |
<_joe_> |
restarting pybal on lvs2003 |
[production] |
15:44 |
<_joe_> |
restarting pybal again on lvs2006 |
[production] |
15:42 |
<jynus> |
migrate db entries of archive Media to backup1001 T238048 |
[production] |
15:37 |
<marostegui> |
Logging retroactively for the record: drop user 'nova'@'%' from m5 - T239170 |
[production] |
15:30 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:30 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:29 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:29 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:29 |
<marostegui> |
Add grants for dump (10.192.0.114,10.192.16.96) for nova_cell0_eqiad database on db1117:3325 and db2078:3325 - T239170 |
[production] |
15:27 |
<marostegui> |
Add grants for dump (10.64.0.95,10.64.16.31) for nova_cell0_eqiad database on db1117:3325 and db2078:3325 - T239170 |
[production] |
15:25 |
<_joe_> |
restarting lvs2006 for addition of eventgate-logging-external,blubberoid-https |
[production] |
15:24 |
<moritzm> |
installing freetype bugfix updates from Buster 10.2 point release |
[production] |
15:21 |
<oblivian@cumin1001> |
conftool action : set/weight=10:pooled=yes; selector: service=eventgate-logging-external |
[production] |
15:14 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:14 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:11 |
<moritzm> |
downgrading trapperkeeper-webserver-jetty9-clojure packages on puppetdb hosts to the version shipped in Buster 10.2 |
[production] |
15:06 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:06 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:04 |
<ema> |
cp-ats: rolling ats-{tls,backend} restart to enable lua reload T233274 |
[production] |
15:02 |
<moritzm> |
remove trapperkeeper-webserver-jetty9-clojure debs from apt.wikimedia.org/buster-wikimedia (these were needed to unbreak TLS on Puppetdb in Buster, but an update landed in Buster 10.2, which replaces our custom hotfix) |
[production] |
14:56 |
<marostegui> |
Add new grants for nova_cell0 database on m5 - T239170 |
[production] |
14:50 |
<marostegui> |
Create nova_cell0 database on m5 master - T239170 |
[production] |
14:43 |
<effie> |
reimage mw1346, mw1336, mw1326 |
[production] |
14:35 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:33 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:15 |
<effie> |
reimage mw2285, mw2284, mw2283 |
[production] |
14:14 |
<effie> |
reimage mw2285, mw2286, mw2283 |
[production] |
14:01 |
<moritzm> |
temporarily stop cas on idp1001 for some failover tests |
[production] |
14:00 |
<jmm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:00 |
<jmm@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:57 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Set all of testwikidatawiki to read from the new term store for items (T225057) (duration: 00m 56s) |
[production] |
13:44 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:44 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
13:42 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
13:42 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
13:42 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:42 |
<ema> |
cp1075: repool with tslua reloads enabled T233274 |
[production] |
13:42 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:42 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:41 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |