2019-11-27
ยง
|
19:23 |
<ebernhardson@deploy1001> |
Started deploy [search/airflow@f3bad9d]: revert adding mysqlclient python package |
[production] |
19:08 |
<ebernhardson@deploy1001> |
Finished deploy [search/airflow@57f4caa]: Install mysqlclient to airflow instance (duration: 00m 40s) |
[production] |
19:08 |
<ebernhardson@deploy1001> |
Started deploy [search/airflow@57f4caa]: Install mysqlclient to airflow instance |
[production] |
19:00 |
<mutante> |
an-airflow1001: cd /etc/ ; chown airflow airflow; systemctl start airflow-webserver to let airflow write unittests.cfg (it tries to write this on first start and did not have permissions to do so) T236180 |
[production] |
18:58 |
<mutante> |
an-airflow1001: cd /etc/ ; chown airflow airflow; systemctl start airflow-webserver to let airflow write unittests.cfg |
[production] |
18:57 |
<eileen> |
process-control config revision is b95355c0c0 - repair omnirecipient job off |
[production] |
16:57 |
<andrewbogott> |
disabling puppet on clouvirt* and cloudcontrol* while merging https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/552894/ |
[production] |
16:50 |
<oblivian@puppetmaster1001> |
conftool action : set/pooled=true; selector: dnsdisc=eventgate-logging-external |
[production] |
16:32 |
<cdanis@deploy1001> |
Synchronized wmf-config/PoolCounterSettings.php: dd4c76d3d SpecialContributions: max concurrency 3 (instead of 10) T234450 (duration: 01m 17s) |
[production] |
16:22 |
<ejegg> |
shifted daily silverpop export start time one hour earlier |
[production] |
16:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1080 for schema change', diff saved to https://phabricator.wikimedia.org/P9768 and previous config saved to /var/cache/conftool/dbconfig/20191127-161525-marostegui.json |
[production] |
16:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1089 after schema change', diff saved to https://phabricator.wikimedia.org/P9767 and previous config saved to /var/cache/conftool/dbconfig/20191127-161450-marostegui.json |
[production] |
16:06 |
<ema> |
cp3050: set proxy.config.http.server_session_sharing.match to "ip" T238494 |
[production] |
15:57 |
<_joe_> |
restarting pybal on lvs1015 |
[production] |
15:56 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:56 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:55 |
<_joe_> |
restarting pybal on lvs1016 |
[production] |
15:52 |
<jynus> |
disabling puppet on dbprov1001 to test bacula restore T238048 |
[production] |
15:47 |
<papaul> |
testing redundancy power on scs-a1-codfw |
[production] |
15:47 |
<_joe_> |
restarting pybal on lvs2003 |
[production] |
15:44 |
<_joe_> |
restarting pybal again on lvs2006 |
[production] |
15:42 |
<jynus> |
migrate db entries of archive Media to backup1001 T238048 |
[production] |
15:37 |
<marostegui> |
Logging retroactively for the record: drop user 'nova'@'%' from m5 - T239170 |
[production] |
15:30 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:30 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:29 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:29 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:29 |
<marostegui> |
Add grants for dump (10.192.0.114,10.192.16.96) for nova_cell0_eqiad database on db1117:3325 and db2078:3325 - T239170 |
[production] |
15:27 |
<marostegui> |
Add grants for dump (10.64.0.95,10.64.16.31) for nova_cell0_eqiad database on db1117:3325 and db2078:3325 - T239170 |
[production] |
15:25 |
<_joe_> |
restarting lvs2006 for addition of eventgate-logging-external,blubberoid-https |
[production] |
15:24 |
<moritzm> |
installing freetype bugfix updates from Buster 10.2 point release |
[production] |
15:21 |
<oblivian@cumin1001> |
conftool action : set/weight=10:pooled=yes; selector: service=eventgate-logging-external |
[production] |
15:14 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:14 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:11 |
<moritzm> |
downgrading trapperkeeper-webserver-jetty9-clojure packages on puppetdb hosts to the version shipped in Buster 10.2 |
[production] |
15:06 |
<jiji@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
15:06 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:04 |
<ema> |
cp-ats: rolling ats-{tls,backend} restart to enable lua reload T233274 |
[production] |
15:02 |
<moritzm> |
remove trapperkeeper-webserver-jetty9-clojure debs from apt.wikimedia.org/buster-wikimedia (these were needed to unbreak TLS on Puppetdb in Buster, but an update landed in Buster 10.2, which replaces our custom hotfix) |
[production] |
14:56 |
<marostegui> |
Add new grants for nova_cell0 database on m5 - T239170 |
[production] |
14:50 |
<marostegui> |
Create nova_cell0 database on m5 master - T239170 |
[production] |
14:43 |
<effie> |
reimage mw1346, mw1336, mw1326 |
[production] |
14:35 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:33 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:15 |
<effie> |
reimage mw2285, mw2284, mw2283 |
[production] |
14:14 |
<effie> |
reimage mw2285, mw2286, mw2283 |
[production] |
14:01 |
<moritzm> |
temporarily stop cas on idp1001 for some failover tests |
[production] |
14:00 |
<jmm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:00 |
<jmm@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:57 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Set all of testwikidatawiki to read from the new term store for items (T225057) (duration: 00m 56s) |
[production] |