901-950 of 10000 results (63ms)
2019-11-27 ยง
18:58 <mutante> an-airflow1001: cd /etc/ ; chown airflow airflow; systemctl start airflow-webserver to let airflow write unittests.cfg [production]
18:57 <eileen> process-control config revision is b95355c0c0 - repair omnirecipient job off [production]
16:57 <andrewbogott> disabling puppet on clouvirt* and cloudcontrol* while merging https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/552894/ [production]
16:50 <oblivian@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=eventgate-logging-external [production]
16:32 <cdanis@deploy1001> Synchronized wmf-config/PoolCounterSettings.php: dd4c76d3d SpecialContributions: max concurrency 3 (instead of 10) T234450 (duration: 01m 17s) [production]
16:22 <ejegg> shifted daily silverpop export start time one hour earlier [production]
16:15 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1080 for schema change', diff saved to https://phabricator.wikimedia.org/P9768 and previous config saved to /var/cache/conftool/dbconfig/20191127-161525-marostegui.json [production]
16:14 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1089 after schema change', diff saved to https://phabricator.wikimedia.org/P9767 and previous config saved to /var/cache/conftool/dbconfig/20191127-161450-marostegui.json [production]
16:06 <ema> cp3050: set proxy.config.http.server_session_sharing.match to "ip" T238494 [production]
15:57 <_joe_> restarting pybal on lvs1015 [production]
15:56 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:56 <jiji@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:55 <_joe_> restarting pybal on lvs1016 [production]
15:52 <jynus> disabling puppet on dbprov1001 to test bacula restore T238048 [production]
15:47 <papaul> testing redundancy power on scs-a1-codfw [production]
15:47 <_joe_> restarting pybal on lvs2003 [production]
15:44 <_joe_> restarting pybal again on lvs2006 [production]
15:42 <jynus> migrate db entries of archive Media to backup1001 T238048 [production]
15:37 <marostegui> Logging retroactively for the record: drop user 'nova'@'%' from m5 - T239170 [production]
15:30 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:30 <jiji@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:29 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:29 <jiji@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:29 <marostegui> Add grants for dump (10.192.0.114,10.192.16.96) for nova_cell0_eqiad database on db1117:3325 and db2078:3325 - T239170 [production]
15:27 <marostegui> Add grants for dump (10.64.0.95,10.64.16.31) for nova_cell0_eqiad database on db1117:3325 and db2078:3325 - T239170 [production]
15:25 <_joe_> restarting lvs2006 for addition of eventgate-logging-external,blubberoid-https [production]
15:24 <moritzm> installing freetype bugfix updates from Buster 10.2 point release [production]
15:21 <oblivian@cumin1001> conftool action : set/weight=10:pooled=yes; selector: service=eventgate-logging-external [production]
15:14 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:14 <jiji@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:11 <moritzm> downgrading trapperkeeper-webserver-jetty9-clojure packages on puppetdb hosts to the version shipped in Buster 10.2 [production]
15:06 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
15:06 <jiji@cumin1001> START - Cookbook sre.hosts.downtime [production]
15:04 <ema> cp-ats: rolling ats-{tls,backend} restart to enable lua reload T233274 [production]
15:02 <moritzm> remove trapperkeeper-webserver-jetty9-clojure debs from apt.wikimedia.org/buster-wikimedia (these were needed to unbreak TLS on Puppetdb in Buster, but an update landed in Buster 10.2, which replaces our custom hotfix) [production]
14:56 <marostegui> Add new grants for nova_cell0 database on m5 - T239170 [production]
14:50 <marostegui> Create nova_cell0 database on m5 master - T239170 [production]
14:43 <effie> reimage mw1346, mw1336, mw1326 [production]
14:35 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
14:33 <jiji@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:15 <effie> reimage mw2285, mw2284, mw2283 [production]
14:14 <effie> reimage mw2285, mw2286, mw2283 [production]
14:01 <moritzm> temporarily stop cas on idp1001 for some failover tests [production]
14:00 <jmm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
14:00 <jmm@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:57 <ladsgroup@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Set all of testwikidatawiki to read from the new term store for items (T225057) (duration: 00m 56s) [production]
13:44 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:44 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
13:42 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
13:42 <jiji@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]