2018-11-20
ยง
|
14:32 |
<jynus> |
stop and upgrade db2033 |
[production] |
14:27 |
<hashar> |
deployment-mediawiki-07 : sudo rm -fR /srv/mediawiki/.~tmp~/ |
[releng] |
14:18 |
<gtirloni> |
Created Puppet prefixes 'tools-clushmaster' & 'tools-mail' |
[tools] |
14:02 |
<elukey> |
restart hive-server2 to pick up new settings - T209536 |
[analytics] |
13:55 |
<andrewbogott> |
deleting deployment-redis05 and deployment-redis06 as per Giuseppe, "we're not using the old jobqueue, we should remove those vms" |
[releng] |
13:55 |
<andrewbogott> |
deleting deployment-redis05 and deployment-redis06 as per Giuseppe, "we're not using the old jobqueue, we should remove those vms" |
[deployment-prep] |
13:24 |
<gtirloni> |
shutdown tools-clushmaster-01 (use tools-clushmaster-02) |
[tools] |
13:19 |
<jynus> |
stop and upgrade db2082 |
[production] |
13:08 |
<twentyafterfour> |
removed local unix user mwdeploy from deployment-mediawiki-07 because it was shadowing the real mwdeploy user in ldap |
[releng] |
13:06 |
<banyek> |
depooling labsdb1011 (T209517) |
[production] |
13:03 |
<zeljkof> |
EU SWAT finished |
[production] |
13:03 |
<zfilipin@deploy1001> |
Synchronized wmf-config/InitialiseSettings-labs.php: SWAT: [[gerrit:474899|deployment-prep: Update parsoid09 IP (T208101)]] (duration: 00m 47s) |
[production] |
13:01 |
<twentyafterfour> |
PHP Startup: Unable to load dynamic library '/usr/lib/php/20151012/luasandbox.so' - /usr/lib/php/20151012/luasandbox.so: cannot open shared object file: No such file or directory T208101 |
[releng] |
12:55 |
<zfilipin@deploy1001> |
Synchronized wmf-config/db-labs.php: SWAT: [[gerrit:474892|deployment-prep: Update deployment-db* IPs (T208101)]] (duration: 00m 47s) |
[production] |
12:55 |
<banyek> |
setting innodb_flush_log_at_trx_commit to 2 on dbstore2002 (s3 instance only!) (T208320) |
[production] |
12:53 |
<banyek> |
setting innodb_flush_log_at_trx_commit to 2 on dbstore2002 (T208320) |
[production] |
12:49 |
<zfilipin@deploy1001> |
scap failed: average error rate on 4/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/db09a36be5ed3e81155041f7d46ad040 for details) |
[production] |
12:45 |
<zfilipin@deploy1001> |
Synchronized wmf-config/reverse-proxy-staging.php: SWAT: [[gerrit:474890|deployment-prep: Update cache-upload private IP (T208101)]] (duration: 00m 45s) |
[production] |
12:30 |
<zfilipin@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:474897|Use HD logos in InitialiseSettings.php for multiple projects (T150618)]] (duration: 00m 48s) |
[production] |
12:25 |
<zfilipin@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:474458|Add tboverride permission to extendedmover group on enwiki (T209753)]] (duration: 00m 47s) |
[production] |
12:19 |
<jynus> |
powercycling db2087, stuck on reboot |
[production] |
12:13 |
<zfilipin@deploy1001> |
Synchronized static/images/project-logos/: SWAT: [[gerrit:472791|Upload HD logos for multiple projects (T150618)]] (duration: 00m 48s) |
[production] |
12:11 |
<twentyafterfour> |
scap failures on deployment-mediawiki-07 are related to uid/gid mismatch of the mwdeploy user, specifically the owner of that user's home dir is uid 603 but /etc/passwd|group have a different uid/gid for the same username. T208101 |
[releng] |
11:55 |
<moritzm> |
rolling reboot of proton hosts for kernel security update |
[production] |
11:47 |
<hashar> |
Armed keyholder on deployment-deploy01 Got shutdown while being migrated a new cloud region # T208101 |
[releng] |
11:44 |
<elukey> |
re-run pageview-hourly-wf-2018-11-20-9 |
[analytics] |
11:27 |
<jynus> |
stop and upgrade db2087 |
[production] |
11:16 |
<banyek@deploy1001> |
Synchronized wmf-config/db-eqiad.php: T85757: (now really) repool db1093 (duration: 00m 47s) |
[production] |
11:11 |
<banyek> |
repooling db1093 (T85757) |
[production] |
11:05 |
<banyek> |
executing schema change on db1093 (T85757) |
[production] |
11:00 |
<hashar> |
deployment-deploy01 got migrated to a new region but the Jenkins configuration had not been updated. Adjusting IP address from 10.68.23.38 to 172.16.4.18 | T208101 |
[releng] |
11:00 |
<jynus> |
stop and upgrade db2086 |
[production] |
10:59 |
<banyek> |
db1093 was depooled wrong message sent |
[production] |
10:58 |
<hashar> |
Clearing out deployment-deploy01 disk space. Went offline due to disk space consumption |
[releng] |
10:52 |
<arturo> |
T208579 distributing now misctools and jobutils 1.33 in all aptly repos |
[tools] |
10:51 |
<banyek@deploy1001> |
Synchronized wmf-config/db-eqiad.php: T85757: repool db1093 (duration: 00m 47s) |
[production] |
10:48 |
<banyek> |
depooling db1093 (T85757) |
[production] |
10:48 |
<banyek> |
depooling db1093 |
[production] |
10:47 |
<jynus@deploy1001> |
Synchronized wmf-config/db-codfw.php: Repool es2018 (duration: 00m 46s) |
[production] |
10:17 |
<jynus> |
upgrade and reboot es2018 |
[production] |
10:13 |
<jynus@deploy1001> |
Synchronized wmf-config/db-codfw.php: Repool es2014, depool es2018 (duration: 00m 46s) |
[production] |
10:04 |
<Krenair> |
manually fixed deployment-mediawiki-09:/srv/mediawiki/wmf-config/db-labs.php to match deployment copy, not sure why it didn't deploy properly yet |
[releng] |
09:43 |
<godog> |
restart prometheus@tools on prometheus-01 |
[tools] |
09:34 |
<marostegui> |
Deploy schema change on s2 hosts: dbstore1002, db1090:3312 and db1095:3312 - T86339 |
[production] |
09:26 |
<marostegui> |
Deploy schema change on s2 codfw master (db2035) with replication - T86339 |
[production] |
09:25 |
<jynus> |
upgrade and reboot es2014 |
[production] |
09:23 |
<jynus@deploy1001> |
Synchronized wmf-config/db-codfw.php: Repool es2011, depool es2014 (duration: 00m 46s) |
[production] |
09:23 |
<godog> |
stress-test new ms-be hardware - T209395 |
[production] |
09:12 |
<marostegui> |
Stop MySQL on pc2004, pc2005 and pc2006 for decommission - T209858 |
[production] |
09:05 |
<gehel> |
powercycle elastic2021 |
[production] |