2017-07-04
§
|
21:40 |
<volans> |
ACK'ed puppet not running on stat100[2-3],snapshot100[1,5-7] due to NFS overloaded on dataset1001 - T169680 |
[production] |
16:54 |
<jynus> |
dropping ukwikimedia from several labsdbhosts |
[production] |
16:10 |
<moritzm> |
rebooting radium for kernel update |
[production] |
15:09 |
<mobrovac@tin> |
Finished deploy [citoid/deploy@9d22567]: Fallback to crossRef (T165105) and use MarcXML (T165105) (duration: 02m 52s) |
[production] |
15:06 |
<mobrovac@tin> |
Started deploy [citoid/deploy@9d22567]: Fallback to crossRef (T165105) and use MarcXML (T165105) |
[production] |
15:02 |
<godog> |
set operations/debs/nginx as hidden and update description |
[production] |
14:57 |
<ema> |
pybal 1.13.7 uploaded to apt.w.o, testing it on pybal-test2001 T82747 T154759 |
[production] |
14:31 |
<godog> |
copy nginx from jessie-wikimedia to stretch-wikimedia |
[production] |
14:15 |
<paravoid> |
reset db2038's iLO |
[production] |
13:06 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=ms-fe2005.codfw.wmnet |
[production] |
11:47 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Remove comments from db1039 status - T166208 (duration: 02m 50s) |
[production] |
11:25 |
<joal@tin> |
Finished deploy [analytics/refinery@88cbb9e]: Regular weekly deploy (2) - Bug patch (duration: 03m 38s) |
[production] |
11:21 |
<joal@tin> |
Started deploy [analytics/refinery@88cbb9e]: Regular weekly deploy (2) - Bug patch |
[production] |
11:15 |
<elukey> |
powercycle elastic1018, host unreachable |
[production] |
11:02 |
<joal@tin> |
Finished deploy [analytics/refinery@12c5f57]: Regular weekly deploy (duration: 04m 47s) |
[production] |
11:00 |
<moritzm> |
rebooting kubernetes workers for kernel update |
[production] |
10:58 |
<godog> |
copy wikimedia-lvs-realserver from jessie-wikimedia to stretch-wikimedia |
[production] |
10:57 |
<joal@tin> |
Started deploy [analytics/refinery@12c5f57]: Regular weekly deploy |
[production] |
10:53 |
<gehel> |
killing stuck wmf-reimage on puppetmaster1001 for maps-test2001 |
[production] |
10:40 |
<marostegui> |
Stop replication on db1102 (sanitarium3) on s2 shard for maintenance - T153743 |
[production] |
10:33 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1060 - T153743 (duration: 02m 49s) |
[production] |
10:23 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1035 - T168661 (duration: 02m 49s) |
[production] |
10:14 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1035 - T168661 (duration: 02m 50s) |
[production] |
09:58 |
<marostegui> |
Move labsdb1009 main general replication thread to a named replication thread called db1095 - T153743 |
[production] |
09:54 |
<marostegui> |
Stop replication on db1095 for maintenance - T153743 |
[production] |
09:38 |
<moritzm> |
rebooting restbase2002-restbase2004 for kernel updates |
[production] |
09:27 |
<moritzm> |
rebooting thumbor1001/1002 for kernel updates |
[production] |
08:54 |
<marostegui> |
Run redact_sanitarium on db1102 (sanitarium3) - T153743 |
[production] |
08:39 |
<moritzm> |
rebooting sca2* for kernel update |
[production] |
08:25 |
<elukey> |
restart redis 6380 (slave) jobqueue instance on rdb1004/2003 to force resync with master |
[production] |
08:12 |
<moritzm> |
powercycling mw1260, stuck in reboot |
[production] |
07:56 |
<moritzm> |
powercycling mw1259, stuck in reboot |
[production] |
07:52 |
<gehel> |
restart of relforge for kernel upgrade |
[production] |
07:42 |
<moritzm> |
rebooting video scalers in eqiad for kernel update |
[production] |
07:15 |
<marostegui> |
Deploy alter table on s3 hosts (eqiad) - T168661 |
[production] |
06:05 |
<marostegui> |
Stop MySQL on db1060 for maintenance - T153743 |
[production] |
05:47 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1060 - T153743 (duration: 02m 51s) |
[production] |
05:26 |
<marostegui> |
Deploy alter table on s5 directly on s5 master (db1063) - T168661 |
[production] |
05:20 |
<marostegui> |
Deploy alter table on s6 directly on s6 master (db1061) - T168661 |
[production] |
05:08 |
<marostegui> |
Deploy alter table on s2 directly on s2 master (db1054) - T168661 |
[production] |
02:27 |
<l10nupdate@tin> |
scap sync-l10n completed (1.30.0-wmf.7) (duration: 10m 14s) |
[production] |
01:30 |
<mutante> |
releases1001: switching GID of reprepro and promemetheus-node-exporter group (1000 vs 1001), changing reprepro UID to 13927. using find -exec to fix all the permissions and make it identical to bromine. prevent permissions snafu when rsyncing (T164030) |
[production] |