8251-8300 of 10000 results (57ms)
2017-07-04 §
21:40 <volans> ACK'ed puppet not running on stat100[2-3],snapshot100[1,5-7] due to NFS overloaded on dataset1001 - T169680 [production]
16:54 <jynus> dropping ukwikimedia from several labsdbhosts [production]
16:10 <moritzm> rebooting radium for kernel update [production]
15:09 <mobrovac@tin> Finished deploy [citoid/deploy@9d22567]: Fallback to crossRef (T165105) and use MarcXML (T165105) (duration: 02m 52s) [production]
15:06 <mobrovac@tin> Started deploy [citoid/deploy@9d22567]: Fallback to crossRef (T165105) and use MarcXML (T165105) [production]
15:02 <godog> set operations/debs/nginx as hidden and update description [production]
14:57 <ema> pybal 1.13.7 uploaded to apt.w.o, testing it on pybal-test2001 T82747 T154759 [production]
14:31 <godog> copy nginx from jessie-wikimedia to stretch-wikimedia [production]
14:15 <paravoid> reset db2038's iLO [production]
13:06 <filippo@puppetmaster1001> conftool action : set/pooled=yes; selector: name=ms-fe2005.codfw.wmnet [production]
11:47 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Remove comments from db1039 status - T166208 (duration: 02m 50s) [production]
11:25 <joal@tin> Finished deploy [analytics/refinery@88cbb9e]: Regular weekly deploy (2) - Bug patch (duration: 03m 38s) [production]
11:21 <joal@tin> Started deploy [analytics/refinery@88cbb9e]: Regular weekly deploy (2) - Bug patch [production]
11:15 <elukey> powercycle elastic1018, host unreachable [production]
11:02 <joal@tin> Finished deploy [analytics/refinery@12c5f57]: Regular weekly deploy (duration: 04m 47s) [production]
11:00 <moritzm> rebooting kubernetes workers for kernel update [production]
10:58 <godog> copy wikimedia-lvs-realserver from jessie-wikimedia to stretch-wikimedia [production]
10:57 <joal@tin> Started deploy [analytics/refinery@12c5f57]: Regular weekly deploy [production]
10:53 <gehel> killing stuck wmf-reimage on puppetmaster1001 for maps-test2001 [production]
10:40 <marostegui> Stop replication on db1102 (sanitarium3) on s2 shard for maintenance - T153743 [production]
10:33 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1060 - T153743 (duration: 02m 49s) [production]
10:23 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1035 - T168661 (duration: 02m 49s) [production]
10:14 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Depool db1035 - T168661 (duration: 02m 50s) [production]
09:58 <marostegui> Move labsdb1009 main general replication thread to a named replication thread called db1095 - T153743 [production]
09:54 <marostegui> Stop replication on db1095 for maintenance - T153743 [production]
09:38 <moritzm> rebooting restbase2002-restbase2004 for kernel updates [production]
09:27 <moritzm> rebooting thumbor1001/1002 for kernel updates [production]
08:54 <marostegui> Run redact_sanitarium on db1102 (sanitarium3) - T153743 [production]
08:39 <moritzm> rebooting sca2* for kernel update [production]
08:25 <elukey> restart redis 6380 (slave) jobqueue instance on rdb1004/2003 to force resync with master [production]
08:12 <moritzm> powercycling mw1260, stuck in reboot [production]
07:56 <moritzm> powercycling mw1259, stuck in reboot [production]
07:52 <gehel> restart of relforge for kernel upgrade [production]
07:42 <moritzm> rebooting video scalers in eqiad for kernel update [production]
07:15 <marostegui> Deploy alter table on s3 hosts (eqiad) - T168661 [production]
06:05 <marostegui> Stop MySQL on db1060 for maintenance - T153743 [production]
05:47 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Depool db1060 - T153743 (duration: 02m 51s) [production]
05:26 <marostegui> Deploy alter table on s5 directly on s5 master (db1063) - T168661 [production]
05:20 <marostegui> Deploy alter table on s6 directly on s6 master (db1061) - T168661 [production]
05:08 <marostegui> Deploy alter table on s2 directly on s2 master (db1054) - T168661 [production]
02:27 <l10nupdate@tin> scap sync-l10n completed (1.30.0-wmf.7) (duration: 10m 14s) [production]
01:30 <mutante> releases1001: switching GID of reprepro and promemetheus-node-exporter group (1000 vs 1001), changing reprepro UID to 13927. using find -exec to fix all the permissions and make it identical to bromine. prevent permissions snafu when rsyncing (T164030) [production]
2017-07-03 §
20:46 <gehel> unbanning elastic1018 from elasticsearch eqiad cluster [production]
20:24 <gehel> banning elastic1018 from elasticsearch eqiad clsuter [production]
19:29 <hashar> restarting jenkins [production]
19:10 <nuria@tin> Finished deploy [eventlogging/analytics@328dea6]: (no justification provided) (duration: 00m 03s) [production]
19:09 <nuria@tin> Started deploy [eventlogging/analytics@328dea6]: (no justification provided) [production]
17:35 <chasemp> labvirt1003:~# service nova-compute restart [production]
16:55 <bd808> Running maintain-views --all-databases --clean --replace-all --debug on labsdb1001 [production]
16:51 <mobrovac@tin> Finished deploy [mobileapps/deploy@58a5b19]: (no justification provided) (duration: 00m 41s) [production]