2351-2400 of 10000 results (67ms)
2019-10-18 §
05:14 <vgutierrez> switch cp3039 from nginx to ats-tls - T231433 [production]
05:13 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1129 for schema change', diff saved to https://phabricator.wikimedia.org/P9386 and previous config saved to /var/cache/conftool/dbconfig/20191018-051355-marostegui.json [production]
05:08 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1099:3311 and db2086:3318 after table compression', diff saved to https://phabricator.wikimedia.org/P9385 and previous config saved to /var/cache/conftool/dbconfig/20191018-050831-marostegui.json [production]
04:57 <vgutierrez> switch cp4025 from nginx to ats-tls - T231433 [production]
04:34 <vgutierrez> switch cp5005 from nginx to ats-tls - T231433 [production]
04:31 <vgutierrez> restarting nagios-nrpe-server on stat1007 [production]
2019-10-17 §
21:42 <mholloway-shell@deploy1001> Finished deploy [mobileapps/deploy@d663006]: Update mobileapps to f345673 (duration: 05m 38s) [production]
21:37 <mholloway-shell@deploy1001> Started deploy [mobileapps/deploy@d663006]: Update mobileapps to f345673 [production]
19:31 <eileen> civicrm revision changed from 4eac801762 to ff69d64ad4, config revision is dc3a88889d [production]
18:26 <mutante> wtp1025 - cd /srv/deployment/parsoid/deploy/src ; sudo -u deploy-service ln -s ../vendor (for benchmarking test) [production]
18:01 <_joe_> depooled wtp1025 from parsoid, parsoid-php to allow running benchmarks there [production]
18:01 <elukey> update librdkafka on eventlog1002 and restart eventlogging [production]
15:19 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1090:3317 and remove db1136 from its temporary vslow,dump role', diff saved to https://phabricator.wikimedia.org/P9382 and previous config saved to /var/cache/conftool/dbconfig/20191017-151952-marostegui.json [production]
15:07 <dcausse> unbanning elastic1050:psi [production]
15:01 <dcausse> dumping jvm heap on elastic1050:psi to investigate gc issues [production]
14:46 <moritzm> installing 4.9.189 Linux update on jessie hosts (no reboots, deploying the package only at this point) [production]
14:37 <dcausse> banning elastic1050:psi to investigate gc issues [production]
14:32 <moritzm> uploaded linux-meta 1.22 for jessie-wikimedia [production]
14:32 <bblack> disable puppet on cache fleet (cp*) ahead of cert deployment refactoring - T234803 [production]
14:09 <cdanis> ✔️ cdanis@install1002.wikimedia.org ~ 🕙☕ sudo -E reprepro --restrict grafana update buster-wikimedia [production]
13:41 <marostegui@cumin1001> dbctl commit (dc=all): 'Fully repool db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9381 and previous config saved to /var/cache/conftool/dbconfig/20191017-134112-marostegui.json [production]
13:30 <marostegui@cumin1001> dbctl commit (dc=all): 'More traffic to db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9380 and previous config saved to /var/cache/conftool/dbconfig/20191017-133047-marostegui.json [production]
13:06 <XioNoX> rollback failover vrrp from cr2-eqiad to cr1-eqiad - T227133 [production]
12:56 <XioNoX> restart mr1-eqiad [production]
12:54 <XioNoX> downtiming all mgmt host for 30min (mr1-eqiad needs to be rebooted) [production]
12:52 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2088:3312 for compression T235599', diff saved to https://phabricator.wikimedia.org/P9379 and previous config saved to /var/cache/conftool/dbconfig/20191017-125248-marostegui.json [production]
12:51 <marostegui@cumin1001> dbctl commit (dc=all): 'More traffic to db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9378 and previous config saved to /var/cache/conftool/dbconfig/20191017-125154-marostegui.json [production]
12:50 <marostegui> Compress tables on db2088:3312 - T235599 [production]
12:45 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly repool db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9377 and previous config saved to /var/cache/conftool/dbconfig/20191017-124503-marostegui.json [production]
12:13 <marostegui@cumin1001> dbctl commit (dc=all): 'Restore db1090:3312 original weight', diff saved to https://phabricator.wikimedia.org/P9376 and previous config saved to /var/cache/conftool/dbconfig/20191017-121330-marostegui.json [production]
12:11 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1105:3312 after schema change', diff saved to https://phabricator.wikimedia.org/P9375 and previous config saved to /var/cache/conftool/dbconfig/20191017-121106-marostegui.json [production]
11:39 <ema> pool cp4027 with ATS backend T227432 [production]
11:36 <vgutierrez> upgrading ATS on eqiad nodes to 8.0.5-1wm9 - T234011 [production]
11:27 <vgutierrez> upgrading ATS on codfw nodes to 8.0.5-1wm9 - T234011 [production]
11:27 <ema@puppetmaster1001> conftool action : set/weight=100; selector: name=cp4027.ulsfo.wmnet,service=ats-be [production]
11:16 <vgutierrez> upgrading ATS on esams nodes to 8.0.5-1wm9 - T234011 [production]
11:11 <Urbanecm> EU SWAT done [production]
11:11 <XioNoX> failover vrrp from cr2-eqiad to cr1-eqiad - T227133 [production]
11:11 <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: 36d4612: Allow sysops to add transwiki on nnwiki, and add import sources (T231761) (duration: 00m 59s) [production]
11:09 <vgutierrez> upgrading ATS on ulsfo nodes to 8.0.5-1wm9 - T234011 [production]
11:08 <urbanecm@deploy1001> Synchronized php-1.35.0-wmf.2/extensions/WikibaseMediaInfo: SWAT: 5a67011: Keep track of assigned nodes in both old & new DOM (T235236) (duration: 01m 03s) [production]
10:58 <ema@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:56 <ema@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:32 <ema> depool cp4027 and reimage as text_ats T227432 [production]
10:31 <effie> depool mw1333 [production]
10:25 <elukey> rollback eventlogging back to Python 2, some errors (unseen in tests) logged by the processors [production]
10:24 <elukey@deploy1001> Finished deploy [eventlogging/analytics@0f0a1aa]: Rollback move codebase to Python3 (duration: 00m 03s) [production]
10:24 <elukey@deploy1001> Started deploy [eventlogging/analytics@0f0a1aa]: Rollback move codebase to Python3 [production]
10:19 <elukey> Move eventlogging on eventlog1002 to Python3 [production]
10:17 <elukey@deploy1001> Finished deploy [eventlogging/analytics@0f0a1aa]: Move codebase to Python3 (duration: 00m 05s) [production]