2019-10-17
ยง
|
18:01 |
<elukey> |
update librdkafka on eventlog1002 and restart eventlogging |
[analytics] |
18:01 |
<elukey> |
update librdkafka on eventlog1002 and restart eventlogging |
[production] |
17:10 |
<James_F> |
Zuul: OOUI] Drop generic-node10-rundoc-docker and โฆ-npmaudit-docker experiments |
[releng] |
16:33 |
<James_F> |
Zuul: Moving OOUI from node 6 to node 10 job T235570 |
[releng] |
16:29 |
<jeh> |
cleanup old fullstackd puppet certs |
[cloudinfra] |
16:11 |
<James_F> |
Docker: Pushing node10-test-browser-php72-composer:0.1.1 |
[releng] |
15:27 |
<jeh> |
update eqiad1's endpoint catalog with the new wikimediacloud.org domain T223907 |
[openstack] |
15:19 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1090:3317 and remove db1136 from its temporary vslow,dump role', diff saved to https://phabricator.wikimedia.org/P9382 and previous config saved to /var/cache/conftool/dbconfig/20191017-151952-marostegui.json |
[production] |
15:07 |
<dcausse> |
unbanning elastic1050:psi |
[production] |
15:01 |
<dcausse> |
dumping jvm heap on elastic1050:psi to investigate gc issues |
[production] |
14:56 |
<jeh> |
added icingia downtime for cloudcontrol100[34] and checker.tools.wmflabs.org for service restarts T223907 |
[openstack] |
14:54 |
<jeh> |
update eqiad1's hiera keystone_host to new wikimediacloud.org domain T223907 |
[openstack] |
14:46 |
<moritzm> |
installing 4.9.189 Linux update on jessie hosts (no reboots, deploying the package only at this point) |
[production] |
14:41 |
<jeh> |
deleting failed stresstest VMs that have multiple designate records stresstest1024-16-[16,17,64] left over from newton upgrade T212302 |
[testlabs] |
14:37 |
<dcausse> |
banning elastic1050:psi to investigate gc issues |
[production] |
14:32 |
<moritzm> |
uploaded linux-meta 1.22 for jessie-wikimedia |
[production] |
14:32 |
<bblack> |
disable puppet on cache fleet (cp*) ahead of cert deployment refactoring - T234803 |
[production] |
14:30 |
<jeh> |
cleaning up failed nova fullstack vms related to puppet ca T234332 |
[admin-monitoring] |
14:09 |
<cdanis> |
โ๏ธ cdanis@install1002.wikimedia.org ~ ๐โ sudo -E reprepro --restrict grafana update buster-wikimedia |
[production] |
13:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9381 and previous config saved to /var/cache/conftool/dbconfig/20191017-134112-marostegui.json |
[production] |
13:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'More traffic to db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9380 and previous config saved to /var/cache/conftool/dbconfig/20191017-133047-marostegui.json |
[production] |
13:06 |
<XioNoX> |
rollback failover vrrp from cr2-eqiad to cr1-eqiad - T227133 |
[production] |
12:56 |
<XioNoX> |
restart mr1-eqiad |
[production] |
12:54 |
<XioNoX> |
downtiming all mgmt host for 30min (mr1-eqiad needs to be rebooted) |
[production] |
12:52 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2088:3312 for compression T235599', diff saved to https://phabricator.wikimedia.org/P9379 and previous config saved to /var/cache/conftool/dbconfig/20191017-125248-marostegui.json |
[production] |
12:52 |
<arturo> |
add phamhi as user and projectadmin |
[cloudinfra] |
12:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'More traffic to db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9378 and previous config saved to /var/cache/conftool/dbconfig/20191017-125154-marostegui.json |
[production] |
12:50 |
<marostegui> |
Compress tables on db2088:3312 - T235599 |
[production] |
12:45 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1129 after PDU maintenance', diff saved to https://phabricator.wikimedia.org/P9377 and previous config saved to /var/cache/conftool/dbconfig/20191017-124503-marostegui.json |
[production] |
12:30 |
<Lucas_WMDE> |
$ npm run build-storybook && tar -C storybook-static -c . | ssh toolforge "sudo -i -u tools.wikibase-databridge-storybook sh -c 'rm -rf www/static/*; tar -C www/static/ -x'" # deploy locally built storybook (T235055) |
[tools.wikibase-databridge-storybook] |
12:13 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Restore db1090:3312 original weight', diff saved to https://phabricator.wikimedia.org/P9376 and previous config saved to /var/cache/conftool/dbconfig/20191017-121330-marostegui.json |
[production] |
12:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1105:3312 after schema change', diff saved to https://phabricator.wikimedia.org/P9375 and previous config saved to /var/cache/conftool/dbconfig/20191017-121106-marostegui.json |
[production] |
11:39 |
<ema> |
pool cp4027 with ATS backend T227432 |
[production] |
11:36 |
<vgutierrez> |
upgrading ATS on eqiad nodes to 8.0.5-1wm9 - T234011 |
[production] |
11:27 |
<vgutierrez> |
upgrading ATS on codfw nodes to 8.0.5-1wm9 - T234011 |
[production] |
11:27 |
<ema@puppetmaster1001> |
conftool action : set/weight=100; selector: name=cp4027.ulsfo.wmnet,service=ats-be |
[production] |
11:16 |
<vgutierrez> |
upgrading ATS on esams nodes to 8.0.5-1wm9 - T234011 |
[production] |
11:11 |
<Urbanecm> |
EU SWAT done |
[production] |
11:11 |
<XioNoX> |
failover vrrp from cr2-eqiad to cr1-eqiad - T227133 |
[production] |
11:11 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 36d4612: Allow sysops to add transwiki on nnwiki, and add import sources (T231761) (duration: 00m 59s) |
[production] |
11:09 |
<vgutierrez> |
upgrading ATS on ulsfo nodes to 8.0.5-1wm9 - T234011 |
[production] |
11:08 |
<urbanecm@deploy1001> |
Synchronized php-1.35.0-wmf.2/extensions/WikibaseMediaInfo: SWAT: 5a67011: Keep track of assigned nodes in both old & new DOM (T235236) (duration: 01m 03s) |
[production] |
10:58 |
<ema@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:56 |
<ema@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:32 |
<ema> |
depool cp4027 and reimage as text_ats T227432 |
[production] |
10:31 |
<effie> |
depool mw1333 |
[production] |
10:26 |
<elukey> |
rollback eventlogging back to Python 2, some errors (unseen in tests) logged by the processors |
[analytics] |
10:25 |
<elukey> |
rollback eventlogging back to Python 2, some errors (unseen in tests) logged by the processors |
[production] |
10:24 |
<elukey@deploy1001> |
Finished deploy [eventlogging/analytics@0f0a1aa]: Rollback move codebase to Python3 (duration: 00m 03s) |
[production] |
10:24 |
<elukey@deploy1001> |
Started deploy [eventlogging/analytics@0f0a1aa]: Rollback move codebase to Python3 |
[production] |