2021-11-17
ยง
|
12:17 |
<topranks> |
Re-pooling ulsfo after completing routing changes on cr3-ulsfo and cr4-ulsfo (T295672) |
[production] |
12:12 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
12:11 |
<btullis@cumin1001> |
START - Cookbook sre.presto.roll-restart-workers for Presto analytics cluster: Roll restart of all Presto's jvm daemons. |
[production] |
12:11 |
<moritzm> |
failover ganeti master in test cluster to ganeti-test2003 |
[production] |
12:09 |
<lucaswerkmeister-wmde@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:739391|Enable more languages for Section Translation in testwiki (T294223)]] (duration: 01m 52s) |
[production] |
12:08 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
11:09 |
<moritzm> |
installing testvm2002 |
[production] |
10:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove recentchangeslinked from s5 eqiad T263127', diff saved to https://phabricator.wikimedia.org/P17753 and previous config saved to /var/cache/conftool/dbconfig/20211117-105120-marostegui.json |
[production] |
10:45 |
<dcausse> |
restarting blazegraph on wdqs1013 (jvm stuck) |
[production] |
10:45 |
<topranks> |
Commencing manual config on cr3-ulsfo and cr4-ulsfo (site depooled) to reconfigure iBGP (T295672) |
[production] |
10:42 |
<hnowlan> |
replaced all references to deploy1001 with deploy1002 in all .git/DEPLOY_HEAD directories on deploy1002:/srv/deployment |
[production] |
10:41 |
<ema> |
A:cp re-enable puppet after testing https://gerrit.wikimedia.org/r/c/operations/puppet/+/738949/ T293879 |
[production] |
10:37 |
<jayme> |
imported wmf-certificates 0~20211110-1 to stretch-wikimedia,buster-wikimedia,bullseye-wikimedia |
[production] |
10:31 |
<ema> |
A:cp disable-puppet to merge and test https://gerrit.wikimedia.org/r/c/operations/puppet/+/738949/ T293879 |
[production] |
10:28 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host testvm2002.codfw.wmnet |
[production] |
10:18 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6016.drmrs.wmnet with OS buster |
[production] |
10:18 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host testvm2002.codfw.wmnet |
[production] |
10:14 |
<topranks> |
De-pool ulsfo in DNS to allow safe reconfiguration / test of changes to CR routers iBGP (T295672) |
[production] |
10:01 |
<hnowlan@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'api-gateway' for release 'production' . |
[production] |
10:01 |
<hnowlan@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . |
[production] |
10:00 |
<moritzm> |
running "gnt-cluster upgrade --to 2.16" on ganeti test cluster |
[production] |
09:59 |
<hnowlan@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'production' . |
[production] |
09:59 |
<hnowlan@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . |
[production] |
09:53 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6015.drmrs.wmnet with OS buster |
[production] |
09:48 |
<moritzm> |
running "gnt-cluster renew-crypto --new-cluster-certificate" on ganeti test cluster |
[production] |
09:39 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6016.drmrs.wmnet with OS buster |
[production] |
09:35 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6014.drmrs.wmnet with OS buster |
[production] |
09:19 |
<_joe_> |
removing php 7.3 images from docker-registry.wikimedia.org |
[production] |
09:13 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6015.drmrs.wmnet with OS buster |
[production] |
09:11 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6013.drmrs.wmnet with OS buster |
[production] |
09:03 |
<moritzm> |
installing ffmpeg security updates on stretch |
[production] |
09:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 100%: Repool after upgrade', diff saved to https://phabricator.wikimedia.org/P17752 and previous config saved to /var/cache/conftool/dbconfig/20211117-090124-root.json |
[production] |
08:56 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6014.drmrs.wmnet with OS buster |
[production] |
08:54 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6012.drmrs.wmnet with OS buster |
[production] |
08:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 75%: Repool after upgrade', diff saved to https://phabricator.wikimedia.org/P17751 and previous config saved to /var/cache/conftool/dbconfig/20211117-084621-root.json |
[production] |
08:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 50%: Repool after upgrade', diff saved to https://phabricator.wikimedia.org/P17750 and previous config saved to /var/cache/conftool/dbconfig/20211117-083117-root.json |
[production] |
08:30 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6013.drmrs.wmnet with OS buster |
[production] |
08:24 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6011.drmrs.wmnet with OS buster |
[production] |
08:16 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 40%: Repool after upgrade', diff saved to https://phabricator.wikimedia.org/P17749 and previous config saved to /var/cache/conftool/dbconfig/20211117-081613-root.json |
[production] |
08:14 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6012.drmrs.wmnet with OS buster |
[production] |
08:11 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6010.drmrs.wmnet with OS buster |
[production] |
08:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 25%: Repool after upgrade', diff saved to https://phabricator.wikimedia.org/P17748 and previous config saved to /var/cache/conftool/dbconfig/20211117-080110-root.json |
[production] |
07:49 |
<elukey> |
restart coal, navtiming, statsv (refreshed by puppet) after https://gerrit.wikimedia.org/r/737970 |
[production] |
07:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 20%: Repool after upgrade', diff saved to https://phabricator.wikimedia.org/P17747 and previous config saved to /var/cache/conftool/dbconfig/20211117-074606-root.json |
[production] |
07:44 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6011.drmrs.wmnet with OS buster |
[production] |
07:34 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6009.drmrs.wmnet with OS buster |
[production] |
07:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1180 (re)pooling @ 10%: Repool after upgrade', diff saved to https://phabricator.wikimedia.org/P17746 and previous config saved to /var/cache/conftool/dbconfig/20211117-073102-root.json |
[production] |
07:31 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6010.drmrs.wmnet with OS buster |
[production] |
07:29 |
<elukey> |
`apt-get clean` on an-tool1005 to free space in the root partition |
[production] |
07:28 |
<elukey> |
`sudo pkill -U jmixter` on stat100[5,8] to allow puppet to run and remove the offboarded user |
[production] |