2021-11-25
§
|
04:27 |
<ryankemper> |
[WDQS Deploy] Tests passing following deploy of `0.3.93` on canary `wdqs1003`; proceeding to rest of fleet |
[production] |
04:25 |
<ryankemper@deploy1002> |
Started deploy [wdqs/wdqs@29c5cd7]: 0.3.93 |
[production] |
04:25 |
<ryankemper> |
[WDQS Deploy] Gearing up for deploy of wdqs `0.3.93`. Pre-deploy tests passing on canary `wdqs1003` |
[production] |
03:12 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2072.codfw.wmnet with OS buster |
[production] |
02:42 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2072.codfw.wmnet with OS buster |
[production] |
02:34 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2071.codfw.wmnet with OS buster |
[production] |
02:23 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2070.codfw.wmnet with OS buster |
[production] |
02:04 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2071.codfw.wmnet with OS buster |
[production] |
01:54 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2070.codfw.wmnet with OS buster |
[production] |
01:49 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2068.codfw.wmnet with OS buster |
[production] |
01:34 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2067.codfw.wmnet with OS buster |
[production] |
01:19 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2068.codfw.wmnet with OS buster |
[production] |
01:04 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2067.codfw.wmnet with OS buster |
[production] |
00:37 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2066.codfw.wmnet with OS buster |
[production] |
2021-11-24
§
|
23:59 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2066.codfw.wmnet with OS buster |
[production] |
23:52 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2065.codfw.wmnet with OS buster |
[production] |
23:44 |
<mutante> |
puppetmaster1001:~] $ sudo puppet cert sign gitlab-runner1001.eqiad.wmnet | sudo install_console gitlab-runner1001.eqiad.wmnet (T295481) |
[production] |
23:26 |
<mutante> |
ganeti - bringing up new VM - sudo gnt-instance start gitlab-runner1001.eqiad.wmnet ; ran puppet on install1003; installing OS T295481 |
[production] |
23:22 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2065.codfw.wmnet with OS buster |
[production] |
23:11 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2064.codfw.wmnet with OS buster |
[production] |
23:09 |
<mutante> |
mwmaint1002 - sudo /usr/bin/find /var/lib/puppet/clientbucket/ -type f -size 1M -delete - to fix Icinga alert about large files in client bucket |
[production] |
23:08 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host gitlab-runner1001.eqiad.wmnet |
[production] |
23:03 |
<mutante> |
wcqs1001 - sudo systemctl restart wcqs-blazegraph - after <+jinxer-wm> (BlazegraphFreeAllocatorsDecreasingRapidly) firing: Blazegraph instance wcqs1001:9195 is burning free allocators |
[production] |
22:52 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host gitlab-runner1001.eqiad.wmnet |
[production] |
22:50 |
<mutante> |
Creating a new Ganeti VM and wondering which row to put it? [ganeti1009:~] $ for row in A B C D; do echo "row ${row}: $(sudo gnt-instance list -o name -F "pnode.group == 'row_${row}'" | wc -l) VMs"; done |
[production] |
22:43 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts gitlab-runner1001.wikimedia.org |
[production] |
22:41 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2064.codfw.wmnet with OS buster |
[production] |
22:39 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2063.codfw.wmnet with OS buster |
[production] |
22:38 |
<mutante> |
running decom cookbook on gitlab-runner1001.wikimedia.org VM which was in state "ADMIN_down" and not used yet. to make room to recreate it as gitlab-runner1001.eqiad.wmnet T295481 |
[production] |
22:36 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts gitlab-runner1001.wikimedia.org |
[production] |
22:08 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2063.codfw.wmnet with OS buster |
[production] |
22:03 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2062.codfw.wmnet with OS buster |
[production] |
21:40 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
21:37 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
21:35 |
<legoktm@deploy1002> |
Synchronized wmf-config/: Improve docs on $wmgUseGlobalAbuseFilters and sort list of wikis (duration: 00m 57s) |
[production] |
21:33 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2062.codfw.wmnet with OS buster |
[production] |
21:21 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2061.codfw.wmnet with OS buster |
[production] |
21:00 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:58 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . |
[production] |
20:54 |
<legoktm@deploy1002> |
Synchronized wmf-config/: Update configuration related to disabling Score functionality (duration: 00m 57s) |
[production] |
20:51 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host elastic2061.codfw.wmnet with OS buster |
[production] |
19:48 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1144:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17834 and previous config saved to /var/cache/conftool/dbconfig/20211124-194857-ladsgroup.json |
[production] |
19:38 |
<razzi> |
`sudo maintain-views --all-databases --replace-all` on clouddb1018 for T292594 |
[production] |
19:33 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1144:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17833 and previous config saved to /var/cache/conftool/dbconfig/20211124-193352-ladsgroup.json |
[production] |
19:19 |
<razzi> |
run `maintain-views --all-databases --replace-all` on clouddb1013 for T292594 |
[production] |
19:18 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1144:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17832 and previous config saved to /var/cache/conftool/dbconfig/20211124-191847-ladsgroup.json |
[production] |
19:03 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'After maintenance db1144:3314 (T296143)', diff saved to https://phabricator.wikimedia.org/P17831 and previous config saved to /var/cache/conftool/dbconfig/20211124-190343-ladsgroup.json |
[production] |
18:57 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ncredir2002.codfw.wmnet |
[production] |
18:51 |
<vgutierrez@cumin1001> |
START - Cookbook sre.ganeti.reboot-vm for VM ncredir2002.codfw.wmnet |
[production] |
18:48 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ncredir2001.codfw.wmnet |
[production] |