2301-2350 of 10000 results (35ms)
2022-01-26 ยง
09:33 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti1005.eqiad.wmnet with OS buster [production]
09:32 <jayme> updated scap to 4.2.0 on A:restbase-canary - T300058 [production]
09:28 <godog> begin rsync prometheus2004 -> 2005 - T296199 [production]
09:26 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P19252 and previous config saved to /var/cache/conftool/dbconfig/20220126-092626-marostegui.json [production]
09:25 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1005.eqiad.wmnet with OS buster [production]
09:25 <jayme> updated scap to 4.2.0 on A:mw-canary, A:parsoid-canary, A:mw-jobrunner-canary - T300058 [production]
09:24 <jayme> uploaded scap 4.2.0 to apt.wikimedia.org - T300058 [production]
09:21 <marostegui@cumin1001> dbctl commit (dc=all): 'db1120 (re)pooling @ 1%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P19251 and previous config saved to /var/cache/conftool/dbconfig/20220126-092158-root.json [production]
09:21 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1120.eqiad.wmnet with OS bullseye [production]
09:11 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P19250 and previous config saved to /var/cache/conftool/dbconfig/20220126-091121-marostegui.json [production]
09:06 <jayme> uploaded scap 4.2.0 to apt.wikimedia.org [production]
09:00 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti1005.eqiad.wmnet with OS buster [production]
09:00 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db1120.eqiad.wmnet with OS bullseye [production]
08:57 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1120 T300099', diff saved to https://phabricator.wikimedia.org/P19249 and previous config saved to /var/cache/conftool/dbconfig/20220126-085733-marostegui.json [production]
08:56 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1014.eqiad.wmnet with OS buster [production]
08:56 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1135 (T285149)', diff saved to https://phabricator.wikimedia.org/P19248 and previous config saved to /var/cache/conftool/dbconfig/20220126-085616-marostegui.json [production]
08:55 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1135 (T285149)', diff saved to https://phabricator.wikimedia.org/P19247 and previous config saved to /var/cache/conftool/dbconfig/20220126-085510-marostegui.json [production]
08:55 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance [production]
08:55 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1135.eqiad.wmnet with reason: Maintenance [production]
08:55 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134 (T285149)', diff saved to https://phabricator.wikimedia.org/P19246 and previous config saved to /var/cache/conftool/dbconfig/20220126-085503-marostegui.json [production]
08:41 <moritzm> draining instances off ganeti1015 for reimage [production]
08:39 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P19245 and previous config saved to /var/cache/conftool/dbconfig/20220126-083958-marostegui.json [production]
08:31 <jelto> sign puppet cert for gitlab-runner1001.eqiad.wmnet [production]
08:29 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti1014.eqiad.wmnet with OS buster [production]
08:24 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134', diff saved to https://phabricator.wikimedia.org/P19244 and previous config saved to /var/cache/conftool/dbconfig/20220126-082453-marostegui.json [production]
08:20 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti1013.eqiad.wmnet to ganeti01.svc.eqiad.wmnet [production]
08:18 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti1013.eqiad.wmnet to ganeti01.svc.eqiad.wmnet [production]
08:09 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1134 (T285149)', diff saved to https://phabricator.wikimedia.org/P19243 and previous config saved to /var/cache/conftool/dbconfig/20220126-080948-marostegui.json [production]
08:08 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1134 (T285149)', diff saved to https://phabricator.wikimedia.org/P19242 and previous config saved to /var/cache/conftool/dbconfig/20220126-080842-marostegui.json [production]
08:08 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
08:08 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1134.eqiad.wmnet with reason: Maintenance [production]
08:08 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
08:08 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
08:08 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1163 (T285149)', diff saved to https://phabricator.wikimedia.org/P19241 and previous config saved to /var/cache/conftool/dbconfig/20220126-080831-marostegui.json [production]
07:53 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1163', diff saved to https://phabricator.wikimedia.org/P19240 and previous config saved to /var/cache/conftool/dbconfig/20220126-075326-marostegui.json [production]
07:51 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
07:50 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2131.codfw.wmnet with OS bullseye [production]
07:50 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
07:50 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn [production]
07:49 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1020.eqiad.wmnet with OS bullseye [production]
07:49 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn [production]
07:45 <taavi@deploy1002> Synchronized wmf-config/interwiki.php: Config: [[gerrit:757377|Update interwiki cache]] (duration: 00m 52s) [production]
07:43 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host es1020.eqiad.wmnet with OS bullseye [production]
07:38 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1163', diff saved to https://phabricator.wikimedia.org/P19239 and previous config saved to /var/cache/conftool/dbconfig/20220126-073822-marostegui.json [production]
07:23 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1163 (T285149)', diff saved to https://phabricator.wikimedia.org/P19238 and previous config saved to /var/cache/conftool/dbconfig/20220126-072317-marostegui.json [production]
07:22 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1163 (T285149)', diff saved to https://phabricator.wikimedia.org/P19237 and previous config saved to /var/cache/conftool/dbconfig/20220126-072211-marostegui.json [production]
07:22 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1163.eqiad.wmnet with reason: Maintenance [production]
07:22 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1163.eqiad.wmnet with reason: Maintenance [production]
07:22 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]
07:22 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]