751-800 of 10000 results (49ms)
2022-04-20 ยง
19:19 <mutante> puppetmaster - cleaning cert for gitlab-runner2001, signing new request [production]
19:19 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T298565)', diff saved to https://phabricator.wikimedia.org/P25803 and previous config saved to /var/cache/conftool/dbconfig/20220420-191934-ladsgroup.json [production]
19:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P25802 and previous config saved to /var/cache/conftool/dbconfig/20220420-190846-ladsgroup.json [production]
19:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P25801 and previous config saved to /var/cache/conftool/dbconfig/20220420-190429-ladsgroup.json [production]
18:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1157 (T298565)', diff saved to https://phabricator.wikimedia.org/P25800 and previous config saved to /var/cache/conftool/dbconfig/20220420-185341-ladsgroup.json [production]
18:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P25799 and previous config saved to /var/cache/conftool/dbconfig/20220420-184925-ladsgroup.json [production]
18:39 <mutante> reimaging gitlab-runner2021.codfw.wmnet [production]
18:36 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gitlab-runner2001.codfw.wmnet with reason: reimage [production]
18:36 <dzahn@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on gitlab-runner2001.codfw.wmnet with reason: reimage [production]
18:34 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T298565)', diff saved to https://phabricator.wikimedia.org/P25798 and previous config saved to /var/cache/conftool/dbconfig/20220420-183419-ladsgroup.json [production]
18:17 <kormat@cumin1001> dbctl commit (dc=all): 'es1025 (re)pooling @ 100%: Reboot T303174', diff saved to https://phabricator.wikimedia.org/P25797 and previous config saved to /var/cache/conftool/dbconfig/20220420-181720-kormat.json [production]
18:15 <kormat@cumin1001> dbctl commit (dc=all): 'es1028 (re)pooling @ 100%: Reboot T303174', diff saved to https://phabricator.wikimedia.org/P25796 and previous config saved to /var/cache/conftool/dbconfig/20220420-181515-kormat.json [production]
18:10 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
18:10 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
18:10 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
18:10 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
18:05 <jhuneidi@deploy1002> Synchronized php: group1 wikis to 1.39.0-wmf.8 refs T305214 (duration: 00m 51s) [production]
18:04 <jhuneidi@deploy1002> rebuilt and synchronized wikiversions files: group1 wikis to 1.39.0-wmf.8 refs T305214 [production]
18:02 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1024.mgmt.eqiad.wmnet with reboot policy FORCED [production]
18:02 <kormat@cumin1001> dbctl commit (dc=all): 'es1025 (re)pooling @ 75%: Reboot T303174', diff saved to https://phabricator.wikimedia.org/P25795 and previous config saved to /var/cache/conftool/dbconfig/20220420-180215-kormat.json [production]
18:00 <kormat@cumin1001> dbctl commit (dc=all): 'es1028 (re)pooling @ 75%: Reboot T303174', diff saved to https://phabricator.wikimedia.org/P25794 and previous config saved to /var/cache/conftool/dbconfig/20220420-180012-kormat.json [production]
17:53 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1023.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1157 (T298565)', diff saved to https://phabricator.wikimedia.org/P25793 and previous config saved to /var/cache/conftool/dbconfig/20220420-175327-ladsgroup.json [production]
17:53 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1157.eqiad.wmnet with reason: Maintenance [production]
17:53 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1157.eqiad.wmnet with reason: Maintenance [production]
17:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1175 (T298565)', diff saved to https://phabricator.wikimedia.org/P25792 and previous config saved to /var/cache/conftool/dbconfig/20220420-175319-ladsgroup.json [production]
17:50 <cmjohnson@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host parse1019.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:49 <cmjohnson@cumin1001> START - Cookbook sre.hosts.provision for host parse1024.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:47 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1018.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:47 <kormat@cumin1001> dbctl commit (dc=all): 'es1025 (re)pooling @ 50%: Reboot T303174', diff saved to https://phabricator.wikimedia.org/P25791 and previous config saved to /var/cache/conftool/dbconfig/20220420-174711-kormat.json [production]
17:46 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1017.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:46 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1014.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:46 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1021.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:46 <cmjohnson@cumin1001> START - Cookbook sre.hosts.provision for host parse1019.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:46 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1013.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:46 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1022.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:46 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1016.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:45 <cmjohnson@cumin1001> START - Cookbook sre.hosts.provision for host parse1015.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:45 <kormat@cumin1001> dbctl commit (dc=all): 'es1028 (re)pooling @ 50%: Reboot T303174', diff saved to https://phabricator.wikimedia.org/P25790 and previous config saved to /var/cache/conftool/dbconfig/20220420-174508-kormat.json [production]
17:40 <cmjohnson@cumin1001> START - Cookbook sre.hosts.provision for host parse1023.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:40 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host parse1012.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:39 <cmjohnson@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host parse1020.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:39 <cmjohnson@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host parse1015.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:39 <cmjohnson@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host parse1019.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:38 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P25789 and previous config saved to /var/cache/conftool/dbconfig/20220420-173814-ladsgroup.json [production]
17:34 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1161 (T298565)', diff saved to https://phabricator.wikimedia.org/P25788 and previous config saved to /var/cache/conftool/dbconfig/20220420-173405-ladsgroup.json [production]
17:34 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
17:34 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
17:34 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]
17:34 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]