2022-11-28
ยง
|
22:06 |
<sukhe@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[5004,5009].eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" |
[production] |
22:03 |
<sukhe@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
22:00 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on phab1001.eqiad.wmnet with reason: T322250 |
[production] |
22:00 |
<dzahn@cumin2002> |
START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on phab1001.eqiad.wmnet with reason: T322250 |
[production] |
22:00 |
<brennen> |
phabricator: phab1001 -> phab1004 migration starting soon; downtime expected (T280597) |
[production] |
21:57 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1188', diff saved to https://phabricator.wikimedia.org/P41486 and previous config saved to /var/cache/conftool/dbconfig/20221128-215715-ladsgroup.json |
[production] |
21:55 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.decommission for hosts cp[5004,5009].eqsin.wmnet |
[production] |
21:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2147 (T321126)', diff saved to https://phabricator.wikimedia.org/P41485 and previous config saved to /var/cache/conftool/dbconfig/20221128-215435-marostegui.json |
[production] |
21:52 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db2147 (T321126)', diff saved to https://phabricator.wikimedia.org/P41484 and previous config saved to /var/cache/conftool/dbconfig/20221128-215223-marostegui.json |
[production] |
21:52 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2147.codfw.wmnet with reason: Maintenance |
[production] |
21:52 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on db2147.codfw.wmnet with reason: Maintenance |
[production] |
21:52 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2139.codfw.wmnet with reason: Maintenance |
[production] |
21:51 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on db2139.codfw.wmnet with reason: Maintenance |
[production] |
21:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2138:3314 (T321126)', diff saved to https://phabricator.wikimedia.org/P41483 and previous config saved to /var/cache/conftool/dbconfig/20221128-215151-marostegui.json |
[production] |
21:46 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp[5004,5009].eqsin.wmnet with reason: downtimed, to be depooled |
[production] |
21:46 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on cp[5004,5009].eqsin.wmnet with reason: downtimed, to be depooled |
[production] |
21:44 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5009.eqsin.wmnet,service=varnish-fe |
[production] |
21:44 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5009.eqsin.wmnet,service=ats-be |
[production] |
21:44 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5009.eqsin.wmnet,service=ats-tls |
[production] |
21:44 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5004.eqsin.wmnet,service=varnish-fe |
[production] |
21:44 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5004.eqsin.wmnet,service=ats-be |
[production] |
21:44 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5004.eqsin.wmnet,service=ats-tls |
[production] |
21:42 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1188', diff saved to https://phabricator.wikimedia.org/P41482 and previous config saved to /var/cache/conftool/dbconfig/20221128-214208-ladsgroup.json |
[production] |
21:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P41481 and previous config saved to /var/cache/conftool/dbconfig/20221128-213645-marostegui.json |
[production] |
21:33 |
<cjming> |
end of UTC late backport window |
[production] |
21:27 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1188 (T323827)', diff saved to https://phabricator.wikimedia.org/P41480 and previous config saved to /var/cache/conftool/dbconfig/20221128-212702-ladsgroup.json |
[production] |
21:23 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cp[5003,5008].eqsin.wmnet |
[production] |
21:23 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:23 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[5003,5008].eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" |
[production] |
21:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P41479 and previous config saved to /var/cache/conftool/dbconfig/20221128-212138-marostegui.json |
[production] |
21:20 |
<sukhe@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[5003,5008].eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" |
[production] |
21:18 |
<sukhe@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
21:16 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-debug: apply |
[production] |
21:15 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:861397|Enable shared Reading Lists landing page on all wikis. (T313269)]] (duration: 06m 22s) |
[production] |
21:13 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-debug: apply |
[production] |
21:13 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply |
[production] |
21:12 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-debug: apply |
[production] |
21:12 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.decommission for hosts cp[5003,5008].eqsin.wmnet |
[production] |
21:10 |
<cjming@deploy1002> |
cjming and dbrant: Backport for [[gerrit:861397|Enable shared Reading Lists landing page on all wikis. (T313269)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
21:09 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:861397|Enable shared Reading Lists landing page on all wikis. (T313269)]] |
[production] |
21:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2138:3314 (T321126)', diff saved to https://phabricator.wikimedia.org/P41478 and previous config saved to /var/cache/conftool/dbconfig/20221128-210632-marostegui.json |
[production] |
21:06 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host arclamp1001.eqiad.wmnet with OS bullseye |
[production] |
21:04 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db2138:3314 (T321126)', diff saved to https://phabricator.wikimedia.org/P41477 and previous config saved to /var/cache/conftool/dbconfig/20221128-210419-marostegui.json |
[production] |
21:04 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2138.codfw.wmnet with reason: Maintenance |
[production] |
21:04 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 5:00:00 on db2138.codfw.wmnet with reason: Maintenance |
[production] |
21:04 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2137:3314 (T321126)', diff saved to https://phabricator.wikimedia.org/P41476 and previous config saved to /var/cache/conftool/dbconfig/20221128-210408-marostegui.json |
[production] |
21:02 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp5008.eqsin.wmnet with reason: downtimed, to be depooled |
[production] |
21:02 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on cp5008.eqsin.wmnet with reason: downtimed, to be depooled |
[production] |
21:02 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5008.eqsin.wmnet,service=varnish-fe |
[production] |
21:02 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5008.eqsin.wmnet,service=ats-be |
[production] |