2025-01-25
§
|
01:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P72409 and previous config saved to /var/cache/conftool/dbconfig/20250125-014201-marostegui.json |
[production] |
01:26 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P72408 and previous config saved to /var/cache/conftool/dbconfig/20250125-012654-marostegui.json |
[production] |
01:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2175 (T384592)', diff saved to https://phabricator.wikimedia.org/P72407 and previous config saved to /var/cache/conftool/dbconfig/20250125-011147-marostegui.json |
[production] |
00:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2175 (T384592)', diff saved to https://phabricator.wikimedia.org/P72406 and previous config saved to /var/cache/conftool/dbconfig/20250125-001950-marostegui.json |
[production] |
00:19 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
00:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T384592)', diff saved to https://phabricator.wikimedia.org/P72405 and previous config saved to /var/cache/conftool/dbconfig/20250125-001929-marostegui.json |
[production] |
00:08 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
00:04 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
00:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P72404 and previous config saved to /var/cache/conftool/dbconfig/20250125-000422-marostegui.json |
[production] |
00:04 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
2025-01-24
§
|
23:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P72403 and previous config saved to /var/cache/conftool/dbconfig/20250124-234914-marostegui.json |
[production] |
23:39 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp7004.magru.wmnet |
[production] |
23:39 |
<brett@cumin2002> |
START - Cookbook sre.hosts.remove-downtime for cp7004.magru.wmnet |
[production] |
23:39 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
23:36 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp7004.magru.wmnet |
[production] |
23:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T384592)', diff saved to https://phabricator.wikimedia.org/P72402 and previous config saved to /var/cache/conftool/dbconfig/20250124-233407-marostegui.json |
[production] |
23:30 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
23:26 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
22:56 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudgw1004 |
[production] |
22:55 |
<vriley@cumin1002> |
START - Cookbook sre.network.configure-switch-interfaces for host cloudgw1004 |
[production] |
22:54 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudgw1003 |
[production] |
22:52 |
<vriley@cumin1002> |
START - Cookbook sre.network.configure-switch-interfaces for host cloudgw1003 |
[production] |
22:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2148 (T384592)', diff saved to https://phabricator.wikimedia.org/P72401 and previous config saved to /var/cache/conftool/dbconfig/20250124-224303-marostegui.json |
[production] |
22:42 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db2148.codfw.wmnet with reason: Maintenance |
[production] |
22:18 |
<brett@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp7004.magru.wmnet |
[production] |
22:18 |
<brett@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp7004.magru.wmnet,service=cdn |
[production] |
22:11 |
<sukhe> |
pool bunch of cp7x in magru for ats-be that were depooled |
[production] |
22:11 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp7015.magru.wmnet,service=(cdn|ats-be) |
[production] |
22:11 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp7010.magru.wmnet,service=(cdn|ats-be) |
[production] |
22:11 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp700[2-4].magru.wmnet,service=(cdn|ats-be) |
[production] |
22:10 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp7006.magru.wmnet,service=(cdn|ats-be) |
[production] |
22:10 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp7008.magru.wmnet,service=(cdn|ats-be) |
[production] |
22:10 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp7003.magru.wmnet,service=(cdn|ats-be) |
[production] |
22:10 |
<sukhe@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp7001.magru.wmnet,service=(cdn|ats-be) |
[production] |
22:08 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
22:08 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
22:07 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
22:07 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt cloudgw1003 - vriley@cumin1002" |
[production] |
22:05 |
<vriley@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt cloudgw1003 - vriley@cumin1002" |
[production] |
22:02 |
<vriley@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
21:51 |
<brett@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7004.magru.wmnet with reason: Thermal settings testing (T373993) |
[production] |
21:50 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance |
[production] |
21:50 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1246 (T384592)', diff saved to https://phabricator.wikimedia.org/P72399 and previous config saved to /var/cache/conftool/dbconfig/20250124-215037-marostegui.json |
[production] |
21:49 |
<brett@puppetserver1001> |
conftool action : set/pooled=no; selector: name=cp7004.magru.wmnet,service=cdn |
[production] |
21:47 |
<brett> |
Testing thermal settings on cp7004 (T373993) |
[production] |
21:43 |
<amastilovic@deploy2002> |
Finished deploy [airflow-dags/platform_eng@ebb3680]: (no justification provided) (duration: 00m 31s) |
[production] |
21:42 |
<amastilovic@deploy2002> |
Started deploy [airflow-dags/platform_eng@ebb3680]: (no justification provided) |
[production] |
21:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1246', diff saved to https://phabricator.wikimedia.org/P72398 and previous config saved to /var/cache/conftool/dbconfig/20250124-213530-marostegui.json |
[production] |
21:20 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1246', diff saved to https://phabricator.wikimedia.org/P72397 and previous config saved to /var/cache/conftool/dbconfig/20250124-212023-marostegui.json |
[production] |
21:15 |
<amastilovic@deploy2002> |
Finished deploy [airflow-dags/platform_eng@3907ed7]: (no justification provided) (duration: 00m 10s) |
[production] |