101-150 of 10000 results (18ms)
2025-01-25 §
01:57 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2189 (T384592)', diff saved to https://phabricator.wikimedia.org/P72411 and previous config saved to /var/cache/conftool/dbconfig/20250125-015731-marostegui.json [production]
01:57 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db2189.codfw.wmnet with reason: Maintenance [production]
01:57 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175 (T384592)', diff saved to https://phabricator.wikimedia.org/P72410 and previous config saved to /var/cache/conftool/dbconfig/20250125-015709-marostegui.json [production]
01:42 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P72409 and previous config saved to /var/cache/conftool/dbconfig/20250125-014201-marostegui.json [production]
01:26 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P72408 and previous config saved to /var/cache/conftool/dbconfig/20250125-012654-marostegui.json [production]
01:11 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175 (T384592)', diff saved to https://phabricator.wikimedia.org/P72407 and previous config saved to /var/cache/conftool/dbconfig/20250125-011147-marostegui.json [production]
00:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2175 (T384592)', diff saved to https://phabricator.wikimedia.org/P72406 and previous config saved to /var/cache/conftool/dbconfig/20250125-001950-marostegui.json [production]
00:19 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db2175.codfw.wmnet with reason: Maintenance [production]
00:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148 (T384592)', diff saved to https://phabricator.wikimedia.org/P72405 and previous config saved to /var/cache/conftool/dbconfig/20250125-001929-marostegui.json [production]
00:08 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
00:04 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
00:04 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P72404 and previous config saved to /var/cache/conftool/dbconfig/20250125-000422-marostegui.json [production]
00:04 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
2025-01-24 §
23:49 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P72403 and previous config saved to /var/cache/conftool/dbconfig/20250124-234914-marostegui.json [production]
23:39 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp7004.magru.wmnet [production]
23:39 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for cp7004.magru.wmnet [production]
23:39 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host cloudgw1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
23:36 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp7004.magru.wmnet [production]
23:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2148 (T384592)', diff saved to https://phabricator.wikimedia.org/P72402 and previous config saved to /var/cache/conftool/dbconfig/20250124-233407-marostegui.json [production]
23:30 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
23:26 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
22:56 <vriley@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudgw1004 [production]
22:55 <vriley@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host cloudgw1004 [production]
22:54 <vriley@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudgw1003 [production]
22:52 <vriley@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host cloudgw1003 [production]
22:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2148 (T384592)', diff saved to https://phabricator.wikimedia.org/P72401 and previous config saved to /var/cache/conftool/dbconfig/20250124-224303-marostegui.json [production]
22:42 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db2148.codfw.wmnet with reason: Maintenance [production]
22:18 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=cp7004.magru.wmnet [production]
22:18 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=cp7004.magru.wmnet,service=cdn [production]
22:11 <sukhe> pool bunch of cp7x in magru for ats-be that were depooled [production]
22:11 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp7015.magru.wmnet,service=(cdn|ats-be) [production]
22:11 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp7010.magru.wmnet,service=(cdn|ats-be) [production]
22:11 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp700[2-4].magru.wmnet,service=(cdn|ats-be) [production]
22:10 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp7006.magru.wmnet,service=(cdn|ats-be) [production]
22:10 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp7008.magru.wmnet,service=(cdn|ats-be) [production]
22:10 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp7003.magru.wmnet,service=(cdn|ats-be) [production]
22:10 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp7001.magru.wmnet,service=(cdn|ats-be) [production]
22:08 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
22:08 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host cloudgw1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
22:07 <vriley@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:07 <vriley@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt cloudgw1003 - vriley@cumin1002" [production]
22:05 <vriley@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt cloudgw1003 - vriley@cumin1002" [production]
22:02 <vriley@cumin1002> START - Cookbook sre.dns.netbox [production]
21:51 <brett@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7004.magru.wmnet with reason: Thermal settings testing (T373993) [production]
21:50 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance [production]
21:50 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1246 (T384592)', diff saved to https://phabricator.wikimedia.org/P72399 and previous config saved to /var/cache/conftool/dbconfig/20250124-215037-marostegui.json [production]
21:49 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=cp7004.magru.wmnet,service=cdn [production]
21:47 <brett> Testing thermal settings on cp7004 (T373993) [production]
21:44 <James_F> Revert "Zuul: Switch Fundraising jobs to REL1_43" [releng]
21:43 <amastilovic@deploy2002> Finished deploy [airflow-dags/platform_eng@ebb3680]: (no justification provided) (duration: 00m 31s) [production]