2025-06-26
§
|
16:57 |
<fabfur> |
repooled cp7006 |
[production] |
16:56 |
<fabfur@cumin1002> |
conftool action : set/pooled=yes; selector: name=cp7006.magru.wmnet |
[production] |
16:55 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore2005.codfw.wmnet with reason: host reimage |
[production] |
16:55 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest2001.codfw.wmnet with OS bookworm |
[production] |
16:55 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:52 |
<jhancock@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cp2043.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:52 |
<mvernon@cumin1002> |
START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f9 |
[production] |
16:52 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f8 |
[production] |
16:44 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.provision for host sretest2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:44 |
<mvernon@cumin1002> |
START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f8 |
[production] |
16:44 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f7 |
[production] |
16:43 |
<mnz@deploy1003> |
Finished deploy [airflow-dags/research@19c55cd]: (no justification provided) (duration: 00m 48s) |
[production] |
16:43 |
<mnz@deploy1003> |
Started deploy [airflow-dags/research@19c55cd]: (no justification provided) |
[production] |
16:42 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2043.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:39 |
<jhathaway@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2001.codfw.wmnet with OS bookworm |
[production] |
16:35 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host sessionstore2005.codfw.wmnet with OS bullseye |
[production] |
16:34 |
<fabfur@cumin1002> |
conftool action : set/pooled=no; selector: name=cp7006.magru.wmnet |
[production] |
16:33 |
<urandom> |
decommissioning Cassandra/sessionstore2005-a — T390514 |
[production] |
16:33 |
<fabfur@cumin1002> |
conftool action : set/pooled=yes; selector: name=cp7006.magru.wmnet |
[production] |
16:32 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api |
[tools] |
16:31 |
<fabfur@cumin1002> |
conftool action : set/pooled=no; selector: name=cp7006.magru.wmnet |
[production] |
16:30 |
<mvernon@cumin1002> |
START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f7 |
[production] |
16:30 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f6 |
[production] |
16:30 |
<fabfur> |
depool cp7006 for a quick test (T397917) |
[production] |
16:29 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component components-api |
[tools] |
16:24 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api |
[toolsbeta] |
16:20 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component components-api |
[toolsbeta] |
16:19 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api |
[tools] |
16:14 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) |
[admin] |
16:13 |
<mvernon@cumin1002> |
START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f6 |
[production] |
16:13 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f5 |
[production] |
16:11 |
<oblivian@cumin1003> |
END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Bugfixes; code refactoring - oblivian@cumin1003" |
[production] |
16:11 |
<oblivian@cumin1003> |
END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Bugfixes; code refactoring - oblivian@cumin1003 |
[production] |
16:11 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component jobs-api |
[tools] |
16:11 |
<oblivian@cumin1003> |
START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Bugfixes; code refactoring - oblivian@cumin1003 |
[production] |
16:11 |
<oblivian@cumin1003> |
START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Bugfixes; code refactoring - oblivian@cumin1003" |
[production] |
16:10 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api |
[toolsbeta] |
16:02 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component jobs-api |
[toolsbeta] |
16:02 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.bootstrap_and_add |
[admin] |
16:01 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=0) |
[admin] |
16:01 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.undrain_node |
[admin] |
16:00 |
<mvernon@cumin1002> |
START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f5 |
[production] |
16:00 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f4 |
[production] |
15:59 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) |
[admin] |
15:52 |
<sukhe> |
sudo cumin -b11 'A:cp' "run-puppet-agent --enable 'merging CR 1163843'" |
[production] |
15:51 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.bootstrap_and_add |
[admin] |
15:46 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirtlocal1003.eqiad.wmnet}' |
[admin] |
15:44 |
<mvernon@cumin1002> |
START - Cookbook sre.swift.check-dbs Checking container DBs of wikipedia-commons-local-thumb.f4 |
[production] |
15:43 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.swift.check-dbs (exit_code=0) Checking container DBs of wikipedia-commons-local-thumb.f3 |
[production] |
15:41 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirtlocal1003.eqiad.wmnet}' |
[admin] |