2025-06-18
ยง
|
20:07 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-worker2009.codfw.wmnet with OS bookworm |
[production] |
20:07 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" |
[production] |
20:06 |
<ebernhardson@deploy1003> |
ebernhardson, ksarabia: Backport for [[gerrit:1160858|Revert "Enable new mobile search experience everywhere (not including empty search recommendations)"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:06 |
<jhancock@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" |
[production] |
20:04 |
<ebernhardson@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1160858|Revert "Enable new mobile search experience everywhere (not including empty search recommendations)"]] |
[production] |
20:03 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-worker2006.codfw.wmnet with OS bookworm |
[production] |
20:03 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" |
[production] |
20:03 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcephosd2007-dev |
[production] |
20:03 |
<jhancock@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host cloudcephosd2007-dev |
[production] |
20:03 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcephosd2006-dev |
[production] |
20:03 |
<jhancock@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" |
[production] |
20:03 |
<jhancock@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host cloudcephosd2006-dev |
[production] |
20:03 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcephosd2005-dev |
[production] |
20:02 |
<jhancock@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host cloudcephosd2005-dev |
[production] |
20:02 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:59 |
<jhancock@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
19:56 |
<dancy@deploy1003> |
Installation of scap version "4.179.0" completed for 2 hosts |
[production] |
19:55 |
<jhancock@cumin1003> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on aux-k8s-worker2008.codfw.wmnet with reason: host reimage |
[production] |
19:54 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker2007.codfw.wmnet with reason: host reimage |
[production] |
19:54 |
<dancy@deploy1003> |
Installing scap version "4.179.0" for 2 host(s) |
[production] |
19:50 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker2009.codfw.wmnet with reason: host reimage |
[production] |
19:47 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-worker2006.codfw.wmnet with reason: host reimage |
[production] |
19:44 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2009.codfw.wmnet with reason: host reimage |
[production] |
19:43 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2008.codfw.wmnet with reason: host reimage |
[production] |
19:43 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2007.codfw.wmnet with reason: host reimage |
[production] |
19:43 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-worker2006.codfw.wmnet with reason: host reimage |
[production] |
19:32 |
<ryankemper> |
T393966 Ran puppet on `titan1001` following merge of https://gerrit.wikimedia.org/r/c/operations/puppet/+/1155335. Puppet looks happy and I see the new recording rules getting created |
[production] |
19:31 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.reimage for host aux-k8s-worker2009.codfw.wmnet with OS bookworm |
[production] |
19:31 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance |
[production] |
19:31 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.reimage for host aux-k8s-worker2008.codfw.wmnet with OS bookworm |
[production] |
19:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1251 (T396130)', diff saved to https://phabricator.wikimedia.org/P78387 and previous config saved to /var/cache/conftool/dbconfig/20250618-193101-marostegui.json |
[production] |
19:31 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.reimage for host aux-k8s-worker2007.codfw.wmnet with OS bookworm |
[production] |
19:30 |
<jhancock@cumin1003> |
START - Cookbook sre.hosts.reimage for host aux-k8s-worker2006.codfw.wmnet with OS bookworm |
[production] |
19:15 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P78386 and previous config saved to /var/cache/conftool/dbconfig/20250618-191553-marostegui.json |
[production] |
19:14 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Testing T395696', diff saved to https://phabricator.wikimedia.org/P78385 and previous config saved to /var/cache/conftool/dbconfig/20250618-191440-ladsgroup.json |
[production] |
19:09 |
<ladsgroup@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1160990|etcd: Check for array key (T395696)]] (duration: 12m 39s) |
[production] |
19:07 |
<ejegg> |
civicrm upgraded from 63302c18 to 670b3f6b |
[production] |
19:05 |
<cdobbins@cumin2002> |
START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade of ATS on A:cp-codfw and A:cp - 9.2.10 upgrade (T390912) |
[production] |
19:05 |
<ChrisDobbins901_> |
cdobbins@cumin2002:~$ sudo -i cookbook sre.cdn.roll-upgrade-ats --query 'A:cp-codfw' --task-id T390912 --reason '9.2.10 upgrade' |
[production] |
19:03 |
<ladsgroup@deploy1003> |
ladsgroup: Continuing with sync |
[production] |
19:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P78384 and previous config saved to /var/cache/conftool/dbconfig/20250618-190045-marostegui.json |
[production] |
19:00 |
<wfan> |
payments-wiki upgraded from aa102260 to f56db8e6 |
[production] |
18:59 |
<ladsgroup@deploy1003> |
ladsgroup: Backport for [[gerrit:1160990|etcd: Check for array key (T395696)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
18:57 |
<ladsgroup@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1160990|etcd: Check for array key (T395696)]] |
[production] |
18:56 |
<ryankemper@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on 6 hosts with reason: T395772 hosts not serving production traffic |
[production] |
18:55 |
<ladsgroup@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1152853|etcd: Remove ES clusters from "write clusters" if section is RO (T395696)]] (duration: 26m 55s) |
[production] |
18:49 |
<ladsgroup@deploy1003> |
ladsgroup: Continuing with sync |
[production] |
18:47 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-ats (exit_code=0) Rolling upgrade of ATS on A:cp-eqiad and A:cp - 9.2.10 upgrade (T390912) |
[production] |
18:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1251 (T396130)', diff saved to https://phabricator.wikimedia.org/P78383 and previous config saved to /var/cache/conftool/dbconfig/20250618-184538-marostegui.json |
[production] |
18:43 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Testing T395696', diff saved to https://phabricator.wikimedia.org/P78382 and previous config saved to /var/cache/conftool/dbconfig/20250618-184325-ladsgroup.json |
[production] |