1-50 of 10000 results (100ms)
2026-04-25 §
02:07 <mwpresync@deploy1003> Finished scap build-images: Publishing wmf/next image (duration: 06m 11s) [production]
02:05 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance [production]
02:05 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2195 (T410589)', diff saved to https://phabricator.wikimedia.org/P91523 and previous config saved to /var/cache/conftool/dbconfig/20260425-020544-ladsgroup.json [production]
02:00 <mwpresync@deploy1003> Started scap build-images: Publishing wmf/next image [production]
01:55 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P91522 and previous config saved to /var/cache/conftool/dbconfig/20260425-015535-ladsgroup.json [production]
01:45 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P91521 and previous config saved to /var/cache/conftool/dbconfig/20260425-014528-ladsgroup.json [production]
01:35 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2195 (T410589)', diff saved to https://phabricator.wikimedia.org/P91520 and previous config saved to /var/cache/conftool/dbconfig/20260425-013520-ladsgroup.json [production]
2026-04-24 §
20:28 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host druid-internal1005.eqiad.wmnet with OS trixie [production]
20:28 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:27 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:23 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host druid-internal1002.eqiad.wmnet with OS trixie [production]
20:23 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:19 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:15 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host druid-internal1004.eqiad.wmnet with OS trixie [production]
20:15 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:14 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:12 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host druid-internal1006.eqiad.wmnet with OS trixie [production]
20:12 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:09 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:09 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid-internal1005.eqiad.wmnet with reason: host reimage [production]
20:07 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host druid-internal1001.eqiad.wmnet with OS trixie [production]
20:07 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:06 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:04 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid-internal1002.eqiad.wmnet with reason: host reimage [production]
20:04 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host druid-internal1003.eqiad.wmnet with OS trixie [production]
20:04 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
20:03 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on druid-internal1005.eqiad.wmnet with reason: host reimage [production]
20:01 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" [production]
19:59 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid-internal1004.eqiad.wmnet with reason: host reimage [production]
19:54 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid-internal1006.eqiad.wmnet with reason: host reimage [production]
19:52 <eevans@deploy1003> helmfile [staging] DONE helmfile.d/services/linked-artifacts: apply [production]
19:51 <eevans@deploy1003> helmfile [staging] START helmfile.d/services/linked-artifacts: apply [production]
19:51 <eevans@deploy1003> helmfile [staging] DONE helmfile.d/services/linked-artifacts: apply [production]
19:51 <eevans@deploy1003> helmfile [staging] START helmfile.d/services/linked-artifacts: apply [production]
19:51 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host druid-internal1005.eqiad.wmnet with OS trixie [production]
19:50 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid-internal1001.eqiad.wmnet with reason: host reimage [production]
19:49 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on druid-internal1006.eqiad.wmnet with reason: host reimage [production]
19:49 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host druid-internal1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
19:45 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid-internal1003.eqiad.wmnet with reason: host reimage [production]
19:41 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host druid-internal1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
19:40 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host druid-internal1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
19:40 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host druid-internal1005.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
19:38 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on druid-internal1002.eqiad.wmnet with reason: host reimage [production]
19:38 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on druid-internal1004.eqiad.wmnet with reason: host reimage [production]
19:38 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on druid-internal1001.eqiad.wmnet with reason: host reimage [production]
19:38 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on druid-internal1003.eqiad.wmnet with reason: host reimage [production]
19:37 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host druid-internal1006.eqiad.wmnet with OS trixie [production]
19:37 <eevans@deploy1003> helmfile [staging] DONE helmfile.d/services/linked-artifacts: apply [production]
19:37 <eevans@deploy1003> helmfile [staging] START helmfile.d/services/linked-artifacts: apply [production]
19:36 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host druid-internal1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]