51-100 of 10000 results (92ms)
2025-07-03 ยง
15:46 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
15:42 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1176.eqiad.wmnet with OS bullseye [production]
15:38 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1176.eqiad.wmnet with OS bullseye [production]
15:34 <vgutierrez@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp7006.magru.wmnet with reason: testing [production]
15:33 <vgutierrez> depooling cp7006 for testing [production]
15:31 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2213 (T395241)', diff saved to https://phabricator.wikimedia.org/P78755 and previous config saved to /var/cache/conftool/dbconfig/20250703-153141-fceratto.json [production]
15:25 <jmm@cumin1003> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:aux-worker-codfw [production]
15:23 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
15:22 <vgutierrez> lvs5006 migrated to katran - T396561 [production]
15:21 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs5006.eqsin.wmnet [production]
15:21 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs5006.eqsin.wmnet [production]
15:16 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P78754 and previous config saved to /var/cache/conftool/dbconfig/20250703-151633-fceratto.json [production]
15:10 <vgutierrez@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on lvs5006.eqsin.wmnet with reason: katran migration [production]
15:04 <jmm@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:aux-worker-codfw [production]
15:04 <jmm@cumin1003> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:aux-worker-eqiad [production]
15:01 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2213', diff saved to https://phabricator.wikimedia.org/P78753 and previous config saved to /var/cache/conftool/dbconfig/20250703-150126-fceratto.json [production]
14:56 <jynus@cumin1002> DONE (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for backup1007.eqiad.wmnet: Renew puppet certificate - jynus@cumin1002 [production]
14:55 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry1005.eqiad.wmnet [production]
14:51 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host registry1005.eqiad.wmnet [production]
14:50 <jynus@cumin1002> DONE (PASS) - Cookbook sre.puppet.renew-cert (exit_code=0) for backup1006.eqiad.wmnet: Renew puppet certificate - jynus@cumin1002 [production]
14:50 <volans> uploaded debmonitor-server,python3-debmonitor_0.6.6 to apt.wikimedia.org bookworm-wikimedia [production]
14:49 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry1004.eqiad.wmnet [production]
14:48 <vgutierrez> repooling cp7006 [production]
14:46 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2213 (T395241)', diff saved to https://phabricator.wikimedia.org/P78752 and previous config saved to /var/cache/conftool/dbconfig/20250703-144619-fceratto.json [production]
14:45 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host registry1004.eqiad.wmnet [production]
14:45 <jmm@dns1004> END - running authdns-update [production]
14:44 <jmm@dns1004> START - running authdns-update [production]
14:43 <jmm@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:aux-worker-eqiad [production]
14:38 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2213 (T395241)', diff saved to https://phabricator.wikimedia.org/P78751 and previous config saved to /var/cache/conftool/dbconfig/20250703-143854-fceratto.json [production]
14:38 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2213.codfw.wmnet with reason: Maintenance [production]
14:32 <moritzm> installing bootstrap4 security updates [production]
14:23 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1176.eqiad.wmnet with OS bullseye [production]
14:17 <vgutierrez> depooling cp7006 for testing [production]
14:09 <jynus@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on backup1007.eqiad.wmnet with reason: Maintenance and reboot [production]
14:08 <jynus@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on backup1006.eqiad.wmnet with reason: Maintenance and reboot [production]
14:05 <moritzm> restarting clamav to pick up libxml security updates [production]
14:03 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host zookeeper-test1002.eqiad.wmnet [production]
13:59 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host zookeeper-test1002.eqiad.wmnet [production]
13:47 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1047.eqiad.wmnet [production]
13:46 <jmm@cumin1003> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1047.eqiad.wmnet [production]
13:46 <sukhe> sudo cumin 'A:wikidough' "disable-puppet 'merging CR 1163859'" [production]
13:45 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry2005.codfw.wmnet [production]
13:40 <moritzm> installing libxml2 security updates on bookworm [production]
13:40 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host registry2005.codfw.wmnet [production]
13:40 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host registry2004.codfw.wmnet [production]
13:39 <andrew@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephmon2005-dev.codfw.wmnet with OS bullseye [production]
13:38 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of durum6001.drmrs.wmnet to drbd [production]
13:35 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host registry2004.codfw.wmnet [production]
13:28 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of durum6001.drmrs.wmnet to drbd [production]
13:26 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of doh6001.wikimedia.org to drbd [production]