101-150 of 10000 results (26ms)
2026-04-30 ยง
10:20 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2174 (T419961)', diff saved to https://phabricator.wikimedia.org/P92025 and previous config saved to /var/cache/conftool/dbconfig/20260430-102026-fceratto.json [production]
10:14 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host hcaptcha-proxy5004.wikimedia.org [production]
10:14 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha-proxy5004.wikimedia.org with OS bookworm [production]
10:10 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P92024 and previous config saved to /var/cache/conftool/dbconfig/20260430-101017-fceratto.json [production]
10:00 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P92023 and previous config saved to /var/cache/conftool/dbconfig/20260430-100009-fceratto.json [production]
09:54 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on hcaptcha-proxy5004.wikimedia.org with reason: host reimage [production]
09:50 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on hcaptcha-proxy5004.wikimedia.org with reason: host reimage [production]
09:50 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2174 (T419961)', diff saved to https://phabricator.wikimedia.org/P92022 and previous config saved to /var/cache/conftool/dbconfig/20260430-095000-fceratto.json [production]
09:42 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2174 (T419961)', diff saved to https://phabricator.wikimedia.org/P92021 and previous config saved to /var/cache/conftool/dbconfig/20260430-094239-fceratto.json [production]
09:42 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2174.codfw.wmnet with reason: Maintenance [production]
09:42 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2173 (T419961)', diff saved to https://phabricator.wikimedia.org/P92020 and previous config saved to /var/cache/conftool/dbconfig/20260430-094210-fceratto.json [production]
09:32 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2173', diff saved to https://phabricator.wikimedia.org/P92019 and previous config saved to /var/cache/conftool/dbconfig/20260430-093202-fceratto.json [production]
09:27 <moritzm> failover Ganeti master in ulsfo02 to ganeti4005 in preparation of forthcoming switch maintenance in ulsfo T424686 [production]
09:24 <moritzm> temporarily remove ganeti4006 from the ulsfo02 Ganeti cluster in preparation of forthcoming switch maintenance in ulsfo T424686 [production]
09:21 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2173', diff saved to https://phabricator.wikimedia.org/P92018 and previous config saved to /var/cache/conftool/dbconfig/20260430-092154-fceratto.json [production]
09:17 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti4006.ulsfo.wmnet [production]
09:15 <brouberol@cumin1003> END (ERROR) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=97) rolling restart_daemons on A:kafka-jumbo-eqiad [production]
09:12 <brouberol@cumin1003> START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-jumbo-eqiad [production]
09:11 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2173 (T419961)', diff saved to https://phabricator.wikimedia.org/P92017 and previous config saved to /var/cache/conftool/dbconfig/20260430-091147-fceratto.json [production]
09:04 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db2173 (T419961)', diff saved to https://phabricator.wikimedia.org/P92016 and previous config saved to /var/cache/conftool/dbconfig/20260430-090408-fceratto.json [production]
09:04 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2173.codfw.wmnet with reason: Maintenance [production]
09:03 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2170 (T419961)', diff saved to https://phabricator.wikimedia.org/P92015 and previous config saved to /var/cache/conftool/dbconfig/20260430-090337-fceratto.json [production]
09:01 <dcausse@deploy1003> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
09:01 <dcausse@deploy1003> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
08:57 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host hcaptcha-proxy5004.wikimedia.org with OS bookworm [production]
08:56 <dcausse@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
08:56 <dcausse@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
08:55 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM hcaptcha-proxy5004.wikimedia.org - jmm@cumin2002" [production]
08:55 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM hcaptcha-proxy5004.wikimedia.org - jmm@cumin2002" [production]
08:55 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) hcaptcha-proxy5004.wikimedia.org on all recursors [production]
08:55 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache hcaptcha-proxy5004.wikimedia.org on all recursors [production]
08:55 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:55 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM hcaptcha-proxy5004.wikimedia.org - jmm@cumin2002" [production]
08:54 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM hcaptcha-proxy5004.wikimedia.org - jmm@cumin2002" [production]
08:54 <bwojtowicz@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
08:54 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
08:54 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
08:53 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P92014 and previous config saved to /var/cache/conftool/dbconfig/20260430-085329-fceratto.json [production]
08:51 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
08:51 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host hcaptcha-proxy5004.wikimedia.org [production]
08:50 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host hcaptcha-proxy5004.wikimedia.org [production]
08:50 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host hcaptcha-proxy5004.wikimedia.org [production]
08:49 <bwojtowicz@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
08:48 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host hcaptcha-proxy5003.wikimedia.org [production]
08:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host hcaptcha-proxy5003.wikimedia.org with OS bookworm [production]
08:43 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P92013 and previous config saved to /var/cache/conftool/dbconfig/20260430-084321-fceratto.json [production]
08:33 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2170 (T419961)', diff saved to https://phabricator.wikimedia.org/P92012 and previous config saved to /var/cache/conftool/dbconfig/20260430-083313-fceratto.json [production]
08:29 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on hcaptcha-proxy5003.wikimedia.org with reason: host reimage [production]
08:27 <hashar@deploy1003> Finished deploy [gerrit/gerrit@83b886a]: wm-checks-api: add tag for PostgreSQL jobs (duration: 00m 14s) [production]
08:27 <hashar@deploy1003> Started deploy [gerrit/gerrit@83b886a]: wm-checks-api: add tag for PostgreSQL jobs [production]