601-650 of 10000 results (96ms)
2024-07-17 ยง
16:34 <klausman@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
16:34 <klausman@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
16:32 <klausman@deploy1002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
16:31 <klausman@deploy1002> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
16:31 <klausman@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
16:31 <klausman@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
16:30 <klausman@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
16:30 <otto@deploy1002> Finished deploy [analytics/refinery@8f00c85] (thin): THIN [analytics/refinery@8f00c859] (duration: 04m 08s) [production]
16:29 <klausman@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
16:29 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P66755 and previous config saved to /var/cache/conftool/dbconfig/20240717-162952-arnaudb.json [production]
16:26 <otto@deploy1002> Started deploy [analytics/refinery@8f00c85] (thin): THIN [analytics/refinery@8f00c859] [production]
16:21 <otto@deploy1002> Finished deploy [analytics/refinery@8f00c85]: [analytics/refinery@8f00c859] (duration: 07m 59s) [production]
16:14 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P66754 and previous config saved to /var/cache/conftool/dbconfig/20240717-161445-arnaudb.json [production]
16:13 <otto@deploy1002> Started deploy [analytics/refinery@8f00c85]: [analytics/refinery@8f00c859] [production]
16:08 <inflatador> bking@kafka-main1005 `kafka topics --create --topic ${TOPIC} --partitions 1 --replication-factor 3; kafka configs --entity-type topics --entity-name ${TOPIC} --alter --add-config retention.ms=2592000000` T367510 [production]
15:59 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202 (T367781)', diff saved to https://phabricator.wikimedia.org/P66752 and previous config saved to /var/cache/conftool/dbconfig/20240717-155937-arnaudb.json [production]
15:56 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db1202 (T367781)', diff saved to https://phabricator.wikimedia.org/P66751 and previous config saved to /var/cache/conftool/dbconfig/20240717-155628-arnaudb.json [production]
15:56 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1202.eqiad.wmnet with reason: Maintenance [production]
15:56 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1202.eqiad.wmnet with reason: Maintenance [production]
15:56 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194 (T367781)', diff saved to https://phabricator.wikimedia.org/P66750 and previous config saved to /var/cache/conftool/dbconfig/20240717-155606-arnaudb.json [production]
15:53 <otto@deploy1002> Finished deploy [analytics/refinery@8f00c85] (hadoop-test): - take 2 - TEST [analytics/refinery@8f00c859] (duration: 03m 33s) [production]
15:50 <otto@deploy1002> Started deploy [analytics/refinery@8f00c85] (hadoop-test): - take 2 - TEST [analytics/refinery@8f00c859] [production]
15:46 <otto@deploy1002> Finished deploy [analytics/refinery@0b53772] (hadoop-test): TEST [analytics/refinery@0b53772e] (duration: 03m 27s) [production]
15:42 <otto@deploy1002> Started deploy [analytics/refinery@0b53772] (hadoop-test): TEST [analytics/refinery@0b53772e] [production]
15:40 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P66748 and previous config saved to /var/cache/conftool/dbconfig/20240717-154059-arnaudb.json [production]
15:38 <sukhe@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-low-traffic-eqiad and A:lvs [production]
15:37 <sukhe@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-low-traffic-eqiad and A:lvs [production]
15:35 <sukhe@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs [production]
15:35 <sukhe@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs [production]
15:33 <sukhe@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-low-traffic-codfw and A:lvs [production]
15:32 <sukhe@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-low-traffic-codfw and A:lvs [production]
15:32 <topranks> Adjust anycast route policy at Chicago Network POP cr2-eqord to announce anycast ranges T367439 [production]
15:30 <sukhe> sudo cumin "A:lvs" "run-puppet-agent" to pick up apus change [production]
15:29 <sukhe@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-secondary-codfw and A:lvs [production]
15:28 <sukhe@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-secondary-codfw and A:lvs [production]
15:25 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P66747 and previous config saved to /var/cache/conftool/dbconfig/20240717-152552-arnaudb.json [production]
15:24 <jforrester@deploy1002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
15:23 <jforrester@deploy1002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
15:23 <jforrester@deploy1002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
15:22 <jforrester@deploy1002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
15:21 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy2007.codfw.wmnet with OS bookworm [production]
15:21 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
15:21 <jforrester@deploy1002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
15:21 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) apus.discovery.wmnet on all recursors [production]
15:20 <sukhe@cumin1002> START - Cookbook sre.dns.wipe-cache apus.discovery.wmnet on all recursors [production]
15:20 <jforrester@deploy1002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
15:19 <jgiannelos@deploy1002> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
15:18 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
15:18 <sukhe> running authdns-update for CR 1054346 [production]
15:16 <jgiannelos@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]