5351-5400 of 10000 results (139ms)
2024-04-11 ยง
13:17 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on db[2135,2160].codfw.wmnet with reason: reboot [production]
13:16 <arnaudb@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2134.codfw.wmnet [production]
13:13 <marostegui@cumin1002> dbctl commit (dc=all): 'db2177 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P60403 and previous config saved to /var/cache/conftool/dbconfig/20240411-131327-root.json [production]
13:13 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2129 (re)pooling @ 4%: Post upgrade', diff saved to https://phabricator.wikimedia.org/P60402 and previous config saved to /var/cache/conftool/dbconfig/20240411-131301-arnaudb.json [production]
13:12 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host matomo1003.eqiad.wmnet with OS bookworm [production]
13:12 <arnaudb@cumin1002> START - Cookbook sre.mysql.upgrade for db2134.codfw.wmnet [production]
13:12 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on db[2134,2160].codfw.wmnet with reason: reboot [production]
13:11 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on db[2134,2160].codfw.wmnet with reason: reboot [production]
13:00 <btullis@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host matomo1003.eqiad.wmnet with OS bookworm [production]
12:58 <arnaudb@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2132.codfw.wmnet [production]
12:58 <marostegui@cumin1002> dbctl commit (dc=all): 'db2177 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P60401 and previous config saved to /var/cache/conftool/dbconfig/20240411-125821-root.json [production]
12:57 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2129 (re)pooling @ 2%: Post upgrade', diff saved to https://phabricator.wikimedia.org/P60400 and previous config saved to /var/cache/conftool/dbconfig/20240411-125755-arnaudb.json [production]
12:54 <akosiaris> lower weight of mw1437 back to 10 from the 30 I had upped it to yesterday. The backlog of videoscaling is apparently now served and CPU usage has reached "normal" levels [production]
12:54 <arnaudb@cumin1002> START - Cookbook sre.mysql.upgrade for db2132.codfw.wmnet [production]
12:54 <jayme@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
12:53 <akosiaris@cumin1002> conftool action : set/weight=10; selector: name=mw1437.*.wmnet,dc=eqiad [production]
12:53 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db[2132,2160].codfw.wmnet with reason: reboot [production]
12:53 <jayme@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
12:53 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db[2132,2160].codfw.wmnet with reason: reboot [production]
12:52 <jayme@deploy1002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
12:52 <jayme@deploy1002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
12:51 <jayme@deploy1002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
12:50 <jayme@deploy1002> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
12:49 <jayme@deploy1002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
12:49 <jayme@deploy1002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
12:45 <ayounsi@cumin1002> START - Cookbook sre.dns.netbox [production]
12:24 <btullis@deploy1002> helmfile [eqiad] DONE helmfile.d/services/editor-analytics: apply [production]
12:24 <btullis@deploy1002> helmfile [eqiad] START helmfile.d/services/editor-analytics: apply [production]
12:23 <btullis@deploy1002> helmfile [codfw] DONE helmfile.d/services/editor-analytics: apply [production]
12:22 <btullis@deploy1002> helmfile [codfw] START helmfile.d/services/editor-analytics: apply [production]
12:21 <ayounsi@cumin1002> START - Cookbook sre.hosts.reimage for host testvm2008.wikimedia.org with OS bookworm [production]
12:21 <ayounsi@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM testvm2008.wikimedia.org - ayounsi@cumin1002" [production]
12:20 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/editor-analytics: apply [production]
12:20 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM testvm2008.wikimedia.org - ayounsi@cumin1002" [production]
12:20 <ayounsi@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) testvm2008.wikimedia.org on all recursors [production]
12:20 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/editor-analytics: apply [production]
12:19 <ayounsi@cumin1002> START - Cookbook sre.dns.wipe-cache testvm2008.wikimedia.org on all recursors [production]
12:19 <ayounsi@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:19 <ayounsi@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM testvm2008.wikimedia.org - ayounsi@cumin1002" [production]
12:18 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM testvm2008.wikimedia.org - ayounsi@cumin1002" [production]
12:16 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:16 <ayounsi@cumin1002> START - Cookbook sre.dns.netbox [production]
12:16 <ayounsi@cumin1002> START - Cookbook sre.ganeti.makevm for new host testvm2008.wikimedia.org [production]
12:16 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host matomo1003.eqiad.wmnet with OS bookworm [production]
12:16 <ayounsi@cumin1002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host testvm2008.wikimedia.org [production]
12:16 <ayounsi@cumin1002> END (FAIL) - Cookbook sre.dns.netbox (exit_code=97) [production]
12:16 <ayounsi@cumin1002> END (ERROR) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=97) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM testvm2008.wikimedia.org - ayounsi@cumin1002" [production]
12:16 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM testvm2008.wikimedia.org - ayounsi@cumin1002" [production]
12:15 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:15 <moritzm> installing gnutls28 security updates [production]