6501-6550 of 10000 results (90ms)
2023-03-15 ยง
14:20 <jbond@cumin1001> START - Cookbook sre.dns.wipe-cache pki.discovery.wmnet on all recursors [production]
14:19 <jbond> update pki to use discovery record [production]
14:16 <jbond@cumin1001> conftool action : set/pooled=true; selector: name=codfw,dnsdisc=pki [production]
14:15 <daniel@deploy2002> daniel: Backport for [[gerrit:898795|Always write parsoid output to parser cache. (T320534)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
14:14 <sukhe@cumin2002> START - Cookbook sre.ganeti.reimage for host doh4002.wikimedia.org with OS bullseye [production]
14:14 <daniel@deploy2002> Started scap: Backport for [[gerrit:898795|Always write parsoid output to parser cache. (T320534)]] [production]
14:12 <sukhe> [correction] depool _doh4002_ for reimaging to bullseye: T321309 [production]
14:12 <sukhe> depool dns4002 for reimaging to bullseye: T321309 [production]
14:00 <moritzm> nodejs security updates on buster [production]
13:51 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-logging1003.eqiad.wmnet with OS bullseye [production]
13:50 <sukhe> reprepro -C component/pdns-recursor include bullseye-wikimedia pdns-recursor_4.6.2-1+wmf11u1_amd64.changes: T321309 [production]
13:49 <moritzm> installing graphite-web security updates [production]
13:32 <jayme@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
13:32 <herron@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging1003.eqiad.wmnet with reason: host reimage [production]
13:30 <jayme@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
13:30 <jayme@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
13:28 <jayme@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
13:28 <jayme@deploy2002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
13:28 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
13:27 <jayme@deploy2002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
13:27 <jayme@deploy2002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
13:27 <herron@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging1003.eqiad.wmnet with reason: host reimage [production]
13:26 <jayme@deploy2002> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
13:25 <jayme@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
13:25 <jayme@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
13:25 <jayme@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
13:25 <jayme@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
13:25 <jayme@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
13:24 <jayme@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
13:22 <jayme@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
13:22 <jayme@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
13:21 <jayme@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
13:20 <jayme@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
13:18 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
13:17 <taavi@deploy2002> Finished scap: Backport for [[gerrit:898843|Enable new Vector (2022) "Add topic" button at cswiki, huwiki (T331313)]], [[gerrit:898844|Enable DiscussionTools usability improvements at cswiki, huwiki (T329407)]], [[gerrit:897912|Disable visual enhancements on newsectionlink pages initially (T331635)]] (duration: 09m 01s) [production]
13:12 <herron@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-logging1003.eqiad.wmnet with OS bullseye [production]
13:10 <taavi@deploy2002> matmarex and taavi and esanders: Backport for [[gerrit:898843|Enable new Vector (2022) "Add topic" button at cswiki, huwiki (T331313)]], [[gerrit:898844|Enable DiscussionTools usability improvements at cswiki, huwiki (T329407)]], [[gerrit:897912|Disable visual enhancements on newsectionlink pages initially (T331635)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebu [production]
13:08 <taavi@deploy2002> Started scap: Backport for [[gerrit:898843|Enable new Vector (2022) "Add topic" button at cswiki, huwiki (T331313)]], [[gerrit:898844|Enable DiscussionTools usability improvements at cswiki, huwiki (T329407)]], [[gerrit:897912|Disable visual enhancements on newsectionlink pages initially (T331635)]] [production]
13:08 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/thumbor: apply [production]
13:07 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/thumbor: apply [production]
12:27 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
12:27 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
12:24 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
12:18 <marostegui> Failover m5 from db1176 to db1106 - T331877 [production]
12:17 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
12:17 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
12:12 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: m5 master switch T331877 [production]
12:11 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: m5 master switch T331877 [production]
12:08 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
11:36 <derick@deploy2002> helmfile [eqiad] DONE helmfile.d/services/proton: apply [production]