501-550 of 10000 results (102ms)
2023-11-16 ยง
12:29 <cmooney@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary [production]
12:27 <jbond@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db1124.eqiad.wmnet [production]
12:27 <cmooney@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary [production]
12:23 <jmm@cumin1001> START - Cookbook sre.puppet.migrate-host for host cumin2002.codfw.wmnet [production]
12:16 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-host for host db1124.eqiad.wmnet [production]
12:07 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host ms-fe1014.eqiad.wmnet [production]
11:55 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host ms-fe1014.eqiad.wmnet [production]
11:55 <taavi@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host clouddb1021.eqiad.wmnet [production]
11:51 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/device-analytics: apply [production]
11:50 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/device-analytics: apply [production]
11:50 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/device-analytics: apply [production]
11:49 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/device-analytics: apply [production]
11:49 <taavi@cumin1001> START - Cookbook sre.puppet.migrate-host for host clouddb1021.eqiad.wmnet [production]
11:45 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: insetup::serviceops [production]
11:45 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1144:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53514 and previous config saved to /var/cache/conftool/dbconfig/20231116-114511-arnaudb.json [production]
11:45 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1144.eqiad.wmnet with reason: Maintenance [production]
11:44 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1144.eqiad.wmnet with reason: Maintenance [production]
11:44 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1143 (T348183)', diff saved to https://phabricator.wikimedia.org/P53513 and previous config saved to /var/cache/conftool/dbconfig/20231116-114450-arnaudb.json [production]
11:34 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/device-analytics: apply [production]
11:34 <ayounsi@cumin1001> END (PASS) - Cookbook sre.hosts.dhcp (exit_code=0) for host sretest1004.eqiad.wmnet [production]
11:34 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/device-analytics: apply [production]
11:33 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: insetup::serviceops [production]
11:29 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1143', diff saved to https://phabricator.wikimedia.org/P53512 and previous config saved to /var/cache/conftool/dbconfig/20231116-112942-arnaudb.json [production]
09:40 <arnaudb@cumin1001> dbctl commit (dc=all): 'db1238 (re)pooling @ 15%: Post warmup repooling', diff saved to https://phabricator.wikimedia.org/P53502 and previous config saved to /var/cache/conftool/dbconfig/20231116-094005-arnaudb.json [production]
09:25 <arnaudb@cumin1001> dbctl commit (dc=all): 'db1238 (re)pooling @ 10%: Post warmup repooling', diff saved to https://phabricator.wikimedia.org/P53501 and previous config saved to /var/cache/conftool/dbconfig/20231116-092500-arnaudb.json [production]
09:09 <arnaudb@cumin1001> dbctl commit (dc=all): 'db1238 (re)pooling @ 5%: Post warmup repooling', diff saved to https://phabricator.wikimedia.org/P53500 and previous config saved to /var/cache/conftool/dbconfig/20231116-090955-arnaudb.json [production]
09:00 <godog> bounce prometheus instances on prometheus2006 to test p7 upgrade [production]
08:59 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: kubernetes::worker [production]
08:42 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: thanos::frontend [production]
08:37 <kharlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/ipoid: apply [production]
08:37 <kharlan@deploy2002> helmfile [eqiad] START helmfile.d/services/ipoid: apply [production]
08:34 <moritzm> installing ruby-rails-html-sanitizer security updates [production]
08:30 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: thanos::frontend [production]
08:25 <taavi@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host clouddumps1001.wikimedia.org [production]
08:22 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host prometheus2006.codfw.wmnet [production]
08:19 <taavi@cumin1001> START - Cookbook sre.puppet.migrate-host for host clouddumps1001.wikimedia.org [production]
08:18 <taavi@cumin1001> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudcumin2001.codfw.wmnet [production]
08:17 <moritzm> installing elfutils security updates [production]
08:12 <taavi@cumin1001> START - Cookbook sre.puppet.migrate-host for host cloudcumin2001.codfw.wmnet [production]
08:09 <moritzm> installing python-git security updates [production]
08:07 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host prometheus2006.codfw.wmnet [production]
08:03 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host ncredir4001.ulsfo.wmnet [production]
07:54 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host ncredir4001.ulsfo.wmnet [production]
07:42 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: prometheus::pop [production]
07:30 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: prometheus::pop [production]
06:30 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2132,2160].codfw.wmnet,db[1119,1164,1217].eqiad.wmnet with reason: Switch [production]
06:30 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on db[2132,2160].codfw.wmnet,db[1119,1164,1217].eqiad.wmnet with reason: Switch [production]
06:07 <cmooney@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest2004.codfw.wmnet with OS bullseye [production]
05:48 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudelastic1007.wikimedia.org with OS bullseye [production]
05:36 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1143 (T348183)', diff saved to https://phabricator.wikimedia.org/P53499 and previous config saved to /var/cache/conftool/dbconfig/20231116-053616-arnaudb.json [production]