8101-8150 of 10000 results (115ms)
2024-04-23 ยง
13:15 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2147.codfw.wmnet with reason: host reimage [production]
13:11 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2155 (re)pooling @ 75%: Sanitarium master', diff saved to https://phabricator.wikimedia.org/P61102 and previous config saved to /var/cache/conftool/dbconfig/20240423-131128-arnaudb.json [production]
12:58 <arnaudb@cumin1002> START - Cookbook sre.hosts.reimage for host db2147.codfw.wmnet with OS bookworm [production]
12:57 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2147.codfw.wmnet with reason: T362746 [production]
12:57 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2147.codfw.wmnet with reason: T362746 [production]
12:57 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depool db2147', diff saved to https://phabricator.wikimedia.org/P61101 and previous config saved to /var/cache/conftool/dbconfig/20240423-125703-arnaudb.json [production]
12:56 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2155 (re)pooling @ 50%: Sanitarium master', diff saved to https://phabricator.wikimedia.org/P61100 and previous config saved to /var/cache/conftool/dbconfig/20240423-125622-arnaudb.json [production]
12:55 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db2155.codfw.wmnet with reason: Reimage db2155 [production]
12:55 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 3:00:00 on db2155.codfw.wmnet with reason: Reimage db2155 [production]
12:54 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2155 depool', diff saved to https://phabricator.wikimedia.org/P61099 and previous config saved to /var/cache/conftool/dbconfig/20240423-125430-arnaudb.json [production]
12:45 <hashar@deploy1002> Finished deploy [gerrit/gerrit@ff51759]: Remove registerStyleModule() for Gerrit 3.8 - T354886 (duration: 00m 07s) [production]
12:17 <taavi@deploy1002> taavi: Continuing with sync [production]
12:17 <taavi@deploy1002> taavi: Backport for [[gerrit:1023046|Add cawiki 750k logo (T363057)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
12:13 <taavi@deploy1002> Started scap: Backport for [[gerrit:1023046|Add cawiki 750k logo (T363057)]] [production]
11:47 <cgoubert@cumin1002> conftool action : set/weight=10:pooled=yes; selector: name=(mw1414.eqiad.wmnet|mw1415.eqiad.wmnet|mw1416.eqiad.wmnet|mw1448.eqiad.wmnet|mw1449.eqiad.wmnet),cluster=kubernetes,service=kubesvc [production]
11:47 <claime> Pooling and uncordoning mw1414.eqiad.wmnet,mw1415.eqiad.wmnet,mw1416.eqiad.wmnet,mw1448.eqiad.wmnet,mw1449.eqiad.wmnet - T351074 [production]
11:39 <claime> Running homer 'cr*eqiad*' commit 'T351074' [production]
11:39 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on db1246.eqiad.wmnet with reason: Host has hardware issues [production]
11:38 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on db1246.eqiad.wmnet with reason: Host has hardware issues [production]
11:38 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1415.eqiad.wmnet with OS bullseye [production]
11:35 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1448.eqiad.wmnet with OS bullseye [production]
11:33 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1416.eqiad.wmnet with OS bullseye [production]
11:30 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1449.eqiad.wmnet with OS bullseye [production]
11:28 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1414.eqiad.wmnet with OS bullseye [production]
11:21 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1415.eqiad.wmnet with reason: host reimage [production]
11:17 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1448.eqiad.wmnet with reason: host reimage [production]
11:15 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1416.eqiad.wmnet with reason: host reimage [production]
11:12 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1449.eqiad.wmnet with reason: host reimage [production]
11:10 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1414.eqiad.wmnet with reason: host reimage [production]
11:08 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1449.eqiad.wmnet with reason: host reimage [production]
11:07 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1448.eqiad.wmnet with reason: host reimage [production]
11:07 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1416.eqiad.wmnet with reason: host reimage [production]
11:06 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1415.eqiad.wmnet with reason: host reimage [production]
11:06 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1414.eqiad.wmnet with reason: host reimage [production]
10:58 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2172 (re)pooling @ 100%: post upgrade repool', diff saved to https://phabricator.wikimedia.org/P61098 and previous config saved to /var/cache/conftool/dbconfig/20240423-105812-arnaudb.json [production]
10:55 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1449.eqiad.wmnet with OS bullseye [production]
10:54 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1448.eqiad.wmnet with OS bullseye [production]
10:54 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1416.eqiad.wmnet with OS bullseye [production]
10:53 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1415.eqiad.wmnet with OS bullseye [production]
10:53 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host mw1414.eqiad.wmnet with OS bullseye [production]
10:45 <claime> Depooling mw1414.eqiad.wmnet,mw1415.eqiad.wmnet,mw1416.eqiad.wmnet,mw1448.eqiad.wmnet,mw1449.eqiad.wmnet for reimage to kubernetes - T351074 [production]
10:43 <jayme> kubectl cordon parse1002.eqiad.wmnet - T363086 [production]
10:43 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2172 (re)pooling @ 75%: post upgrade repool', diff saved to https://phabricator.wikimedia.org/P61097 and previous config saved to /var/cache/conftool/dbconfig/20240423-104306-arnaudb.json [production]
10:34 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db2196.codfw.wmnet [production]
10:28 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2172 (re)pooling @ 50%: post upgrade repool', diff saved to https://phabricator.wikimedia.org/P61094 and previous config saved to /var/cache/conftool/dbconfig/20240423-102801-arnaudb.json [production]
10:26 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1245.eqiad.wmnet with reason: T360116 [production]
10:25 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1245.eqiad.wmnet with reason: T360116 [production]
10:22 <btullis@deploy1002> Finished deploy [analytics/hdfs-tools/deploy@3618aab]: (no justification provided) (duration: 00m 11s) [production]
10:22 <btullis@deploy1002> Started deploy [analytics/hdfs-tools/deploy@3618aab]: (no justification provided) [production]
10:17 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host db2196.codfw.wmnet [production]