201-250 of 10000 results (95ms)
2024-10-10 ยง
13:17 <sukhe@cumin1002> START - Cookbook sre.dns.roll-reboot rolling reboot on A:dnsbox and A:ulsfo and A:dnsbox [production]
13:15 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P69610 and previous config saved to /var/cache/conftool/dbconfig/20241010-131542-arnaudb.json [production]
13:12 <dreamyjazz@deploy2002> dreamyjazz, kharlan: Continuing with sync [production]
13:11 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host kubestage1004.eqiad.wmnet [production]
13:11 <jayme@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host kubestage1004.eqiad.wmnet [production]
13:10 <dreamyjazz@deploy2002> dreamyjazz, kharlan: Backport for [[gerrit:1079217|QuickSurvey.vue: Support using HTML in thank you message (T376517)]], [[gerrit:1079278|extension.json: Add mediawiki.jqueryMsg to dependencies for ext.quicksurveys.lib (T376517)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:10 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2034.codfw.wmnet [production]
13:10 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2034.codfw.wmnet [production]
13:08 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1079217|QuickSurvey.vue: Support using HTML in thank you message (T376517)]], [[gerrit:1079278|extension.json: Add mediawiki.jqueryMsg to dependencies for ext.quicksurveys.lib (T376517)]] [production]
13:02 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2034.codfw.wmnet [production]
13:01 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2034.codfw.wmnet [production]
13:01 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host bast4005.wikimedia.org [production]
13:01 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow1002.eqiad.wmnet [production]
13:00 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2034.codfw.wmnet [production]
13:00 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P69609 and previous config saved to /var/cache/conftool/dbconfig/20241010-130035-arnaudb.json [production]
12:57 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow1002.eqiad.wmnet [production]
12:56 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow2003.codfw.wmnet [production]
12:55 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2034.codfw.wmnet [production]
12:55 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host bast4005.wikimedia.org [production]
12:52 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow2003.codfw.wmnet [production]
12:51 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow3003.esams.wmnet [production]
12:45 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow3003.esams.wmnet [production]
12:45 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1247 (T367781)', diff saved to https://phabricator.wikimedia.org/P69608 and previous config saved to /var/cache/conftool/dbconfig/20241010-124528-arnaudb.json [production]
12:43 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db1247 (T367781)', diff saved to https://phabricator.wikimedia.org/P69607 and previous config saved to /var/cache/conftool/dbconfig/20241010-124319-arnaudb.json [production]
12:43 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1247.eqiad.wmnet with reason: Maintenance [production]
12:43 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1247.eqiad.wmnet with reason: Maintenance [production]
12:42 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
12:42 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
12:42 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1244 (T367781)', diff saved to https://phabricator.wikimedia.org/P69606 and previous config saved to /var/cache/conftool/dbconfig/20241010-124241-arnaudb.json [production]
12:38 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestage1004.eqiad.wmnet with OS bookworm [production]
12:35 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow4002.ulsfo.wmnet [production]
12:29 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow4002.ulsfo.wmnet [production]
12:27 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1244', diff saved to https://phabricator.wikimedia.org/P69605 and previous config saved to /var/cache/conftool/dbconfig/20241010-122734-arnaudb.json [production]
12:26 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow5002.eqsin.wmnet [production]
12:21 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow5002.eqsin.wmnet [production]
12:19 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestage1004.eqiad.wmnet with reason: host reimage [production]
12:16 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestage1004.eqiad.wmnet with reason: host reimage [production]
12:12 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1244', diff saved to https://phabricator.wikimedia.org/P69604 and previous config saved to /var/cache/conftool/dbconfig/20241010-121227-arnaudb.json [production]
12:00 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host kubestage1004.eqiad.wmnet with OS bookworm [production]
11:57 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1244 (T367781)', diff saved to https://phabricator.wikimedia.org/P69603 and previous config saved to /var/cache/conftool/dbconfig/20241010-115720-arnaudb.json [production]
11:40 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P69599 and previous config saved to /var/cache/conftool/dbconfig/20241010-114042-arnaudb.json [production]
11:34 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1079233|s2: Reduce revision-slots cache expiry to 60 seconds (T183490)]] (duration: 06m 58s) [production]
11:29 <zabe@deploy2002> zabe: Continuing with sync [production]
11:29 <zabe@deploy2002> zabe: Backport for [[gerrit:1079233|s2: Reduce revision-slots cache expiry to 60 seconds (T183490)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
11:27 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1079233|s2: Reduce revision-slots cache expiry to 60 seconds (T183490)]] [production]
11:26 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow6001.drmrs.wmnet [production]
11:25 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1243', diff saved to https://phabricator.wikimedia.org/P69598 and previous config saved to /var/cache/conftool/dbconfig/20241010-112535-arnaudb.json [production]
11:22 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow6001.drmrs.wmnet [production]
11:20 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow7001.magru.wmnet [production]
11:16 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow7001.magru.wmnet [production]