2501-2550 of 10000 results (120ms)
2024-10-31 §
10:13 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1179', diff saved to https://phabricator.wikimedia.org/P70738 and previous config saved to /var/cache/conftool/dbconfig/20241031-101328-ladsgroup.json [production]
10:06 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aux-k8s-ctrl1003.eqiad.wmnet with OS bookworm [production]
10:04 <arnaudb@cumin1002> START - Cookbook sre.mysql.clone of db1232.eqiad.wmnet onto db1234.eqiad.wmnet [production]
10:03 <arnaudb@cumin1002> dbctl commit (dc=all): 'Cloning db1232 in db1234 for T378267', diff saved to https://phabricator.wikimedia.org/P70737 and previous config saved to /var/cache/conftool/dbconfig/20241031-100301-arnaudb.json [production]
09:58 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1179', diff saved to https://phabricator.wikimedia.org/P70736 and previous config saved to /var/cache/conftool/dbconfig/20241031-095821-ladsgroup.json [production]
09:49 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aux-k8s-ctrl1003.eqiad.wmnet with reason: host reimage [production]
09:47 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on aux-k8s-ctrl1003.eqiad.wmnet with reason: host reimage [production]
09:43 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1179 (T376905)', diff saved to https://phabricator.wikimedia.org/P70735 and previous config saved to /var/cache/conftool/dbconfig/20241031-094314-ladsgroup.json [production]
09:35 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host aux-k8s-ctrl1003.eqiad.wmnet with OS bookworm [production]
09:35 <elukey@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host aux-k8s-ctrl1003.eqiad.wmnet [production]
09:35 <elukey@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host aux-k8s-ctrl1003.eqiad.wmnet [production]
09:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1179 (T376905)', diff saved to https://phabricator.wikimedia.org/P70734 and previous config saved to /var/cache/conftool/dbconfig/20241031-093446-ladsgroup.json [production]
09:34 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1179.eqiad.wmnet with reason: Maintenance [production]
09:34 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1179.eqiad.wmnet with reason: Maintenance [production]
09:34 <elukey@puppetserver1001> conftool action : set/pooled=yes; selector: name=aux-k8s-worker1003.eqiad.wmnet [production]
09:32 <elukey@puppetserver1001> conftool action : set/weight=10; selector: name=aux-k8s-ctrl1003.eqiad.wmnet [production]
09:07 <fabfur> importing haproxykafka 0.3 package into apt repository (T377613) [production]
08:23 <stevemunene@cumin1002> START - Cookbook sre.hosts.reimage for host an-presto1016.eqiad.wmnet with OS bullseye [production]
08:23 <stevemunene@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1016.eqiad.wmnet with OS bullseye [production]
08:21 <stevemunene@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1019.eqiad.wmnet with OS bullseye [production]
08:13 <ayounsi@cumin1002> END (FAIL) - Cookbook sre.network.peering (exit_code=99) with action 'configure' for AS: 56258 [production]
08:12 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'configure' for AS: 56258 [production]
08:01 <stevemunene@cumin1002> START - Cookbook sre.hosts.reimage for host an-presto1020.eqiad.wmnet with OS bullseye [production]
04:54 <eileen> civicrm upgraded from 0eb881ca to 31f5cbdb [production]
01:45 <krinkle@deploy2002> Finished deploy [integration/docroot@0b03488]: (no justification provided) (duration: 00m 10s) [production]
01:45 <krinkle@deploy2002> Started deploy [integration/docroot@0b03488]: (no justification provided) [production]
01:42 <Krinkle> krinkle@mwmaint2001$ Purge https://doc.wikimedia.org/lib/wmui-page.css via `mwscript extensions/WikimediaMaintenance/purgeUrls.php`, T257188 T378542 [production]
01:38 <krinkle@deploy2002> Finished deploy [integration/docroot@a2c044c]: T378542 (duration: 00m 23s) [production]
01:38 <krinkle@deploy2002> Started deploy [integration/docroot@a2c044c]: T378542 [production]
00:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2215 (T376905)', diff saved to https://phabricator.wikimedia.org/P70733 and previous config saved to /var/cache/conftool/dbconfig/20241031-003014-ladsgroup.json [production]
00:15 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2215', diff saved to https://phabricator.wikimedia.org/P70732 and previous config saved to /var/cache/conftool/dbconfig/20241031-001507-ladsgroup.json [production]
00:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2215', diff saved to https://phabricator.wikimedia.org/P70731 and previous config saved to /var/cache/conftool/dbconfig/20241031-000000-ladsgroup.json [production]
2024-10-30 §
23:53 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-be2081.codfw.wmnet with OS bullseye [production]
23:44 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2215 (T376905)', diff saved to https://phabricator.wikimedia.org/P70730 and previous config saved to /var/cache/conftool/dbconfig/20241030-234453-ladsgroup.json [production]
23:44 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-be2083.codfw.wmnet with OS bullseye [production]
22:55 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2215 (T376905)', diff saved to https://phabricator.wikimedia.org/P70729 and previous config saved to /var/cache/conftool/dbconfig/20241030-225520-ladsgroup.json [production]
22:55 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2215.codfw.wmnet with reason: Maintenance [production]
22:54 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2215.codfw.wmnet with reason: Maintenance [production]
22:54 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2191 (T376905)', diff saved to https://phabricator.wikimedia.org/P70728 and previous config saved to /var/cache/conftool/dbconfig/20241030-225449-ladsgroup.json [production]
22:39 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2191', diff saved to https://phabricator.wikimedia.org/P70727 and previous config saved to /var/cache/conftool/dbconfig/20241030-223942-ladsgroup.json [production]
22:39 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2081.codfw.wmnet with OS bullseye [production]
22:29 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2083.codfw.wmnet with OS bullseye [production]
22:24 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2191', diff saved to https://phabricator.wikimedia.org/P70726 and previous config saved to /var/cache/conftool/dbconfig/20241030-222435-ladsgroup.json [production]
22:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2191 (T376905)', diff saved to https://phabricator.wikimedia.org/P70725 and previous config saved to /var/cache/conftool/dbconfig/20241030-220928-ladsgroup.json [production]
22:03 <brett> Running ./redis-check-aof --fix on rdb1014 tcp_6379 instance - T376961 [production]
21:25 <dreamyjazz@deploy2002> Finished scap sync-world: Backport for [[gerrit:1084834|Fix bug in BlockManager::getUniqueBlocks (T378563)]] (duration: 07m 22s) [production]
21:21 <dreamyjazz@deploy2002> dreamyjazz: Continuing with sync [production]
21:21 <dreamyjazz@deploy2002> dreamyjazz: Backport for [[gerrit:1084834|Fix bug in BlockManager::getUniqueBlocks (T378563)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:18 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1084834|Fix bug in BlockManager::getUniqueBlocks (T378563)]] [production]
21:17 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1081104|GrowthExperiments: enable community updates module in pilot wikis (T374664)]] (duration: 10m 10s) [production]