2025-01-30
ยง
|
15:23 |
<hashar@deploy2002> |
matmarex, hashar: Continuing with sync |
[production] |
15:18 |
<hashar@deploy2002> |
matmarex, hashar: Backport for [[gerrit:1115103|Define new 'auth' docroot with custom files for the auth domain (T383952 T384137)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
15:15 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1115103|Define new 'auth' docroot with custom files for the auth domain (T383952 T384137)]] |
[production] |
15:11 |
<jmm@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host maps-test2001.codfw.wmnet with OS bookworm |
[production] |
15:09 |
<ladsgroup@dns1004> |
END - running authdns-update |
[production] |
15:09 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1113476|Use full URLs for wgUploadNavigationUrl (T383916)]] (duration: 11m 02s) |
[production] |
15:07 |
<ladsgroup@dns1004> |
START - running authdns-update |
[production] |
15:04 |
<jmm@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ganeti2029.codfw.wmnet |
[production] |
15:03 |
<hashar@deploy2002> |
hashar, matmarex: Continuing with sync |
[production] |
15:03 |
<hashar@deploy2002> |
hashar, matmarex: Backport for [[gerrit:1113476|Use full URLs for wgUploadNavigationUrl (T383916)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
15:02 |
<jayme> |
enabled puppet on all kubernetes hosts |
[production] |
15:01 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ganeti2029.codfw.wmnet with reason: remove from cluster for reimage |
[production] |
14:58 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1113476|Use full URLs for wgUploadNavigationUrl (T383916)]] |
[production] |
14:56 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72882 and previous config saved to /var/cache/conftool/dbconfig/20250130-145620-root.json |
[production] |
14:55 |
<elukey@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/kartotherian: sync |
[production] |
14:54 |
<elukey@deploy2002> |
helmfile [eqiad] START helmfile.d/services/kartotherian: sync |
[production] |
14:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2205 (T384592)', diff saved to https://phabricator.wikimedia.org/P72881 and previous config saved to /var/cache/conftool/dbconfig/20250130-145136-marostegui.json |
[production] |
14:51 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2205.codfw.wmnet with reason: Maintenance |
[production] |
14:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2194 (T384592)', diff saved to https://phabricator.wikimedia.org/P72880 and previous config saved to /var/cache/conftool/dbconfig/20250130-145114-marostegui.json |
[production] |
14:51 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1115383|SuggestedEditSession: remove incorrect cast to integer (T385117)]], [[gerrit:1115384|SuggestedEditSession: remove incorrect cast to integer (T385117)]] (duration: 13m 41s) |
[production] |
14:44 |
<hashar@deploy2002> |
hashar, sgimeno: Continuing with sync |
[production] |
14:44 |
<hashar@deploy2002> |
hashar, sgimeno: Backport for [[gerrit:1115383|SuggestedEditSession: remove incorrect cast to integer (T385117)]], [[gerrit:1115384|SuggestedEditSession: remove incorrect cast to integer (T385117)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:43 |
<aokoth@deploy2002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
14:43 |
<aokoth@deploy2002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
14:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72877 and previous config saved to /var/cache/conftool/dbconfig/20250130-144115-root.json |
[production] |
14:37 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1115383|SuggestedEditSession: remove incorrect cast to integer (T385117)]], [[gerrit:1115384|SuggestedEditSession: remove incorrect cast to integer (T385117)]] |
[production] |
14:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2194', diff saved to https://phabricator.wikimedia.org/P72876 and previous config saved to /var/cache/conftool/dbconfig/20250130-143607-marostegui.json |
[production] |
14:32 |
<aokoth@deploy2002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
14:26 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72875 and previous config saved to /var/cache/conftool/dbconfig/20250130-142609-root.json |
[production] |
14:22 |
<aokoth@deploy2002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
14:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2194', diff saved to https://phabricator.wikimedia.org/P72873 and previous config saved to /var/cache/conftool/dbconfig/20250130-142100-marostegui.json |
[production] |
14:20 |
<aokoth@deploy2002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
14:19 |
<jayme> |
stopped puppet on all kubernetes hosts |
[production] |
14:18 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1115062|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115059|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115336|migrateConfigToCommunity: Include an edit summary (T385024)]], [[gerrit:1115337|migrateConfigToCommunity: Include an edit summary (T385024)]] (duration: 11m 19s) |
[production] |
14:11 |
<hashar@deploy2002> |
urbanecm, hashar: Continuing with sync |
[production] |
14:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72872 and previous config saved to /var/cache/conftool/dbconfig/20250130-141104-root.json |
[production] |
14:10 |
<aokoth@deploy2002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
14:10 |
<hashar@deploy2002> |
urbanecm, hashar: Backport for [[gerrit:1115062|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115059|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115336|migrateConfigToCommunity: Include an edit summary (T385024)]], [[gerrit:1115337|migrateConfigToCommunity: Include an edit summary (T385024)]] synced to the testservers (https://wik |
[production] |
14:06 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1115062|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115059|migrateConfigToCommunity: Handle false BabelMainCategory (T384941)]], [[gerrit:1115336|migrateConfigToCommunity: Include an edit summary (T385024)]], [[gerrit:1115337|migrateConfigToCommunity: Include an edit summary (T385024)]] |
[production] |
14:05 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2194 (T384592)', diff saved to https://phabricator.wikimedia.org/P72871 and previous config saved to /var/cache/conftool/dbconfig/20250130-140553-marostegui.json |
[production] |
14:04 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2029.codfw.wmnet |
[production] |
13:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72870 and previous config saved to /var/cache/conftool/dbconfig/20250130-135559-root.json |
[production] |
13:50 |
<jayme@cumin1002> |
START - Cookbook sre.k8s.wipe-cluster Wipe the K8s cluster staging-codfw: Kubernetes upgrade |
[production] |
13:50 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2224 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72869 and previous config saved to /var/cache/conftool/dbconfig/20250130-135024-root.json |
[production] |
13:45 |
<elukey@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/kartotherian: sync |
[production] |
13:44 |
<elukey@deploy2002> |
helmfile [eqiad] START helmfile.d/services/kartotherian: sync |
[production] |
13:36 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1115344|Fix response error handling in FlickrBlacklist (T385143)]] (duration: 11m 54s) |
[production] |
13:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2224 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72868 and previous config saved to /var/cache/conftool/dbconfig/20250130-133519-root.json |
[production] |
13:30 |
<hashar@deploy2002> |
hashar: Continuing with sync |
[production] |
13:27 |
<hashar@deploy2002> |
hashar: Backport for [[gerrit:1115344|Fix response error handling in FlickrBlacklist (T385143)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |