2025-01-30
ยง
|
17:58 |
<ladsgroup@deploy2002> |
ladsgroup: Backport for [[gerrit:1115466|file: Remove from filerevision when only one row exists (T384481)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
17:55 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1115466|file: Remove from filerevision when only one row exists (T384481)]] |
[production] |
17:20 |
<jayme> |
staging-codfw k8s cluster is currently being updated to k8s 1.31 and in an unusable state - T384450 |
[production] |
17:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2227 (T384592)', diff saved to https://phabricator.wikimedia.org/P72890 and previous config saved to /var/cache/conftool/dbconfig/20250130-171903-marostegui.json |
[production] |
17:18 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2227.codfw.wmnet with reason: Maintenance |
[production] |
17:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2205 (T384592)', diff saved to https://phabricator.wikimedia.org/P72889 and previous config saved to /var/cache/conftool/dbconfig/20250130-171841-marostegui.json |
[production] |
17:06 |
<cdanis@deploy2002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
17:06 |
<cdanis@deploy2002> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
17:04 |
<cdanis@deploy2002> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
17:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2205', diff saved to https://phabricator.wikimedia.org/P72888 and previous config saved to /var/cache/conftool/dbconfig/20250130-170334-marostegui.json |
[production] |
17:02 |
<cdanis@deploy2002> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
16:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2205', diff saved to https://phabricator.wikimedia.org/P72887 and previous config saved to /var/cache/conftool/dbconfig/20250130-164828-marostegui.json |
[production] |
16:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2205 (T384592)', diff saved to https://phabricator.wikimedia.org/P72885 and previous config saved to /var/cache/conftool/dbconfig/20250130-163321-marostegui.json |
[production] |
16:22 |
<Emperor> |
repool ms-fe1014 T384317 |
[production] |
16:21 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host maps-test2001.codfw.wmnet with OS bookworm |
[production] |
16:17 |
<cdanis@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1115436|Trace only on k8s. (T321211 T340552 T385037)]] (duration: 11m 55s) |
[production] |
16:10 |
<cdanis@deploy2002> |
cdanis: Continuing with sync |
[production] |
16:09 |
<cdanis@deploy2002> |
cdanis: Backport for [[gerrit:1115436|Trace only on k8s. (T321211 T340552 T385037)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
16:06 |
<cdanis@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1115436|Trace only on k8s. (T321211 T340552 T385037)]] |
[production] |
15:58 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage |
[production] |
15:54 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage |
[production] |
15:53 |
<mforns@deploy2002> |
Finished deploy [airflow-dags/analytics@c85b504]: pin confluent kafka to avoid certificate errors (duration: 00m 52s) |
[production] |
15:53 |
<mforns@deploy2002> |
Started deploy [airflow-dags/analytics@c85b504]: pin confluent kafka to avoid certificate errors |
[production] |
15:52 |
<reedy@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1115396|FancyCaptcha: Return early in passCaptcha in numerous cases (T384858)]], [[gerrit:1115395|FancyCaptcha: Return early in passCaptcha in numerous cases (T384858)]], [[gerrit:1115429|MultiUsernameFilter: Don't try to split ids if they're not a string (T385169)]], [[gerrit:1115430|MultiUsernameFilter: Don't try to split ids if they're not a string ( |
[production] |
15:47 |
<moritzm> |
installing git security updates |
[production] |
15:46 |
<reedy@deploy2002> |
reedy: Continuing with sync |
[production] |
15:46 |
<reedy@deploy2002> |
reedy: Backport for [[gerrit:1115396|FancyCaptcha: Return early in passCaptcha in numerous cases (T384858)]], [[gerrit:1115395|FancyCaptcha: Return early in passCaptcha in numerous cases (T384858)]], [[gerrit:1115429|MultiUsernameFilter: Don't try to split ids if they're not a string (T385169)]], [[gerrit:1115430|MultiUsernameFilter: Don't try to split ids if they're not a string (T385169)]], [[gerri |
[production] |
15:43 |
<reedy@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1115396|FancyCaptcha: Return early in passCaptcha in numerous cases (T384858)]], [[gerrit:1115395|FancyCaptcha: Return early in passCaptcha in numerous cases (T384858)]], [[gerrit:1115429|MultiUsernameFilter: Don't try to split ids if they're not a string (T385169)]], [[gerrit:1115430|MultiUsernameFilter: Don't try to split ids if they're not a string (T |
[production] |
15:41 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ganeti2029.codfw.wmnet |
[production] |
15:41 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host ganeti2029.codfw.wmnet |
[production] |
15:33 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host maps-test2001.codfw.wmnet with OS bookworm |
[production] |
15:30 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1115103|Define new 'auth' docroot with custom files for the auth domain (T383952 T384137)]] (duration: 14m 55s) |
[production] |
15:25 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2029.codfw.wmnet |
[production] |
15:23 |
<hashar@deploy2002> |
matmarex, hashar: Continuing with sync |
[production] |
15:18 |
<hashar@deploy2002> |
matmarex, hashar: Backport for [[gerrit:1115103|Define new 'auth' docroot with custom files for the auth domain (T383952 T384137)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
15:15 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1115103|Define new 'auth' docroot with custom files for the auth domain (T383952 T384137)]] |
[production] |
15:11 |
<jmm@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host maps-test2001.codfw.wmnet with OS bookworm |
[production] |
15:09 |
<ladsgroup@dns1004> |
END - running authdns-update |
[production] |
15:09 |
<hashar@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1113476|Use full URLs for wgUploadNavigationUrl (T383916)]] (duration: 11m 02s) |
[production] |
15:07 |
<ladsgroup@dns1004> |
START - running authdns-update |
[production] |
15:04 |
<jmm@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ganeti2029.codfw.wmnet |
[production] |
15:03 |
<hashar@deploy2002> |
hashar, matmarex: Continuing with sync |
[production] |
15:03 |
<hashar@deploy2002> |
hashar, matmarex: Backport for [[gerrit:1113476|Use full URLs for wgUploadNavigationUrl (T383916)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
15:02 |
<jayme> |
enabled puppet on all kubernetes hosts |
[production] |
15:01 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ganeti2029.codfw.wmnet with reason: remove from cluster for reimage |
[production] |
14:58 |
<hashar@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1113476|Use full URLs for wgUploadNavigationUrl (T383916)]] |
[production] |
14:56 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P72882 and previous config saved to /var/cache/conftool/dbconfig/20250130-145620-root.json |
[production] |
14:55 |
<elukey@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/kartotherian: sync |
[production] |
14:54 |
<elukey@deploy2002> |
helmfile [eqiad] START helmfile.d/services/kartotherian: sync |
[production] |
14:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2205 (T384592)', diff saved to https://phabricator.wikimedia.org/P72881 and previous config saved to /var/cache/conftool/dbconfig/20250130-145136-marostegui.json |
[production] |