2025-04-16
§
|
14:57 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1229.eqiad.wmnet with reason: Maintenance |
[production] |
14:53 |
<kamila@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
14:52 |
<kamila@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
14:48 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1225.eqiad.wmnet with reason: Maintenance |
[production] |
14:47 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222 (T391056)', diff saved to https://phabricator.wikimedia.org/P75122 and previous config saved to /var/cache/conftool/dbconfig/20250416-144750-fceratto.json |
[production] |
14:47 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component registry-admission |
[tools] |
14:45 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . |
[production] |
14:44 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . |
[production] |
14:42 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission |
[toolsbeta] |
14:32 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222', diff saved to https://phabricator.wikimedia.org/P75121 and previous config saved to /var/cache/conftool/dbconfig/20250416-143242-fceratto.json |
[production] |
14:31 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component registry-admission |
[toolsbeta] |
14:29 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
14:27 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
14:26 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
14:26 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
14:22 |
<sukhe> |
reprepro -C component/nginx-ech include bookworm-wikimedia nginx_1.22.1-9+deb12u1+ech3_amd64.changes: T205378 |
[production] |
14:18 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
14:17 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2101.codfw.wmnet on all recursors |
[production] |
14:17 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222', diff saved to https://phabricator.wikimedia.org/P75120 and previous config saved to /var/cache/conftool/dbconfig/20250416-141735-fceratto.json |
[production] |
14:17 |
<brouberol@cumin2002> |
START - Cookbook sre.dns.wipe-cache cirrussearch2101.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2099.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
START - Cookbook sre.dns.wipe-cache cirrussearch2099.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2071.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
START - Cookbook sre.dns.wipe-cache cirrussearch2071.codfw.wmnet on all recursors |
[production] |
14:16 |
<brouberol@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - brouberol@cumin2002 - T388610 |
[production] |
14:02 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222 (T391056)', diff saved to https://phabricator.wikimedia.org/P75119 and previous config saved to /var/cache/conftool/dbconfig/20250416-140228-fceratto.json |
[production] |
13:58 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for service: project,cinder |
[admin] |
13:58 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,cinder |
[admin] |
13:55 |
<Lucas_WMDE> |
UTC afternoon backport+config window done |
[production] |
13:55 |
<eevans@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase1045.eqiad.wmnet with reason: Bootstrapping — T389423 |
[production] |
13:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich-next: apply |
[production] |
13:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich-next: apply |
[production] |
13:53 |
<lucaswerkmeister-wmde@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1136754|Release campaignEvents extension to azwiki (T390805)]] (duration: 19m 09s) |
[production] |
13:52 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply |
[production] |
13:52 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply |
[production] |
13:51 |
<jelto> |
"Imported helm311 3.11.3-4 to bullseye-wikimedia and bookworm-wikimedia - T387548" |
[production] |
13:51 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1222 (T391056)', diff saved to https://phabricator.wikimedia.org/P75118 and previous config saved to /var/cache/conftool/dbconfig/20250416-135121-fceratto.json |
[production] |
13:51 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1222.eqiad.wmnet with reason: Maintenance |
[production] |
13:50 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1197 (T391056)', diff saved to https://phabricator.wikimedia.org/P75117 and previous config saved to /var/cache/conftool/dbconfig/20250416-135059-fceratto.json |
[production] |
13:47 |
<lucaswerkmeister-wmde@deploy1003> |
mhorsey, lucaswerkmeister-wmde: Continuing with sync |
[production] |
13:44 |
<lucaswerkmeister-wmde@deploy1003> |
mhorsey, lucaswerkmeister-wmde: Backport for [[gerrit:1136754|Release campaignEvents extension to azwiki (T390805)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:35 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P75116 and previous config saved to /var/cache/conftool/dbconfig/20250416-133552-fceratto.json |
[production] |
13:34 |
<lucaswerkmeister-wmde@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1136754|Release campaignEvents extension to azwiki (T390805)]] |
[production] |
13:28 |
<lucaswerkmeister-wmde@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1134984|search-redirect: fix case-sensitivity of project name (T391297)]] (duration: 22m 55s) |
[production] |
13:24 |
<godog> |
finish rollout of thanos 0.38 to prometheus* - T383966 |
[production] |
13:20 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1197', diff saved to https://phabricator.wikimedia.org/P75115 and previous config saved to /var/cache/conftool/dbconfig/20250416-132043-fceratto.json |
[production] |
13:20 |
<lucaswerkmeister-wmde@deploy1003> |
wargo, lucaswerkmeister-wmde: Continuing with sync |
[production] |
13:18 |
<godog> |
bounce thanos on titan100* - overload |
[production] |
13:16 |
<lucaswerkmeister-wmde@deploy1003> |
wargo, lucaswerkmeister-wmde: Backport for [[gerrit:1134984|search-redirect: fix case-sensitivity of project name (T391297)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:06 |
<lucaswerkmeister-wmde@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1134984|search-redirect: fix case-sensitivity of project name (T391297)]] |
[production] |