2025-04-16
§
|
15:56 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1233 (T391056)', diff saved to https://phabricator.wikimedia.org/P75129 and previous config saved to /var/cache/conftool/dbconfig/20250416-155655-fceratto.json |
[production] |
15:51 |
<bking@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic2070 to cirrussearch2070 - bking@cumin2002" |
[production] |
15:46 |
<bking@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
15:46 |
<bking@cumin2002> |
START - Cookbook sre.hosts.rename from elastic2070 to cirrussearch2070 |
[production] |
15:45 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1233 (T391056)', diff saved to https://phabricator.wikimedia.org/P75128 and previous config saved to /var/cache/conftool/dbconfig/20250416-154515-fceratto.json |
[production] |
15:45 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
15:45 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1233.eqiad.wmnet with reason: Maintenance |
[production] |
15:44 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1229 (T391056)', diff saved to https://phabricator.wikimedia.org/P75127 and previous config saved to /var/cache/conftool/dbconfig/20250416-154452-fceratto.json |
[production] |
15:41 |
<arturo> |
add member srv-networktests |
[testlabs] |
15:32 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
15:29 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P75126 and previous config saved to /var/cache/conftool/dbconfig/20250416-152945-fceratto.json |
[production] |
15:17 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
15:14 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
15:14 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P75125 and previous config saved to /var/cache/conftool/dbconfig/20250416-151438-fceratto.json |
[production] |
15:00 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission |
[tools] |
14:59 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1229 (T391056)', diff saved to https://phabricator.wikimedia.org/P75124 and previous config saved to /var/cache/conftool/dbconfig/20250416-145928-fceratto.json |
[production] |
14:57 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1229 (T391056)', diff saved to https://phabricator.wikimedia.org/P75123 and previous config saved to /var/cache/conftool/dbconfig/20250416-145718-fceratto.json |
[production] |
14:57 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1229.eqiad.wmnet with reason: Maintenance |
[production] |
14:53 |
<kamila@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
14:52 |
<kamila@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
14:48 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1225.eqiad.wmnet with reason: Maintenance |
[production] |
14:47 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222 (T391056)', diff saved to https://phabricator.wikimedia.org/P75122 and previous config saved to /var/cache/conftool/dbconfig/20250416-144750-fceratto.json |
[production] |
14:47 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component registry-admission |
[tools] |
14:45 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . |
[production] |
14:44 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . |
[production] |
14:42 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component registry-admission |
[toolsbeta] |
14:32 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222', diff saved to https://phabricator.wikimedia.org/P75121 and previous config saved to /var/cache/conftool/dbconfig/20250416-143242-fceratto.json |
[production] |
14:31 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component registry-admission |
[toolsbeta] |
14:29 |
<sukhe@dns1004> |
END - running authdns-update |
[production] |
14:27 |
<sukhe@dns1004> |
START - running authdns-update |
[production] |
14:26 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
14:26 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
14:22 |
<sukhe> |
reprepro -C component/nginx-ech include bookworm-wikimedia nginx_1.22.1-9+deb12u1+ech3_amd64.changes: T205378 |
[production] |
14:18 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 |
[production] |
14:17 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2101.codfw.wmnet on all recursors |
[production] |
14:17 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222', diff saved to https://phabricator.wikimedia.org/P75120 and previous config saved to /var/cache/conftool/dbconfig/20250416-141735-fceratto.json |
[production] |
14:17 |
<brouberol@cumin2002> |
START - Cookbook sre.dns.wipe-cache cirrussearch2101.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2099.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
START - Cookbook sre.dns.wipe-cache cirrussearch2099.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2071.codfw.wmnet on all recursors |
[production] |
14:17 |
<brouberol@cumin2002> |
START - Cookbook sre.dns.wipe-cache cirrussearch2071.codfw.wmnet on all recursors |
[production] |
14:16 |
<brouberol@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - brouberol@cumin2002 - T388610 |
[production] |
14:02 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1222 (T391056)', diff saved to https://phabricator.wikimedia.org/P75119 and previous config saved to /var/cache/conftool/dbconfig/20250416-140228-fceratto.json |
[production] |
13:58 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for service: project,cinder |
[admin] |
13:58 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,cinder |
[admin] |
13:55 |
<Lucas_WMDE> |
UTC afternoon backport+config window done |
[production] |
13:55 |
<eevans@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase1045.eqiad.wmnet with reason: Bootstrapping — T389423 |
[production] |
13:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich-next: apply |
[production] |
13:54 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich-next: apply |
[production] |
13:53 |
<lucaswerkmeister-wmde@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1136754|Release campaignEvents extension to azwiki (T390805)]] (duration: 19m 09s) |
[production] |