2025-08-01
ยง
|
13:33 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.provision for device lsw1-e2-codfw.mgmt.codfw.wmnet |
[production] |
13:27 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1200 (T400854)', diff saved to https://phabricator.wikimedia.org/P80430 and previous config saved to /var/cache/conftool/dbconfig/20250801-132745-ladsgroup.json |
[production] |
13:25 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1200 (T400854)', diff saved to https://phabricator.wikimedia.org/P80429 and previous config saved to /var/cache/conftool/dbconfig/20250801-132514-ladsgroup.json |
[production] |
13:25 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1200.eqiad.wmnet with reason: Maintenance |
[production] |
13:24 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185 (T400854)', diff saved to https://phabricator.wikimedia.org/P80428 and previous config saved to /var/cache/conftool/dbconfig/20250801-132451-ladsgroup.json |
[production] |
13:09 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P80427 and previous config saved to /var/cache/conftool/dbconfig/20250801-130943-ladsgroup.json |
[production] |
13:06 |
<wmbot~lucaswerkmeister@tools-bastion-13> |
kubectl rollout restart deployment pagepile # T400995; just as in https://sal.toolforge.org/log/VweuxZYBffdvpiTrxEE7, `webservice restart` seemed confused about whether the webservice was running or not |
[tools.pagepile] |
12:58 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 274685 |
[production] |
12:57 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.peering with action 'email' for AS: 274685 |
[production] |
12:57 |
<Amir1> |
re-running recountCategories.php on all wikis except s4 and s1 (T400987) |
[production] |
12:57 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 263252 |
[production] |
12:57 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.peering with action 'email' for AS: 263252 |
[production] |
12:57 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 37662 |
[production] |
12:56 |
<jiji@deploy1003> |
helmfile [staging] DONE helmfile.d/services/thumbor: apply |
[production] |
12:56 |
<jiji@deploy1003> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
12:56 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.peering with action 'email' for AS: 37662 |
[production] |
12:55 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 5400 |
[production] |
12:54 |
<ladsgroup@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1175095|recountCategories: Avoid escpaing column name (T400987)]] (duration: 08m 36s) |
[production] |
12:54 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P80426 and previous config saved to /var/cache/conftool/dbconfig/20250801-125436-ladsgroup.json |
[production] |
12:53 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.peering with action 'email' for AS: 5400 |
[production] |
12:49 |
<ladsgroup@deploy1003> |
ladsgroup: Continuing with sync |
[production] |
12:48 |
<ladsgroup@deploy1003> |
ladsgroup: Backport for [[gerrit:1175095|recountCategories: Avoid escpaing column name (T400987)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
12:47 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
12:46 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
12:46 |
<ladsgroup@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1175095|recountCategories: Avoid escpaing column name (T400987)]] |
[production] |
12:39 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1185 (T400854)', diff saved to https://phabricator.wikimedia.org/P80424 and previous config saved to /var/cache/conftool/dbconfig/20250801-123928-ladsgroup.json |
[production] |
12:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1185 (T400854)', diff saved to https://phabricator.wikimedia.org/P80423 and previous config saved to /var/cache/conftool/dbconfig/20250801-123057-ladsgroup.json |
[production] |
12:30 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1185.eqiad.wmnet with reason: Maintenance |
[production] |
12:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161 (T400854)', diff saved to https://phabricator.wikimedia.org/P80422 and previous config saved to /var/cache/conftool/dbconfig/20250801-123034-ladsgroup.json |
[production] |
12:15 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P80421 and previous config saved to /var/cache/conftool/dbconfig/20250801-121526-ladsgroup.json |
[production] |
12:00 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P80420 and previous config saved to /var/cache/conftool/dbconfig/20250801-120019-ladsgroup.json |
[production] |
11:45 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161 (T400854)', diff saved to https://phabricator.wikimedia.org/P80419 and previous config saved to /var/cache/conftool/dbconfig/20250801-114511-ladsgroup.json |
[production] |
11:42 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1161 (T400854)', diff saved to https://phabricator.wikimedia.org/P80418 and previous config saved to /var/cache/conftool/dbconfig/20250801-114238-ladsgroup.json |
[production] |
11:42 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
11:42 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1161.eqiad.wmnet with reason: Maintenance |
[production] |
11:41 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T400854)', diff saved to https://phabricator.wikimedia.org/P80417 and previous config saved to /var/cache/conftool/dbconfig/20250801-114155-ladsgroup.json |
[production] |
11:26 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P80415 and previous config saved to /var/cache/conftool/dbconfig/20250801-112647-ladsgroup.json |
[production] |
11:24 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: sync |
[production] |
11:18 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/thumbor: sync |
[production] |
11:11 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P80414 and previous config saved to /var/cache/conftool/dbconfig/20250801-111139-ladsgroup.json |
[production] |
10:56 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T400854)', diff saved to https://phabricator.wikimedia.org/P80413 and previous config saved to /var/cache/conftool/dbconfig/20250801-105631-ladsgroup.json |
[production] |
10:54 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1159 (T400854)', diff saved to https://phabricator.wikimedia.org/P80412 and previous config saved to /var/cache/conftool/dbconfig/20250801-105400-ladsgroup.json |
[production] |
10:53 |
<ladsgroup@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1159.eqiad.wmnet with reason: Maintenance |
[production] |
10:18 |
<elukey@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bookworm |
[production] |
10:14 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host cp2043.codfw.wmnet with OS bookworm |
[production] |
10:01 |
<elukey@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bookworm |
[production] |
09:52 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host cp2043.codfw.wmnet with OS bookworm |
[production] |
09:44 |
<elukey@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bookworm |
[production] |
09:38 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.reimage for host cp2043.codfw.wmnet with OS bookworm |
[production] |
09:38 |
<elukey@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bullseye |
[production] |