101-150 of 10000 results (17ms)
2025-08-01 ยง
13:33 <ayounsi@cumin1003> START - Cookbook sre.network.provision for device lsw1-e2-codfw.mgmt.codfw.wmnet [production]
13:27 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1200 (T400854)', diff saved to https://phabricator.wikimedia.org/P80430 and previous config saved to /var/cache/conftool/dbconfig/20250801-132745-ladsgroup.json [production]
13:25 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1200 (T400854)', diff saved to https://phabricator.wikimedia.org/P80429 and previous config saved to /var/cache/conftool/dbconfig/20250801-132514-ladsgroup.json [production]
13:25 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1200.eqiad.wmnet with reason: Maintenance [production]
13:24 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1185 (T400854)', diff saved to https://phabricator.wikimedia.org/P80428 and previous config saved to /var/cache/conftool/dbconfig/20250801-132451-ladsgroup.json [production]
13:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P80427 and previous config saved to /var/cache/conftool/dbconfig/20250801-130943-ladsgroup.json [production]
13:06 <wmbot~lucaswerkmeister@tools-bastion-13> kubectl rollout restart deployment pagepile # T400995; just as in https://sal.toolforge.org/log/VweuxZYBffdvpiTrxEE7, `webservice restart` seemed confused about whether the webservice was running or not [tools.pagepile]
12:58 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 274685 [production]
12:57 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'email' for AS: 274685 [production]
12:57 <Amir1> re-running recountCategories.php on all wikis except s4 and s1 (T400987) [production]
12:57 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 263252 [production]
12:57 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'email' for AS: 263252 [production]
12:57 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 37662 [production]
12:56 <jiji@deploy1003> helmfile [staging] DONE helmfile.d/services/thumbor: apply [production]
12:56 <jiji@deploy1003> helmfile [staging] START helmfile.d/services/thumbor: apply [production]
12:56 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'email' for AS: 37662 [production]
12:55 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 5400 [production]
12:54 <ladsgroup@deploy1003> Finished scap sync-world: Backport for [[gerrit:1175095|recountCategories: Avoid escpaing column name (T400987)]] (duration: 08m 36s) [production]
12:54 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P80426 and previous config saved to /var/cache/conftool/dbconfig/20250801-125436-ladsgroup.json [production]
12:53 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'email' for AS: 5400 [production]
12:49 <ladsgroup@deploy1003> ladsgroup: Continuing with sync [production]
12:48 <ladsgroup@deploy1003> ladsgroup: Backport for [[gerrit:1175095|recountCategories: Avoid escpaing column name (T400987)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
12:47 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
12:46 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
12:46 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1175095|recountCategories: Avoid escpaing column name (T400987)]] [production]
12:39 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1185 (T400854)', diff saved to https://phabricator.wikimedia.org/P80424 and previous config saved to /var/cache/conftool/dbconfig/20250801-123928-ladsgroup.json [production]
12:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1185 (T400854)', diff saved to https://phabricator.wikimedia.org/P80423 and previous config saved to /var/cache/conftool/dbconfig/20250801-123057-ladsgroup.json [production]
12:30 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1185.eqiad.wmnet with reason: Maintenance [production]
12:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T400854)', diff saved to https://phabricator.wikimedia.org/P80422 and previous config saved to /var/cache/conftool/dbconfig/20250801-123034-ladsgroup.json [production]
12:15 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P80421 and previous config saved to /var/cache/conftool/dbconfig/20250801-121526-ladsgroup.json [production]
12:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P80420 and previous config saved to /var/cache/conftool/dbconfig/20250801-120019-ladsgroup.json [production]
11:45 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T400854)', diff saved to https://phabricator.wikimedia.org/P80419 and previous config saved to /var/cache/conftool/dbconfig/20250801-114511-ladsgroup.json [production]
11:42 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1161 (T400854)', diff saved to https://phabricator.wikimedia.org/P80418 and previous config saved to /var/cache/conftool/dbconfig/20250801-114238-ladsgroup.json [production]
11:42 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
11:42 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]
11:41 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T400854)', diff saved to https://phabricator.wikimedia.org/P80417 and previous config saved to /var/cache/conftool/dbconfig/20250801-114155-ladsgroup.json [production]
11:26 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P80415 and previous config saved to /var/cache/conftool/dbconfig/20250801-112647-ladsgroup.json [production]
11:24 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/thumbor: sync [production]
11:18 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/thumbor: sync [production]
11:11 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P80414 and previous config saved to /var/cache/conftool/dbconfig/20250801-111139-ladsgroup.json [production]
10:56 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T400854)', diff saved to https://phabricator.wikimedia.org/P80413 and previous config saved to /var/cache/conftool/dbconfig/20250801-105631-ladsgroup.json [production]
10:54 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1159 (T400854)', diff saved to https://phabricator.wikimedia.org/P80412 and previous config saved to /var/cache/conftool/dbconfig/20250801-105400-ladsgroup.json [production]
10:53 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1159.eqiad.wmnet with reason: Maintenance [production]
10:18 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bookworm [production]
10:14 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host cp2043.codfw.wmnet with OS bookworm [production]
10:01 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bookworm [production]
09:52 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host cp2043.codfw.wmnet with OS bookworm [production]
09:44 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bookworm [production]
09:38 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host cp2043.codfw.wmnet with OS bookworm [production]
09:38 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp2043.codfw.wmnet with OS bullseye [production]