651-700 of 10000 results (90ms)
2025-08-19 ยง
18:24 <dancy@deploy1003> Installing scap version "4.206.0" for 2 host(s) [production]
18:24 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2193.codfw.wmnet with reason: Maintenance [production]
18:23 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2180 (T402010)', diff saved to https://phabricator.wikimedia.org/P81555 and previous config saved to /var/cache/conftool/dbconfig/20250819-182356-ladsgroup.json [production]
18:22 <mutante> gerrit - deactivated user Keccake256 for spam-like comments and edits on commons [production]
18:18 <jhathaway@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-test-coord1002.eqiad.wmnet with OS bookworm [production]
18:11 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host an-test-coord1002.eqiad.wmnet with OS bookworm [production]
18:08 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P81554 and previous config saved to /var/cache/conftool/dbconfig/20250819-180848-ladsgroup.json [production]
18:04 <rzl@deploy1003> Finished scap sync-world: https://gerrit.wikimedia.org/r/1174872 (duration: 07m 51s) [production]
18:00 <jhathaway@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on an-test-coord1002.eqiad.wmnet with reason: supermicro [production]
17:59 <rzl@deploy1003> rzl: Continuing with sync [production]
17:58 <rzl@deploy1003> rzl: https://gerrit.wikimedia.org/r/1174872 synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
17:57 <rzl@deploy1003> Started scap sync-world: https://gerrit.wikimedia.org/r/1174872 [production]
17:53 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P81553 and previous config saved to /var/cache/conftool/dbconfig/20250819-175340-ladsgroup.json [production]
17:49 <jgleeson> process-control config revision changed from 80aab41e to 45c6fa38 [production]
17:39 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
17:38 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2180 (T402010)', diff saved to https://phabricator.wikimedia.org/P81552 and previous config saved to /var/cache/conftool/dbconfig/20250819-173833-ladsgroup.json [production]
17:38 <zoe@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-experimental: apply [production]
17:38 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
17:37 <zoe@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-experimental: apply [production]
17:37 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2180 (T402010)', diff saved to https://phabricator.wikimedia.org/P81551 and previous config saved to /var/cache/conftool/dbconfig/20250819-173709-ladsgroup.json [production]
17:37 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2180.codfw.wmnet with reason: Maintenance [production]
17:36 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2169 (T402010)', diff saved to https://phabricator.wikimedia.org/P81550 and previous config saved to /var/cache/conftool/dbconfig/20250819-173646-ladsgroup.json [production]
17:21 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2169', diff saved to https://phabricator.wikimedia.org/P81548 and previous config saved to /var/cache/conftool/dbconfig/20250819-172139-ladsgroup.json [production]
17:17 <swfrench@deploy1003> Finished scap sync-world: No-op deployment to introduce new build report metadata - T401721 (duration: 02m 52s) [production]
17:15 <swfrench@deploy1003> Started scap sync-world: No-op deployment to introduce new build report metadata - T401721 [production]
17:12 <mszabo@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180167|AbuseFilterHooks: Handle IP user performers without actor records (T402298)]] (duration: 07m 38s) [production]
17:10 <mutante> phab2002/phab1004 - systemctl restart php7.4-fpm after we increased APCu shared memory segment size (T401157) [production]
17:07 <mszabo@deploy1003> kharlan, mszabo: Continuing with sync [production]
17:07 <mszabo@deploy1003> kharlan, mszabo: Backport for [[gerrit:1180167|AbuseFilterHooks: Handle IP user performers without actor records (T402298)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
17:06 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2169', diff saved to https://phabricator.wikimedia.org/P81546 and previous config saved to /var/cache/conftool/dbconfig/20250819-170632-ladsgroup.json [production]
17:05 <mszabo@deploy1003> Started scap sync-world: Backport for [[gerrit:1180167|AbuseFilterHooks: Handle IP user performers without actor records (T402298)]] [production]
16:51 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2169 (T402010)', diff saved to https://phabricator.wikimedia.org/P81545 and previous config saved to /var/cache/conftool/dbconfig/20250819-165124-ladsgroup.json [production]
16:50 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2169 (T402010)', diff saved to https://phabricator.wikimedia.org/P81544 and previous config saved to /var/cache/conftool/dbconfig/20250819-165015-ladsgroup.json [production]
16:50 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2169.codfw.wmnet with reason: Maintenance [production]
16:50 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2158 (T402010)', diff saved to https://phabricator.wikimedia.org/P81543 and previous config saved to /var/cache/conftool/dbconfig/20250819-165003-ladsgroup.json [production]
16:48 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1042.eqiad.wmnet with OS bookworm [production]
16:48 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
16:45 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
16:39 <mszabo@deploy1003> Sync cancelled. [production]
16:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2158', diff saved to https://phabricator.wikimedia.org/P81542 and previous config saved to /var/cache/conftool/dbconfig/20250819-163455-ladsgroup.json [production]
16:32 <mszabo@deploy1003> mszabo, kharlan: Backport for [[gerrit:1180167|AbuseFilterHooks: Handle IP user performers without actor records (T402298)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
16:30 <mszabo@deploy1003> Started scap sync-world: Backport for [[gerrit:1180167|AbuseFilterHooks: Handle IP user performers without actor records (T402298)]] [production]
16:23 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1042.eqiad.wmnet with reason: host reimage [production]
16:20 <vriley@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1042.eqiad.wmnet with reason: host reimage [production]
16:19 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2158', diff saved to https://phabricator.wikimedia.org/P81541 and previous config saved to /var/cache/conftool/dbconfig/20250819-161948-ladsgroup.json [production]
16:05 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1150.eqiad.wmnet with reason: Maintenance [production]
16:04 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2158 (T402010)', diff saved to https://phabricator.wikimedia.org/P81540 and previous config saved to /var/cache/conftool/dbconfig/20250819-160439-ladsgroup.json [production]
16:02 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2158 (T402010)', diff saved to https://phabricator.wikimedia.org/P81539 and previous config saved to /var/cache/conftool/dbconfig/20250819-160230-ladsgroup.json [production]
16:02 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2158.codfw.wmnet with reason: Maintenance [production]
16:02 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2151 (T402010)', diff saved to https://phabricator.wikimedia.org/P81538 and previous config saved to /var/cache/conftool/dbconfig/20250819-160218-ladsgroup.json [production]