501-550 of 10000 results (120ms)
2025-07-23 ยง
20:07 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2238.codfw.wmnet with reason: Maintenance [production]
20:07 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226 (T399728)', diff saved to https://phabricator.wikimedia.org/P79778 and previous config saved to /var/cache/conftool/dbconfig/20250723-200659-fceratto.json [production]
20:02 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1023.eqiad.wmnet with reason: host reimage [production]
19:57 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1023.eqiad.wmnet with reason: host reimage [production]
19:57 <jhathaway@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on ml-serve1012.eqiad.wmnet with reason: redfish-test [production]
19:53 <jhathaway@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on sretest2001.codfw.wmnet with reason: redfish-test [production]
19:51 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226', diff saved to https://phabricator.wikimedia.org/P79777 and previous config saved to /var/cache/conftool/dbconfig/20250723-195152-fceratto.json [production]
19:41 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1172082|AuthManager: Move temp account login to continueAuthentication (T398270)]] (duration: 11m 39s) [production]
19:41 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash1023.eqiad.wmnet with OS bookworm [production]
19:36 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226', diff saved to https://phabricator.wikimedia.org/P79776 and previous config saved to /var/cache/conftool/dbconfig/20250723-193644-fceratto.json [production]
19:36 <kharlan@deploy1003> kharlan: Continuing with sync [production]
19:32 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1172082|AuthManager: Move temp account login to continueAuthentication (T398270)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
19:30 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1172082|AuthManager: Move temp account login to continueAuthentication (T398270)]] [production]
19:29 <mutante> gitlab-runner* - apt-get upgrade - upgrading gitlab-runner, libgnutls30, ca-certificates [production]
19:28 <bking@cumin1002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227 [production]
19:26 <dzahn@cumin2002> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: security release 20250723 [production]
19:24 <dzahn@cumin2002> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: security release 20250723 [production]
19:21 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226 (T399728)', diff saved to https://phabricator.wikimedia.org/P79775 and previous config saved to /var/cache/conftool/dbconfig/20250723-192136-fceratto.json [production]
19:20 <bking@cumin1002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227 [production]
19:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2226 (T399728)', diff saved to https://phabricator.wikimedia.org/P79774 and previous config saved to /var/cache/conftool/dbconfig/20250723-191841-fceratto.json [production]
19:18 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2226.codfw.wmnet with reason: Maintenance [production]
19:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2225 (T399728)', diff saved to https://phabricator.wikimedia.org/P79773 and previous config saved to /var/cache/conftool/dbconfig/20250723-191817-fceratto.json [production]
19:16 <dzahn@cumin2002> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: security release 20250723 [production]
19:14 <bking@cumin1002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227 [production]
19:14 <bking@cumin1002> END (ERROR) - Cookbook sre.elasticsearch.rolling-operation (exit_code=97) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227 [production]
19:11 <dzahn@cumin2002> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: security release 20250723 [production]
19:06 <otto@deploy1003> helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: apply [production]
19:04 <otto@deploy1003> helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: apply [production]
19:03 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2225', diff saved to https://phabricator.wikimedia.org/P79772 and previous config saved to /var/cache/conftool/dbconfig/20250723-190309-fceratto.json [production]
19:03 <otto@deploy1003> helmfile [codfw] DONE helmfile.d/services/eventgate-analytics-external: apply [production]
19:02 <otto@deploy1003> helmfile [codfw] START helmfile.d/services/eventgate-analytics-external: apply [production]
19:02 <dzahn@cumin2002> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: security release 20250723 [production]
19:01 <bking@cumin1002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227 [production]
19:01 <otto@deploy1003> helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply [production]
19:00 <ottomata> deploying eventgate-analytics-external and eventgate-logging-external to get meta.dt logic change - T376026 [production]
18:59 <otto@deploy1003> helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply [production]
18:59 <otto@deploy1003> helmfile [codfw] DONE helmfile.d/services/eventgate-logging-external: apply [production]
18:58 <otto@deploy1003> helmfile [codfw] START helmfile.d/services/eventgate-logging-external: apply [production]
18:52 <inflatador> depool eqiad in preparation for rolling restart T399162 [production]
18:51 <bking@cumin2002> conftool action : set/pooled=false; selector: dnsdisc=search,name=eqiad [production]
18:50 <dduvall@deploy1003> rebuilt and synchronized wikiversions files: group1 to 1.45.0-wmf.11 refs T396372 [production]
18:48 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2225', diff saved to https://phabricator.wikimedia.org/P79771 and previous config saved to /var/cache/conftool/dbconfig/20250723-184801-fceratto.json [production]
18:47 <otto@deploy1003> helmfile [staging] DONE helmfile.d/services/eventgate-logging-external: apply [production]
18:47 <otto@deploy1003> helmfile [staging] START helmfile.d/services/eventgate-logging-external: apply [production]
18:43 <otto@deploy1003> helmfile [staging] DONE helmfile.d/services/eventgate-analytics-external: apply [production]
18:42 <otto@deploy1003> helmfile [staging] START helmfile.d/services/eventgate-analytics-external: apply [production]
18:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2225 (T399728)', diff saved to https://phabricator.wikimedia.org/P79770 and previous config saved to /var/cache/conftool/dbconfig/20250723-183254-fceratto.json [production]
18:29 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2225 (T399728)', diff saved to https://phabricator.wikimedia.org/P79769 and previous config saved to /var/cache/conftool/dbconfig/20250723-182951-fceratto.json [production]
18:29 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2225.codfw.wmnet with reason: Maintenance [production]
18:29 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2207 (T399728)', diff saved to https://phabricator.wikimedia.org/P79768 and previous config saved to /var/cache/conftool/dbconfig/20250723-182928-fceratto.json [production]