1-50 of 10000 results (97ms)
2025-07-23 ยง
23:54 <dzahn@cumin2002> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: security release 20250723 [production]
23:48 <ryankemper@cumin1002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_codfw: activate new plugins packages - ryankemper@cumin1002 - T397227 [production]
23:46 <ryankemper> [Cirrus] Depooled codfw in anticipation of rolling restart. Hopefully minimal noise on this one :) [production]
23:46 <ryankemper@cumin1002> conftool action : set/pooled=false; selector: dnsdisc=search,name=codfw [production]
23:15 <inflatador> pool cirrussearch eqiad, will resume investigations tomorrow T400160 [production]
23:14 <bking@cumin2002> conftool action : set/pooled=true; selector: dnsdisc=search,name=eqiad [production]
23:08 <bking@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 55 hosts with reason: testing cluster quorum [production]
22:53 <bking@cumin1002> END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227 [production]
22:17 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
22:05 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:57 <vriley@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host clouddb1022 [production]
21:56 <vriley@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host clouddb1022 [production]
21:55 <vriley@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:55 <vriley@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt clouddb1022 - vriley@cumin1002" [production]
21:55 <vriley@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt clouddb1022 - vriley@cumin1002" [production]
21:52 <vriley@cumin1002> START - Cookbook sre.dns.netbox [production]
21:15 <cscott@deploy1003> Finished scap sync-world: Backport for [[gerrit:1172108|Create "report visual bug" dialog (T365371)]], [[gerrit:1165094|Disable ParserMigration indicator and user notice (T363484 T363472)]] (duration: 40m 57s) [production]
21:02 <cscott@deploy1003> cscott: Continuing with sync [production]
20:58 <cscott@deploy1003> cscott: Backport for [[gerrit:1172108|Create "report visual bug" dialog (T365371)]], [[gerrit:1165094|Disable ParserMigration indicator and user notice (T363484 T363472)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:55 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2238 (T399728)', diff saved to https://phabricator.wikimedia.org/P79784 and previous config saved to /var/cache/conftool/dbconfig/20250723-205548-fceratto.json [production]
20:40 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2238', diff saved to https://phabricator.wikimedia.org/P79783 and previous config saved to /var/cache/conftool/dbconfig/20250723-204041-fceratto.json [production]
20:38 <eileen> * civicrm upgraded from 3c23a5c0 to fccd9ef9 [production]
20:37 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1023.eqiad.wmnet with OS bookworm [production]
20:34 <cscott@deploy1003> Started scap sync-world: Backport for [[gerrit:1172108|Create "report visual bug" dialog (T365371)]], [[gerrit:1165094|Disable ParserMigration indicator and user notice (T363484 T363472)]] [production]
20:32 <cscott@deploy1003> Finished scap sync-world: Backport for [[gerrit:1170549|Enable the "Report Visual Bug" feature of Extension:ParserMigration (T365371)]] (duration: 10m 32s) [production]
20:30 <dani@deploy1003> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
20:29 <dani@deploy1003> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
20:29 <dani@deploy1003> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
20:29 <dani@deploy1003> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
20:29 <dani@deploy1003> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
20:28 <dani@deploy1003> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
20:26 <cscott@deploy1003> cscott: Continuing with sync [production]
20:25 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2238', diff saved to https://phabricator.wikimedia.org/P79781 and previous config saved to /var/cache/conftool/dbconfig/20250723-202533-fceratto.json [production]
20:23 <cscott@deploy1003> cscott: Backport for [[gerrit:1170549|Enable the "Report Visual Bug" feature of Extension:ParserMigration (T365371)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:21 <cscott@deploy1003> Started scap sync-world: Backport for [[gerrit:1170549|Enable the "Report Visual Bug" feature of Extension:ParserMigration (T365371)]] [production]
20:18 <jhathaway@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on sretest1003.eqiad.wmnet with reason: redfish-test [production]
20:10 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2238 (T399728)', diff saved to https://phabricator.wikimedia.org/P79780 and previous config saved to /var/cache/conftool/dbconfig/20250723-201025-fceratto.json [production]
20:07 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db2238 (T399728)', diff saved to https://phabricator.wikimedia.org/P79779 and previous config saved to /var/cache/conftool/dbconfig/20250723-200722-fceratto.json [production]
20:07 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2238.codfw.wmnet with reason: Maintenance [production]
20:07 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226 (T399728)', diff saved to https://phabricator.wikimedia.org/P79778 and previous config saved to /var/cache/conftool/dbconfig/20250723-200659-fceratto.json [production]
20:02 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1023.eqiad.wmnet with reason: host reimage [production]
19:57 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1023.eqiad.wmnet with reason: host reimage [production]
19:57 <jhathaway@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on ml-serve1012.eqiad.wmnet with reason: redfish-test [production]
19:53 <jhathaway@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on sretest2001.codfw.wmnet with reason: redfish-test [production]
19:51 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226', diff saved to https://phabricator.wikimedia.org/P79777 and previous config saved to /var/cache/conftool/dbconfig/20250723-195152-fceratto.json [production]
19:41 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1172082|AuthManager: Move temp account login to continueAuthentication (T398270)]] (duration: 11m 39s) [production]
19:41 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash1023.eqiad.wmnet with OS bookworm [production]
19:36 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226', diff saved to https://phabricator.wikimedia.org/P79776 and previous config saved to /var/cache/conftool/dbconfig/20250723-193644-fceratto.json [production]
19:36 <kharlan@deploy1003> kharlan: Continuing with sync [production]
19:32 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1172082|AuthManager: Move temp account login to continueAuthentication (T398270)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]