251-300 of 10000 results (84ms)
2025-08-21 ยง
21:22 <ryankemper> T386098 Depooled eqiad `wdqs-internal-scholarly` in preparation for data transfer [production]
21:21 <ryankemper@cumin2002> conftool action : set/pooled=false; selector: dnsdisc=wdqs-internal-scholarly,name=eqiad [production]
21:21 <reedy@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180970|CommonSettings: Add hcaptcha.wikimedia.org to $wgCrossSiteAJAXdomains (T382148)]] (duration: 11m 39s) [production]
21:18 <dzahn@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM people1005.eqiad.wmnet - dzahn@cumin1002" [production]
21:18 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host an-test-coord1002.eqiad.wmnet with OS bookworm [production]
21:16 <reedy@deploy1003> reedy: Continuing with sync [production]
21:15 <reedy@deploy1003> reedy: Backport for [[gerrit:1180970|CommonSettings: Add hcaptcha.wikimedia.org to $wgCrossSiteAJAXdomains (T382148)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
21:14 <dzahn@cumin1002> START - Cookbook sre.dns.netbox [production]
21:14 <dzahn@cumin1002> START - Cookbook sre.ganeti.makevm for new host people1005.eqiad.wmnet [production]
21:09 <reedy@deploy1003> Started scap sync-world: Backport for [[gerrit:1180970|CommonSettings: Add hcaptcha.wikimedia.org to $wgCrossSiteAJAXdomains (T382148)]] [production]
20:55 <ejegg> payments-wiki upgraded from 1235f11f to cb76e2b7 [production]
20:54 <ejegg> donorwiki upgraded from 5dcb98fd to cb76e2b7 [production]
20:53 <zabe@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180609|Stop writing to cl_to and cl_collation on large s7 and s8 wikis (T399579)]] (duration: 11m 46s) [production]
20:48 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-transfer (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs2024.codfw.wmnet -> wdqs2027.codfw.wmnet w/ force delete existing files, repooling both afterwards [production]
20:47 <zabe@deploy1003> zabe: Continuing with sync [production]
20:47 <zabe@deploy1003> zabe: Backport for [[gerrit:1180609|Stop writing to cl_to and cl_collation on large s7 and s8 wikis (T399579)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:41 <zabe@deploy1003> Started scap sync-world: Backport for [[gerrit:1180609|Stop writing to cl_to and cl_collation on large s7 and s8 wikis (T399579)]] [production]
20:40 <zabe@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180917|Update redirected link]] (duration: 11m 06s) [production]
20:39 <jhathaway@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-test-coord1002.eqiad.wmnet with reason: supermicro [production]
20:38 <mutante> deleted a bunch of old bounce messages in the exim queue on lists1004 [production]
20:37 <jhathaway@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest2001.codfw.wmnet with OS bookworm [production]
20:35 <zabe@deploy1003> zabe, meno25: Continuing with sync [production]
20:35 <zabe@deploy1003> zabe, meno25: Backport for [[gerrit:1180917|Update redirected link]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:32 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs2024.codfw.wmnet -> wdqs2027.codfw.wmnet w/ force delete existing files, repooling both afterwards [production]
20:32 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-transfer (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs2024.codfw.wmnet -> wdqs2027.codfw.wmnet w/ force delete existing files, repooling both afterwards [production]
20:29 <zabe@deploy1003> Started scap sync-world: Backport for [[gerrit:1180917|Update redirected link]] [production]
20:29 <zabe@deploy1003> Finished scap sync-world: Backport for [[gerrit:1180895|Set categorylinks to read new on enwiki (T397912)]] (duration: 11m 58s) [production]
20:23 <zabe@deploy1003> zabe: Continuing with sync [production]
20:23 <jhathaway@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2001.codfw.wmnet with reason: host reimage [production]
20:22 <zabe@deploy1003> zabe: Backport for [[gerrit:1180895|Set categorylinks to read new on enwiki (T397912)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:19 <jhathaway@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2001.codfw.wmnet with reason: host reimage [production]
20:19 <mutante> lists1004 - sudo exim4 -qf - forced delivery attempt as reaction to alerting about large mail queue [production]
20:17 <zabe@deploy1003> Started scap sync-world: Backport for [[gerrit:1180895|Set categorylinks to read new on enwiki (T397912)]] [production]
20:07 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host sretest2001.codfw.wmnet with OS bookworm [production]
20:00 <jhathaway@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:54 <ejegg> payments-wiki rolled back from 49bef1cf to 1235f11f [production]
19:53 <ejegg> payments-wiki upgraded from 1235f11f to 49bef1cf [production]
19:50 <jhathaway@cumin1002> START - Cookbook sre.hosts.provision for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:48 <jhathaway@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2001.codfw.wmnet with reason: supermicro [production]
19:48 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:44 <jhathaway@cumin1002> START - Cookbook sre.hosts.provision for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:43 <jhathaway@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:38 <jhathaway@cumin1002> START - Cookbook sre.hosts.provision for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:38 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:37 <brett@dns1004> END - running authdns-update [production]
19:36 <brett@dns1004> START - running authdns-update [production]
19:35 <jhathaway@cumin1002> START - Cookbook sre.hosts.provision for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:35 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:13 <jhathaway@cumin1002> START - Cookbook sre.hosts.provision for host sretest2001.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
19:11 <jhathaway@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2001.codfw.wmnet with reason: supermicro [production]