301-350 of 10000 results (102ms)
2026-01-21 ยง
11:46 <btullis@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-druid1007.eqiad.wmnet with OS bookworm [production]
11:45 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on an-druid1005.eqiad.wmnet with reason: host reimage [production]
11:45 <dreamyjazz@deploy2002> dreamyjazz, tstarling: Backport for [[gerrit:1229252|Remove unused LoginNotify config (T412939)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
11:44 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-druid1006.eqiad.wmnet with reason: host reimage [production]
11:44 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2074 [production]
11:44 <mvernon@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2074 [production]
11:43 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1229252|Remove unused LoginNotify config (T412939)]] [production]
11:40 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on an-druid1006.eqiad.wmnet with reason: host reimage [production]
11:34 <mvernon@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ms-be2074 [production]
11:34 <mvernon@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ms-be2074.codfw.wmnet 137.0.192.10.in-addr.arpa 7.3.1.0.0.0.0.0.2.9.1.0.0.1.0.0.1.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
11:34 <mvernon@cumin2002> START - Cookbook sre.dns.wipe-cache ms-be2074.codfw.wmnet 137.0.192.10.in-addr.arpa 7.3.1.0.0.0.0.0.2.9.1.0.0.1.0.0.1.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
11:34 <mvernon@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:34 <mvernon@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2074 - mvernon@cumin2002" [production]
11:34 <mvernon@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2074 - mvernon@cumin2002" [production]
11:30 <mvernon@cumin2002> START - Cookbook sre.dns.netbox [production]
11:29 <mvernon@cumin2002> START - Cookbook sre.hosts.move-vlan for host ms-be2074 [production]
11:29 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2074.codfw.wmnet with OS bullseye [production]
11:28 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-druid1007.eqiad.wmnet with OS bookworm [production]
11:28 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-druid1006.eqiad.wmnet with OS bookworm [production]
11:27 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-druid1005.eqiad.wmnet with OS bookworm [production]
11:27 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-druid1004.eqiad.wmnet with OS bookworm [production]
11:26 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-druid1003.eqiad.wmnet with OS bookworm [production]
11:14 <moritzm> installing curl bugfix updates from Bookworm point release [production]
10:49 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wdqs1032.eqiad.wmnet with OS trixie [production]
10:43 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host wdqs1032.eqiad.wmnet with OS trixie [production]
10:34 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on dbstore1008.eqiad.wmnet with reason: long schema change [production]
10:18 <moritzm> installing setuptools security updates [production]
10:13 <elukey@deploy2002> helmfile [staging] DONE helmfile.d/services/kartotherian: sync [production]
10:12 <elukey@deploy2002> helmfile [staging] START helmfile.d/services/kartotherian: sync [production]
09:09 <aklapper@deploy2002> rebuilt and synchronized wikiversions files: group1 to 1.46.0-wmf.12 refs T413803 [production]
08:19 <a-pizzata@deploy2002> Finished deploy [analytics/refinery@4f6560f] (thin): Regular analytics weekly train THIN [analytics/refinery@4f6560f9] (duration: 01m 16s) [production]
08:18 <a-pizzata@deploy2002> Started deploy [analytics/refinery@4f6560f] (thin): Regular analytics weekly train THIN [analytics/refinery@4f6560f9] [production]
08:11 <a-pizzata@deploy2002> Finished deploy [analytics/refinery@4f6560f]: Regular analytics weekly train [analytics/refinery@4f6560f9] (duration: 02m 43s) [production]
08:08 <a-pizzata@deploy2002> Started deploy [analytics/refinery@4f6560f]: Regular analytics weekly train [analytics/refinery@4f6560f9] [production]
08:08 <a-pizzata@deploy2002> Finished deploy [analytics/refinery@4f6560f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@4f6560f9] (duration: 01m 05s) [production]
08:07 <a-pizzata@deploy2002> Started deploy [analytics/refinery@4f6560f] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@4f6560f9] [production]
07:24 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2216 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87830 and previous config saved to /var/cache/conftool/dbconfig/20260121-072446-marostegui.json [production]
07:24 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2216.codfw.wmnet with reason: Maintenance [production]
07:24 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2203 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87829 and previous config saved to /var/cache/conftool/dbconfig/20260121-072422-marostegui.json [production]
07:14 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2203', diff saved to https://phabricator.wikimedia.org/P87828 and previous config saved to /var/cache/conftool/dbconfig/20260121-071414-marostegui.json [production]
07:04 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2203', diff saved to https://phabricator.wikimedia.org/P87827 and previous config saved to /var/cache/conftool/dbconfig/20260121-070405-marostegui.json [production]
06:53 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2203 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87826 and previous config saved to /var/cache/conftool/dbconfig/20260121-065357-marostegui.json [production]
06:48 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2197.codfw.wmnet with reason: Maintenance [production]
06:48 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2189 (T410589)', diff saved to https://phabricator.wikimedia.org/P87825 and previous config saved to /var/cache/conftool/dbconfig/20260121-064817-ladsgroup.json [production]
06:47 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.newpool (exit_code=0) pool db2179: After schema change [production]
06:47 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.newpool (exit_code=0) pool db1160: After schema change [production]
06:44 <samwilson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1229445|Revert "jquery.wikiEditor: enable resizing drag bar without RTP"]] (duration: 09m 37s) [production]
06:40 <samwilson@deploy2002> samwilson: Continuing with sync [production]
06:38 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P87822 and previous config saved to /var/cache/conftool/dbconfig/20260121-063809-ladsgroup.json [production]
06:37 <samwilson@deploy2002> samwilson: Backport for [[gerrit:1229445|Revert "jquery.wikiEditor: enable resizing drag bar without RTP"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]