3001-3050 of 10000 results (94ms)
2023-03-09 ยง
20:12 <sukhe@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) dns1003.wikimedia.org on all recursors [production]
20:12 <sukhe@cumin2002> START - Cookbook sre.dns.wipe-cache dns1003.wikimedia.org on all recursors [production]
20:09 <sukhe@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:09 <sukhe@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add dns1003 (renamed from authdns1001) - sukhe@cumin2002" [production]
20:07 <sukhe@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add dns1003 (renamed from authdns1001) - sukhe@cumin2002" [production]
20:06 <sukhe@cumin2002> START - Cookbook sre.dns.netbox [production]
19:51 <ryankemper@cumin1001> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster restart to enable incr shard recovery throughput - ryankemper@cumin1001 - T317816 [production]
19:46 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 12:00:00 on an-worker1078.eqiad.wmnet with reason: Replacing RAID BBU [production]
19:46 <btullis@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 12:00:00 on an-worker1078.eqiad.wmnet with reason: Replacing RAID BBU [production]
19:15 <sukhe@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host dns1003 [production]
19:15 <sukhe@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host dns1003 [production]
19:14 <sukhe@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:14 <sukhe@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add dns1003 (renamed from authdns1001) - sukhe@cumin2002" [production]
19:12 <sukhe@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add dns1003 (renamed from authdns1001) - sukhe@cumin2002" [production]
19:10 <jhuneidi@deploy2002> rebuilt and synchronized wikiversions files: all wikis to 1.40.0-wmf.26 refs T330204 [production]
19:06 <sukhe@cumin2002> START - Cookbook sre.dns.netbox [production]
18:53 <sukhe> enable puppet on A:dns-rec and force puppet run: T330670 [production]
18:50 <mforns@deploy2002> Finished deploy [airflow-dags/analytics@3419b7d]: (no justification provided) (duration: 00m 10s) [production]
18:50 <mforns@deploy2002> Started deploy [airflow-dags/analytics@3419b7d]: (no justification provided) [production]
18:47 <sukhe> enable puppet on dns4003 to merge 895894 [production]
18:44 <sukhe> disable puppet on A:dns-rec to merge CR 895894 [production]
18:38 <jhathaway@deploy1002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
18:38 <jhathaway@deploy1002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
18:34 <sukhe> [correction] homer "cr*-codfw*" commit "Remove authdns2001 from homer, T330670" [production]
18:34 <sukhe> homer "cr*-codfw*" commit "Remove authdns1001 from homer, T330670" [production]
18:31 <sukhe> homer "cr*-eqiad*" commit "Remove authdns1001 from homer, T330670" [production]
18:26 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
18:26 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts authdns[1001,2001].wikimedia.org [production]
18:25 <sukhe@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:25 <sukhe@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: authdns[1001,2001].wikimedia.org decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
18:24 <sukhe@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: authdns[1001,2001].wikimedia.org decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
18:22 <sukhe> running puppet-agent on A:dns-auth to remove deprecated authdns[12]001 [production]
18:22 <sukhe@cumin2002> START - Cookbook sre.dns.netbox [production]
18:21 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
18:15 <sukhe@cumin2002> START - Cookbook sre.hosts.decommission for hosts authdns[1001,2001].wikimedia.org [production]
18:11 <bd808@deploy2002> helmfile [codfw] DONE helmfile.d/services/developer-portal: apply [production]
18:10 <bd808@deploy2002> helmfile [codfw] START helmfile.d/services/developer-portal: apply [production]
18:10 <bd808@deploy2002> helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply [production]
18:10 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
18:09 <bd808@deploy2002> helmfile [eqiad] START helmfile.d/services/developer-portal: apply [production]
18:09 <bd808@deploy2002> helmfile [staging] DONE helmfile.d/services/developer-portal: apply [production]
18:09 <bd808@deploy2002> helmfile [staging] START helmfile.d/services/developer-portal: apply [production]
18:08 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
18:08 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
18:01 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
18:00 <sukhe> cr*-codfw [ns0]: set routing-options static route 208.80.154.238/32 next-hop 208.80.153.77: T330670 [production]
17:53 <sukhe> cr*-codfw [ns1]: set routing-options static route 208.80.153.231/32 next-hop 208.80.153.77: T330670 [production]
17:50 <zabe@deploy2002> Finished scap: Backport for [[gerrit:896030|Revert "TransformHandler: Load stashed page bundle based on ETag." (T331629)]] (duration: 11m 57s) [production]
17:47 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
17:47 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2179 (T329260)', diff saved to https://phabricator.wikimedia.org/P45725 and previous config saved to /var/cache/conftool/dbconfig/20230309-174723-marostegui.json [production]