4051-4100 of 10000 results (52ms)
2024-12-04 ยง
18:58 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply [production]
18:58 <amastilovic@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply [production]
18:57 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wdqs-internal-main.svc.eqiad.wmnet on all recursors [production]
18:57 <sukhe@cumin1002> START - Cookbook sre.dns.wipe-cache wdqs-internal-main.svc.eqiad.wmnet on all recursors [production]
18:55 <ryankemper> T379334 Successfully ran `sudo authdns-update` on `dns1004` [production]
18:52 <ryankemper> T379334 Creating A and PTR records for `wdqs-internal-main` and `wdqs-internal-scholarly` VIPs [merging https://gerrit.wikimedia.org/r/c/operations/dns/+/1100010/ & running authdns update after] [production]
18:48 <cjming@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply [production]
18:47 <cjming@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply [production]
18:47 <ryankemper> T379330 `wdqs-internal-main` and `wdqs-internal-scholarly` pools created [production]
18:46 <ryankemper@cumin2002> conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-internal-main,service=wdqs-main [production]
18:46 <ryankemper@cumin2002> conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-internal-scholarly,service=wdqs-scholarly [production]
18:35 <dbrant@deploy2002> helmfile [codfw] DONE helmfile.d/services/push-notifications: apply [production]
18:35 <dbrant@deploy2002> helmfile [codfw] START helmfile.d/services/push-notifications: apply [production]
18:34 <dbrant@deploy2002> helmfile [eqiad] DONE helmfile.d/services/push-notifications: apply [production]
18:33 <dbrant@deploy2002> helmfile [eqiad] START helmfile.d/services/push-notifications: apply [production]
18:30 <dbrant@deploy2002> helmfile [staging] DONE helmfile.d/services/push-notifications: apply [production]
18:30 <dbrant@deploy2002> helmfile [staging] START helmfile.d/services/push-notifications: apply [production]
18:13 <swfrench@deploy2002> Finished scap sync-world: Deployment to clear noop chart diff from 1081449 - T377040 (duration: 02m 07s) [production]
18:11 <swfrench@deploy2002> Started scap sync-world: Deployment to clear noop chart diff from 1081449 - T377040 [production]
18:04 <cjming@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
18:04 <cjming@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
18:01 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1178 (T371742)', diff saved to https://phabricator.wikimedia.org/P71556 and previous config saved to /var/cache/conftool/dbconfig/20241204-180114-ladsgroup.json [production]
18:01 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1178.eqiad.wmnet with reason: Maintenance [production]
18:00 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1178.eqiad.wmnet with reason: Maintenance [production]
18:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177 (T371742)', diff saved to https://phabricator.wikimedia.org/P71555 and previous config saved to /var/cache/conftool/dbconfig/20241204-180052-ladsgroup.json [production]
17:59 <joal> Rerun cassandra_load_pageview_top_articles_monthly after refinery patch deployed [analytics]
17:56 <joal> Deploying refinery onto HDFS [analytics]
17:55 <joal@deploy2002> Finished deploy [analytics/refinery@6e3ee14] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@6e3ee14b] (duration: 00m 31s) [production]
17:54 <joal@deploy2002> Started deploy [analytics/refinery@6e3ee14] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@6e3ee14b] [production]
17:54 <joal@deploy2002> Finished deploy [analytics/refinery@6e3ee14] (thin): Regular analytics weekly train THIN [analytics/refinery@6e3ee14b] (duration: 00m 37s) [production]
17:54 <joal@deploy2002> Started deploy [analytics/refinery@6e3ee14] (thin): Regular analytics weekly train THIN [analytics/refinery@6e3ee14b] [production]
17:52 <joal@deploy2002> Finished deploy [analytics/refinery@6e3ee14]: Regular analytics weekly train [analytics/refinery@6e3ee14b] (duration: 02m 05s) [production]
17:50 <bd808> Moved SAL fediverse posts to https://wikimedia.social/@sal. Many thanks to botsin.space for providing hosting for so long. [production]
17:50 <joal> Deploying refinery with scap [analytics]
17:50 <joal@deploy2002> Started deploy [analytics/refinery@6e3ee14]: Regular analytics weekly train [analytics/refinery@6e3ee14b] [production]
17:48 <wmbot~bd808@tools-bastion-12> Moved fediverse posting to https://wikimedia.social/@sal (T378571) [tools.stashbot]
17:46 <andrewbogott> rebooting tools-legacy-redirector-2, many probes failing [tools]
17:45 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P71554 and previous config saved to /var/cache/conftool/dbconfig/20241204-174544-ladsgroup.json [production]
17:38 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component envvars-admission [tools]
17:30 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component envvars-admission [tools]
17:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P71553 and previous config saved to /var/cache/conftool/dbconfig/20241204-173037-ladsgroup.json [production]
17:30 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component envvars-admission [toolsbeta]
17:23 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component envvars-admission [toolsbeta]
17:18 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1035.eqiad.wmnet' [admin]
17:16 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1035.eqiad.wmnet' [admin]
17:16 <andrew@cloudcumin1001> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=97) on host 'cloudvirt1035.eqiad.wmnet' (T380893) [admin]
17:16 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1035.eqiad.wmnet' (T380893) [admin]
17:15 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177 (T371742)', diff saved to https://phabricator.wikimedia.org/P71551 and previous config saved to /var/cache/conftool/dbconfig/20241204-171530-ladsgroup.json [production]
17:11 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component ingress-admission [tools]
17:10 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cloudcephmon1003.eqiad.wmnet [production]