2024-12-04
ยง
|
19:02 |
<ryankemper> |
T380555 Merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/1094061 to establish initial service definitions for `wdqs-internal-main` and `wdqs-internal-scholarly` |
[production] |
18:58 |
<amastilovic@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply |
[production] |
18:58 |
<amastilovic@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply |
[production] |
18:58 |
<amastilovic@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/blunderbuss: apply |
[production] |
18:58 |
<amastilovic@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/blunderbuss: apply |
[production] |
18:57 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wdqs-internal-main.svc.eqiad.wmnet on all recursors |
[production] |
18:57 |
<sukhe@cumin1002> |
START - Cookbook sre.dns.wipe-cache wdqs-internal-main.svc.eqiad.wmnet on all recursors |
[production] |
18:55 |
<ryankemper> |
T379334 Successfully ran `sudo authdns-update` on `dns1004` |
[production] |
18:52 |
<ryankemper> |
T379334 Creating A and PTR records for `wdqs-internal-main` and `wdqs-internal-scholarly` VIPs [merging https://gerrit.wikimedia.org/r/c/operations/dns/+/1100010/ & running authdns update after] |
[production] |
18:48 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply |
[production] |
18:47 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply |
[production] |
18:47 |
<ryankemper> |
T379330 `wdqs-internal-main` and `wdqs-internal-scholarly` pools created |
[production] |
18:46 |
<ryankemper@cumin2002> |
conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-internal-main,service=wdqs-main |
[production] |
18:46 |
<ryankemper@cumin2002> |
conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-internal-scholarly,service=wdqs-scholarly |
[production] |
18:35 |
<dbrant@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/push-notifications: apply |
[production] |
18:35 |
<dbrant@deploy2002> |
helmfile [codfw] START helmfile.d/services/push-notifications: apply |
[production] |
18:34 |
<dbrant@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/push-notifications: apply |
[production] |
18:33 |
<dbrant@deploy2002> |
helmfile [eqiad] START helmfile.d/services/push-notifications: apply |
[production] |
18:30 |
<dbrant@deploy2002> |
helmfile [staging] DONE helmfile.d/services/push-notifications: apply |
[production] |
18:30 |
<dbrant@deploy2002> |
helmfile [staging] START helmfile.d/services/push-notifications: apply |
[production] |
18:13 |
<swfrench@deploy2002> |
Finished scap sync-world: Deployment to clear noop chart diff from 1081449 - T377040 (duration: 02m 07s) |
[production] |
18:11 |
<swfrench@deploy2002> |
Started scap sync-world: Deployment to clear noop chart diff from 1081449 - T377040 |
[production] |
18:04 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
18:04 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
18:01 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1178 (T371742)', diff saved to https://phabricator.wikimedia.org/P71556 and previous config saved to /var/cache/conftool/dbconfig/20241204-180114-ladsgroup.json |
[production] |
18:01 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1178.eqiad.wmnet with reason: Maintenance |
[production] |
18:00 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db1178.eqiad.wmnet with reason: Maintenance |
[production] |
18:00 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1177 (T371742)', diff saved to https://phabricator.wikimedia.org/P71555 and previous config saved to /var/cache/conftool/dbconfig/20241204-180052-ladsgroup.json |
[production] |
17:59 |
<joal> |
Rerun cassandra_load_pageview_top_articles_monthly after refinery patch deployed |
[analytics] |
17:56 |
<joal> |
Deploying refinery onto HDFS |
[analytics] |
17:55 |
<joal@deploy2002> |
Finished deploy [analytics/refinery@6e3ee14] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@6e3ee14b] (duration: 00m 31s) |
[production] |
17:54 |
<joal@deploy2002> |
Started deploy [analytics/refinery@6e3ee14] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@6e3ee14b] |
[production] |
17:54 |
<joal@deploy2002> |
Finished deploy [analytics/refinery@6e3ee14] (thin): Regular analytics weekly train THIN [analytics/refinery@6e3ee14b] (duration: 00m 37s) |
[production] |
17:54 |
<joal@deploy2002> |
Started deploy [analytics/refinery@6e3ee14] (thin): Regular analytics weekly train THIN [analytics/refinery@6e3ee14b] |
[production] |
17:52 |
<joal@deploy2002> |
Finished deploy [analytics/refinery@6e3ee14]: Regular analytics weekly train [analytics/refinery@6e3ee14b] (duration: 02m 05s) |
[production] |
17:50 |
<bd808> |
Moved SAL fediverse posts to https://wikimedia.social/@sal. Many thanks to botsin.space for providing hosting for so long. |
[production] |
17:50 |
<joal> |
Deploying refinery with scap |
[analytics] |
17:50 |
<joal@deploy2002> |
Started deploy [analytics/refinery@6e3ee14]: Regular analytics weekly train [analytics/refinery@6e3ee14b] |
[production] |
17:48 |
<wmbot~bd808@tools-bastion-12> |
Moved fediverse posting to https://wikimedia.social/@sal (T378571) |
[tools.stashbot] |
17:46 |
<andrewbogott> |
rebooting tools-legacy-redirector-2, many probes failing |
[tools] |
17:45 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P71554 and previous config saved to /var/cache/conftool/dbconfig/20241204-174544-ladsgroup.json |
[production] |
17:38 |
<sstefanova@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component envvars-admission |
[tools] |
17:30 |
<sstefanova@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component envvars-admission |
[tools] |
17:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P71553 and previous config saved to /var/cache/conftool/dbconfig/20241204-173037-ladsgroup.json |
[production] |
17:30 |
<sstefanova@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component envvars-admission |
[toolsbeta] |
17:23 |
<sstefanova@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component envvars-admission |
[toolsbeta] |
17:18 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1035.eqiad.wmnet' |
[admin] |
17:16 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1035.eqiad.wmnet' |
[admin] |
17:16 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=97) on host 'cloudvirt1035.eqiad.wmnet' (T380893) |
[admin] |
17:16 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1035.eqiad.wmnet' (T380893) |
[admin] |