2024-03-26
ยง
|
12:48 |
<TheresNoTime> |
noting that `host='mwdebug2001.codfw.wmnet', port=443): Read timed out.` during scap `check_testservers_baremetal`, retry worked P58919 |
[production] |
12:47 |
<samtar@deploy1002> |
musikanimal and samtar: Continuing with sync |
[production] |
12:46 |
<samtar@deploy1002> |
musikanimal and samtar: Backport for [[gerrit:1014113|[officewiki, testwiki]: enable CodeMirrorV6 (T357795)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
12:41 |
<samtar@deploy1002> |
Started scap: Backport for [[gerrit:1014113|[officewiki, testwiki]: enable CodeMirrorV6 (T357795)]] |
[production] |
12:06 |
<btullis@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
12:05 |
<btullis@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
11:24 |
<claime> |
enabling and running puppet on restbase1035.eqiad.wmnet - T358213 |
[production] |
11:19 |
<claime> |
enabling and running puppet on restbase2021.codfw.wmnet - T358213 |
[production] |
11:15 |
<claime> |
Stopping puppet on P:restbase to deploy 1005756 - T358213 |
[production] |
11:11 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
11:10 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
11:06 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 16 hosts with reason: Maint T343718 |
[production] |
11:06 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 16 hosts with reason: Maint T343718 |
[production] |
10:46 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:1013610|[beta] eswiki: Enable CommunityConfiguration extension (T357766)]], [[gerrit:1013611|[beta] eswiki: Use CommunityConfiguration extension for GrowthExperiments (T357766)]] (duration: 13m 41s) |
[production] |
10:43 |
<jayme@deploy1002> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
10:39 |
<jayme@deploy1002> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
10:38 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
10:37 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
10:35 |
<urbanecm@deploy1002> |
urbanecm: Continuing with sync |
[production] |
10:35 |
<urbanecm@deploy1002> |
urbanecm: Backport for [[gerrit:1013610|[beta] eswiki: Enable CommunityConfiguration extension (T357766)]], [[gerrit:1013611|[beta] eswiki: Use CommunityConfiguration extension for GrowthExperiments (T357766)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
10:33 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply |
[production] |
10:33 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:1013610|[beta] eswiki: Enable CommunityConfiguration extension (T357766)]], [[gerrit:1013611|[beta] eswiki: Use CommunityConfiguration extension for GrowthExperiments (T357766)]] |
[production] |
10:33 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply |
[production] |
10:31 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
10:31 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:1013608|Add CommunityConfiguration extension (T357766)]], [[gerrit:1013609|Add wmgUseCommunityConfiguration (T357766)]] (duration: 48m 57s) |
[production] |
10:30 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
10:18 |
<dcausse> |
stopping blazegraph on wdqs1013, (wdqs->wikidata maxlag propagation not working as expected) |
[production] |
10:18 |
<jayme@deploy1002> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
10:18 |
<jayme@deploy1002> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
10:13 |
<urbanecm@deploy1002> |
urbanecm: Continuing with sync |
[production] |
10:13 |
<urbanecm@deploy1002> |
urbanecm: Backport for [[gerrit:1013608|Add CommunityConfiguration extension (T357766)]], [[gerrit:1013609|Add wmgUseCommunityConfiguration (T357766)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
10:08 |
<brouberol@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts an-tool1010.eqiad.wmnet |
[production] |
10:08 |
<brouberol@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
10:08 |
<brouberol@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-tool1010.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1002" |
[production] |
10:01 |
<brouberol@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-tool1010.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin1002" |
[production] |
09:42 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:1013608|Add CommunityConfiguration extension (T357766)]], [[gerrit:1013609|Add wmgUseCommunityConfiguration (T357766)]] |
[production] |
09:38 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
09:38 |
<brouberol@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
09:37 |
<brouberol@deploy1002> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
09:37 |
<brouberol@deploy1002> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
09:12 |
<brouberol@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
09:04 |
<brouberol@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts an-tool1010.eqiad.wmnet |
[production] |
09:02 |
<hashar@deploy1002> |
Finished scap: Backport for [[gerrit:1013632|zhwikivoyage: Enable NewUserMessage extension (T360175)]] (duration: 18m 29s) |
[production] |
08:51 |
<hashar@deploy1002> |
hashar and s8321414: Continuing with sync |
[production] |
08:46 |
<hashar@deploy1002> |
hashar and s8321414: Backport for [[gerrit:1013632|zhwikivoyage: Enable NewUserMessage extension (T360175)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
08:44 |
<hashar@deploy1002> |
Started scap: Backport for [[gerrit:1013632|zhwikivoyage: Enable NewUserMessage extension (T360175)]] |
[production] |
08:41 |
<dcausse> |
depooling and restarting blazegraph on wdqs1013 (stuck for 2 days) |
[production] |
08:41 |
<kart_> |
Updated cxserver to 2024-03-21-114859-production (T353510) |
[production] |
08:31 |
<brouberol> |
I'm going to apply kafka log compaction for {eqiad,codfw}.mediawiki.currussearch.page_rerender.v1 on kafka-main-codfw only (current replica) - T354794 |
[production] |
08:28 |
<kartik@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |