2024-09-18
ยง
|
15:08 |
<denisse> |
Resolve alerts DNS queries to alert1002 - T372418 |
[production] |
15:03 |
<_joe_> |
uploading conftool 3.2.4 to apt T375059 |
[production] |
15:02 |
<sukhe> |
sudo cumin "A:cp" 'disable-puppet "merging CR 1073798"': T365327 |
[production] |
15:01 |
<denisse> |
Make alert1002 the active host - T372418 |
[production] |
15:00 |
<denisse> |
Disable meta-monitoring for the alert hosts - T372418 |
[production] |
14:55 |
<elukey> |
restart poolcounter on poolcounter100[4,5] (depooled nodes) to clear old/stale TCP conns for port 7531 |
[production] |
14:54 |
<dcausse@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:54 |
<dcausse@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:54 |
<dcausse@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:54 |
<dcausse@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:53 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 55655 |
[production] |
14:52 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'configure' for AS: 55655 |
[production] |
14:50 |
<dcausse@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:49 |
<dcausse@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:47 |
<dcausse@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:46 |
<dcausse@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:45 |
<dcausse@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:45 |
<dcausse@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:42 |
<elukey@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti1052.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
14:40 |
<sukhe@cumin1002> |
END (PASS) - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns (exit_code=0) rolling restart_daemons on A:wikidough and A:wikidough |
[production] |
14:36 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.provision for host ganeti1052.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
14:26 |
<sukhe@cumin1002> |
START - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns rolling restart_daemons on A:wikidough and A:wikidough |
[production] |
14:25 |
<bking@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:24 |
<sukhe> |
run puppet agent on A:wikidough |
[production] |
14:23 |
<bking@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:19 |
<bking@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:19 |
<bking@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:07 |
<bking@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:07 |
<bking@deploy1003> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
13:53 |
<elukey@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1073503|Swap poolcounter1005 with poolcounter1007 (T332015)]] (duration: 07m 23s) |
[production] |
13:49 |
<elukey@deploy1003> |
elukey: Continuing with sync |
[production] |
13:48 |
<elukey@deploy1003> |
elukey: Backport for [[gerrit:1073503|Swap poolcounter1005 with poolcounter1007 (T332015)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:46 |
<elukey@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1073503|Swap poolcounter1005 with poolcounter1007 (T332015)]] |
[production] |
13:38 |
<elukey@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1073502|Swap poolcounter1004 with poolcounter1006 (T332015)]] (duration: 07m 15s) |
[production] |
13:34 |
<elukey@deploy1003> |
elukey: Continuing with sync |
[production] |
13:33 |
<elukey@deploy1003> |
elukey: Backport for [[gerrit:1073502|Swap poolcounter1004 with poolcounter1006 (T332015)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:31 |
<elukey@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1073502|Swap poolcounter1004 with poolcounter1006 (T332015)]] |
[production] |
13:25 |
<Dreamy_Jazz> |
Afternoon UTC backport window done |
[production] |
13:20 |
<dreamyjazz@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1073739|GrowthExperiments: enable Community Updates module in testwiki (T374577)]], [[gerrit:1073487|Check that throttling exceptions use valid public IP addresses (T374980)]], [[gerrit:1073790|Hide temp account IP address viewing right from non-temp account wikis (T369187)]], [[gerrit:1073586|Lift IP cap on 2024-10-07/08 for edit-a-thon (T374964)] |
[production] |
13:18 |
<elukey> |
restart puppetserver on puppetserver1002 - trashing - T373527 |
[production] |
13:15 |
<dreamyjazz@deploy1003> |
sgimeno, anzx, lucaswerkmeister-wmde, cscott, hnowlan, dreamyjazz: Continuing with sync |
[production] |
13:11 |
<dreamyjazz@deploy1003> |
sgimeno, anzx, lucaswerkmeister-wmde, cscott, hnowlan, dreamyjazz: Backport for [[gerrit:1073739|GrowthExperiments: enable Community Updates module in testwiki (T374577)]], [[gerrit:1073487|Check that throttling exceptions use valid public IP addresses (T374980)]], [[gerrit:1073790|Hide temp account IP address viewing right from non-temp account wikis (T369187)]], [[gerrit:1073586|Lift IP cap on |
[production] |
13:09 |
<dreamyjazz@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1073739|GrowthExperiments: enable Community Updates module in testwiki (T374577)]], [[gerrit:1073487|Check that throttling exceptions use valid public IP addresses (T374980)]], [[gerrit:1073790|Hide temp account IP address viewing right from non-temp account wikis (T369187)]], [[gerrit:1073586|Lift IP cap on 2024-10-07/08 for edit-a-thon (T374964)]] |
[production] |
12:46 |
<vgutierrez> |
rolling upgrade to purged 0.23 in A:cp-ulsfo - T334078 |
[production] |
12:44 |
<vgutierrez> |
uploaded purged 0.23 to bullseye-wikimedia (apt.wm.o) - T334078 |
[production] |
12:33 |
<moritzm> |
uploaded cas 7.0.4.1+wmf12u3 T367487 |
[production] |
12:28 |
<jnuche@deploy1003> |
rebuilt and synchronized wikiversions files: group1 to 1.43.0-wmf.23 refs T373642 |
[production] |
12:21 |
<tchin@deploy1003> |
Finished deploy [airflow-dags/analytics_test@e6cc31a]: Regular analytics weekly train (duration: 00m 20s) |
[production] |
12:21 |
<tchin@deploy1003> |
Started deploy [airflow-dags/analytics_test@e6cc31a]: Regular analytics weekly train |
[production] |
12:18 |
<tchin@deploy1003> |
Finished deploy [airflow-dags/analytics@e6cc31a]: Regular analytics weekly train (duration: 01m 18s) |
[production] |