2025-04-28
ยง
|
11:52 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
11:52 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
11:52 |
<moritzm> |
installing avahi security updates |
[production] |
11:47 |
<XioNoX> |
push pfw policies - T392617 |
[production] |
11:45 |
<arnaudb@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gerrit2003.wikimedia.org with reason: T392804 |
[production] |
11:42 |
<slyngshede@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM idp1004.wikimedia.org |
[production] |
11:41 |
<ladsgroup@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1139439|EventStore: Add caching for per-page event lookups (T392784)]] (duration: 13m 15s) |
[production] |
11:35 |
<ladsgroup@deploy1003> |
ladsgroup: Continuing with sync |
[production] |
11:33 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 274607 |
[production] |
11:33 |
<ladsgroup@deploy1003> |
ladsgroup: Backport for [[gerrit:1139439|EventStore: Add caching for per-page event lookups (T392784)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
11:32 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 274607 |
[production] |
11:32 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 270589 |
[production] |
11:32 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 270589 |
[production] |
11:32 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 17072 |
[production] |
11:31 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 17072 |
[production] |
11:31 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 264544 |
[production] |
11:31 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 264544 |
[production] |
11:31 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 61622 |
[production] |
11:31 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 61622 |
[production] |
11:30 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 264195 |
[production] |
11:30 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 264195 |
[production] |
11:30 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 267372 |
[production] |
11:30 |
<ozge@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
11:30 |
<ayounsi@cumin1002> |
START - Cookbook sre.network.peering with action 'email' for AS: 267372 |
[production] |
11:28 |
<ladsgroup@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1139439|EventStore: Add caching for per-page event lookups (T392784)]] |
[production] |
10:55 |
<samtar@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1134771|InitialiseSettings: wgTemplateDataEnableDiscovery on testwiki (T377975)]] (duration: 12m 18s) |
[production] |
10:48 |
<samtar@deploy1003> |
samtar: Continuing with sync |
[production] |
10:47 |
<samtar@deploy1003> |
samtar: Backport for [[gerrit:1134771|InitialiseSettings: wgTemplateDataEnableDiscovery on testwiki (T377975)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
10:42 |
<samtar@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1134771|InitialiseSettings: wgTemplateDataEnableDiscovery on testwiki (T377975)]] |
[production] |
10:40 |
<fceratto@cumin1002> |
END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis nupwiki in section s5 |
[production] |
10:32 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis nupwiki in section s5 |
[production] |
10:10 |
<fceratto@cumin1002> |
END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis nupwiki in section s5 |
[production] |
10:06 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-staging-ctrl2001.codfw.wmnet |
[production] |
10:04 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis nupwiki in section s5 |
[production] |
10:03 |
<fceratto@cumin1002> |
END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis nupwiki in section s5 |
[production] |
10:01 |
<elukey@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM ml-staging-ctrl2001.codfw.wmnet |
[production] |
09:58 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-staging-ctrl2002.codfw.wmnet |
[production] |
09:55 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis nupwiki in section s5 |
[production] |
09:54 |
<elukey> |
increase vcores and memory available for ml-staging-ctrl2* - T392289#10771944 |
[production] |
09:53 |
<elukey@cumin1002> |
START - Cookbook sre.ganeti.reboot-vm for VM ml-staging-ctrl2002.codfw.wmnet |
[production] |
09:51 |
<fceratto@cumin1002> |
END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Checking sanitization for wikis nupwiki in section s5 |
[production] |
09:46 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.sanitize-wiki Checking sanitization for wikis nupwiki in section s5 |
[production] |
09:28 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
09:28 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
09:02 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
09:02 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
08:58 |
<dcausse> |
restarting blazegraph on wdqs1013 (deadlocked) |
[production] |
08:55 |
<taavi> |
update cr-cloud firewall policy for new gerrit ip address T392793 |
[production] |
08:49 |
<taavi@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1137732|Add WMCS ranges to wgAutoblockExemptions (T386689)]] (duration: 25m 46s) |
[production] |
08:48 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on krb1002.eqiad.wmnet with reason: work in progress, not yet active |
[production] |