1301-1350 of 10000 results (83ms)
2025-04-28 ยง
11:52 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
11:52 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
11:52 <moritzm> installing avahi security updates [production]
11:47 <XioNoX> push pfw policies - T392617 [production]
11:45 <arnaudb@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on gerrit2003.wikimedia.org with reason: T392804 [production]
11:42 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp1004.wikimedia.org [production]
11:41 <ladsgroup@deploy1003> Finished scap sync-world: Backport for [[gerrit:1139439|EventStore: Add caching for per-page event lookups (T392784)]] (duration: 13m 15s) [production]
11:35 <ladsgroup@deploy1003> ladsgroup: Continuing with sync [production]
11:33 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 274607 [production]
11:33 <ladsgroup@deploy1003> ladsgroup: Backport for [[gerrit:1139439|EventStore: Add caching for per-page event lookups (T392784)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
11:32 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 274607 [production]
11:32 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 270589 [production]
11:32 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 270589 [production]
11:32 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 17072 [production]
11:31 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 17072 [production]
11:31 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 264544 [production]
11:31 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 264544 [production]
11:31 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 61622 [production]
11:31 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 61622 [production]
11:30 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 264195 [production]
11:30 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 264195 [production]
11:30 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 267372 [production]
11:30 <ozge@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
11:30 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 267372 [production]
11:28 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1139439|EventStore: Add caching for per-page event lookups (T392784)]] [production]
10:55 <samtar@deploy1003> Finished scap sync-world: Backport for [[gerrit:1134771|InitialiseSettings: wgTemplateDataEnableDiscovery on testwiki (T377975)]] (duration: 12m 18s) [production]
10:48 <samtar@deploy1003> samtar: Continuing with sync [production]
10:47 <samtar@deploy1003> samtar: Backport for [[gerrit:1134771|InitialiseSettings: wgTemplateDataEnableDiscovery on testwiki (T377975)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
10:42 <samtar@deploy1003> Started scap sync-world: Backport for [[gerrit:1134771|InitialiseSettings: wgTemplateDataEnableDiscovery on testwiki (T377975)]] [production]
10:40 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis nupwiki in section s5 [production]
10:32 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis nupwiki in section s5 [production]
10:10 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis nupwiki in section s5 [production]
10:06 <elukey@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-staging-ctrl2001.codfw.wmnet [production]
10:04 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis nupwiki in section s5 [production]
10:03 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Managing sanitization for wikis nupwiki in section s5 [production]
10:01 <elukey@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM ml-staging-ctrl2001.codfw.wmnet [production]
09:58 <elukey@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-staging-ctrl2002.codfw.wmnet [production]
09:55 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Managing sanitization for wikis nupwiki in section s5 [production]
09:54 <elukey> increase vcores and memory available for ml-staging-ctrl2* - T392289#10771944 [production]
09:53 <elukey@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM ml-staging-ctrl2002.codfw.wmnet [production]
09:51 <fceratto@cumin1002> END (FAIL) - Cookbook sre.mysql.sanitize-wiki (exit_code=99) Checking sanitization for wikis nupwiki in section s5 [production]
09:46 <fceratto@cumin1002> START - Cookbook sre.mysql.sanitize-wiki Checking sanitization for wikis nupwiki in section s5 [production]
09:28 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
09:28 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
09:02 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
09:02 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
08:58 <dcausse> restarting blazegraph on wdqs1013 (deadlocked) [production]
08:55 <taavi> update cr-cloud firewall policy for new gerrit ip address T392793 [production]
08:49 <taavi@deploy1003> Finished scap sync-world: Backport for [[gerrit:1137732|Add WMCS ranges to wgAutoblockExemptions (T386689)]] (duration: 25m 46s) [production]
08:48 <jmm@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on krb1002.eqiad.wmnet with reason: work in progress, not yet active [production]