201-250 of 10000 results (87ms)
2025-04-14 ยง
12:56 <jforrester@deploy1003> Finished scap sync-world: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level an [production]
12:56 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-upload_ulsfo and not P{cp4047.ulsfo.wmnet} and not P{cp4045.ulsfo.wmnet} and A:cp [production]
12:55 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool pc5 T391454', diff saved to https://phabricator.wikimedia.org/P74950 and previous config saved to /var/cache/conftool/dbconfig/20250414-125511-marostegui.json [production]
12:53 <moritzm> remove ganeti01.svc.codfw.wmnet cert (replaced by cfssl cert) T357750 [production]
12:51 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1189', diff saved to https://phabricator.wikimedia.org/P74949 and previous config saved to /var/cache/conftool/dbconfig/20250414-125156-fceratto.json [production]
12:51 <godog> upgrade prometheus2007 to thanos 0.38.0 - T383966 [production]
12:50 <godog> upgrade prometheus2005 to thanos 0.38.0 - T383966 [production]
12:49 <moritzm> remove ganeti01.svc.esams.wmnet cert (replaced by cfssl cert) T357750 [production]
12:46 <jforrester@deploy1003> jforrester: Continuing with sync [production]
12:46 <moritzm> remove ganeti01.svc.ulsfo.wmnet cert (replaced by cfssl cert) T357750 [production]
12:44 <jforrester@deploy1003> jforrester: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level and above]] sync [production]
12:43 <moritzm> remove ganeti01.svc.eqsin.wmnet cert (replaced by cfssl cert) T357750 [production]
12:36 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level and [production]
12:36 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1189 (T391056)', diff saved to https://phabricator.wikimedia.org/P74948 and previous config saved to /var/cache/conftool/dbconfig/20250414-123649-fceratto.json [production]
12:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1189 (T391056)', diff saved to https://phabricator.wikimedia.org/P74947 and previous config saved to /var/cache/conftool/dbconfig/20250414-123255-fceratto.json [production]
12:32 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1189.eqiad.wmnet with reason: Maintenance [production]
12:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1175 (T391056)', diff saved to https://phabricator.wikimedia.org/P74946 and previous config saved to /var/cache/conftool/dbconfig/20250414-123234-fceratto.json [production]
12:25 <cgoubert@deploy1003> helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:24 <cgoubert@deploy1003> helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:24 <cgoubert@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
12:23 <cgoubert@deploy1003> helmfile [ml-serve-eqiad] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
12:22 <cgoubert@deploy1003> helmfile [ml-serve-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
12:22 <cgoubert@deploy1003> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
12:17 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P74945 and previous config saved to /var/cache/conftool/dbconfig/20250414-121726-fceratto.json [production]
12:06 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1136049|Special pages: Don't just set userCanExecute() but actually run it (T391594)]], [[gerrit:1136050|Client mode: Provide WikiLambdaClientModeOffline for SRE to disable]], [[gerrit:1136051|Wikifunctions VE: Add loading and abort state to content editable (T391441)]], [[gerrit:1136126|logging: Allow through WikiLambdaClient logs at info level and [production]
12:02 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P74944 and previous config saved to /var/cache/conftool/dbconfig/20250414-120219-fceratto.json [production]
11:47 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1175 (T391056)', diff saved to https://phabricator.wikimedia.org/P74943 and previous config saved to /var/cache/conftool/dbconfig/20250414-114711-fceratto.json [production]
11:43 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1175 (T391056)', diff saved to https://phabricator.wikimedia.org/P74942 and previous config saved to /var/cache/conftool/dbconfig/20250414-114323-fceratto.json [production]
11:43 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1175.eqiad.wmnet with reason: Maintenance [production]
11:43 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1166 (T391056)', diff saved to https://phabricator.wikimedia.org/P74941 and previous config saved to /var/cache/conftool/dbconfig/20250414-114300-fceratto.json [production]
11:40 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.roll-restart (exit_code=0) rolling restart_daemons on A:dnsbox [production]
11:30 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on P{cp4045.ulsfo.wmnet} and A:cp [production]
11:30 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on P{cp4037.ulsfo.wmnet} and A:cp [production]
11:29 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
11:29 <fceratto@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
11:28 <fceratto@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
11:27 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1166', diff saved to https://phabricator.wikimedia.org/P74940 and previous config saved to /var/cache/conftool/dbconfig/20250414-112754-fceratto.json [production]
11:27 <fceratto@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
11:26 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
11:26 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
11:25 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on P{cp4045.ulsfo.wmnet} and A:cp [production]
11:25 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
11:25 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on P{cp4037.ulsfo.wmnet} and A:cp [production]
11:25 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
11:24 <vgutierrez> upload varnishkafka 1.2.0-3 to apt.wm.o (bullseye-wikimedia) - T391334 [production]
11:20 <hnowlan@deploy1003> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
11:20 <hnowlan@deploy1003> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
11:19 <fceratto@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
11:19 <fceratto@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
11:12 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1166', diff saved to https://phabricator.wikimedia.org/P74939 and previous config saved to /var/cache/conftool/dbconfig/20250414-111247-fceratto.json [production]