251-300 of 10000 results (36ms)
2023-10-12 ยง
13:43 <sukhe> remove old ns2 IP 91.198.174.239/32 from /e/n/i on A:dns-rec: T329219 [production]
13:38 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 54994 [production]
13:37 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 54994 [production]
13:35 <sukhe> remove redundant 208.80.153.231/32 from /e/n/i on A:dns-rec and A:codfw (superseded by label lo:anycast): T348041 [production]
13:34 <kartik@deploy2002> Finished scap: Backport for [[gerrit:955007|Add Akan language (T333765)]] (duration: 09m 39s) [production]
13:32 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 139901 [production]
13:32 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 139901 [production]
13:28 <kartik@deploy2002> kartik and srishakatux: Continuing with sync [production]
13:26 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host archiva1002.wikimedia.org [production]
13:26 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 15133 [production]
13:25 <kartik@deploy2002> kartik and srishakatux: Backport for [[gerrit:955007|Add Akan language (T333765)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:25 <kartik@deploy2002> Started scap: Backport for [[gerrit:955007|Add Akan language (T333765)]] [production]
13:24 <sukhe@cumin2002> START - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns rolling restart_daemons on A:wikidough and A:wikidough [production]
13:24 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 15133 [production]
13:23 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host archiva1002.wikimedia.org [production]
13:22 <btullis> rebooting archiva1002.wikimedia.org for T344671 [analytics]
13:19 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 40317 [production]
13:19 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 40317 [production]
13:18 <hashar@deploy2002> Finished scap: Backport for [[gerrit:965217|LinkRecommendationUpdater: Update $linkRecommendationTaskType declaration (T348719)]] (duration: 06m 51s) [production]
13:13 <hashar@deploy2002> phuedx and hashar: Continuing with sync [production]
13:13 <hashar@deploy2002> phuedx and hashar: Backport for [[gerrit:965217|LinkRecommendationUpdater: Update $linkRecommendationTaskType declaration (T348719)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:11 <hashar@deploy2002> Started scap: Backport for [[gerrit:965217|LinkRecommendationUpdater: Update $linkRecommendationTaskType declaration (T348719)]] [production]
13:10 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.apt.copy_to_main_repo (exit_code=0) for package 'toolforge-jobs-framework-cli' version '15' [tools]
13:10 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.apt.copy_to_main_repo for package 'toolforge-jobs-framework-cli' version '15' [tools]
13:06 <hashar> Updating docker-pkg files for https://gerrit.wikimedia.org/r/c/integration/config/+/965502 # T348724 [releng]
12:34 <wm-bot> <root> Restart because of duplicate messages [tools.bridgebot]
12:30 <taavi> restart bnc pod to get tool to reconnect [tools.bridgebot]
12:26 <jayme> re-enable puppet on A:cp - T347544 [production]
12:21 <dcaro> rebooting sgeexec-10-17 [tools]
12:18 <jayme> disable puppet on A:cp - T347544 [production]
12:16 <jayme> disable puppet on A:cp-text - T347544 [production]
12:02 <taavi> also reboot tools-sgeweblight-10-30 [tools]
12:00 <btullis> pushing out presto version 0.283 to the test cluster. [analytics]
12:00 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.grid.cleanup_queue_errors (exit_code=0) [tools]
12:00 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.grid.cleanup_queue_errors [tools]
11:59 <taavi@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.grid.cleanup_queue_errors (exit_code=0) [toolsbeta]
11:59 <taavi@cloudcumin1001> START - Cookbook wmcs.toolforge.grid.cleanup_queue_errors [toolsbeta]
11:52 <taavi> reboot tools-sgeweblight-10-22, 28 [tools]
11:49 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
11:49 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
11:49 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1162.eqiad.wmnet with reason: Maintenance [production]
11:49 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1162.eqiad.wmnet with reason: Maintenance [production]
11:41 <taavi> configure keepalived ip for main project-proxy service T316982 [project-proxy]
11:37 <jayme@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
11:36 <jayme@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
11:34 <jayme@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
11:33 <jayme@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
11:30 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1001.eqiad.wmnet with reason: testing [production]
11:30 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: testing [production]
11:27 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2102.codfw.wmnet with reason: Maintenance [production]