2501-2550 of 10000 results (26ms)
2023-01-23 §
07:08 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
06:58 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
06:58 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
06:23 <kart_> Updated cxserver to 2023-01-20-051603-production (T323840, T326236) [production]
06:19 <kartik@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
06:18 <kartik@deploy1002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
06:18 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
06:18 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
06:17 <kartik@deploy1002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
06:17 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
06:17 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
06:16 <kartik@deploy1002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
06:12 <kartik@deploy1002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
06:12 <kartik@deploy1002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
05:07 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
05:07 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
05:01 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
05:01 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2113.codfw.wmnet with reason: Maintenance [production]
04:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depool db2113 T327611', diff saved to https://phabricator.wikimedia.org/P43210 and previous config saved to /var/cache/conftool/dbconfig/20230123-045939-ladsgroup.json [production]
04:57 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Promote db2123 to s5 primary T327611', diff saved to https://phabricator.wikimedia.org/P43209 and previous config saved to /var/cache/conftool/dbconfig/20230123-045740-ladsgroup.json [production]
04:57 <Amir1> Starting s5 codfw failover from db2113 to db2123 - T327611 [production]
04:53 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
04:53 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
04:33 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Set db2123 with weight 0 T327611', diff saved to https://phabricator.wikimedia.org/P43208 and previous config saved to /var/cache/conftool/dbconfig/20230123-043324-ladsgroup.json [production]
04:33 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 25 hosts with reason: Primary switchover s5 T327611 [production]
04:32 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 25 hosts with reason: Primary switchover s5 T327611 [production]
04:02 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
04:02 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
03:56 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
03:56 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2107.codfw.wmnet with reason: Maintenance [production]
03:54 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depool db2107 T327609', diff saved to https://phabricator.wikimedia.org/P43207 and previous config saved to /var/cache/conftool/dbconfig/20230123-035458-ladsgroup.json [production]
03:52 <Amir1> Starting s2 codfw failover from db2107 to db2104 - T327609 [production]
2023-01-22 §
21:40 <AntiComposite> start cvndb-CVNBot14-publish on app10 [cvn]
21:07 <AntiComposite> Deploy 1acdb8e to cvn-app10, starting bots (T306066) [cvn]
20:56 <AntiComposite> disable cvndb-CVNBot14-publish on app8 [cvn]
20:51 <AntiComposite> Deploy 1acdb8e to cvn-app8, stopping bots (T306066) [cvn]
19:53 <AntiComposite> Deploy 80ea1f5 to cvn-app10 (T306066) [cvn]
15:43 <AntiComposite> restart all CVNBots on app9 [cvn]
15:42 <AntiComposite> restart all CVNBots on app8 [cvn]
09:48 <wm-bot> <lucaswerkmeister> Double IRC messages to other bridges [tools.bridgebot]
03:42 <andrewbogott> reset eqiad1 rabbitmq in an attempt to resolve some mild instability [admin]
2023-01-21 §
20:48 <wm-bot> <urbanecm> Replace crontab with toolforge-jobs entry (T320134) [tools.watch-translations]
20:28 <wm-bot> <urbanecm> Starting python3.9 webservice [tools.watch-translations]
20:19 <wm-bot> <urbanecm> Stopping webservice for maintenance and upgrade to Python 3.9 [tools.watch-translations]
2023-01-20 §
23:24 <andrewbogott> truncating logfiles with find . -name '*.err' -size +1G -exec truncate --size=100M {} \; [tools]
21:24 <andrewbogott> truncating logfiles with find . -name '*.out' -size +1G -exec truncate --size=100M {} \; [tools]
18:22 <jynus> deploying new grants for backups on m1 T327155 [production]
16:15 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
16:15 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]
16:15 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . [production]