2251-2300 of 10000 results (112ms)
2024-01-08 ยง
16:25 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_eqsin and not P{cp[5030,5032].eqsin.wmnet} and A:cp [production]
16:25 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:988658|Undeploy Listings extension part III (T253216)]] (duration: 24m 06s) [production]
16:24 <taavi> lvs1018: sudo ipvsadm --delete-service --tcp-service 208.80.154.243:3311 (and all the way to :3318) - T346947 [production]
16:23 <taavi> lvs1018: sudo ipvsadm --delete-service --tcp-service 208.80.154.242:3311 (and all the way to :3318) - T346947 [production]
16:21 <taavi> lvs1020: sudo ipvsadm --delete-service --tcp-service 208.80.154.243:3311 (and all the way to :3318) - T346947 [production]
16:20 <taavi> lvs1020: sudo ipvsadm --delete-service --tcp-service 208.80.154.242:3311 (and all the way to :3318) - T346947 [production]
16:18 <pt1979@cumin2002> START - Cookbook sre.hosts.decommission for hosts ganeti2033.codfw.wmnet [production]
16:15 <taavi> restart pybal on lvs1018 - T346947 [production]
16:14 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
16:14 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:988658|Undeploy Listings extension part III (T253216)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
16:09 <taavi> restart pybal on lvs1020 - T346947 [production]
16:01 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:988658|Undeploy Listings extension part III (T253216)]] [production]
15:59 <sfaci@deploy2002> helmfile [eqiad] DONE helmfile.d/services/edit-analytics: apply [production]
15:59 <sfaci@deploy2002> helmfile [eqiad] START helmfile.d/services/edit-analytics: apply [production]
15:58 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:988655|Undeploy listing extension part II (T253216)]] (duration: 08m 40s) [production]
15:57 <sfaci@deploy2002> helmfile [codfw] DONE helmfile.d/services/edit-analytics: apply [production]
15:57 <sfaci@deploy2002> helmfile [codfw] START helmfile.d/services/edit-analytics: apply [production]
15:52 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
15:51 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:988655|Undeploy listing extension part II (T253216)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:49 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:988655|Undeploy listing extension part II (T253216)]] [production]
15:48 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on mw1377.eqiad.wmnet with reason: reboot debugging [production]
15:48 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on mw1377.eqiad.wmnet with reason: reboot debugging [production]
15:47 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:988654|Undeploy Listings extension, part I (T253216)]] (duration: 08m 22s) [production]
15:46 <jelto@deploy2002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
15:46 <jelto@deploy2002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
15:45 <jelto@deploy2002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
15:41 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
15:40 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:988654|Undeploy Listings extension, part I (T253216)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:40 <jelto@deploy2002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
15:38 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:988654|Undeploy Listings extension, part I (T253216)]] [production]
15:35 <claime> Draining and cordoning kubestage2002.codfw.wmnet - T352883 [production]
15:32 <krinkle@deploy2002> Finished scap: Backport for [[gerrit:987999|Fix parsing logic when comments or hidden characters are present (T354385)]] (duration: 07m 52s) [production]
15:26 <krinkle@deploy2002> krinkle: Continuing with sync [production]
15:26 <krinkle@deploy2002> krinkle: Backport for [[gerrit:987999|Fix parsing logic when comments or hidden characters are present (T354385)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:24 <krinkle@deploy2002> Started scap: Backport for [[gerrit:987999|Fix parsing logic when comments or hidden characters are present (T354385)]] [production]
14:46 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:987159|Add agent.app_install_id to android.product_metrics.* streams (T353680)]], [[gerrit:982467|Remove partial migration of EditAttemptStep instrument (T351335)]], [[gerrit:982903|Add new stream names to the config variable (T353297)]], [[gerrit:988504|agent.app_ -> agent_app_ in android.product_metrics.* streams (T353680)]] (duration: 10m 22s) [production]
14:40 <urbanecm@deploy2002> urbanecm and phuedx and ksarabia and sfaci: Continuing with sync [production]
14:37 <urbanecm@deploy2002> urbanecm and phuedx and ksarabia and sfaci: Backport for [[gerrit:987159|Add agent.app_install_id to android.product_metrics.* streams (T353680)]], [[gerrit:982467|Remove partial migration of EditAttemptStep instrument (T351335)]], [[gerrit:982903|Add new stream names to the config variable (T353297)]], [[gerrit:988504|agent.app_ -> agent_app_ in android.product_metrics.* streams (T353680)]] synce [production]
14:35 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:987159|Add agent.app_install_id to android.product_metrics.* streams (T353680)]], [[gerrit:982467|Remove partial migration of EditAttemptStep instrument (T351335)]], [[gerrit:982903|Add new stream names to the config variable (T353297)]], [[gerrit:988504|agent.app_ -> agent_app_ in android.product_metrics.* streams (T353680)]] [production]
14:34 <urbanecm@deploy2002> Sync cancelled. [production]
14:27 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10 days, 0:00:00 on debmonitor2003.codfw.wmnet with reason: WIP [production]
14:27 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 10 days, 0:00:00 on debmonitor2003.codfw.wmnet with reason: WIP [production]
14:17 <marostegui@cumin1001> dbctl commit (dc=all): 'db1224 (re)pooling @ 100%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P54548 and previous config saved to /var/cache/conftool/dbconfig/20240108-141717-root.json [production]
14:14 <urbanecm@deploy2002> urbanecm and phuedx and ksarabia and sfaci: Backport for [[gerrit:987159|Add agent.app_install_id to android.product_metrics.* streams (T353680)]], [[gerrit:982467|Remove partial migration of EditAttemptStep instrument (T351335)]], [[gerrit:982903|Add new stream names to the config variable (T353297)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:12 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:987159|Add agent.app_install_id to android.product_metrics.* streams (T353680)]], [[gerrit:982467|Remove partial migration of EditAttemptStep instrument (T351335)]], [[gerrit:982903|Add new stream names to the config variable (T353297)]] [production]
14:12 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:988449|enable page_rerender for 3rd batch of wikis (T351503)]] (duration: 09m 35s) [production]
14:06 <urbanecm@deploy2002> pfischer and urbanecm: Continuing with sync [production]
14:04 <jelto@deploy2002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
14:04 <jelto@deploy2002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
14:04 <urbanecm@deploy2002> pfischer and urbanecm: Backport for [[gerrit:988449|enable page_rerender for 3rd batch of wikis (T351503)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]