201-250 of 10000 results (96ms)
2026-02-17 ยง
15:16 <sfaci@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/test-kitchen: apply [production]
15:14 <jayme@cumin1003> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs [production]
15:13 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239329|sqwiki: remove editor usergroup (T415196)]] (duration: 07m 45s) [production]
15:11 <jayme@cumin1003> START - Cookbook sre.loadbalancer.migrate-service-ipip for alias: wikikube-staging-worker-codfw@codfw [production]
15:11 <jayme@cumin1003> END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for alias: wikikube-staging-worker-eqiad@eqiad [production]
15:11 <jayme@cumin1003> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
15:09 <jayme@cumin1003> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
15:09 <ladsgroup@deploy2002> ladsgroup, anzx: Continuing with sync [production]
15:08 <ladsgroup@deploy2002> ladsgroup, anzx: Backport for [[gerrit:1239329|sqwiki: remove editor usergroup (T415196)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:07 <sukhe@cumin1003> END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: pool site magru [reason: XioNoX: maint work done, T416442] [production]
15:07 <sukhe@cumin1003> START - Cookbook sre.dns.admin DNS admin: pool site magru [reason: XioNoX: maint work done, T416442] [production]
15:06 <sukhe@cumin1003> END (FAIL) - Cookbook sre.dns.admin (exit_code=99) DNS admin: pool site magru [reason: Xionix maint work done, T416442] [production]
15:06 <sukhe@cumin1003> START - Cookbook sre.dns.admin DNS admin: pool site magru [reason: Xionix maint work done, T416442] [production]
15:06 <jayme@cumin1003> START - Cookbook sre.loadbalancer.migrate-service-ipip for alias: wikikube-staging-worker-eqiad@eqiad [production]
15:05 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1239329|sqwiki: remove editor usergroup (T415196)]] [production]
15:02 <phuedx@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239672|Test Kitchen: Set event intake service name]] (duration: 11m 56s) [production]
15:01 <sukhe@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 40 hosts [production]
15:01 <sukhe@cumin1003> START - Cookbook sre.hosts.remove-downtime for 40 hosts [production]
14:59 <sukhe@dns1004> END - running authdns-update [production]
14:58 <sukhe> running authdns-update after magru depool [production]
14:58 <sukhe@dns1004> START - running authdns-update [production]
14:58 <sukhe@puppetserver1001> conftool action : set/pooled=yes; selector: cluster=dnsbox,dc=magru [reason: magru maintenance done] [production]
14:58 <vgutierrez> upload golang-github-mmatczuk-anyflag-dev 0.0~git20240709.eb9e24c-1 to trixie-wikimedia (apt.wm.o) - T401832 [production]
14:57 <phuedx@deploy2002> phuedx: Continuing with sync [production]
14:55 <brouberol@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:54 <brouberol@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
14:52 <phuedx@deploy2002> phuedx: Backport for [[gerrit:1239672|Test Kitchen: Set event intake service name]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:50 <phuedx@deploy2002> Started scap sync-world: Backport for [[gerrit:1239672|Test Kitchen: Set event intake service name]] [production]
14:47 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:44 <XioNoX> mr1-magru> request system reboot - T416442 [production]
14:36 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:35 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:34 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239935|lift IP cap for event at Tshwane University of Technology (T417578)]] (duration: 06m 45s) [production]
14:30 <ladsgroup@deploy2002> anzx, ladsgroup: Continuing with sync [production]
14:29 <ladsgroup@deploy2002> anzx, ladsgroup: Backport for [[gerrit:1239935|lift IP cap for event at Tshwane University of Technology (T417578)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:27 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1239935|lift IP cap for event at Tshwane University of Technology (T417578)]] [production]
14:26 <brouberol@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:26 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host ml-serve1014.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:23 <XioNoX> asw1-b4-magru> request system reboot - T416442 [production]
14:23 <brouberol@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
14:16 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1239855|Add ParserOutputFlags::PREVENT_SELECTIVE_UPDATE (T348236)]] (duration: 08m 13s) [production]
14:15 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2172 (T415786)', diff saved to https://phabricator.wikimedia.org/P88846 and previous config saved to /var/cache/conftool/dbconfig/20260217-141510-marostegui.json [production]
14:15 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2172.codfw.wmnet with reason: Maintenance [production]
14:14 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2155 (T415786)', diff saved to https://phabricator.wikimedia.org/P88845 and previous config saved to /var/cache/conftool/dbconfig/20260217-141457-marostegui.json [production]
14:12 <ayounsi@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on asw1-b4-magru,asw1-b4-magru IPv6,asw1-b4-magru.mgmt with reason: router upgrade [production]
14:12 <ladsgroup@deploy2002> ladsgroup, cscott: Continuing with sync [production]
14:10 <ladsgroup@deploy2002> ladsgroup, cscott: Backport for [[gerrit:1239855|Add ParserOutputFlags::PREVENT_SELECTIVE_UPDATE (T348236)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:08 <ayounsi@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on asw1-b3-magru,asw1-b3-magru IPv6,asw1-b3-magru.mgmt with reason: router upgrade [production]
14:08 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1239855|Add ParserOutputFlags::PREVENT_SELECTIVE_UPDATE (T348236)]] [production]
14:07 <vgutierrez> upload golang-github-florianl-go-tc_0.4.7 to trixie-wikimedia (apt.wm.o) - T401832 [production]