2401-2450 of 10000 results (32ms)
2025-04-15 ยง
11:58 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum2002.codfw.wmnet with reason: host reimage [production]
11:51 <raymond-ndibe@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [tools]
11:45 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
11:45 <sukhe> sudo cumin 'A:durum and not P{durum2002*}' 'run-puppet-agent --enable "rolling out CR 1132669"' [production]
11:45 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170 (T391056)', diff saved to https://phabricator.wikimedia.org/P75014 and previous config saved to /var/cache/conftool/dbconfig/20250415-114501-fceratto.json [production]
11:42 <sukhe@cumin1002> START - Cookbook sre.hosts.reimage for host durum2002.codfw.wmnet with OS bookworm [production]
11:34 <raymond-ndibe@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [tools]
11:33 <raymond-ndibe@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
11:29 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P75013 and previous config saved to /var/cache/conftool/dbconfig/20250415-112955-fceratto.json [production]
11:25 <jelto@deploy1003> helmfile [eqiad] DONE helmfile.d/services/wikidata-query-gui: apply [production]
11:25 <jelto@deploy1003> helmfile [eqiad] START helmfile.d/services/wikidata-query-gui: apply [production]
11:25 <jelto@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikidata-query-gui: apply [production]
11:25 <jelto@deploy1003> helmfile [codfw] START helmfile.d/services/wikidata-query-gui: apply [production]
11:24 <jelto@deploy1003> helmfile [staging] DONE helmfile.d/services/wikidata-query-gui: apply [production]
11:24 <jelto@deploy1003> helmfile [staging] START helmfile.d/services/wikidata-query-gui: apply [production]
11:22 <raymond-ndibe@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component jobs-api [toolsbeta]
11:14 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P75012 and previous config saved to /var/cache/conftool/dbconfig/20250415-111447-fceratto.json [production]
11:08 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-upload_esams and not P{cp3081.esams.wmnet} and A:cp [production]
11:08 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-text_esams and not P{cp3073.esams.wmnet} and A:cp [production]
11:07 <vgutierrez> rolling upgrade to varnish 7.1.1-1.1~bpo11+wmf3 in esams - T391334 [production]
11:07 <cgoubert@deploy1003> Started scap sync-world: test rebuild to look at logs [production]
11:07 <sukhe@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on durum2002.codfw.wmnet with reason: testing [production]
11:05 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on P{cp[5023-5024].eqsin.wmnet} and A:cp [production]
10:59 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1170 (T391056)', diff saved to https://phabricator.wikimedia.org/P75011 and previous config saved to /var/cache/conftool/dbconfig/20250415-105941-fceratto.json [production]
10:58 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-upload_eqsin [production]
10:52 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-upload_drmrs [production]
10:52 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on A:cp-text_drmrs [production]
10:52 <vgutierrez> rolling upgrade to varnish 7.1.1-1.1~bpo11+wmf3 in drmrs - T391334 [production]
10:42 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1170 (T391056)', diff saved to https://phabricator.wikimedia.org/P75010 and previous config saved to /var/cache/conftool/dbconfig/20250415-104235-fceratto.json [production]
10:42 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1170.eqiad.wmnet with reason: Maintenance [production]
10:42 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1158 (T391056)', diff saved to https://phabricator.wikimedia.org/P75009 and previous config saved to /var/cache/conftool/dbconfig/20250415-104212-fceratto.json [production]
10:41 <sukhe> enable puppet on durum2002 [production]
10:40 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-text_codfw [production]
10:39 <ladsgroup@deploy1003> sync-world aborted: Backport for [[gerrit:1136670|Bump thumbnail steps to 95% (T360589)]] (duration: 05m 08s) [production]
10:38 <sukhe> sudo cumin 'A:durum' 'disable-puppet "rolling out CR 1132669"' [production]
10:37 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=0) rolling upgrade of Varnish on A:cp-upload_codfw [production]
10:34 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1136670|Bump thumbnail steps to 95% (T360589)]] [production]
10:33 <ladsgroup@deploy1003> sync-world aborted: Backport for [[gerrit:1136670|Bump thumbnail steps to 95% (T360589)]] (duration: 14m 11s) [production]
10:27 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P75008 and previous config saved to /var/cache/conftool/dbconfig/20250415-102705-fceratto.json [production]
10:26 <vgutierrez@cumin1002> START - Cookbook sre.cdn.roll-upgrade-varnish rolling upgrade of Varnish on P{cp[5023-5024].eqsin.wmnet} and A:cp [production]
10:24 <vgutierrez@cumin1002> END (FAIL) - Cookbook sre.cdn.roll-upgrade-varnish (exit_code=1) rolling upgrade of Varnish on A:cp-text_eqsin [production]
10:19 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1136670|Bump thumbnail steps to 95% (T360589)]] [production]
10:11 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P75007 and previous config saved to /var/cache/conftool/dbconfig/20250415-101158-fceratto.json [production]
10:00 <dcausse@deploy1003> Finished deploy [wdqs/wdqs@fe88851] (wcqs): version 0.3.156 (duration: 02m 25s) [production]
09:58 <dcausse@deploy1003> Started deploy [wdqs/wdqs@fe88851] (wcqs): version 0.3.156 [production]
09:57 <dcausse@deploy1003> Finished deploy [wdqs/wdqs@fe88851]: version 0.3.156 (T326311) (duration: 14m 31s) [production]
09:56 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1158 (T391056)', diff saved to https://phabricator.wikimedia.org/P75006 and previous config saved to /var/cache/conftool/dbconfig/20250415-095650-fceratto.json [production]
09:54 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1158 (T391056)', diff saved to https://phabricator.wikimedia.org/P75005 and previous config saved to /var/cache/conftool/dbconfig/20250415-095442-fceratto.json [production]
09:54 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance [production]
09:54 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1158.eqiad.wmnet with reason: Maintenance [production]