2951-3000 of 10000 results (112ms)
2024-11-27 ยง
19:18 <brett@cumin2002> START - Cookbook sre.dns.admin DNS admin: pool site magru [reason: repool magru, T376737] [production]
19:17 <mforns@deploy2002> Finished deploy [airflow-dags/analytics@99032bf]: regular weekly train (duration: 03m 10s) [production]
19:14 <mforns@deploy2002> Started deploy [airflow-dags/analytics@99032bf]: regular weekly train [production]
19:13 <mutante> disabled puppet on R:scap::target (180 hosts) for a short time - deploying gerrit:1092841 [production]
19:09 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: dc=magru,service=cdn [production]
19:04 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1209', diff saved to https://phabricator.wikimedia.org/P71235 and previous config saved to /var/cache/conftool/dbconfig/20241127-190453-ladsgroup.json [production]
19:02 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1025.eqiad.wmnet with OS bullseye [production]
18:56 <bking@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wdqs1025.eqiad.wmnet with OS bullseye [production]
18:49 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1209 (T370903)', diff saved to https://phabricator.wikimedia.org/P71233 and previous config saved to /var/cache/conftool/dbconfig/20241127-184946-ladsgroup.json [production]
18:47 <fabfur@cumin1002> conftool action : set/pooled=yes; selector: cluster=dnsbox,dc=magru [production]
18:38 <fabfur@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 16 hosts [production]
18:38 <fabfur@cumin1002> START - Cookbook sre.hosts.remove-downtime for 16 hosts [production]
18:38 <fabfur@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs7003.magru.wmnet [production]
18:38 <fabfur@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs7003.magru.wmnet [production]
18:38 <fabfur@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs7002.magru.wmnet [production]
18:38 <fabfur@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs7002.magru.wmnet [production]
18:38 <fabfur@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs7001.magru.wmnet [production]
18:38 <fabfur@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs7001.magru.wmnet [production]
18:38 <fabfur@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for dns7002.wikimedia.org [production]
18:38 <fabfur@cumin1002> START - Cookbook sre.hosts.remove-downtime for dns7002.wikimedia.org [production]
18:37 <fabfur@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for dns7001.wikimedia.org [production]
18:37 <fabfur@cumin1002> START - Cookbook sre.hosts.remove-downtime for dns7001.wikimedia.org [production]
18:37 <fabfur@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dns7001.wikimedia.org with reason: T380307 [production]
18:37 <fabfur@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on dns7001.wikimedia.org with reason: T380307 [production]
18:36 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1025.eqiad.wmnet with OS bullseye [production]
18:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1209 (T370903)', diff saved to https://phabricator.wikimedia.org/P71232 and previous config saved to /var/cache/conftool/dbconfig/20241127-183455-ladsgroup.json [production]
18:34 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1209.eqiad.wmnet with reason: Maintenance [production]
18:34 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1209.eqiad.wmnet with reason: Maintenance [production]
18:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203 (T370903)', diff saved to https://phabricator.wikimedia.org/P71231 and previous config saved to /var/cache/conftool/dbconfig/20241127-183432-ladsgroup.json [production]
18:19 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P71230 and previous config saved to /var/cache/conftool/dbconfig/20241127-181925-ladsgroup.json [production]
18:05 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host wdqs1027.eqiad.wmnet with OS bullseye [production]
18:04 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P71229 and previous config saved to /var/cache/conftool/dbconfig/20241127-180418-ladsgroup.json [production]
17:49 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203 (T370903)', diff saved to https://phabricator.wikimedia.org/P71228 and previous config saved to /var/cache/conftool/dbconfig/20241127-174911-ladsgroup.json [production]
17:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1203 (T370903)', diff saved to https://phabricator.wikimedia.org/P71227 and previous config saved to /var/cache/conftool/dbconfig/20241127-173426-ladsgroup.json [production]
17:34 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1203.eqiad.wmnet with reason: Maintenance [production]
17:34 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1203.eqiad.wmnet with reason: Maintenance [production]
17:34 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1192 (T370903)', diff saved to https://phabricator.wikimedia.org/P71226 and previous config saved to /var/cache/conftool/dbconfig/20241127-173403-ladsgroup.json [production]
17:33 <jiji@deploy2002> helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply [production]
17:33 <jiji@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:33 <jiji@deploy2002> helmfile [eqiad] DONE helmfile.d/services/eventstreams: apply [production]
17:32 <jiji@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
17:32 <jiji@deploy2002> helmfile [eqiad] START helmfile.d/services/eventgate-main: apply [production]
17:32 <jiji@deploy2002> helmfile [staging] DONE helmfile.d/services/eventgate-main: apply [production]
17:32 <jiji@deploy2002> helmfile [eqiad] START helmfile.d/services/eventstreams: apply [production]
17:31 <jiji@deploy2002> helmfile [staging] DONE helmfile.d/services/eventstreams: apply [production]
17:31 <jiji@deploy2002> helmfile [staging] START helmfile.d/services/eventgate-main: apply [production]
17:31 <jiji@deploy2002> helmfile [staging] START helmfile.d/services/eventstreams: apply [production]
17:28 <jiji@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
17:27 <jiji@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
17:27 <jiji@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]