2301-2350 of 10000 results (87ms)
2023-03-17 §
14:54 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
14:54 <bking@cumin1001> START - Cookbook sre.wdqs.restart [production]
14:35 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
14:13 <bking@cumin1001> START - Cookbook sre.wdqs.restart [production]
14:05 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
13:59 <cmjohnson@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-fe1013.eqiad.wmnet with OS bullseye [production]
13:59 <cmjohnson@cumin1001> START - Cookbook sre.hosts.reimage for host ms-fe1013.eqiad.wmnet with OS bullseye [production]
13:57 <bking@cumin1001> START - Cookbook sre.wdqs.restart [production]
13:57 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
13:57 <bking@cumin1001> START - Cookbook sre.wdqs.restart [production]
13:55 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
13:51 <bking@cumin1001> START - Cookbook sre.wdqs.restart [production]
13:51 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
13:51 <bking@cumin1001> START - Cookbook sre.wdqs.restart [production]
13:51 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
13:51 <bking@cumin1001> START - Cookbook sre.wdqs.restart [production]
13:21 <cgoubert@cumin1001> conftool action : set/pooled=inactive; selector: name=parse2004.codfw.wmnet [production]
13:21 <claime> Depooling parse2004.codfw.wmnet for broken PSU - T332119 [production]
12:06 <mutante> systemct-reset failed on gitlab-runner* [production]
11:16 <akosiaris@deploy1002> helmfile [staging-eqiad] DONE helmfile.d/admin 'sync'. [production]
11:16 <akosiaris@deploy1002> helmfile [staging-eqiad] START helmfile.d/admin 'sync'. [production]
11:03 <akosiaris@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. [production]
11:02 <akosiaris@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'sync'. [production]
09:45 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
09:45 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
09:38 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
09:38 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
07:57 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
07:57 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
07:28 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
07:28 <gmodena@deploy1002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mediawiki-page-content-change-enrichment: apply [production]
05:56 <marostegui@cumin1001> dbctl commit (dc=all): 'Add db1106 to dbctl', diff saved to https://phabricator.wikimedia.org/P45887 and previous config saved to /var/cache/conftool/dbconfig/20230317-055643-marostegui.json [production]
02:10 <ejegg> civicrm upgraded from 672950d9 to 5dd37c9c [production]
01:05 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs2010.codfw.wmnet [production]
01:05 <sukhe@cumin2002> START - Cookbook sre.hosts.remove-downtime for lvs2010.codfw.wmnet [production]
00:35 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on lvs1020.eqiad.wmnet with reason: rebooting for kernel updates [production]
00:35 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 0:10:00 on lvs1020.eqiad.wmnet with reason: rebooting for kernel updates [production]
00:26 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on lvs2010.codfw.wmnet with reason: rebooting for kernel updates [production]
00:26 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 0:10:00 on lvs2010.codfw.wmnet with reason: rebooting for kernel updates [production]
00:13 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on lvs5006.eqsin.wmnet with reason: rebooting for kernel updates [production]
00:13 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 0:10:00 on lvs5006.eqsin.wmnet with reason: rebooting for kernel updates [production]
2023-03-16 §
23:41 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on lvs6003.drmrs.wmnet with reason: rebooting for kernel updates [production]
23:40 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 0:10:00 on lvs6003.drmrs.wmnet with reason: rebooting for kernel updates [production]
23:33 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:25:00 on lvs3007.esams.wmnet with reason: rebooting for kernel updates [production]
23:33 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 0:25:00 on lvs3007.esams.wmnet with reason: rebooting for kernel updates [production]
23:31 <dzahn@cumin2002> END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host miscweb2003.codfw.wmnet with OS bullseye [production]
23:28 <dzahn@cumin1001> END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host miscweb1003.eqiad.wmnet with OS bullseye [production]
23:20 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@e6f0142]: bump discolytics env to 0.7.0 (duration: 00m 19s) [production]
23:20 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@e6f0142]: bump discolytics env to 0.7.0 [production]
23:18 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on miscweb2003.codfw.wmnet with reason: host reimage [production]