1301-1350 of 10000 results (70ms)
2024-03-01 §
13:59 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1393.eqiad.wmnet with OS bullseye [production]
13:57 <elukey@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
13:57 <elukey@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
13:56 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1397.eqiad.wmnet with OS bullseye [production]
13:53 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1389.eqiad.wmnet with OS bullseye [production]
13:51 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1391.eqiad.wmnet with OS bullseye [production]
13:48 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1395.eqiad.wmnet with OS bullseye [production]
13:46 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1387.eqiad.wmnet with OS bullseye [production]
13:41 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1393.eqiad.wmnet with reason: host reimage [production]
13:40 <elukey@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
13:40 <elukey@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
13:38 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1397.eqiad.wmnet with reason: host reimage [production]
13:35 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1389.eqiad.wmnet with reason: host reimage [production]
13:33 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1391.eqiad.wmnet with reason: host reimage [production]
13:30 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1395.eqiad.wmnet with reason: host reimage [production]
13:28 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
13:28 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
13:28 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1163 (T354015)', diff saved to https://phabricator.wikimedia.org/P58285 and previous config saved to /var/cache/conftool/dbconfig/20240301-132824-marostegui.json [production]
13:28 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1387.eqiad.wmnet with reason: host reimage [production]
13:26 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1397.eqiad.wmnet with reason: host reimage [production]
13:26 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1395.eqiad.wmnet with reason: host reimage [production]
13:26 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1393.eqiad.wmnet with reason: host reimage [production]
13:26 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1391.eqiad.wmnet with reason: host reimage [production]
13:25 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1389.eqiad.wmnet with reason: host reimage [production]
13:25 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mw1387.eqiad.wmnet with reason: host reimage [production]
13:13 <cgoubert@cumin2002> START - Cookbook sre.hosts.reimage for host mw1397.eqiad.wmnet with OS bullseye [production]
13:13 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1163', diff saved to https://phabricator.wikimedia.org/P58284 and previous config saved to /var/cache/conftool/dbconfig/20240301-131318-marostegui.json [production]
13:13 <cgoubert@cumin2002> START - Cookbook sre.hosts.reimage for host mw1395.eqiad.wmnet with OS bullseye [production]
13:12 <cgoubert@cumin2002> START - Cookbook sre.hosts.reimage for host mw1393.eqiad.wmnet with OS bullseye [production]
13:12 <cgoubert@cumin2002> START - Cookbook sre.hosts.reimage for host mw1391.eqiad.wmnet with OS bullseye [production]
13:12 <cgoubert@cumin2002> START - Cookbook sre.hosts.reimage for host mw1389.eqiad.wmnet with OS bullseye [production]
13:11 <cgoubert@cumin2002> START - Cookbook sre.hosts.reimage for host mw1387.eqiad.wmnet with OS bullseye [production]
13:03 <jynus> refreshing image metadata of commons Алтарна_частина.jpg [production]
13:02 <claime> Depooling mw1387.eqiad.wmnet,mw1389.eqiad.wmnet,mw1391.eqiad.wmnet,mw1393.eqiad.wmnet,mw1395.eqiad.wmnet,mw1397.eqiad.wmnet for reimage to k8s nodes - T351074 [production]
12:58 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1163', diff saved to https://phabricator.wikimedia.org/P58283 and previous config saved to /var/cache/conftool/dbconfig/20240301-125812-marostegui.json [production]
12:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1163 (T354015)', diff saved to https://phabricator.wikimedia.org/P58282 and previous config saved to /var/cache/conftool/dbconfig/20240301-124306-marostegui.json [production]
11:58 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
11:58 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
11:56 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply [production]
11:55 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-int: apply [production]
11:54 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1173.eqiad.wmnet [production]
11:48 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply [production]
11:48 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-api-int: apply [production]
11:47 <btullis@cumin1002> START - Cookbook sre.hosts.reboot-single for host an-worker1173.eqiad.wmnet [production]
11:33 <mfossati@deploy2002> Finished deploy [airflow-dags/platform_eng@241457d]: (no justification provided) (duration: 00m 28s) [production]
11:32 <mfossati@deploy2002> Started deploy [airflow-dags/platform_eng@241457d]: (no justification provided) [production]
11:16 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db2194 (T352010)', diff saved to https://phabricator.wikimedia.org/P58281 and previous config saved to /var/cache/conftool/dbconfig/20240301-111610-ladsgroup.json [production]
11:16 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2194.codfw.wmnet with reason: Maintenance [production]
11:15 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2194.codfw.wmnet with reason: Maintenance [production]
10:34 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2117.codfw.wmnet with reason: Silence for maintenance [production]