3251-3300 of 10000 results (98ms)
2023-11-14 ยง
18:50 <eevans@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1011.eqiad.wmnet with reason: host reimage [production]
18:46 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2122 (T348183)', diff saved to https://phabricator.wikimedia.org/P53453 and previous config saved to /var/cache/conftool/dbconfig/20231114-184637-arnaudb.json [production]
18:42 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2122 (T348183)', diff saved to https://phabricator.wikimedia.org/P53452 and previous config saved to /var/cache/conftool/dbconfig/20231114-184204-arnaudb.json [production]
18:41 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2122.codfw.wmnet with reason: Maintenance [production]
18:41 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2122.codfw.wmnet with reason: Maintenance [production]
18:41 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2121 (T348183)', diff saved to https://phabricator.wikimedia.org/P53451 and previous config saved to /var/cache/conftool/dbconfig/20231114-184142-arnaudb.json [production]
18:36 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1011.eqiad.wmnet with OS bullseye [production]
18:33 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1046.eqiad.wmnet with OS bookworm [production]
18:32 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1048.eqiad.wmnet with reason: host reimage [production]
18:27 <andrew@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1048.eqiad.wmnet with reason: host reimage [production]
18:26 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P53450 and previous config saved to /var/cache/conftool/dbconfig/20231114-182636-arnaudb.json [production]
18:22 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1047.eqiad.wmnet with reason: host reimage [production]
18:19 <andrew@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1047.eqiad.wmnet with reason: host reimage [production]
18:11 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P53449 and previous config saved to /var/cache/conftool/dbconfig/20231114-181130-arnaudb.json [production]
18:11 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1048.eqiad.wmnet with OS bookworm [production]
18:04 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1047.eqiad.wmnet with OS bookworm [production]
17:56 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2121 (T348183)', diff saved to https://phabricator.wikimedia.org/P53448 and previous config saved to /var/cache/conftool/dbconfig/20231114-175623-arnaudb.json [production]
17:55 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
17:54 <jbond@cumin1001> END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: wmcs::openstack::codfw1dev::control [production]
17:52 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2121 (T348183)', diff saved to https://phabricator.wikimedia.org/P53447 and previous config saved to /var/cache/conftool/dbconfig/20231114-175202-arnaudb.json [production]
17:51 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance [production]
17:51 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance [production]
17:51 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2120 (T348183)', diff saved to https://phabricator.wikimedia.org/P53446 and previous config saved to /var/cache/conftool/dbconfig/20231114-175140-arnaudb.json [production]
17:45 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
17:43 <jbond@cumin1001> START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::control [production]
17:36 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2120', diff saved to https://phabricator.wikimedia.org/P53445 and previous config saved to /var/cache/conftool/dbconfig/20231114-173634-arnaudb.json [production]
17:21 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1043.eqiad.wmnet with OS bookworm [production]
17:21 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2120', diff saved to https://phabricator.wikimedia.org/P53444 and previous config saved to /var/cache/conftool/dbconfig/20231114-172127-arnaudb.json [production]
17:12 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1046.eqiad.wmnet with OS bookworm [production]
17:12 <andrew@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1046.eqiad.wmnet with OS bookworm [production]
17:06 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2120 (T348183)', diff saved to https://phabricator.wikimedia.org/P53442 and previous config saved to /var/cache/conftool/dbconfig/20231114-170621-arnaudb.json [production]
17:03 <elukey@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
17:02 <elukey@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
17:02 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2120 (T348183)', diff saved to https://phabricator.wikimedia.org/P53441 and previous config saved to /var/cache/conftool/dbconfig/20231114-170158-arnaudb.json [production]
17:02 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2120.codfw.wmnet with reason: Maintenance [production]
17:01 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2120.codfw.wmnet with reason: Maintenance [production]
17:01 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2108 (T348183)', diff saved to https://phabricator.wikimedia.org/P53440 and previous config saved to /var/cache/conftool/dbconfig/20231114-170136-arnaudb.json [production]
16:50 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@0ae1184]: make cirrus index imports world readable in hdfs (duration: 00m 28s) [production]
16:50 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@0ae1184]: make cirrus index imports world readable in hdfs [production]
16:47 <elukey@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop: sync [production]
16:47 <elukey@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop: sync [production]
16:46 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2108', diff saved to https://phabricator.wikimedia.org/P53438 and previous config saved to /var/cache/conftool/dbconfig/20231114-164630-arnaudb.json [production]
16:44 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1044.eqiad.wmnet with OS bookworm [production]
16:37 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1046.eqiad.wmnet with OS bookworm [production]
16:35 <ebernhardson@deploy2002> Finished deploy [airflow-dags/search@017fbf1]: search: clean wcqs revision map (duration: 00m 29s) [production]
16:34 <ebernhardson@deploy2002> Started deploy [airflow-dags/search@017fbf1]: search: clean wcqs revision map [production]
16:31 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2108', diff saved to https://phabricator.wikimedia.org/P53437 and previous config saved to /var/cache/conftool/dbconfig/20231114-163123-arnaudb.json [production]
16:30 <aokoth@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host vrts1002.eqiad.wmnet [production]
16:26 <aokoth@cumin1001> START - Cookbook sre.hosts.reboot-single for host vrts1002.eqiad.wmnet [production]
16:17 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1044.eqiad.wmnet with reason: host reimage [production]