1551-1600 of 10000 results (50ms)
2022-02-28 ยง
14:48 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes2019.codfw.wmnet with OS bullseye [production]
14:44 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
14:43 <klausman@cumin2001> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host ml-etcd-staging2001.codfw.wmnet [production]
14:37 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2019.codfw.wmnet with reason: host reimage [production]
14:35 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2019.codfw.wmnet with reason: host reimage [production]
14:33 <klausman@cumin2001> START - Cookbook sre.ganeti.makevm for new host ml-etcd-staging2001.codfw.wmnet [production]
14:20 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kubernetes2019.codfw.wmnet with OS bullseye [production]
14:18 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes2018.codfw.wmnet with OS bullseye [production]
14:09 <kharlan@deploy1002> helmfile [staging] DONE helmfile.d/services/linkrecommendation: apply [production]
14:09 <kharlan@deploy1002> helmfile [staging] START helmfile.d/services/linkrecommendation: apply [production]
14:07 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2018.codfw.wmnet with reason: host reimage [production]
14:05 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2018.codfw.wmnet with reason: host reimage [production]
14:03 <jelto> update gitlab-ce to 14.7.4 on all GitLab hosts [production]
14:00 <ebysans@deploy1002> Finished deploy [airflow-dags/analytics@75e8eb7]: (no justification provided) (duration: 00m 14s) [production]
14:00 <kharlan@deploy1002> helmfile [staging] START helmfile.d/services/linkrecommendation: apply [production]
14:00 <ebysans@deploy1002> Started deploy [airflow-dags/analytics@75e8eb7]: (no justification provided) [production]
13:51 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21600 and previous config saved to /var/cache/conftool/dbconfig/20220228-135158-ladsgroup.json [production]
13:50 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kubernetes2018.codfw.wmnet with OS bullseye [production]
13:36 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1111', diff saved to https://phabricator.wikimedia.org/P21599 and previous config saved to /var/cache/conftool/dbconfig/20220228-133653-ladsgroup.json [production]
13:21 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1111', diff saved to https://phabricator.wikimedia.org/P21598 and previous config saved to /var/cache/conftool/dbconfig/20220228-132148-ladsgroup.json [production]
13:14 <moritzm> restarting apache on puppet masters to pick up expat security update [production]
13:06 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21597 and previous config saved to /var/cache/conftool/dbconfig/20220228-130644-ladsgroup.json [production]
13:01 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1111.eqiad.wmnet with OS bullseye [production]
12:46 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1111.eqiad.wmnet with reason: host reimage [production]
12:44 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Your commit message', diff saved to https://phabricator.wikimedia.org/P21596 and previous config saved to /var/cache/conftool/dbconfig/20220228-124454-ladsgroup.json [production]
12:44 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1111.eqiad.wmnet with reason: host reimage [production]
12:35 <ladsgroup@cumin1001> START - Cookbook sre.hosts.reimage for host db1111.eqiad.wmnet with OS bullseye [production]
12:30 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21594 and previous config saved to /var/cache/conftool/dbconfig/20220228-123008-ladsgroup.json [production]
12:30 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1111.eqiad.wmnet with reason: Maintenance [production]
12:30 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1111.eqiad.wmnet with reason: Maintenance [production]
12:25 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5011.eqsin.wmnet with OS buster [production]
12:24 <vgutierrez> pool cp5011 running HAProxy as TLS termination layer - T290005 T271421 [production]
12:22 <vgutierrez> vgutierrez@apt1001:~$ sudo -i reprepro --component thirdparty/haproxy24 update buster-wikimedia - T290005 [production]
12:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1175 (T300992)', diff saved to https://phabricator.wikimedia.org/P21593 and previous config saved to /var/cache/conftool/dbconfig/20220228-122039-ladsgroup.json [production]
12:05 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P21592 and previous config saved to /var/cache/conftool/dbconfig/20220228-120535-ladsgroup.json [production]
11:58 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5011.eqsin.wmnet with reason: host reimage [production]
11:55 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5011.eqsin.wmnet with reason: host reimage [production]
11:50 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P21591 and previous config saved to /var/cache/conftool/dbconfig/20220228-115030-ladsgroup.json [production]
11:42 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1114 (T302185)', diff saved to https://phabricator.wikimedia.org/P21590 and previous config saved to /var/cache/conftool/dbconfig/20220228-114230-ladsgroup.json [production]
11:35 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1175 (T300992)', diff saved to https://phabricator.wikimedia.org/P21589 and previous config saved to /var/cache/conftool/dbconfig/20220228-113525-ladsgroup.json [production]
11:29 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reimage for host cp5011.eqsin.wmnet with OS buster [production]
11:27 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1114', diff saved to https://phabricator.wikimedia.org/P21588 and previous config saved to /var/cache/conftool/dbconfig/20220228-112726-ladsgroup.json [production]
11:17 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1175 (T300992)', diff saved to https://phabricator.wikimedia.org/P21587 and previous config saved to /var/cache/conftool/dbconfig/20220228-111700-ladsgroup.json [production]
11:17 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1175.eqiad.wmnet with reason: Maintenance [production]
11:16 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1175.eqiad.wmnet with reason: Maintenance [production]
11:12 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1088.eqiad.wmnet with OS buster [production]
11:12 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1114', diff saved to https://phabricator.wikimedia.org/P21586 and previous config saved to /var/cache/conftool/dbconfig/20220228-111221-ladsgroup.json [production]
11:09 <vgutierrez> pool cp1088 running HAProxy as TLS termination layer - T290005 T271421 [production]
10:57 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1114 (T302185)', diff saved to https://phabricator.wikimedia.org/P21585 and previous config saved to /var/cache/conftool/dbconfig/20220228-105716-ladsgroup.json [production]
10:54 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance [production]