2025-07-15
ยง
|
19:46 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2157 (T399249)', diff saved to https://phabricator.wikimedia.org/P79124 and previous config saved to /var/cache/conftool/dbconfig/20250715-194642-marostegui.json |
[production] |
19:42 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
19:41 |
<eevans@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
19:33 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
19:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P79123 and previous config saved to /var/cache/conftool/dbconfig/20250715-193134-marostegui.json |
[production] |
19:20 |
<eevans@cumin1003> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
19:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P79122 and previous config saved to /var/cache/conftool/dbconfig/20250715-191627-marostegui.json |
[production] |
19:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2157 (T399249)', diff saved to https://phabricator.wikimedia.org/P79121 and previous config saved to /var/cache/conftool/dbconfig/20250715-190120-marostegui.json |
[production] |
18:52 |
<eevans@cumin1003> |
START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye |
[production] |
18:30 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2157 (T399249)', diff saved to https://phabricator.wikimedia.org/P79120 and previous config saved to /var/cache/conftool/dbconfig/20250715-183047-marostegui.json |
[production] |
18:30 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2157.codfw.wmnet with reason: Maintenance |
[production] |
18:19 |
<dancy@deploy1003> |
rebuilt and synchronized wikiversions files: group0 to 1.45.0-wmf.10 refs T392180 |
[production] |
18:11 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs1017.eqiad.wmnet with OS bookworm |
[production] |
18:10 |
<inflatador> |
bking@build2001 /srv/deployment/docker-pkg/venv/bin/docker-pkg -c /etc/production-images/config.yaml build images/ --select '*flink*' T398159 |
[production] |
18:01 |
<swfrench@deploy1003> |
Finished scap sync-world: Stop building buster-based webserver flavour images - T378128 (duration: 02m 21s) |
[production] |
17:58 |
<swfrench@deploy1003> |
Started scap sync-world: Stop building buster-based webserver flavour images - T378128 |
[production] |
17:55 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs1017.eqiad.wmnet with reason: host reimage |
[production] |
17:51 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on lvs1017.eqiad.wmnet with reason: host reimage |
[production] |
17:49 |
<brett@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host lvs1017.eqiad.wmnet with OS bookworm |
[production] |
17:34 |
<swfrench@deploy1003> |
Finished scap sync-world: Rebuild to pick up new php8.1 production image (duration: 34m 16s) |
[production] |
17:34 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host lvs1017.eqiad.wmnet with OS bookworm |
[production] |
17:24 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host lvs1017.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
17:14 |
<brett@cumin2002> |
START - Cookbook sre.hosts.provision for host lvs1017.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
17:09 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host lvs1017 |
[production] |
17:09 |
<brett@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host lvs1017 |
[production] |
17:09 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Set es1032 back as master', diff saved to https://phabricator.wikimedia.org/P79119 and previous config saved to /var/cache/conftool/dbconfig/20250715-170919-fceratto.json |
[production] |
17:07 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
17:07 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Pooling in after update es1032', diff saved to https://phabricator.wikimedia.org/P79118 and previous config saved to /var/cache/conftool/dbconfig/20250715-170724-fceratto.json |
[production] |
17:04 |
<brett@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
17:04 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd2007-dev.codfw.wmnet with OS bookworm |
[production] |
17:04 |
<andrew@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye |
[production] |
17:01 |
<swfrench@deploy1003> |
Started scap sync-world: Rebuild to pick up new php8.1 production image |
[production] |
16:59 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'update es1032', diff saved to https://phabricator.wikimedia.org/P79117 and previous config saved to /var/cache/conftool/dbconfig/20250715-165930-fceratto.json |
[production] |
16:58 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host lvs1017 |
[production] |
16:57 |
<brett@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host lvs1017 |
[production] |
16:40 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye |
[production] |
16:36 |
<mutante> |
downtiming es1032 for 3 days - expired downtime for T391921? |
[production] |
16:36 |
<dzahn@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on es1032.eqiad.wmnet with reason: T391921 |
[production] |
16:33 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host lvs1017.eqiad.wmnet with OS bookworm |
[production] |
16:21 |
<andrew@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye |
[production] |
16:10 |
<jynus@cumin1003> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for db[2160,2234].codfw.wmnet,db[1217,1250].eqiad.wmnet |
[production] |
16:10 |
<jynus@cumin1003> |
START - Cookbook sre.hosts.remove-downtime for db[2160,2234].codfw.wmnet,db[1217,1250].eqiad.wmnet |
[production] |
15:52 |
<btullis@dns1004> |
END - running authdns-update |
[production] |
15:52 |
<btullis@dns1004> |
START - running authdns-update |
[production] |
15:46 |
<jynus> |
start replica @ db1217:m3, db2160:m3 T370266 |
[production] |
15:42 |
<mutante> |
phabricator version upgrade finished |
[production] |
15:29 |
<andrew@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye |
[production] |
15:28 |
<andrew@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye |
[production] |
15:14 |
<btullis@dns1004> |
END - running authdns-update |
[production] |
15:13 |
<btullis@dns1004> |
START - running authdns-update |
[production] |