501-550 of 10000 results (88ms)
2025-07-15 ยง
19:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157 (T399249)', diff saved to https://phabricator.wikimedia.org/P79124 and previous config saved to /var/cache/conftool/dbconfig/20250715-194642-marostegui.json [production]
19:42 <eevans@cumin1003> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
19:41 <eevans@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1012.eqiad.wmnet with OS bullseye [production]
19:33 <eevans@cumin1003> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
19:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P79123 and previous config saved to /var/cache/conftool/dbconfig/20250715-193134-marostegui.json [production]
19:20 <eevans@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1012.eqiad.wmnet with OS bullseye [production]
19:16 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P79122 and previous config saved to /var/cache/conftool/dbconfig/20250715-191627-marostegui.json [production]
19:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2157 (T399249)', diff saved to https://phabricator.wikimedia.org/P79121 and previous config saved to /var/cache/conftool/dbconfig/20250715-190120-marostegui.json [production]
18:52 <eevans@cumin1003> START - Cookbook sre.hosts.reimage for host aqs1012.eqiad.wmnet with OS bullseye [production]
18:30 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2157 (T399249)', diff saved to https://phabricator.wikimedia.org/P79120 and previous config saved to /var/cache/conftool/dbconfig/20250715-183047-marostegui.json [production]
18:30 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2157.codfw.wmnet with reason: Maintenance [production]
18:19 <dancy@deploy1003> rebuilt and synchronized wikiversions files: group0 to 1.45.0-wmf.10 refs T392180 [production]
18:11 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs1017.eqiad.wmnet with OS bookworm [production]
18:10 <inflatador> bking@build2001 /srv/deployment/docker-pkg/venv/bin/docker-pkg -c /etc/production-images/config.yaml build images/ --select '*flink*' T398159 [production]
18:01 <swfrench@deploy1003> Finished scap sync-world: Stop building buster-based webserver flavour images - T378128 (duration: 02m 21s) [production]
17:58 <swfrench@deploy1003> Started scap sync-world: Stop building buster-based webserver flavour images - T378128 [production]
17:55 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs1017.eqiad.wmnet with reason: host reimage [production]
17:51 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs1017.eqiad.wmnet with reason: host reimage [production]
17:49 <brett@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host lvs1017.eqiad.wmnet with OS bookworm [production]
17:34 <swfrench@deploy1003> Finished scap sync-world: Rebuild to pick up new php8.1 production image (duration: 34m 16s) [production]
17:34 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host lvs1017.eqiad.wmnet with OS bookworm [production]
17:24 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host lvs1017.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
17:14 <brett@cumin2002> START - Cookbook sre.hosts.provision for host lvs1017.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
17:09 <brett@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host lvs1017 [production]
17:09 <brett@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host lvs1017 [production]
17:09 <fceratto@cumin1002> dbctl commit (dc=all): 'Set es1032 back as master', diff saved to https://phabricator.wikimedia.org/P79119 and previous config saved to /var/cache/conftool/dbconfig/20250715-170919-fceratto.json [production]
17:07 <brett@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:07 <fceratto@cumin1002> dbctl commit (dc=all): 'Pooling in after update es1032', diff saved to https://phabricator.wikimedia.org/P79118 and previous config saved to /var/cache/conftool/dbconfig/20250715-170724-fceratto.json [production]
17:04 <brett@cumin2002> START - Cookbook sre.dns.netbox [production]
17:04 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd2007-dev.codfw.wmnet with OS bookworm [production]
17:04 <andrew@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye [production]
17:01 <swfrench@deploy1003> Started scap sync-world: Rebuild to pick up new php8.1 production image [production]
16:59 <fceratto@cumin1002> dbctl commit (dc=all): 'update es1032', diff saved to https://phabricator.wikimedia.org/P79117 and previous config saved to /var/cache/conftool/dbconfig/20250715-165930-fceratto.json [production]
16:58 <brett@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host lvs1017 [production]
16:57 <brett@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host lvs1017 [production]
16:40 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye [production]
16:36 <mutante> downtiming es1032 for 3 days - expired downtime for T391921? [production]
16:36 <dzahn@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on es1032.eqiad.wmnet with reason: T391921 [production]
16:33 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host lvs1017.eqiad.wmnet with OS bookworm [production]
16:21 <andrew@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye [production]
16:10 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for db[2160,2234].codfw.wmnet,db[1217,1250].eqiad.wmnet [production]
16:10 <jynus@cumin1003> START - Cookbook sre.hosts.remove-downtime for db[2160,2234].codfw.wmnet,db[1217,1250].eqiad.wmnet [production]
15:52 <btullis@dns1004> END - running authdns-update [production]
15:52 <btullis@dns1004> START - running authdns-update [production]
15:46 <jynus> start replica @ db1217:m3, db2160:m3 T370266 [production]
15:42 <mutante> phabricator version upgrade finished [production]
15:29 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye [production]
15:28 <andrew@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd2007-dev.codfw.wmnet with OS bullseye [production]
15:14 <btullis@dns1004> END - running authdns-update [production]
15:13 <btullis@dns1004> START - running authdns-update [production]