1201-1250 of 10000 results (24ms)
2024-09-26 ยง
13:04 <dcaro@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1041.eqiad.wmnet [production]
13:04 <aokoth@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts vrts1001.eqiad.wmnet [production]
13:03 <aokoth@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:03 <aokoth@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: vrts1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - aokoth@cumin1002" [production]
12:58 <aokoth@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: vrts1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - aokoth@cumin1002" [production]
12:54 <aokoth@cumin1002> START - Cookbook sre.dns.netbox [production]
12:47 <aokoth@cumin1002> START - Cookbook sre.hosts.decommission for hosts vrts1001.eqiad.wmnet [production]
12:46 <aokoth@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on vrts1001.eqiad.wmnet with reason: Decom [production]
12:46 <aokoth@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on vrts1001.eqiad.wmnet with reason: Decom [production]
12:37 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host cloudcephosd1002.eqiad.wmnet [production]
12:32 <moritzm> installing glib2.0 bugfix updates from Bookworm point release [production]
12:31 <kart_> Updated cxserver to 2024-09-18-104433-production (T375017, T374815, T374644) [production]
12:30 <kartik@deploy1003> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
12:29 <kartik@deploy1003> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
12:28 <kartik@deploy1003> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
12:27 <kartik@deploy1003> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
12:27 <jynus@cumin1002> dbctl commit (dc=all): 's8 weight tuning T375732', diff saved to https://phabricator.wikimedia.org/P69420 and previous config saved to /var/cache/conftool/dbconfig/20240926-122715-jynus.json [production]
12:26 <dcaro@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd1040.eqiad.wmnet [production]
12:22 <jynus@cumin1002> dbctl commit (dc=all): 's8 weight tuning T375732', diff saved to https://phabricator.wikimedia.org/P69419 and previous config saved to /var/cache/conftool/dbconfig/20240926-122237-jynus.json [production]
12:21 <kartik@deploy1003> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
12:21 <kartik@deploy1003> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
12:18 <dcaro@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1040.eqiad.wmnet [production]
12:11 <jynus@cumin1002> dbctl commit (dc=all): 's4 weight tuning T375732', diff saved to https://phabricator.wikimedia.org/P69418 and previous config saved to /var/cache/conftool/dbconfig/20240926-121105-jynus.json [production]
12:06 <mvolz@deploy1003> helmfile [codfw] DONE helmfile.d/services/zotero: apply [production]
12:06 <mvolz@deploy1003> helmfile [codfw] START helmfile.d/services/zotero: apply [production]
12:02 <mvolz@deploy1003> helmfile [eqiad] DONE helmfile.d/services/zotero: apply [production]
12:01 <mvolz@deploy1003> helmfile [eqiad] START helmfile.d/services/zotero: apply [production]
12:00 <jynus@cumin1002> dbctl commit (dc=all): 's4 weight tuning T375732', diff saved to https://phabricator.wikimedia.org/P69417 and previous config saved to /var/cache/conftool/dbconfig/20240926-120013-jynus.json [production]
12:00 <akosiaris@deploy1003> helmfile [staging] DONE helmfile.d/services/zotero: apply [production]
11:59 <akosiaris@deploy1003> helmfile [staging] START helmfile.d/services/zotero: apply [production]
11:52 <jynus@cumin1002> dbctl commit (dc=all): 's4 weight tuning T375732', diff saved to https://phabricator.wikimedia.org/P69416 and previous config saved to /var/cache/conftool/dbconfig/20240926-115208-jynus.json [production]
11:09 <mvolz@deploy1003> helmfile [staging] DONE helmfile.d/services/zotero: apply [production]
11:09 <mvolz@deploy1003> helmfile [staging] START helmfile.d/services/zotero: apply [production]
10:58 <moritzm> prune now obsolete nginx packages from docker-registry hosts T329529 [production]
10:33 <dcaro@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1041.eqiad.wmnet with OS bullseye [production]
10:26 <wmbot~raymondndibe@wmf3402> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-cli [tools]
10:26 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.undrain_node (T372814) [admin]
10:25 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) (T372814) [admin]
10:25 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.bootstrap_and_add (T372814) [admin]
10:21 <elukey> start dry run of docker distribution GC on registry1004 (info in https://phabricator.wikimedia.org/T375645#10176397, you can find a root tmux session named as the task on the host to stop) [production]
10:20 <wmbot~raymondndibe@wmf3402> START - Cookbook wmcs.toolforge.component.deploy for component jobs-cli [tools]
10:20 <wmbot~raymondndibe@wmf3402> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component jobs-cli [toolsbeta]
10:17 <dcaro@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd1038.eqiad.wmnet [production]
10:15 <dcaro@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1041.eqiad.wmnet with reason: host reimage [production]
10:12 <wmbot~raymondndibe@wmf3402> START - Cookbook wmcs.toolforge.component.deploy for component jobs-cli [toolsbeta]
10:12 <wmbot~raymondndibe@wmf3402> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component envvars-cli [tools]
10:11 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=99) (T372814) [admin]
10:11 <dcaro@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1041.eqiad.wmnet with reason: host reimage [production]
10:09 <dcaro@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1038.eqiad.wmnet [production]
10:05 <wmbot~raymondndibe@wmf3402> START - Cookbook wmcs.toolforge.component.deploy for component envvars-cli [tools]