1951-2000 of 10000 results (39ms)
2025-06-16 §
06:47 <jmm@cumin1003> START - Cookbook sre.hosts.decommission for hosts install7001.wikimedia.org [production]
06:41 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172 (T396130)', diff saved to https://phabricator.wikimedia.org/P77968 and previous config saved to /var/cache/conftool/dbconfig/20250616-064117-marostegui.json [production]
06:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1172 (T396130)', diff saved to https://phabricator.wikimedia.org/P77967 and previous config saved to /var/cache/conftool/dbconfig/20250616-062536-marostegui.json [production]
06:25 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1172.eqiad.wmnet with reason: Maintenance [production]
06:11 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
06:10 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167 (T396130)', diff saved to https://phabricator.wikimedia.org/P77966 and previous config saved to /var/cache/conftool/dbconfig/20250616-061053-marostegui.json [production]
05:55 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P77965 and previous config saved to /var/cache/conftool/dbconfig/20250616-055545-marostegui.json [production]
05:49 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 151326 [production]
05:48 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 151326 [production]
05:46 <marostegui@cumin1002> dbctl commit (dc=all): 'db2204 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P77964 and previous config saved to /var/cache/conftool/dbconfig/20250616-054656-root.json [production]
05:42 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1161.eqiad.wmnet [production]
05:42 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1161.eqiad.wmnet [production]
05:41 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1161.eqiad.wmnet [production]
05:40 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P77963 and previous config saved to /var/cache/conftool/dbconfig/20250616-054037-marostegui.json [production]
05:38 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1161.eqiad.wmnet [production]
05:37 <stevemunene@cumin1002> END (FAIL) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=99) for hosts an-worker1162.eqiad.wmnet [production]
05:35 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1162.eqiad.wmnet [production]
05:35 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1160.eqiad.wmnet [production]
05:33 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1160.eqiad.wmnet [production]
05:31 <marostegui@cumin1002> dbctl commit (dc=all): 'db2204 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P77962 and previous config saved to /var/cache/conftool/dbconfig/20250616-053150-root.json [production]
05:25 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167 (T396130)', diff saved to https://phabricator.wikimedia.org/P77961 and previous config saved to /var/cache/conftool/dbconfig/20250616-052530-marostegui.json [production]
05:16 <marostegui@cumin1002> dbctl commit (dc=all): 'db2204 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P77960 and previous config saved to /var/cache/conftool/dbconfig/20250616-051644-root.json [production]
05:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1167 (T396130)', diff saved to https://phabricator.wikimedia.org/P77959 and previous config saved to /var/cache/conftool/dbconfig/20250616-050637-marostegui.json [production]
05:06 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
05:06 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
05:01 <marostegui@cumin1002> dbctl commit (dc=all): 'db2204 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P77958 and previous config saved to /var/cache/conftool/dbconfig/20250616-050139-root.json [production]
04:58 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2204.codfw.wmnet with reason: Maintenance [production]
04:57 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2204 T396549', diff saved to https://phabricator.wikimedia.org/P77957 and previous config saved to /var/cache/conftool/dbconfig/20250616-045738-marostegui.json [production]
04:52 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2161.codfw.wmnet with reason: Maintenance [production]
2025-06-15 §
18:09 <aokoth@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10 days, 0:00:00 on doc1003.eqiad.wmnet with reason: Bookworm Migration [production]
2025-06-14 §
22:38 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:38 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) [admin]
22:35 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
22:29 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
22:24 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:24 <andrew@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: T396940 - andrew@cumin1002" [production]
22:23 <andrew@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: T396940 - andrew@cumin1002" [production]
22:18 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
21:51 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1024.eqiad.wmnet with OS bullseye [production]
21:35 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage [production]
21:31 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage [production]
21:16 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1024.eqiad.wmnet with OS bullseye [production]
21:15 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1024.eqiad.wmnet'] [production]
21:08 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1024.eqiad.wmnet'] [production]
21:08 <andrew@cumin1002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts cloudcephosd1024.eqiad.wmnet [production]
21:08 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudcephosd1024.eqiad.wmnet [production]
20:58 <andrew@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudcephosd1024.eqiad.wmnet [production]
20:58 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) [admin]
20:48 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
20:46 <andrew@cumin1002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cloudcephosd1024.eqiad.wmnet [production]