2025-04-23
ยง
|
17:42 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
17:39 |
<bking@cumin2002> |
conftool action : set/pooled=yes:weight=10; selector: name=cirrussearch2071.codfw.wmnet|cirrussearch2098.codfw.wmnet|cirrussearch2099.codfw.wmnet|cirrussearch2101.codfw.wmnet|cirrussearch2102.codfw.wmnet|cirrussearch2113.codfw.wmnet |
[production] |
17:38 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
17:29 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1207', diff saved to https://phabricator.wikimedia.org/P75343 and previous config saved to /var/cache/conftool/dbconfig/20250423-172912-fceratto.json |
[production] |
17:21 |
<brett> |
Remove libvarnishapi-dev from bookworm-wikimedia |
[production] |
17:14 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1207 (T391056)', diff saved to https://phabricator.wikimedia.org/P75342 and previous config saved to /var/cache/conftool/dbconfig/20250423-171404-fceratto.json |
[production] |
17:04 |
<brett> |
Remove varnish libvmod-re2 libvmod-netmapper libvmod-querysort libvarnishapi2 varnish-modules varnishkafka from bookworm-wikimedia |
[production] |
16:57 |
<brennen> |
Updating development images on contint primary for https://gitlab.wikimedia.org/repos/releng/dev-images/-/commit/a80e5211100f1cc42e4ae020d4266ea22938eb5a (T383097) |
[releng] |
16:56 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1207 (T391056)', diff saved to https://phabricator.wikimedia.org/P75341 and previous config saved to /var/cache/conftool/dbconfig/20250423-165634-fceratto.json |
[production] |
16:56 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1207.eqiad.wmnet with reason: Maintenance |
[production] |
16:56 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1206 (T391056)', diff saved to https://phabricator.wikimedia.org/P75340 and previous config saved to /var/cache/conftool/dbconfig/20250423-165611-fceratto.json |
[production] |
16:52 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host cirrussearch2113 |
[production] |
16:52 |
<bking@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host cirrussearch2113 |
[production] |
16:52 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host cirrussearch2113.codfw.wmnet with OS bullseye |
[production] |
16:51 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from elastic2113 to cirrussearch2113 |
[production] |
16:50 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cirrussearch2113 |
[production] |
16:50 |
<bking@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host cirrussearch2113 |
[production] |
16:50 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:50 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic2113 to cirrussearch2113 - bking@cumin2002" |
[production] |
16:47 |
<bking@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic2113 to cirrussearch2113 - bking@cumin2002" |
[production] |
16:43 |
<bking@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
16:42 |
<bking@cumin2002> |
START - Cookbook sre.hosts.rename from elastic2113 to cirrussearch2113 |
[production] |
16:41 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P75338 and previous config saved to /var/cache/conftool/dbconfig/20250423-164105-fceratto.json |
[production] |
16:32 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.undrain_node |
[admin] |
16:32 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.ceph.osd.drain_node (exit_code=97) |
[admin] |
16:32 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.drain_node |
[admin] |
16:31 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.ceph.osd.drain_node (exit_code=97) |
[admin] |
16:30 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch2102.codfw.wmnet with OS bullseye |
[production] |
16:29 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.drain_node |
[admin] |
16:29 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.ceph.osd.drain_node (exit_code=97) |
[admin] |
16:29 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.drain_node |
[admin] |
16:29 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=97) |
[admin] |
16:28 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:28 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:27 |
<dreamyjazz@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1138403|Enable temporary-account-viewer group on all WMF production wikis (T390942 T387205)]] (duration: 11m 11s) |
[production] |
16:27 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
16:25 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P75337 and previous config saved to /var/cache/conftool/dbconfig/20250423-162558-fceratto.json |
[production] |
16:25 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
16:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1032 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P75336 and previous config saved to /var/cache/conftool/dbconfig/20250423-162434-root.json |
[production] |
16:24 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host an-worker1184.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL |
[production] |
16:23 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1184 |
[production] |
16:23 |
<vriley@cumin1002> |
START - Cookbook sre.network.configure-switch-interfaces for host an-worker1184 |
[production] |
16:21 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:21 |
<dreamyjazz@deploy1003> |
dreamyjazz: Continuing with sync |
[production] |
16:21 |
<dreamyjazz@deploy1003> |
dreamyjazz: Backport for [[gerrit:1138403|Enable temporary-account-viewer group on all WMF production wikis (T390942 T387205)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
16:19 |
<vriley@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
16:18 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Upgrade Replica to GitLab 17.9 |
[production] |
16:18 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.undrain_node |
[admin] |
16:17 |
<andrew@cloudcumin1001> |
END (ERROR) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=97) |
[admin] |
16:17 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.ceph.osd.undrain_node |
[admin] |