951-1000 of 10000 results (116ms)
2025-08-29 ยง
09:14 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM failoid2003.codfw.wmnet - jmm@cumin2002" [production]
09:13 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM failoid2003.codfw.wmnet - jmm@cumin2002" [production]
09:13 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) failoid2003.codfw.wmnet on all recursors [production]
09:13 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache failoid2003.codfw.wmnet on all recursors [production]
09:13 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:13 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM failoid2003.codfw.wmnet - jmm@cumin2002" [production]
09:13 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM failoid2003.codfw.wmnet - jmm@cumin2002" [production]
09:11 <kevinbazira@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . [production]
09:11 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2194 (T402925)', diff saved to https://phabricator.wikimedia.org/P82093 and previous config saved to /var/cache/conftool/dbconfig/20250829-091108-ladsgroup.json [production]
09:08 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
09:08 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host failoid2003.codfw.wmnet [production]
08:53 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host maps1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
08:51 <kevinbazira@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . [production]
08:51 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host maps1011.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
08:50 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:50 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host maps1012 [production]
08:49 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . [production]
08:49 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host maps1012 [production]
08:48 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host maps1011 [production]
08:48 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
08:47 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host maps1011 [production]
08:47 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:44 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
08:44 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:44 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt maps1012 - vriley@cumin1003" [production]
08:44 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt maps1012 - vriley@cumin1003" [production]
08:40 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
08:40 <vriley@cumin1003> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
08:38 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
08:27 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
08:22 <kevinbazira@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . [production]
08:19 <jmm@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on install2004.wikimedia.org with reason: being replaced by install2005 [production]
08:02 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2194 (T402925)', diff saved to https://phabricator.wikimedia.org/P82092 and previous config saved to /var/cache/conftool/dbconfig/20250829-080216-ladsgroup.json [production]
08:02 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2194.codfw.wmnet with reason: Maintenance [production]
08:01 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2190 (T402925)', diff saved to https://phabricator.wikimedia.org/P82091 and previous config saved to /var/cache/conftool/dbconfig/20250829-080153-ladsgroup.json [production]
07:49 <kevinbazira@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
07:46 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2190', diff saved to https://phabricator.wikimedia.org/P82090 and previous config saved to /var/cache/conftool/dbconfig/20250829-074645-ladsgroup.json [production]
07:46 <kevinbazira@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . [production]
07:31 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2190', diff saved to https://phabricator.wikimedia.org/P82089 and previous config saved to /var/cache/conftool/dbconfig/20250829-073138-ladsgroup.json [production]
07:16 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2190 (T402925)', diff saved to https://phabricator.wikimedia.org/P82088 and previous config saved to /var/cache/conftool/dbconfig/20250829-071630-ladsgroup.json [production]
06:13 <arnaudb@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Security Update [production]
06:06 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2190 (T402925)', diff saved to https://phabricator.wikimedia.org/P82087 and previous config saved to /var/cache/conftool/dbconfig/20250829-060644-ladsgroup.json [production]
06:06 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2190.codfw.wmnet with reason: Maintenance [production]
06:06 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T402925)', diff saved to https://phabricator.wikimedia.org/P82086 and previous config saved to /var/cache/conftool/dbconfig/20250829-060621-ladsgroup.json [production]
05:51 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P82085 and previous config saved to /var/cache/conftool/dbconfig/20250829-055113-ladsgroup.json [production]
05:36 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P82084 and previous config saved to /var/cache/conftool/dbconfig/20250829-053606-ladsgroup.json [production]
05:20 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T402925)', diff saved to https://phabricator.wikimedia.org/P82083 and previous config saved to /var/cache/conftool/dbconfig/20250829-052059-ladsgroup.json [production]
04:08 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2177 (T402925)', diff saved to https://phabricator.wikimedia.org/P82082 and previous config saved to /var/cache/conftool/dbconfig/20250829-040849-ladsgroup.json [production]
04:08 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2177.codfw.wmnet with reason: Maintenance [production]
04:08 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2156 (T402925)', diff saved to https://phabricator.wikimedia.org/P82081 and previous config saved to /var/cache/conftool/dbconfig/20250829-040826-ladsgroup.json [production]