1-50 of 10000 results (83ms)
2026-05-19 §
22:39 <brett> disabling pybal/puppet on lvs2012 due to hardware misconfiguration/failure - T425890 [production]
22:18 <brett@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on lvs2012.codfw.wmnet with reason: MD RAID failure [production]
22:16 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs2012.codfw.wmnet [production]
22:16 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for lvs2012.codfw.wmnet [production]
21:48 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:46 <jiji@cumin1003> END (FAIL) - Cookbook sre.k8s.reboot-nodes (exit_code=1) rolling reboot on P{wikikube-worker[1006-1007,1015-1016,1021,1034-1057,1064-1081,1084-1087,1093-1095,1113-1165,1240-1289,1291-1327,1375-1384].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
21:42 <jclark@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wdqs1037 [production]
21:41 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:41 <jclark@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wdqs1037 [production]
21:34 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:33 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:27 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:23 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:22 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:20 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs2014.codfw.wmnet [production]
21:19 <jclark@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wdqs1037 [production]
21:18 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wdqs1038.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:18 <jclark@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wdqs1037 [production]
21:18 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:16 <jclark@cumin1003> END (FAIL) - Cookbook sre.network.configure-switch-interfaces (exit_code=99) for host wdqs1036 [production]
21:16 <jclark@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wdqs1036 [production]
21:15 <brett@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs2014.codfw.wmnet [production]
21:13 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:12 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:11 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1038.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:10 <cdanis> 💔cdanis@apt1002.wikimedia.org ~ 🕔🍺 sudo -i reprepro -C main --ignore=wrongdistribution copy bookworm-wikimedia trixie-wikimedia cidergrinder [production]
21:10 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1037.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:09 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:09 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:09 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:09 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host wdqs1036.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:08 <jclark@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:08 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding wdqs1036 to eqiad - jclark@cumin1003" [production]
21:08 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding wdqs1036 to eqiad - jclark@cumin1003" [production]
21:04 <jclark@cumin1003> START - Cookbook sre.dns.netbox [production]
20:55 <sbassett@deploy1003> Finished scap sync-world: Backport for [[gerrit:1288999|Explicitly set wgCSPUseReportURIDirective and not wmgCSPUseReportURIDirective to true (T424058)]] (duration: 06m 40s) [production]
20:51 <sbassett@deploy1003> sbassett: Continuing with deployment [production]
20:51 <brett@cumin2002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) rebooting P{lvs7003.magru.wmnet} and A:liberica [production]
20:51 <sbassett@deploy1003> sbassett: Backport for [[gerrit:1288999|Explicitly set wgCSPUseReportURIDirective and not wmgCSPUseReportURIDirective to true (T424058)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:49 <sbassett@deploy1003> Started scap sync-world: Backport for [[gerrit:1288999|Explicitly set wgCSPUseReportURIDirective and not wmgCSPUseReportURIDirective to true (T424058)]] [production]
20:47 <brett@cumin2002> START - Cookbook sre.loadbalancer.admin rebooting P{lvs7003.magru.wmnet} and A:liberica [production]
20:40 <ebernhardson@deploy1003> Finished scap sync-world: Backport for [[gerrit:1288983|Revert^2 "Include xff in search logs"]] (duration: 08m 12s) [production]
20:36 <ebernhardson@deploy1003> ebernhardson: Continuing with deployment [production]
20:35 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp701[1-2].magru.wmnet} and A:cp [production]
20:35 <brett@cumin2002> cookbooks.sre.cdn.roll-reboot finished rebooting cp7012.magru.wmnet [production]
20:35 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on P{cp700[3-4].magru.wmnet} and A:cp [production]
20:35 <brett@cumin2002> cookbooks.sre.cdn.roll-reboot finished rebooting cp7004.magru.wmnet [production]
20:34 <ebernhardson@deploy1003> ebernhardson: Backport for [[gerrit:1288983|Revert^2 "Include xff in search logs"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:32 <ebernhardson@deploy1003> Started scap sync-world: Backport for [[gerrit:1288983|Revert^2 "Include xff in search logs"]] [production]
20:29 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-restart-reboot-ncredir (exit_code=0) rolling reboot on A:ncredir and not A:ncredir-magru and A:ncredir [production]