3051-3100 of 10000 results (90ms)
2023-05-12 §
00:50 <denisse@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: prometheus5001.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - denisse@cumin1001" [production]
00:48 <denisse@cumin1001> START - Cookbook sre.dns.netbox [production]
00:44 <denisse@cumin1001> START - Cookbook sre.hosts.decommission for hosts prometheus5001.eqsin.wmnet [production]
00:32 <denisse> manually removing prometheus4001.ulsfo.wmnet from the Ganeti master after a failed step in the decommission cookbook - T335585 [production]
00:22 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on prometheus3001.esams.wmnet with reason: maintenance [production]
00:22 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on prometheus3001.esams.wmnet with reason: maintenance [production]
2023-05-11 §
23:39 <mutante> LDAP - added uid lorenjohnson to groups wmde nda T335858 [production]
23:39 <mutante> LDAP - added uid roti to groups wmde and nda T336435 [production]
23:24 <rzl@cumin2002> conftool action : set/pooled=no; selector: cluster=videoscaler,name=mw14(3[789]|4[056]|57)\.eqiad\.wmnet [production]
23:22 <rzl@cumin2002> conftool action : set/pooled=no; selector: cluster=jobrunner,name=mw14(5[89]|6[016789]|9[45])\.eqiad\.wmnet [production]
23:22 <rzl@cumin2002> conftool action : set/pooled=no; selector: cluster=videoscaler,name=mw14(3[789]|4[056]57)\.eqiad\.wmnet [production]
23:07 <pt1979@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudswift1002'] [production]
22:46 <pt1979@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudswift1002'] [production]
22:41 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['cloudswift1001'] [production]
22:23 <pt1979@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudswift1001'] [production]
21:56 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudswift1002.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:45 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host cloudswift1002.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:45 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudswift1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:10 <eevans@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe-eqiad [production]
21:07 <eevans@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe-eqiad [production]
21:07 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts db1225.eqiad.wmnet [production]
21:07 <eevans@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe-codfw [production]
21:06 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts db1225.eqiad.wmnet [production]
21:05 <eevans@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe-codfw [production]
20:58 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] (duration: 07m 30s) [production]
20:52 <urbanecm@deploy1002> urbanecm: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet [production]
20:51 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] [production]
20:50 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] (duration: 20m 04s) [production]
20:37 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus4001.ulsfo.wment [production]
20:37 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:36 <denisse@cumin1001> START - Cookbook sre.dns.netbox [production]
20:32 <denisse@cumin1001> START - Cookbook sre.hosts.decommission for hosts prometheus4001.ulsfo.wment [production]
20:31 <urbanecm@deploy1002> urbanecm: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet [production]
20:30 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] [production]
20:24 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host cloudswift1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:22 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] (duration: 09m 37s) [production]
20:22 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus3001.esams.wment [production]
20:22 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:21 <denisse@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: prometheus3001.esams.wment decommissioned, removing all IPs except the asset tag one - denisse@cumin1001" [production]
20:20 <denisse@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: prometheus3001.esams.wment decommissioned, removing all IPs except the asset tag one - denisse@cumin1001" [production]
20:18 <denisse@cumin1001> START - Cookbook sre.dns.netbox [production]
20:17 <denisse> manually remove prometheus3001.esams.wmnet from the ganeti master after a failed step in the decommission cookbook. [production]
20:14 <denisse@cumin1001> START - Cookbook sre.hosts.decommission for hosts prometheus3001.esams.wment [production]
20:14 <thcipriani@deploy1002> bd808 and thcipriani: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
20:12 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] [production]
19:56 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus3001.esams.wment [production]
19:56 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:55 <denisse@cumin1001> START - Cookbook sre.dns.netbox [production]
19:51 <denisse@cumin1001> START - Cookbook sre.hosts.decommission for hosts prometheus3001.esams.wment [production]
19:06 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on wdqs2021.codfw.wmnet with reason: attempting WDQS stack on bullseye [production]