551-600 of 10000 results (77ms)
2023-05-11 ยง
23:22 <rzl@cumin2002> conftool action : set/pooled=no; selector: cluster=videoscaler,name=mw14(3[789]|4[056]57)\.eqiad\.wmnet [production]
23:07 <pt1979@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudswift1002'] [production]
22:46 <pt1979@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudswift1002'] [production]
22:41 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['cloudswift1001'] [production]
22:23 <pt1979@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudswift1001'] [production]
21:56 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudswift1002.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:45 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host cloudswift1002.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:45 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudswift1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:10 <eevans@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe-eqiad [production]
21:07 <eevans@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe-eqiad [production]
21:07 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts db1225.eqiad.wmnet [production]
21:07 <eevans@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe-codfw [production]
21:06 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts db1225.eqiad.wmnet [production]
21:05 <eevans@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe-codfw [production]
20:58 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] (duration: 07m 30s) [production]
20:52 <urbanecm@deploy1002> urbanecm: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet [production]
20:51 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] [production]
20:50 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] (duration: 20m 04s) [production]
20:37 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus4001.ulsfo.wment [production]
20:37 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:36 <denisse@cumin1001> START - Cookbook sre.dns.netbox [production]
20:32 <denisse@cumin1001> START - Cookbook sre.hosts.decommission for hosts prometheus4001.ulsfo.wment [production]
20:31 <urbanecm@deploy1002> urbanecm: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet [production]
20:30 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] [production]
20:24 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host cloudswift1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:22 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] (duration: 09m 37s) [production]
20:22 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus3001.esams.wment [production]
20:22 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:21 <denisse@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: prometheus3001.esams.wment decommissioned, removing all IPs except the asset tag one - denisse@cumin1001" [production]
20:20 <denisse@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: prometheus3001.esams.wment decommissioned, removing all IPs except the asset tag one - denisse@cumin1001" [production]
20:18 <denisse@cumin1001> START - Cookbook sre.dns.netbox [production]
20:17 <denisse> manually remove prometheus3001.esams.wmnet from the ganeti master after a failed step in the decommission cookbook. [production]
20:14 <denisse@cumin1001> START - Cookbook sre.hosts.decommission for hosts prometheus3001.esams.wment [production]
20:14 <thcipriani@deploy1002> bd808 and thcipriani: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
20:12 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] [production]
19:56 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus3001.esams.wment [production]
19:56 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
19:55 <denisse@cumin1001> START - Cookbook sre.dns.netbox [production]
19:51 <denisse@cumin1001> START - Cookbook sre.hosts.decommission for hosts prometheus3001.esams.wment [production]
19:06 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on wdqs2021.codfw.wmnet with reason: attempting WDQS stack on bullseye [production]
19:06 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on wdqs2021.codfw.wmnet with reason: attempting WDQS stack on bullseye [production]
18:46 <ejegg> civicrm upgraded from d8a1a562 to db6e8d69 [production]
17:46 <stevemunene@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on an-airflow1006.eqiad.wmnet with reason: Silence error notifications/alerts during setup [production]
17:46 <stevemunene@cumin1001> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on an-airflow1006.eqiad.wmnet with reason: Silence error notifications/alerts during setup [production]
17:24 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/thumbor: sync [production]
17:12 <brennen@deploy1002> Synchronized php: group1 wikis to 1.41.0-wmf.8 refs T330214 (duration: 06m 14s) [production]
17:12 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/thumbor: sync [production]
17:11 <bking@deploy1002> helmfile [eqiad] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
17:10 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/thumbor: apply [production]
17:08 <bking@deploy1002> helmfile [eqiad] START helmfile.d/services/rdf-streaming-updater: apply [production]