2023-05-11
§
|
23:39 |
<mutante> |
LDAP - added uid lorenjohnson to groups wmde nda T335858 |
[production] |
23:39 |
<mutante> |
LDAP - added uid roti to groups wmde and nda T336435 |
[production] |
23:24 |
<rzl@cumin2002> |
conftool action : set/pooled=no; selector: cluster=videoscaler,name=mw14(3[789]|4[056]|57)\.eqiad\.wmnet |
[production] |
23:22 |
<rzl@cumin2002> |
conftool action : set/pooled=no; selector: cluster=jobrunner,name=mw14(5[89]|6[016789]|9[45])\.eqiad\.wmnet |
[production] |
23:22 |
<rzl@cumin2002> |
conftool action : set/pooled=no; selector: cluster=videoscaler,name=mw14(3[789]|4[056]57)\.eqiad\.wmnet |
[production] |
23:07 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudswift1002'] |
[production] |
22:46 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudswift1002'] |
[production] |
22:41 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['cloudswift1001'] |
[production] |
22:23 |
<pt1979@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudswift1001'] |
[production] |
21:56 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudswift1002.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
21:45 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.provision for host cloudswift1002.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
21:45 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudswift1001.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
21:10 |
<eevans@cumin1001> |
END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe-eqiad |
[production] |
21:07 |
<eevans@cumin1001> |
START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe-eqiad |
[production] |
21:07 |
<jclark@cumin1001> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts db1225.eqiad.wmnet |
[production] |
21:07 |
<eevans@cumin1001> |
END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe-codfw |
[production] |
21:06 |
<jclark@cumin1001> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts db1225.eqiad.wmnet |
[production] |
21:05 |
<eevans@cumin1001> |
START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe-codfw |
[production] |
20:58 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] (duration: 07m 30s) |
[production] |
20:52 |
<urbanecm@deploy1002> |
urbanecm: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet |
[production] |
20:51 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:919175|Personalized praise: Do not suggest users with Homepage disabled (T336300)]], [[gerrit:919176|Personalized praise: Do not suggest users with Homepage disabled (T336300)]] |
[production] |
20:50 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] (duration: 20m 04s) |
[production] |
20:37 |
<denisse@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus4001.ulsfo.wment |
[production] |
20:37 |
<denisse@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:36 |
<denisse@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
20:32 |
<denisse@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts prometheus4001.ulsfo.wment |
[production] |
20:31 |
<urbanecm@deploy1002> |
urbanecm: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
20:30 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:912310|[Growth] Remove config variables provided by extension]] |
[production] |
20:24 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.provision for host cloudswift1001.mgmt.eqiad.wmnet with reboot policy FORCED |
[production] |
20:22 |
<thcipriani@deploy1002> |
Finished scap: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] (duration: 09m 37s) |
[production] |
20:22 |
<denisse@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus3001.esams.wment |
[production] |
20:22 |
<denisse@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:21 |
<denisse@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: prometheus3001.esams.wment decommissioned, removing all IPs except the asset tag one - denisse@cumin1001" |
[production] |
20:20 |
<denisse@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: prometheus3001.esams.wment decommissioned, removing all IPs except the asset tag one - denisse@cumin1001" |
[production] |
20:18 |
<denisse@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
20:17 |
<denisse> |
manually remove prometheus3001.esams.wmnet from the ganeti master after a failed step in the decommission cookbook. |
[production] |
20:14 |
<denisse@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts prometheus3001.esams.wment |
[production] |
20:14 |
<thcipriani@deploy1002> |
bd808 and thcipriani: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
20:12 |
<thcipriani@deploy1002> |
Started scap: Backport for [[gerrit:919168|Allow http://localhost callback URL (T299737)]] |
[production] |
19:56 |
<denisse@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts prometheus3001.esams.wment |
[production] |
19:56 |
<denisse@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:55 |
<denisse@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:51 |
<denisse@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts prometheus3001.esams.wment |
[production] |
19:06 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on wdqs2021.codfw.wmnet with reason: attempting WDQS stack on bullseye |
[production] |
19:06 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on wdqs2021.codfw.wmnet with reason: attempting WDQS stack on bullseye |
[production] |
18:46 |
<ejegg> |
civicrm upgraded from d8a1a562 to db6e8d69 |
[production] |
17:46 |
<stevemunene@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on an-airflow1006.eqiad.wmnet with reason: Silence error notifications/alerts during setup |
[production] |
17:46 |
<stevemunene@cumin1001> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on an-airflow1006.eqiad.wmnet with reason: Silence error notifications/alerts during setup |
[production] |
17:24 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: sync |
[production] |