2024-08-28
§
|
15:23 |
<elukey@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/toolhub: sync |
[production] |
15:23 |
<elukey@deploy1003> |
helmfile [eqiad] START helmfile.d/services/toolhub: sync |
[production] |
15:22 |
<elukey@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/toolhub: sync |
[production] |
15:22 |
<elukey@deploy1003> |
helmfile [codfw] START helmfile.d/services/toolhub: sync |
[production] |
13:45 |
<elukey@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/thumbor: sync |
[production] |
13:40 |
<elukey@deploy1003> |
helmfile [eqiad] START helmfile.d/services/thumbor: sync |
[production] |
13:36 |
<elukey@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/thumbor: sync |
[production] |
13:31 |
<elukey@deploy1003> |
helmfile [codfw] START helmfile.d/services/thumbor: sync |
[production] |
13:10 |
<elukey@deploy1003> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
13:10 |
<elukey@deploy1003> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
2024-08-27
§
|
15:11 |
<elukey> |
restart httpd and librenms-syslog.service on netmon1003 for libaom upgrades |
[production] |
15:11 |
<elukey> |
restart httpd on crm2001 for libaom upgrades |
[production] |
15:02 |
<elukey@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=wikikube-ctrl2003.codfw.wmnet |
[production] |
15:01 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2003.mgmt.codfw.wmnet with reboot policy GRACEFUL |
[production] |
14:44 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.provision for host wikikube-ctrl2003.mgmt.codfw.wmnet with reboot policy GRACEFUL |
[production] |
14:41 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on wikikube-ctrl2003.codfw.wmnet with reason: running provision again |
[production] |
14:41 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on wikikube-ctrl2003.codfw.wmnet with reason: running provision again |
[production] |
14:40 |
<elukey@puppetserver1001> |
conftool action : set/pooled=no; selector: name=wikikube-ctrl2003.codfw.wmnet |
[production] |
2024-08-08
§
|
16:29 |
<elukey> |
debmonitor-client 0.4.0 rolledout to all bullseye nodes |
[production] |
16:07 |
<elukey> |
on cumin1002 "sudo cumin -b 20 -p 95 'P{F:lsbdistcodename="bullseye"} and A:codfw' 'run-puppet-agent -q --failed-only'" |
[production] |
09:38 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:aqs-codfw: Openjdk upgrade - elukey@cumin1002 |
[production] |
09:24 |
<elukey> |
powercycle ml-serve2004 - host frozen, no ssh access, get sel shows "Multi-bit memory errors detected on a memory device at location(s) DIMM_A2." |
[production] |
08:19 |
<elukey> |
restart dump_ip_reputation.service on puppetserver1001 |
[production] |
08:13 |
<elukey> |
restart tomcat on idp[1,2]003 to pick up the new openjdk |
[production] |
08:09 |
<elukey@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:aqs-codfw: Openjdk upgrade - elukey@cumin1002 |
[production] |
2024-08-07
§
|
16:01 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:aqs-eqiad: Openjdk upgrade - elukey@cumin1002 |
[production] |
14:33 |
<elukey@cumin1002> |
START - Cookbook sre.cassandra.roll-restart for nodes matching A:aqs-eqiad: Openjdk upgrade - elukey@cumin1002 |
[production] |
14:01 |
<elukey> |
import Jenkins 2.462.1 on bullseye-wikimedia:thirdparty/ci |
[production] |
13:24 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons. |
[production] |
13:17 |
<elukey@cumin1002> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons. |
[production] |
13:15 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. |
[production] |
08:31 |
<elukey> |
openjdk-11 upgrades for bullseye rolled out to prod |
[production] |