2021-09-27
§
|
09:29 |
<moritzm> |
systemctl reset-failed networking T273026 |
[production] |
09:29 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-fe1001.eqiad.wmnet |
[production] |
09:27 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on ldap-replica1004.wikimedia.org with reason: reboot - T291813 |
[production] |
09:27 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:10:00 on ldap-replica1004.wikimedia.org with reason: reboot - T291813 |
[production] |
09:27 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mx1001.wikimedia.org |
[production] |
09:24 |
<arturo> |
rebooting cloudcontrol1004 for T291446 |
[admin] |
09:24 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host cloudcontrol1004.wikimedia.org |
[production] |
09:23 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-fe1001.eqiad.wmnet |
[production] |
09:22 |
<filippo@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host thanos-fe1001.eqiad.wmnet |
[production] |
09:22 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-fe1001.eqiad.wmnet |
[production] |
09:18 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host mx1001.wikimedia.org |
[production] |
09:17 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on ldap-replica1003.wikimedia.org with reason: reboot - T291813 |
[production] |
09:17 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:10:00 on ldap-replica1003.wikimedia.org with reason: reboot - T291813 |
[production] |
09:13 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host failoid1002.eqiad.wmnet |
[production] |
09:10 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host failoid1002.eqiad.wmnet |
[production] |
09:09 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on people1003.eqiad.wmnet with reason: reboot - T291813 |
[production] |
09:09 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:10:00 on people1003.eqiad.wmnet with reason: reboot - T291813 |
[production] |
09:07 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on people2002.codfw.wmnet with reason: reboot - T291813 |
[production] |
09:07 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:10:00 on people2002.codfw.wmnet with reason: reboot - T291813 |
[production] |
09:07 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host failoid2002.codfw.wmnet |
[production] |
09:04 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host failoid2002.codfw.wmnet |
[production] |
08:35 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host copernicium.wikimedia.org |
[production] |
08:35 |
<dcausse@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'rdf-streaming-updater' for release 'main' . |
[production] |
08:30 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host copernicium.wikimedia.org |
[production] |
08:27 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host theemin.codfw.wmnet |
[production] |
08:24 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host theemin.codfw.wmnet |
[production] |
08:14 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1002.eqiad.wmnet |
[production] |
08:07 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet |
[production] |
07:18 |
<godog> |
swift eqiad-prod: add weight to ms-be10[64-67] - T290546 |
[production] |
07:11 |
<hashar> |
Switching python based jobs to docker-registry.wikimedia.org/releng/tox-buster:0.5.0 # T291292 |
[releng] |
07:07 |
<marostegui> |
Remove flaggedimages from s3 T290340 |
[production] |
06:13 |
<effie> |
rolling restart php-fpm in eqiad - T291052 |
[production] |
06:07 |
<effie> |
upgrade php7.2 in eqiad - T291052 |
[production] |
05:56 |
<marostegui> |
Drop labswiki from m5 T167973 |
[production] |
05:28 |
<marostegui> |
Remove flaggedimages from s2 T290340 |
[production] |
2021-09-25
§
|
17:13 |
<wm-bot> |
<lucaswerkmeister> cleaned up old replicasets: kubectl delete rs $(kubectl get rs -o json | jq -r ".items |.[] | select(.status.replicas == 0) | .metadata.name") |
[tools.notwikilambda] |
16:45 |
<wm-bot> |
<lucaswerkmeister> updated pygments-server to Python 3.9 (cef7948b15) |
[tools.notwikilambda] |
16:37 |
<wm-bot> |
<lucaswerkmeister> switched pygments-server from web to base image (25f8a10fa0) |
[tools.notwikilambda] |
16:36 |
<wm-bot> |
<lucaswerkmeister> switched function evaluator and orchestrator from web to base image (8424307b5d) |
[tools.notwikilambda] |
16:29 |
<wm-bot> |
<lucaswerkmeister> shortened periodSeconds of JS container probes from 15 to 5 seconds (3d658c10ba) |
[tools.notwikilambda] |
16:18 |
<wm-bot> |
<lucaswerkmeister> added --no-save to npm install in function evaluator and orchestrator (266c550663) and manually reset the package-lock.json in both repos |
[tools.notwikilambda] |
14:49 |
<wm-bot> |
<lucaswerkmeister> removed old venv-3.7 |
[tools.translate-link] |
14:48 |
<wm-bot> |
<lucaswerkmeister> removed old venv-3.7 |
[tools.ranker] |