2023-09-13
ยง
|
16:40 |
<fnegri@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudnet.reboot_node (T345811) |
[admin] |
16:40 |
<wm-bot2> |
dcaro@urcuchillay START - Cookbook wmcs.toolforge.k8s.component.deploy for component maintain-kubeusersNone (T341084) |
[toolsbeta] |
16:34 |
<denisse@deploy1002> |
Finished deploy [librenms/librenms@f049593]: Upgrade LibreNMS to 23.8.2 - T344136 (duration: 00m 16s) |
[production] |
16:34 |
<denisse@deploy1002> |
Started deploy [librenms/librenms@f049593]: Upgrade LibreNMS to 23.8.2 - T344136 |
[production] |
16:04 |
<aokoth@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on vrts1002.eqiad.wmnet with reason: Testing |
[production] |
16:04 |
<aokoth@cumin1001> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on vrts1002.eqiad.wmnet with reason: Testing |
[production] |
16:04 |
<denisse@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host netmon2002.wikimedia.org with OS bookworm |
[production] |
15:53 |
<wm-bot> |
<samtar> webservice restart |
[tools.refill-api] |
15:41 |
<denisse@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netmon2002.wikimedia.org with reason: host reimage |
[production] |
15:38 |
<denisse@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on netmon2002.wikimedia.org with reason: host reimage |
[production] |
15:34 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
15:34 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
15:31 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
15:29 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
15:26 |
<jayme> |
re-enabled puppet on all k8s control planes |
[production] |
15:26 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.aqs.roll-restart-reboot (exit_code=0) rolling reboot on A:aqs-codfw |
[production] |
15:19 |
<denisse@cumin1001> |
START - Cookbook sre.hosts.reimage for host netmon2002.wikimedia.org with OS bookworm |
[production] |
15:19 |
<denisse> |
Start reimage of netmon2002 |
[production] |
15:17 |
<denisse> |
Starting LibreNMS upgrade in codfw. |
[production] |
15:14 |
<jgiannelos@deploy1002> |
helmfile [eqiad] START helmfile.d/services/wikifeeds: apply |
[production] |
15:04 |
<jayme> |
stopped puppet on all k8s control planes for 956842 rollout |
[production] |
15:01 |
<akosiaris@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/machinetranslation: apply |
[production] |
15:01 |
<hnowlan> |
repooling cp2037 and enabling puppet on A:cp |
[production] |
14:56 |
<akosiaris@deploy1002> |
helmfile [codfw] START helmfile.d/services/machinetranslation: apply |
[production] |
14:55 |
<akosiaris@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply |
[production] |
14:52 |
<hnowlan> |
disable puppet on A:cp |
[production] |
14:51 |
<hnowlan> |
depooled service=ats-be,name=cp2037.codfw.wmnet |
[production] |
14:51 |
<jayme> |
updated kubernetes-* packages fleet wide to 1.23.14-3 - T329826 |
[production] |
14:50 |
<akosiaris@deploy1002> |
helmfile [eqiad] START helmfile.d/services/machinetranslation: apply |
[production] |
14:41 |
<akosiaris@deploy1002> |
helmfile [staging] DONE helmfile.d/services/machinetranslation: apply |
[production] |
14:39 |
<akosiaris@deploy1002> |
helmfile [staging] START helmfile.d/services/machinetranslation: apply |
[production] |
14:36 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on sretest1001.eqiad.wmnet with reason: WIP towards puppetised nftables firewall |
[production] |
14:36 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on sretest1001.eqiad.wmnet with reason: WIP towards puppetised nftables firewall |
[production] |
14:31 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1001.eqiad.wmnet with OS bullseye |
[production] |
14:29 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:29 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:26 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:25 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:17 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:17 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:10 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:10 |
<bking@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply |
[production] |
14:08 |
<hnowlan> |
stopping cassandra on restbase1030-c |
[production] |
13:52 |
<jmm@cumin2002> |
START - Cookbook sre.aqs.roll-restart-reboot rolling reboot on A:aqs-codfw |
[production] |
13:34 |
<Lucas_WMDE> |
UTC afternoon backport+config window done |
[production] |
13:34 |
<lucaswerkmeister-wmde@deploy1002> |
Finished scap: Backport for [[gerrit:956818|rdbms: Use `debugSql` instead of `debugDumpSql` which is unuset (T318272)]] (duration: 15m 42s) |
[production] |
13:27 |
<lucaswerkmeister-wmde@deploy1002> |
lucaswerkmeister-wmde and d3r1ck01: Continuing with sync |
[production] |
13:20 |
<lucaswerkmeister-wmde@deploy1002> |
lucaswerkmeister-wmde and d3r1ck01: Backport for [[gerrit:956818|rdbms: Use `debugSql` instead of `debugDumpSql` which is unuset (T318272)]] synced to the testservers mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) |
[production] |
13:18 |
<lucaswerkmeister-wmde@deploy1002> |
Started scap: Backport for [[gerrit:956818|rdbms: Use `debugSql` instead of `debugDumpSql` which is unuset (T318272)]] |
[production] |
12:51 |
<wm-bot2> |
dcaro@urcuchillay END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component maintain-kubeusersNone (T341084) |
[tools] |