4401-4450 of 10000 results (90ms)
2023-07-10 §
08:31 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1030.eqiad.wmnet [production]
08:27 <kartik@deploy1002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
08:27 <kartik@deploy1002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
08:25 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1030.eqiad.wmnet [production]
08:24 <claime> Running puppet on cp-text hosts - T337489 [production]
08:11 <hashar> UTC morning backport window completed. [production]
08:11 <hashar@deploy1002> Finished scap: Backport for [[gerrit:934614|Deploy action blocks on bnwiki (T340904)]] (duration: 08m 15s) [production]
08:04 <moritzm> installing c-ares security updates on buster [production]
08:04 <hashar@deploy1002> hashar and mdsshakil: Backport for [[gerrit:934614|Deploy action blocks on bnwiki (T340904)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet [production]
08:02 <hashar@deploy1002> Started scap: Backport for [[gerrit:934614|Deploy action blocks on bnwiki (T340904)]] [production]
08:02 <hashar@deploy1002> Finished scap: Backport for [[gerrit:935876|thwiki: Update logos from commons (T341407)]] (duration: 25m 32s) [production]
08:00 <moritzm> installing flask security updates on bullseye [production]
07:58 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1030.eqiad.wmnet [production]
07:54 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1029.eqiad.wmnet [production]
07:54 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1029.eqiad.wmnet [production]
07:47 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1029.eqiad.wmnet [production]
07:45 <hashar@deploy1002> func and hashar: Backport for [[gerrit:935876|thwiki: Update logos from commons (T341407)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet [production]
07:36 <hashar@deploy1002> Started scap: Backport for [[gerrit:935876|thwiki: Update logos from commons (T341407)]] [production]
07:30 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1029.eqiad.wmnet [production]
07:30 <elukey@deploy1002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
07:30 <moritzm> installing libgstreamer-plugins-base1.0-0 security updates [production]
07:29 <elukey@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: sync [production]
07:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1027.eqiad.wmnet [production]
07:29 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1027.eqiad.wmnet [production]
07:22 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1027.eqiad.wmnet [production]
07:22 <elukey@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
07:21 <elukey@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync [production]
07:21 <hashar> deploy1002: removed empty untracked directory from MediaWiki staging area: `rmdir /srv/mediawiki-staging/wmf-config/scap/log/ && rmdir /srv/mediawiki-staging/wmf-config/scap/` | T341292 [production]
07:20 <elukey@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: sync [production]
07:20 <elukey@deploy1002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: sync [production]
07:02 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1027.eqiad.wmnet [production]
07:01 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1026.eqiad.wmnet [production]
07:01 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1026.eqiad.wmnet [production]
06:55 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1026.eqiad.wmnet [production]
06:45 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1026.eqiad.wmnet [production]
06:43 <godog> add 100G to prometheus/k8s in codfw [production]
01:06 <rzl@deploy1002> helmfile [staging] DONE helmfile.d/services/opentelemetry-collector: apply [production]
01:06 <rzl@deploy1002> helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply [production]
2023-07-09 §
14:51 <apergos> swapped dumpsdata1003 in as the new nfs share for misc dumps; dumpsdata1002 is now a spare, to be decommissioned. 1003 is running bullseye. [production]
04:04 <apergos> rsync misc dumps output files from dumpsdata1002 to 1003, in ariel screen session on 1003, bwlimit to 1G [production]
2023-07-08 §
03:21 <rzl@deploy1002> helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply [production]
2023-07-07 §
22:55 <rzl@deploy1002> helmfile [staging] DONE helmfile.d/services/opentelemetry-collector: apply [production]
22:55 <rzl@deploy1002> helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply [production]
22:41 <rzl@deploy1002> helmfile [staging] DONE helmfile.d/services/opentelemetry-collector: apply [production]
22:21 <rzl@deploy1002> helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply [production]
22:04 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1156.eqiad.wmnet with OS bullseye [production]
21:59 <rzl@deploy1002> helmfile [staging] START helmfile.d/services/opentelemetry-collector: apply [production]
21:24 <bking@deploy1002> Finished deploy [wdqs/wdqs@dff41b7]: 0.3.124 (duration: 00m 57s) [production]
21:23 <bking@deploy1002> Started deploy [wdqs/wdqs@dff41b7]: 0.3.124 [production]
21:23 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) [production]