2351-2400 of 10000 results (138ms)
2024-11-06 ยง
14:20 <sukhe@puppetserver1001> conftool action : set/pooled=no; selector: name=cp2031.codfw.wmnet [production]
14:19 <sukhe> depool cp2031 [production]
14:19 <lucaswerkmeister-wmde@deploy2002> hamishz, lucaswerkmeister-wmde: Backport for [[gerrit:1085572|Cleanup for logo related file]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:19 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1045.eqiad.wmnet [production]
14:16 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1085572|Cleanup for logo related file]] [production]
14:16 <jmm@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ganeti1045 [production]
14:14 <jmm@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ganeti1045 [production]
14:02 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: depool site eqiad for service: ncredir-addrs [reason: no reason specified, T378453] [production]
14:02 <vgutierrez@cumin1002> START - Cookbook sre.dns.admin DNS admin: depool site eqiad for service: ncredir-addrs [reason: no reason specified, T378453] [production]
13:52 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1014.eqiad.wmnet [production]
13:52 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1044.eqiad.wmnet to cluster eqiad and group B [production]
13:47 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti1044.eqiad.wmnet to cluster eqiad and group B [production]
13:44 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of dse-k8s-etcd1002.eqiad.wmnet to plain [production]
13:43 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
13:42 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
13:41 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of dse-k8s-etcd1002.eqiad.wmnet to plain [production]
13:28 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1014.eqiad.wmnet [production]
13:27 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1014.eqiad.wmnet [production]
13:27 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1041.eqiad.wmnet [production]
13:27 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1041.eqiad.wmnet [production]
13:08 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of dse-k8s-etcd1002.eqiad.wmnet to drbd [production]
13:02 <arnaudb@cumin1002> START - Cookbook sre.mysql.clone of db2136.codfw.wmnet onto db2236.codfw.wmnet [production]
12:58 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of dse-k8s-etcd1002.eqiad.wmnet to drbd [production]
12:56 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ml-etcd1001.eqiad.wmnet to plain [production]
12:56 <arnaudb@cumin1002> dbctl commit (dc=all): 'Cloning db2136 in db2236 for T373579', diff saved to https://phabricator.wikimedia.org/P70964 and previous config saved to /var/cache/conftool/dbconfig/20241106-125648-arnaudb.json [production]
12:55 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ml-etcd1001.eqiad.wmnet to plain [production]
12:55 <arnaudb@cumin1002> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db2136 - depooling db2136 to clone on db2236 [production]
12:55 <arnaudb@cumin1002> START - Cookbook sre.mysql.depool db2136 - depooling db2136 to clone on db2236 [production]
12:55 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2236.codfw.wmnet with reason: provisionning db2236.codfw.wmnet - T373579 [production]
12:54 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2236.codfw.wmnet with reason: provisionning db2236.codfw.wmnet - T373579 [production]
12:54 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2136.codfw.wmnet with reason: provisionning db2236.codfw.wmnet - T373579 [production]
12:54 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2136.codfw.wmnet with reason: provisionning db2236.codfw.wmnet - T373579 [production]
12:52 <slyngs> IDP/CAS-SSO Enable Redis TGT backend [production]
12:52 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1014.eqiad.wmnet [production]
12:52 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1014.eqiad.wmnet [production]
12:50 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ml-etcd1001.eqiad.wmnet to drbd [production]
12:41 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ml-etcd1001.eqiad.wmnet to drbd [production]
12:40 <arnaudb@cumin1002> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1206 quickly with 2 steps - test 1087895 [production]
12:25 <arnaudb@cumin1002> START - Cookbook sre.mysql.pool db1206 quickly with 2 steps - test 1087895 [production]
12:23 <arnaudb@cumin1002> dbctl commit (dc=all): 'db1206 depool to test cookbook hotfix on CR 1087895', diff saved to https://phabricator.wikimedia.org/P70960 and previous config saved to /var/cache/conftool/dbconfig/20241106-122348-arnaudb.json [production]
12:23 <marostegui> Migrate db1125 to MariaDB 10.6.20 T378940 [production]
12:23 <arnaudb@cumin1002> dbctl commit (dc=all): '"db1206 pending"', diff saved to https://phabricator.wikimedia.org/P70959 and previous config saved to /var/cache/conftool/dbconfig/20241106-122318-arnaudb.json [production]
12:21 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2230.codfw.wmnet with reason: testing [production]
12:21 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2230.codfw.wmnet with reason: testing [production]
12:21 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: testing [production]
12:21 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: testing [production]
12:09 <arnaudb@cumin1002> END (FAIL) - Cookbook sre.mysql.pool (exit_code=99) db1206 quickly with 2 steps - repool [production]
12:09 <arnaudb@cumin1002> START - Cookbook sre.mysql.pool db1206 quickly with 2 steps - repool [production]
12:06 <mvolz@deploy2002> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
12:06 <mvolz@deploy2002> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]