951-1000 of 10000 results (56ms)
2024-07-17 ยง
09:07 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component jobs-api [tools]
09:06 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component jobs-api [tools]
09:02 <elukey@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp4037.ulsfo.wmnet [production]
08:58 <marostegui@cumin1002> dbctl commit (dc=all): 'db1181 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P66705 and previous config saved to /var/cache/conftool/dbconfig/20240717-085857-root.json [production]
08:57 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp4037.ulsfo.wmnet [production]
08:54 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component jobs-api [toolsbeta]
08:54 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component jobs-api [toolsbeta]
08:54 <sstefanova@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=99) for component jobs-api [toolsbeta]
08:54 <sstefanova@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component jobs-api [toolsbeta]
08:48 <elukey@cumin1002> START - Cookbook sre.hosts.reboot-single for host cp4037.ulsfo.wmnet [production]
08:47 <elukey@puppetserver1001> conftool action : set/pooled=no; selector: name=cp4037.ulsfo.wmnet [production]
08:45 <btullis> stopping mariadb section 1-8 on clouddb1021 for T368518 [analytics]
08:43 <marostegui@cumin1002> dbctl commit (dc=all): 'db1181 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P66704 and previous config saved to /var/cache/conftool/dbconfig/20240717-084351-root.json [production]
08:26 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component ingress-nginx [tools]
08:26 <aborrero@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component ingress-nginx [tools]
08:23 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component ingress-nginx [toolsbeta]
08:22 <aborrero@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component ingress-nginx [toolsbeta]
08:20 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component ingress-nginx [toolsbeta]
08:20 <aborrero@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component ingress-nginx [toolsbeta]
08:14 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component ingress-nginx [toolsbeta]
08:14 <aborrero@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.component.deploy for component ingress-nginx [toolsbeta]
08:10 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
08:09 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=0) [admin]
08:06 <kartik@deploy1002> Finished scap: Backport for [[gerrit:1054699|TranslatablePageState: Check if banner namespaces are configured (T370219)]] (duration: 14m 26s) [production]
08:00 <kartik@deploy1002> abi, kartik: Continuing with sync [production]
07:54 <kartik@deploy1002> abi, kartik: Backport for [[gerrit:1054699|TranslatablePageState: Check if banner namespaces are configured (T370219)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
07:51 <kartik@deploy1002> Started scap sync-world: Backport for [[gerrit:1054699|TranslatablePageState: Check if banner namespaces are configured (T370219)]] [production]
07:50 <jayme@deploy1002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
07:50 <jayme@deploy1002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
07:50 <jayme@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
07:49 <elukey> restart hadoop-mapreduce-historyserver.service on an-master1003 - failed for Java OOM [production]
07:49 <jayme@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
07:39 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.depool_and_destroy [admin]
07:39 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) [admin]
07:38 <elukey@cumin1002> END (PASS) - Cookbook sre.network.tls (exit_code=0) for network device lsw1-d1-codfw [production]
07:37 <jayme> imported helm3 3.11.3 to bullseye-wikimedia and buster-wikimedia [production]
07:36 <elukey@cumin1002> START - Cookbook sre.network.tls for network device lsw1-d1-codfw [production]
06:48 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'clear' for AS: 17072 [production]
06:48 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'clear' for AS: 17072 [production]
05:40 <sstefanova@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.quota_increase (exit_code=0) [catalyst]
05:40 <sstefanova@cloudcumin1001> START - Cookbook wmcs.openstack.quota_increase [catalyst]
05:36 <marostegui> Deploy schema change on s7 eqiad db1181 dbmaint T367856 [production]
05:35 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1181.eqiad.wmnet with reason: Long schema change [production]
05:35 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1181.eqiad.wmnet with reason: Long schema change [production]
05:34 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1181 T370121', diff saved to https://phabricator.wikimedia.org/P66703 and previous config saved to /var/cache/conftool/dbconfig/20240717-053359-marostegui.json [production]
05:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote db1236 to s7 primary and set section read-write T370121', diff saved to https://phabricator.wikimedia.org/P66702 and previous config saved to /var/cache/conftool/dbconfig/20240717-053302-root.json [production]
05:32 <marostegui@cumin1002> dbctl commit (dc=all): 'Set s7 eqiad as read-only for maintenance - T370121', diff saved to https://phabricator.wikimedia.org/P66701 and previous config saved to /var/cache/conftool/dbconfig/20240717-053230-root.json [production]
05:32 <marostegui> Starting s7 eqiad failover from db1181 to db1236 - T370121 [production]
05:14 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 27 hosts with reason: Primary switchover s7 T370121 [production]
05:14 <marostegui@cumin1002> dbctl commit (dc=all): 'Set db1236 with weight 0 T370121', diff saved to https://phabricator.wikimedia.org/P66700 and previous config saved to /var/cache/conftool/dbconfig/20240717-051419-root.json [production]