| 2024-08-26
      
      ยง | 
    
  | 17:52 | <swfrench@deploy1003> | helmfile [codfw] DONE helmfile.d/services/eventstreams: apply | [production] | 
            
  | 17:51 | <swfrench@deploy1003> | helmfile [codfw] START helmfile.d/services/eventstreams: apply | [production] | 
            
  | 17:49 | <wmbot~dcaro@urcuchillay> | END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster | [tools] | 
            
  | 17:49 | <wmbot~dcaro@urcuchillay> | Added a new k8s worker-nfs tools-k8s-worker-nfs-62.tools.eqiad1.wikimedia.cloud to the cluster | [tools] | 
            
  | 17:43 | <ryankemper@cumin2002> | conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-main | [production] | 
            
  | 17:43 | <ryankemper@cumin2002> | conftool action : set/pooled=yes:weight=10; selector: cluster=wdqs-scholarly | [production] | 
            
  | 17:41 | <swfrench@deploy1003> | helmfile [staging] DONE helmfile.d/services/eventstreams-internal: apply | [production] | 
            
  | 17:41 | <swfrench@deploy1003> | helmfile [staging] START helmfile.d/services/eventstreams-internal: apply | [production] | 
            
  | 17:40 | <kamila@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubernetes2018.codfw.wmnet | [production] | 
            
  | 17:40 | <swfrench@deploy1003> | helmfile [staging] DONE helmfile.d/services/eventstreams: apply | [production] | 
            
  | 17:39 | <kamila@cumin1002> | START - Cookbook sre.k8s.pool-depool-node depool for host kubernetes2018.codfw.wmnet | [production] | 
            
  | 17:39 | <swfrench@deploy1003> | helmfile [staging] START helmfile.d/services/eventstreams: apply | [production] | 
            
  | 17:39 | <ryankemper> | T364364 Created PTR & A records for new graph split services `wdqs-main` and `wdqs-scholarly` (merged https://gerrit.wikimedia.org/r/c/operations/dns/+/1051446 and ran `sudo authdns-update` on `dns1004.wikimedia.org`) | [production] | 
            
  | 17:38 | <wmbot~dcaro@urcuchillay> | START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | [tools] | 
            
  | 17:38 | <wmbot~dcaro@urcuchillay> | END (PASS) - Cookbook wmcs.openstack.quota_increase (exit_code=0) | [tools] | 
            
  | 17:38 | <wmbot~dcaro@urcuchillay> | START - Cookbook wmcs.openstack.quota_increase | [tools] | 
            
  | 17:33 | <wmbot~dcaro@urcuchillay> | END (FAIL) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=99) for a worker-nfs role in the tools cluster | [tools] | 
            
  | 17:33 | <wmbot~dcaro@urcuchillay> | START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | [tools] | 
            
  | 17:33 | <wmbot~dcaro@urcuchillay> | END (PASS) - Cookbook wmcs.openstack.quota_increase (exit_code=0) | [tools] | 
            
  | 17:33 | <wmbot~dcaro@urcuchillay> | START - Cookbook wmcs.openstack.quota_increase | [tools] | 
            
  | 17:30 | <wmbot~dcaro@urcuchillay> | END (FAIL) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=99) for a worker-nfs role in the tools cluster | [tools] | 
            
  | 17:29 | <wmbot~dcaro@urcuchillay> | START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | [tools] | 
            
  | 17:23 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on 11 hosts with reason: Maintenance | [production] | 
            
  | 17:23 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 16:00:00 on 11 hosts with reason: Maintenance | [production] | 
            
  | 17:23 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2129.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 17:22 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 8:00:00 on db2129.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 17:22 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2124 (T370903)', diff saved to https://phabricator.wikimedia.org/P67809 and previous config saved to /var/cache/conftool/dbconfig/20240826-172250-ladsgroup.json | [production] | 
            
  | 17:07 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2124', diff saved to https://phabricator.wikimedia.org/P67808 and previous config saved to /var/cache/conftool/dbconfig/20240826-170742-ladsgroup.json | [production] | 
            
  | 17:04 | <wmbot~dcaro@urcuchillay> | END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster | [tools] | 
            
  | 17:04 | <wmbot~dcaro@urcuchillay> | Added a new k8s worker-nfs tools-k8s-worker-nfs-61.tools.eqiad1.wikimedia.cloud to the cluster | [tools] | 
            
  | 16:54 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2035.codfw.wmnet | [production] | 
            
  | 16:54 | <cgoubert@cumin1002> | START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2035.codfw.wmnet | [production] | 
            
  | 16:54 | <wmbot~dcaro@urcuchillay> | START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | [tools] | 
            
  | 16:54 | <wmbot~dcaro@urcuchillay> | END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster | [tools] | 
            
  | 16:54 | <wmbot~dcaro@urcuchillay> | Added a new k8s worker-nfs tools-k8s-worker-nfs-60.tools.eqiad1.wikimedia.cloud to the cluster | [tools] | 
            
  | 16:53 | <claime> | homer 'lsw1-b8-codfw*' commit T372878 | [production] | 
            
  | 16:52 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2035.codfw.wmnet with OS bullseye | [production] | 
            
  | 16:52 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2124', diff saved to https://phabricator.wikimedia.org/P67807 and previous config saved to /var/cache/conftool/dbconfig/20240826-165235-ladsgroup.json | [production] | 
            
  | 16:42 | <wmbot~dcaro@urcuchillay> | START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | [tools] | 
            
  | 16:37 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2124 (T370903)', diff saved to https://phabricator.wikimedia.org/P67806 and previous config saved to /var/cache/conftool/dbconfig/20240826-163728-ladsgroup.json | [production] | 
            
  | 16:32 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2035.codfw.wmnet with reason: host reimage | [production] | 
            
  | 16:30 | <wmbot~dcaro@urcuchillay> | END (FAIL) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=99) for a worker-nfs role in the tools cluster | [tools] | 
            
  | 16:30 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling db2124 (T370903)', diff saved to https://phabricator.wikimedia.org/P67805 and previous config saved to /var/cache/conftool/dbconfig/20240826-163032-ladsgroup.json | [production] | 
            
  | 16:30 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2124.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:30 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 8:00:00 on db2124.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:30 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2114.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:29 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 8:00:00 on db2114.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 16:29 | <cgoubert@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2035.codfw.wmnet with reason: host reimage | [production] | 
            
  | 16:28 | <claime> | homer 'cr*codfw*' commit 'T372878' | [production] | 
            
  | 16:26 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance | [production] |