1251-1300 of 10000 results (69ms)
2023-09-26 ยง
14:15 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1131 (T343198)', diff saved to https://phabricator.wikimedia.org/P52638 and previous config saved to /var/cache/conftool/dbconfig/20230926-141508-arnaudb.json [production]
14:13 <jbond@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on puppetserver2002.codfw.wmnet with reason: host reimage [production]
14:10 <jbond@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on puppetserver2002.codfw.wmnet with reason: host reimage [production]
14:08 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host wdqs1017.eqiad.wmnet with OS bullseye [production]
14:08 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host wdqs1021.eqiad.wmnet with OS bullseye [production]
14:05 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on puppetserver1003.eqiad.wmnet with reason: host reimage [production]
14:02 <Lucas_WMDE> UTC afternoon backport+config window done [production]
14:02 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:961038|Enable Minerva site notice for wikifunctions wiki (T345463)]] (duration: 09m 51s) [production]
14:02 <jbond@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on puppetserver1003.eqiad.wmnet with reason: host reimage [production]
14:01 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase2021.codfw.wmnet with reason: host reimage [production]
13:57 <eevans@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase2021.codfw.wmnet with reason: host reimage [production]
13:55 <lucaswerkmeister-wmde@deploy2002> ammarpad and lucaswerkmeister-wmde: Continuing with sync [production]
13:54 <lucaswerkmeister-wmde@deploy2002> ammarpad and lucaswerkmeister-wmde: Backport for [[gerrit:961038|Enable Minerva site notice for wikifunctions wiki (T345463)]] synced to the testservers mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
13:52 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:961038|Enable Minerva site notice for wikifunctions wiki (T345463)]] [production]
13:51 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:961030|arwikisource: Increase autoconfirm edit count to 10 (T347264)]] (duration: 11m 27s) [production]
13:47 <Lucas_WMDE> lucaswerkmeister-wmde@deploy2002 ammarpad and lucaswerkmeister-wmde: Continuing with sync [originally 13:44 UTC] [production]
13:43 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host restbase2021.codfw.wmnet with OS bullseye [production]
13:43 <lucaswerkmeister-wmde@deploy2002> ammarpad and lucaswerkmeister-wmde: Backport for [[gerrit:961030|arwikisource: Increase autoconfirm edit count to 10 (T347264)]] synced to the testservers mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
13:43 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for restbase2019.codfw.wmnet [production]
13:43 <eevans@cumin1001> START - Cookbook sre.hosts.remove-downtime for restbase2019.codfw.wmnet [production]
13:39 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:961030|arwikisource: Increase autoconfirm edit count to 10 (T347264)]] [production]
13:37 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:960616|add search update pipeline streams (update + fetch_error) (T317609)]] (duration: 11m 54s) [production]
13:36 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2023.codfw.wmnet [production]
13:36 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2023.codfw.wmnet [production]
13:36 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2023.codfw.wmnet [production]
13:35 <jayme@deploy1002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: sync [production]
13:35 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2023.codfw.wmnet [production]
13:34 <jayme@deploy1002> helmfile [codfw] START helmfile.d/services/wikifunctions: sync [production]
13:31 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2019.codfw.wmnet with OS bullseye [production]
13:31 <lucaswerkmeister-wmde@deploy2002> pfischer and lucaswerkmeister-wmde: Continuing with sync [production]
13:29 <jbond@cumin1001> START - Cookbook sre.hosts.reimage for host puppetserver1003.eqiad.wmnet with OS bookworm [production]
13:27 <lucaswerkmeister-wmde@deploy2002> pfischer and lucaswerkmeister-wmde: Backport for [[gerrit:960616|add search update pipeline streams (update + fetch_error) (T317609)]] synced to the testservers mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
13:25 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:960616|add search update pipeline streams (update + fetch_error) (T317609)]] [production]
13:25 <jbond@cumin2002> START - Cookbook sre.hosts.reimage for host puppetserver2002.codfw.wmnet with OS bookworm [production]
13:25 <jbond@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) puppetserver2002.codfw.wmnet on all recursors [production]
13:25 <jbond@cumin1001> START - Cookbook sre.dns.wipe-cache puppetserver2002.codfw.wmnet on all recursors [production]
13:25 <jbond@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:25 <jbond@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: rename puppetmaster[12]004 - jbond@cumin1001" [production]
13:24 <jbond@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: rename puppetmaster[12]004 - jbond@cumin1001" [production]
13:23 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1131 (T343198)', diff saved to https://phabricator.wikimedia.org/P52637 and previous config saved to /var/cache/conftool/dbconfig/20230926-132357-arnaudb.json [production]
13:23 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1131.eqiad.wmnet with reason: Maintenance [production]
13:23 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1131.eqiad.wmnet with reason: Maintenance [production]
13:22 <jbond@cumin1001> START - Cookbook sre.dns.netbox [production]
13:21 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:960066|Make wikifunctionswiki a multilingual Wikidata client (T342857)]] (duration: 09m 44s) [production]
13:18 <aokoth@cumin1001> START - Cookbook sre.gitlab.failover Failover of gitlab from gitlab2002.wikimedia.org to gitlab1003.wikimedia.org [production]
13:15 <jayme@deploy1002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: sync [production]
13:15 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Continuing with sync [production]
13:14 <jayme@deploy1002> helmfile [eqiad] START helmfile.d/services/wikifunctions: sync [production]
13:13 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Backport for [[gerrit:960066|Make wikifunctionswiki a multilingual Wikidata client (T342857)]] synced to the testservers mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
13:11 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:960066|Make wikifunctionswiki a multilingual Wikidata client (T342857)]] [production]