1751-1800 of 10000 results (100ms)
2024-10-09 ยง
14:10 <jforrester@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
14:09 <jforrester@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
14:09 <jforrester@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
14:09 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2033.codfw.wmnet [production]
14:08 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:08 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding cloudlb2004-dev to codfw - jhancock@cumin2002" [production]
14:08 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding cloudlb2004-dev to codfw - jhancock@cumin2002" [production]
14:08 <jforrester@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
14:07 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host debmonitor1003.eqiad.wmnet [production]
14:07 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp2004.wikimedia.org [production]
14:06 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot finished rebooting dns1004.wikimedia.org [production]
14:05 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host debmonitor2003.codfw.wmnet [production]
14:05 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:04 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host backup1012.eqiad.wmnet with OS bookworm [production]
14:03 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp2004.wikimedia.org [production]
14:02 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host backup1012.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
14:02 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host debmonitor2003.codfw.wmnet [production]
14:01 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
14:01 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flink-zk1002.eqiad.wmnet [production]
13:58 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P69517 and previous config saved to /var/cache/conftool/dbconfig/20241009-135812-ladsgroup.json [production]
13:57 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1002.eqiad.wmnet [production]
13:56 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host zookeeper-test1002.eqiad.wmnet [production]
13:55 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp-test2005.wikimedia.org [production]
13:54 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flink-zk1003.eqiad.wmnet [production]
13:53 <jayme@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
13:53 <jayme@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
13:53 <sukhe@cumin1002> cookbooks.sre.dns.roll-reboot begin reboot of dns1004.wikimedia.org [production]
13:52 <sukhe@cumin1002> START - Cookbook sre.dns.roll-reboot rolling reboot on A:dnsbox [production]
13:52 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host zookeeper-test1002.eqiad.wmnet [production]
13:51 <jayme@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
13:51 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp-test2005.wikimedia.org [production]
13:51 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp-test2004.wikimedia.org [production]
13:50 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1003.eqiad.wmnet [production]
13:50 <jynus@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on backup[1010-1011].eqiad.wmnet with reason: T376800 [production]
13:50 <jynus@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on backup[1010-1011].eqiad.wmnet with reason: T376800 [production]
13:49 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cloudcephosd1028.eqiad.wmnet [production]
13:49 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flink-zk1001.eqiad.wmnet [production]
13:48 <Lucas_WMDE> UTC afternoon backport+config window done [production]
13:48 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp-test2004.wikimedia.org [production]
13:48 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1078774|[brwikimedia] Enable the CampaignEvents extension (T376747)]] (duration: 07m 04s) [production]
13:48 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp1004.wikimedia.org [production]
13:45 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1001.eqiad.wmnet [production]
13:45 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host flink-zk1001.eqiad.wmnet [production]
13:44 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host flink-zk1001.eqiad.wmnet [production]
13:44 <lucaswerkmeister-wmde@deploy2002> albertoleoncio, lucaswerkmeister-wmde: Continuing with sync [production]
13:44 <slyngshede@cumin1002> START - Cookbook sre.ganeti.reboot-vm for VM idp1004.wikimedia.org [production]
13:43 <lucaswerkmeister-wmde@deploy2002> albertoleoncio, lucaswerkmeister-wmde: Backport for [[gerrit:1078774|[brwikimedia] Enable the CampaignEvents extension (T376747)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:43 <slyngshede@cumin1002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM idp-test1004.wikimedia.org [production]
13:43 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1203 (T367856)', diff saved to https://phabricator.wikimedia.org/P69516 and previous config saved to /var/cache/conftool/dbconfig/20241009-134305-ladsgroup.json [production]
13:42 <brouberol@cumin1002> END (ERROR) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=97) for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. [production]