3401-3450 of 10000 results (113ms)
2024-11-18 §
10:14 <fabfur@cumin1002> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-ulsfo [production]
10:13 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
10:13 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:47 <dcausse@deploy2002> helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
09:47 <dcausse@deploy2002> helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply [production]
09:42 <moritzm> restarting nginx on acmechief hosts to pick up openssl updates [production]
09:24 <moritzm> installing openssl security updates [production]
09:18 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:17 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
08:57 <kartik@deploy2002> Finished scap sync-world: Backport for [[gerrit:1091932|Enable the Contribute menu in 2nd group of Wikis (T375300)]] (duration: 11m 45s) [production]
08:55 <ayounsi@cumin1002> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 40850 [production]
08:55 <ayounsi@cumin1002> START - Cookbook sre.network.peering with action 'email' for AS: 40850 [production]
08:53 <kartik@deploy2002> kartik: Continuing with sync [production]
08:49 <kartik@deploy2002> kartik: Backport for [[gerrit:1091932|Enable the Contribute menu in 2nd group of Wikis (T375300)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:45 <kartik@deploy2002> Started scap sync-world: Backport for [[gerrit:1091932|Enable the Contribute menu in 2nd group of Wikis (T375300)]] [production]
08:44 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on registry1004.eqiad.wmnet with reason: testing [production]
08:44 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on registry1004.eqiad.wmnet with reason: testing [production]
08:43 <kartik@deploy2002> Finished scap sync-world: Backport for [[gerrit:1091912|bjnwikiquote: Add local logo (T375054)]] (duration: 22m 55s) [production]
08:31 <kartik@deploy2002> kartik, hamishz: Continuing with sync [production]
08:30 <kartik@deploy2002> kartik, hamishz: Backport for [[gerrit:1091912|bjnwikiquote: Add local logo (T375054)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:20 <kartik@deploy2002> Started scap sync-world: Backport for [[gerrit:1091912|bjnwikiquote: Add local logo (T375054)]] [production]
08:07 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1021.eqiad.wmnet [production]
08:07 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1021.eqiad.wmnet [production]
08:05 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1021.eqiad.wmnet [production]
08:03 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1021.eqiad.wmnet [production]
08:01 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1021.eqiad.wmnet [production]
08:01 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1021.eqiad.wmnet [production]
07:56 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1021.eqiad.wmnet [production]
07:54 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1020.eqiad.wmnet [production]
07:52 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1020.eqiad.wmnet [production]
07:51 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1020.eqiad.wmnet [production]
07:47 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1020.eqiad.wmnet [production]
07:46 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on pc1017.eqiad.wmnet with reason: T378068, host is not pooled [production]
07:46 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on pc1017.eqiad.wmnet with reason: T378068, host is not pooled [production]
07:46 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on pc1013.eqiad.wmnet with reason: T373037, host is not pooled [production]
07:46 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on pc1013.eqiad.wmnet with reason: T373037, host is not pooled [production]
06:31 <kart_> Updated MinT to 2024-10-16-065051-production on eqiad [production]
06:28 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply [production]
06:19 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/machinetranslation: apply [production]
2024-11-17 §
16:41 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2216.codfw.wmnet with reason: Sad [production]
16:40 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on db2216.codfw.wmnet with reason: Sad [production]
16:35 <ladsgroup@cumin1002> dbctl commit (dc=all): 'db2216 sad', diff saved to https://phabricator.wikimedia.org/P71059 and previous config saved to /var/cache/conftool/dbconfig/20241117-163522-ladsgroup.json [production]
2024-11-16 §
20:30 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-jumbo1017.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:29 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-jumbo1016.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
20:29 <jclark@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-jumbo1018.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:09 <jclark@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:09 <jclark@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for wikikube-worker - jclark@cumin1002" [production]
18:08 <jclark@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for wikikube-worker - jclark@cumin1002" [production]
18:06 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1183.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:05 <jclark@cumin1002> START - Cookbook sre.dns.netbox [production]