6101-6150 of 10000 results (119ms)
2023-11-03 §
11:13 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1010.eqiad.wmnet with OS bookworm [production]
11:08 <fnegri@cumin1001> START - Cookbook sre.hosts.reimage for host cloudnet1006.eqiad.wmnet with OS bookworm [production]
11:07 <jynus@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host backup1011.eqiad.wmnet with OS bookworm [production]
10:53 <jynus@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on backup1011.eqiad.wmnet with reason: host reimage [production]
10:50 <jynus@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on backup1011.eqiad.wmnet with reason: host reimage [production]
10:34 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1011.eqiad.wmnet with OS bookworm [production]
10:08 <mvernon@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe [production]
09:59 <mvernon@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe [production]
09:59 <Emperor> roll-restart swift frontends [production]
04:01 <eileen> civicrm upgraded from 84ec2957 to 5be02f1b [production]
01:23 <thcipriani@deploy2002> Finished scap: Backport for [[gerrit:971287|Disable namespaceDupes.php for now (T350443)]] (duration: 10m 29s) [production]
01:18 <thcipriani@deploy2002> thcipriani: Continuing with sync [production]
01:14 <thcipriani@deploy2002> thcipriani: Backport for [[gerrit:971287|Disable namespaceDupes.php for now (T350443)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
01:13 <thcipriani@deploy2002> Started scap: Backport for [[gerrit:971287|Disable namespaceDupes.php for now (T350443)]] [production]
2023-11-02 §
22:31 <Amir1> killed update collation on s5 [production]
22:13 <brett> import acme-chief 0.36-2 into bookworm-wikimedia repo [production]
21:22 <inflatador> bking@cumin2002 enabling elastic snapshots on eqiad clusters T348686 [production]
20:32 <mabualruz@deploy2002> Finished scap: Backport for [[gerrit:971244|Enable native math rendering mode on testwiki (T311620)]] (duration: 14m 06s) [production]
20:27 <mabualruz@deploy2002> mabualruz and physikerwelt: Continuing with sync [production]
20:20 <mabualruz@deploy2002> mabualruz and physikerwelt: Backport for [[gerrit:971244|Enable native math rendering mode on testwiki (T311620)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:18 <mabualruz@deploy2002> Started scap: Backport for [[gerrit:971244|Enable native math rendering mode on testwiki (T311620)]] [production]
20:04 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:04 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS entries for codfw CR IPs moved to new interfaces. - cmooney@cumin1001" [production]
20:01 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS entries for codfw CR IPs moved to new interfaces. - cmooney@cumin1001" [production]
19:59 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
18:52 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host doh1001.wikimedia.org with OS bookworm [production]
18:46 <topranks> shutting down uplink from asw-b-codfw et-2/0/51 to cr1-codfw in advance of cable move (T347191) [production]
18:44 <topranks> Making cr2-codfw VRRP Master for row B traffic over new link from ssw1-a8-codfw (T347191) [production]
18:35 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doh1001.wikimedia.org with reason: host reimage [production]
18:32 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on doh1001.wikimedia.org with reason: host reimage [production]
18:22 <dduvall@deploy2002> rebuilt and synchronized wikiversions files: group2 wikis to 1.42.0-wmf.3 refs T348356 [production]
18:22 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host doh1001.wikimedia.org with OS bookworm [production]
18:21 <topranks> Shutting asw-b-codfw uplink to cr2-codfw down in advance of cable move (T347191) [production]
18:09 <ebernhardson@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
18:09 <ebernhardson@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
18:07 <topranks> Making cr1-codfw VRRP Master for row A traffic again on ssw1-a1-codfw interface (T347191) [production]
17:50 <topranks> Shutting asw-a-codfw uplink to cr1-codfw down in advance of cable move (T347191) [production]
17:45 <topranks> Moving row A outbound traffic from direct CR link to routing via Spinie (T347191) [production]
17:45 <fnegri@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcontrol1005.eqiad.wmnet with OS bookworm [production]
17:42 <vgutierrez> repool cp4051 and cp5030 [production]
17:40 <ebernhardson@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:40 <ebernhardson@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:23 <vgutierrez> depool cp5030 [production]
17:19 <vgutierrez> restart haproxy on cp4051 [production]
17:14 <bd808@deploy2002> helmfile [eqiad] DONE helmfile.d/services/toolhub: apply [production]
17:14 <fnegri@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcontrol1005.eqiad.wmnet with reason: host reimage [production]
17:13 <bd808@deploy2002> helmfile [eqiad] START helmfile.d/services/toolhub: apply [production]
17:13 <bd808@deploy2002> helmfile [codfw] DONE helmfile.d/services/toolhub: apply [production]
17:12 <bd808@deploy2002> helmfile [codfw] START helmfile.d/services/toolhub: apply [production]
17:11 <fnegri@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcontrol1005.eqiad.wmnet with reason: host reimage [production]