2023-11-03
§
|
11:08 |
<fnegri@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudnet1006.eqiad.wmnet with OS bookworm |
[production] |
11:07 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host backup1011.eqiad.wmnet with OS bookworm |
[production] |
10:53 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on backup1011.eqiad.wmnet with reason: host reimage |
[production] |
10:50 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on backup1011.eqiad.wmnet with reason: host reimage |
[production] |
10:34 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.reimage for host backup1011.eqiad.wmnet with OS bookworm |
[production] |
10:08 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe |
[production] |
09:59 |
<mvernon@cumin1001> |
START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe |
[production] |
09:59 |
<Emperor> |
roll-restart swift frontends |
[production] |
04:01 |
<eileen> |
civicrm upgraded from 84ec2957 to 5be02f1b |
[production] |
01:23 |
<thcipriani@deploy2002> |
Finished scap: Backport for [[gerrit:971287|Disable namespaceDupes.php for now (T350443)]] (duration: 10m 29s) |
[production] |
01:18 |
<thcipriani@deploy2002> |
thcipriani: Continuing with sync |
[production] |
01:14 |
<thcipriani@deploy2002> |
thcipriani: Backport for [[gerrit:971287|Disable namespaceDupes.php for now (T350443)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
01:13 |
<thcipriani@deploy2002> |
Started scap: Backport for [[gerrit:971287|Disable namespaceDupes.php for now (T350443)]] |
[production] |
2023-11-02
§
|
22:31 |
<Amir1> |
killed update collation on s5 |
[production] |
22:13 |
<brett> |
import acme-chief 0.36-2 into bookworm-wikimedia repo |
[production] |
21:22 |
<inflatador> |
bking@cumin2002 enabling elastic snapshots on eqiad clusters T348686 |
[production] |
20:32 |
<mabualruz@deploy2002> |
Finished scap: Backport for [[gerrit:971244|Enable native math rendering mode on testwiki (T311620)]] (duration: 14m 06s) |
[production] |
20:27 |
<mabualruz@deploy2002> |
mabualruz and physikerwelt: Continuing with sync |
[production] |
20:20 |
<mabualruz@deploy2002> |
mabualruz and physikerwelt: Backport for [[gerrit:971244|Enable native math rendering mode on testwiki (T311620)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:18 |
<mabualruz@deploy2002> |
Started scap: Backport for [[gerrit:971244|Enable native math rendering mode on testwiki (T311620)]] |
[production] |
20:04 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
20:04 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS entries for codfw CR IPs moved to new interfaces. - cmooney@cumin1001" |
[production] |
20:01 |
<cmooney@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update DNS entries for codfw CR IPs moved to new interfaces. - cmooney@cumin1001" |
[production] |
19:59 |
<cmooney@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:52 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host doh1001.wikimedia.org with OS bookworm |
[production] |
18:46 |
<topranks> |
shutting down uplink from asw-b-codfw et-2/0/51 to cr1-codfw in advance of cable move (T347191) |
[production] |
18:44 |
<topranks> |
Making cr2-codfw VRRP Master for row B traffic over new link from ssw1-a8-codfw (T347191) |
[production] |
18:35 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doh1001.wikimedia.org with reason: host reimage |
[production] |
18:32 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on doh1001.wikimedia.org with reason: host reimage |
[production] |
18:22 |
<dduvall@deploy2002> |
rebuilt and synchronized wikiversions files: group2 wikis to 1.42.0-wmf.3 refs T348356 |
[production] |
18:22 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.reimage for host doh1001.wikimedia.org with OS bookworm |
[production] |
18:21 |
<topranks> |
Shutting asw-b-codfw uplink to cr2-codfw down in advance of cable move (T347191) |
[production] |
18:09 |
<ebernhardson@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
18:09 |
<ebernhardson@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
18:07 |
<topranks> |
Making cr1-codfw VRRP Master for row A traffic again on ssw1-a1-codfw interface (T347191) |
[production] |
17:50 |
<topranks> |
Shutting asw-a-codfw uplink to cr1-codfw down in advance of cable move (T347191) |
[production] |
17:45 |
<topranks> |
Moving row A outbound traffic from direct CR link to routing via Spinie (T347191) |
[production] |
17:45 |
<fnegri@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcontrol1005.eqiad.wmnet with OS bookworm |
[production] |
17:42 |
<vgutierrez> |
repool cp4051 and cp5030 |
[production] |
17:40 |
<ebernhardson@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
17:40 |
<ebernhardson@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
17:23 |
<vgutierrez> |
depool cp5030 |
[production] |
17:19 |
<vgutierrez> |
restart haproxy on cp4051 |
[production] |
17:14 |
<bd808@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/toolhub: apply |
[production] |
17:14 |
<fnegri@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcontrol1005.eqiad.wmnet with reason: host reimage |
[production] |
17:13 |
<bd808@deploy2002> |
helmfile [eqiad] START helmfile.d/services/toolhub: apply |
[production] |
17:13 |
<bd808@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/toolhub: apply |
[production] |
17:12 |
<bd808@deploy2002> |
helmfile [codfw] START helmfile.d/services/toolhub: apply |
[production] |
17:11 |
<fnegri@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcontrol1005.eqiad.wmnet with reason: host reimage |
[production] |
17:11 |
<bd808@deploy2002> |
helmfile [staging] DONE helmfile.d/services/toolhub: apply |
[production] |