2024-07-19
§
|
10:00 |
<pfischer@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
10:00 |
<pfischer@deploy1002> |
helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
09:58 |
<cgoubert@cumin1002> |
END (ERROR) - Cookbook sre.hosts.convert-disks (exit_code=97) for host mw2439 |
[production] |
09:54 |
<elukey@cumin1002> |
END (FAIL) - Cookbook sre.hosts.dhcp (exit_code=99) for host sretest2001.codfw.wmnet |
[production] |
09:42 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts mw2439.codfw.wmnet |
[production] |
09:41 |
<cgoubert@cumin1002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts mw2439.codfw.wmnet |
[production] |
09:41 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.convert-disks for host mw2439 |
[production] |
09:35 |
<cgoubert@cumin1002> |
END (FAIL) - Cookbook sre.hosts.convert-disks (exit_code=99) for host mw2439 |
[production] |
09:35 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts mw2439.codfw.wmnet |
[production] |
09:35 |
<cgoubert@cumin1002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts mw2439.codfw.wmnet |
[production] |
09:35 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.convert-disks for host mw2439 |
[production] |
09:32 |
<cgoubert@cumin1002> |
END (FAIL) - Cookbook sre.hosts.convert-disks (exit_code=99) for host mw2439 |
[production] |
09:31 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts mw2439.codfw.wmnet |
[production] |
09:21 |
<cgoubert@cumin1002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts mw2439.codfw.wmnet |
[production] |
09:21 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.convert-disks for host mw2439 |
[production] |
08:16 |
<pfischer@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:16 |
<pfischer@deploy1002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:15 |
<pfischer@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:15 |
<pfischer@deploy1002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:15 |
<pfischer@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:15 |
<pfischer@deploy1002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:08 |
<cgoubert@cumin1002> |
END (FAIL) - Cookbook sre.hosts.convert-disks (exit_code=99) for host mw2438 |
[production] |
08:08 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2438.mgmt.codfw.wmnet with reboot policy GRACEFUL |
[production] |
08:05 |
<elukey@cumin1002> |
START - Cookbook sre.hosts.dhcp for host sretest2001.codfw.wmnet |
[production] |
02:50 |
<eileen> |
civicrm upgraded from 384fe444 to a9ef8ab9 |
[production] |
00:28 |
<zabe@deploy1002> |
sync-world aborted: Backport for [[gerrit:1055311|Set some site names for new-ish wikis (T363270 T360303 T360310 T363263)]] (duration: 01m 33s) |
[production] |
00:26 |
<zabe@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1055311|Set some site names for new-ish wikis (T363270 T360303 T360310 T363263)]] |
[production] |
2024-07-18
§
|
23:57 |
<topranks> |
re-enable ssw<->ssw bgp in codfw to move east-west traffic away from CRs T369274 |
[production] |
23:46 |
<topranks> |
move IP GW for vlan private1-d-codfw to ssw1-d1-codfw and ssw1-d8-codfw T369274 |
[production] |
23:44 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
23:44 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns entries for migrated codfw gw IPs - cmooney@cumin1002" |
[production] |
23:44 |
<topranks> |
remove VRRP group for private1-d-codfw vlan on cr1-codfw and cr2-codfw |
[production] |
23:43 |
<cmooney@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns entries for migrated codfw gw IPs - cmooney@cumin1002" |
[production] |
23:40 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
23:36 |
<topranks> |
move outbound gateway for private1-d-codfw vlan from cr1-codfw to ssw1-d1-codfw |
[production] |
23:31 |
<topranks> |
disable IPv6 RA generation for private1-d-codfw vlan on cr1-codfw and cr2-codfw T369274 |
[production] |
23:17 |
<topranks> |
enable IPv6 RA generation for private1-d-codfw vlan from ssw1-d1-codfw and ssw1-d8-codfw T369274 |
[production] |
23:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2145 (T367856)', diff saved to https://phabricator.wikimedia.org/P66838 and previous config saved to /var/cache/conftool/dbconfig/20240718-231639-marostegui.json |
[production] |
23:16 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2145.codfw.wmnet with reason: Maintenance |
[production] |
23:16 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2145.codfw.wmnet with reason: Maintenance |
[production] |
23:05 |
<topranks> |
Remove VRRP group for vlan private1-c-codfw on cr1-codfw and cr2-codfw |
[production] |
22:49 |
<topranks> |
Re-route outbound traffic for private1-c-codfw vlan on to ssw1-d1-codfw |
[production] |
22:33 |
<topranks> |
Disable IPv6 RA generation for private1-c-codfw vlan on cr1-codfw and cr2-codfw T369274 |
[production] |
22:19 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on elastic1100.eqiad.wmnet with reason: catch up on indexing |
[production] |
22:19 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on elastic1100.eqiad.wmnet with reason: catch up on indexing |
[production] |
22:15 |
<topranks> |
add IP interfaces for private1-c-codfw vlan to ssw1-d1-codfw and ssw1-d8-codfw |
[production] |
22:03 |
<topranks> |
move GW IPs for public1-d-codfw vlan to ssw1-d1-codfw and ssw1-d8-codfw T369274 |
[production] |
21:58 |
<topranks> |
remove VRRP group on cr1-codfw and cr2-codfw for public1-d-codfw vlan T369274 |
[production] |
21:57 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.elasticsearch.force-shard-allocation (exit_code=0) |
[production] |
21:57 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.force-shard-allocation |
[production] |