2020-02-18
ยง
|
21:54 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
21:26 |
<XioNoX> |
rollback tcp-mss clamping in eqiad/eqord |
[production] |
21:07 |
<jeh> |
power down and set incinga downtime on cloudvirt1022 T243536 |
[production] |
21:07 |
<jeh> |
power down and set incinga downtime on cloudvirt1022 T241884 |
[production] |
20:54 |
<otto@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Enabling EventStreamConfig extension on metawiki - T242122 (duration: 01m 03s) |
[production] |
20:47 |
<ppchelko@deploy1001> |
Finished deploy [changeprop/deploy@e2fe8ca]: respect service name in consumer group T244387 (duration: 07m 59s) |
[production] |
20:45 |
<otto@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Enabling EventStreamConfig extension on testwiki - T242122 (duration: 01m 04s) |
[production] |
20:39 |
<ppchelko@deploy1001> |
Started deploy [changeprop/deploy@e2fe8ca]: respect service name in consumer group T244387 |
[production] |
20:06 |
<jforrester@deploy1001> |
Synchronized php-1.35.0-wmf.19/includes/libs/StatusValue.php: T245155 StatusValue: Fix __toString() to not choke on special parameters (duration: 01m 04s) |
[production] |
20:03 |
<jforrester@deploy1001> |
rebuilt and synchronized wikiversions files: group0 to 1.35.0-wmf.20 T233868 |
[production] |
19:52 |
<jforrester@deploy1001> |
Finished scap: testwiki to 1.35.0-wmf.20 and re-build l10n cache T233868 (duration: 61m 01s) |
[production] |
19:41 |
<papaul> |
shutting down dns2001 for 10G card troubleshooting |
[production] |
19:30 |
<James_F> |
Running `foreachwiki sql.php php-1.35.0-wmf.19/maintenance/archives/patch-watchlist_expiry.sql` for T244631 |
[production] |
18:51 |
<jforrester@deploy1001> |
Started scap: testwiki to 1.35.0-wmf.20 and re-build l10n cache T233868 |
[production] |
18:49 |
<jforrester@deploy1001> |
Pruned MediaWiki: 1.35.0-wmf.18 (duration: 15m 29s) |
[production] |
18:25 |
<James_F> |
Running `scap prep` for 1.35.0-wmf.20 ref. T233868 |
[production] |
18:01 |
<James_F> |
1.35.0-wmf.20 was branched at c664b4f1b933d110bd69f074c399695bd6b17d13 for T233868 |
[production] |
18:01 |
<marxarelli> |
completed promotion of 1.35.0-wmf.19 to all wikis (T233867) |
[production] |
17:52 |
<dduvall@deploy1001> |
rebuilt and synchronized wikiversions files: Re-roll all wikis to 1.35.0-wmf.19 (T233867) |
[production] |
17:47 |
<marxarelli> |
re-rolling wmf.19 to all wikis (T233867) with eyes particularly on (T245202) |
[production] |
17:28 |
<bblack> |
cp3 (esams edge) - revert GRE MTU mitigations - T232602 |
[production] |
17:00 |
<papaul> |
restting ps1-a8-codfw see T245164 |
[production] |
16:34 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
16:32 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:12 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-main' for release 'production' . |
[production] |
16:11 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
16:09 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-main' for release 'production' . |
[production] |
16:08 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
16:03 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-main' for release 'production' . |
[production] |
16:02 |
<ottomata> |
deploying new 'canary' and 'production' releases for eventgate-main. (These releases use a new nodePort, and so will not be active until LVS is modified. The old 'main' release and nodePort is left as is.) - T242861 |
[production] |
16:02 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
15:51 |
<bblack> |
dns2001 - shutdown for hw/reimage work - T242017 |
[production] |
15:47 |
<bblack> |
dns2001 - stopping bgp to drain service for hw/reimage work - T242017 |
[production] |
15:41 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
15:39 |
<otto@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
15:36 |
<jynus> |
stopping db1140:s3 instance |
[production] |
15:35 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
15:34 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
15:34 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
15:14 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
15:08 |
<vgutierrez@puppetmaster1001> |
conftool action : set/weight=100; selector: dc=eqiad,cluster=cache_text,service=ats-be,name=cp1089.eqiad.wmnet |
[production] |
15:04 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
14:56 |
<bblack> |
esams repooled in DNS |
[production] |
14:54 |
<otto@deploy1001> |
helmfile [CODFW] Ran 'apply' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
14:53 |
<ottomata> |
deploying new 'canary' and 'production' releases for eventgate-analytics. (These releases use a new nodePort, and so will not be active until LVS is modified. The old 'analytics' release and nodePort is left as is.) - T242861 |
[production] |
14:47 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-analytics' for release 'canary' . |
[production] |
14:47 |
<otto@deploy1001> |
helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-analytics' for release 'production' . |
[production] |
14:39 |
<XioNoX> |
remove cr2-esams VRRP handicap - T243080 |
[production] |
14:34 |
<XioNoX> |
restore default esams-eqiad link cost - T243080 |
[production] |
14:33 |
<XioNoX> |
re-enable cr2-esams BGP transit/peering - T243080 |
[production] |