2019-06-05
ยง
|
22:15 |
<chaomodus> |
restarting gerrit on cobalt due to it being down (seems like Java out of heap space) |
[production] |
20:43 |
<mforns@deploy1001> |
Finished deploy [analytics/refinery@0660e70]: deploying analytics/refinery up to 0660e70153dec892ae20bee7119a72cc17e8ec87 (duration: 19m 30s) |
[production] |
20:39 |
<reedy@deploy1001> |
Synchronized wmf-config/flaggedrevs.php: Turn off some FR config T225138 (duration: 00m 54s) |
[production] |
20:25 |
<akosiaris@deploy1001> |
scap-helm blubberoid finished |
[production] |
20:25 |
<akosiaris@deploy1001> |
scap-helm blubberoid cluster codfw completed |
[production] |
20:25 |
<akosiaris@deploy1001> |
scap-helm blubberoid cluster eqiad completed |
[production] |
20:25 |
<akosiaris@deploy1001> |
scap-helm blubberoid upgrade -f blubberoid-values.yaml production stable/blubberoid [namespace: blubberoid, clusters: eqiad,codfw] |
[production] |
20:23 |
<mforns@deploy1001> |
Started deploy [analytics/refinery@0660e70]: deploying analytics/refinery up to 0660e70153dec892ae20bee7119a72cc17e8ec87 |
[production] |
19:57 |
<hashar> |
contint1001: docker container prune -f && docker image prune -f # reclaimed 166 MB and 3.4 GB |
[production] |
19:48 |
<marostegui> |
Check data consistency on db1091 against db1135 - T225060 |
[production] |
19:45 |
<reedy@deploy1001> |
Synchronized wmf-config/flaggedrevs.php: T225115 (duration: 00m 54s) |
[production] |
17:36 |
<marostegui> |
Start replication db1091 - T225060 |
[production] |
17:32 |
<marostegui> |
Start MySQL with replication stopped on db1091 - T225060 |
[production] |
16:29 |
<otto@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Revert user-blocks-change to use eventbus and old schema - T211248 (duration: 00m 54s) |
[production] |
16:22 |
<otto@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: use eventgate-main for 2 events on all wikis - T211248 (duration: 00m 55s) |
[production] |
16:11 |
<reedy@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Add wgEventServiceStreamConfig and switch 2 topics in group0 T222822 (duration: 00m 56s) |
[production] |
16:11 |
<XioNoX> |
remove BGP to AS38082 on cr4-ulsfo (left the IXP) |
[production] |
15:46 |
<reedy@deploy1001> |
Scap failed!: Call to mwscript eval.php returned: None |
[production] |
15:44 |
<reedy@deploy1001> |
Finished scap: Rebuild .8 i18n for FlaggedRevs (duration: 41m 14s) |
[production] |
15:36 |
<moritzm> |
installing exim4 security updates |
[production] |
15:03 |
<reedy@deploy1001> |
Started scap: Rebuild .8 i18n for FlaggedRevs |
[production] |
14:24 |
<marostegui> |
Poweroff db1091 for BBU replacement - T225060 |
[production] |
13:57 |
<elukey> |
restart mcrouter on MediaWiki app/api canaries to pick up new config change (timeouts before marking a memcached shard as TKO from 3 to 10) - T203786 |
[production] |
13:56 |
<jijiki> |
enabling puppet and pooling on mw* canaries |
[production] |
13:17 |
<jynus> |
start es2,es3 backup on codfw |
[production] |
13:17 |
<zfilipin@deploy1001> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.34.0-wmf.8 |
[production] |
13:03 |
<hashar> |
restarting Jenkins |
[production] |
12:53 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: More traffic to db1135 (duration: 00m 54s) |
[production] |
12:46 |
<Lucas_WMDE> |
EU SWAT finished |
[production] |
12:32 |
<ladsgroup@deploy1001> |
Synchronized php-1.34.0-wmf.8/extensions/WikimediaMessages/: SWAT: [[gerrit:514460|Fix wikidata copyright message (T224536)]] (duration: 00m 56s) |
[production] |
11:43 |
<pmiazga@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:514449|Enable the new history page in the advanced mobile contributions mode (T219895)]] (duration: 00m 56s) |
[production] |
11:27 |
<urbanecm@deploy1001> |
Synchronized wmf-config/flaggedrevs.php: [[:gerrit:514413|Remove project namespace from flaggedrevs on ruwikisource]] (T225037) (duration: 00m 54s) |
[production] |
10:57 |
<ladsgroup@deploy1001> |
Synchronized php-1.34.0-wmf.8/extensions/FlaggedRevs: [[gerrit:514456|Add ext.flaggedRevs.icons to modules registeration]] (duration: 00m 57s) |
[production] |
10:14 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: More traffic to db1135 (duration: 00m 55s) |
[production] |
10:09 |
<godog> |
mount sdb3 on ms-be1022 - T225079 |
[production] |
09:50 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Pool db1135 with very low weight on s4 (duration: 00m 55s) |
[production] |
09:26 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-codfw.php: Pool without traffic db1135 into s4 T225060 (duration: 00m 55s) |
[production] |
09:25 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Pool without traffic db1135 into s4 T225060 (duration: 00m 56s) |
[production] |
08:42 |
<onimisionipe> |
removing maps2001 from cassandra cluster. It is going to be reimaged - T224395 |
[production] |
08:40 |
<_joe_> |
rolling restart of php7 on the api servers, to test a different strategy of restarting compared to the appservers. |
[production] |
08:21 |
<_joe_> |
performing a rolling restart of the php appservers via cumin to test speed and safety of the operations proposed in T224857 |
[production] |
08:13 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:12 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:12 |
<moritzm> |
rebooting pybal-test2001 for tests with new qemu |
[production] |
08:12 |
<ema> |
pool cp3035 w/ ATS backend T222937 |
[production] |
08:12 |
<marostegui> |
Reboot db1091 T225060 |
[production] |
08:05 |
<moritzm> |
installing qemu security updates on Ganeti hosts |
[production] |
07:45 |
<marostegui> |
Transfer dbprov1001.eqiad.wmnet:snapshot.s4.2019-06-04--21-37-03.tar.gz to db1135 to provision it on s4 T225060 |
[production] |
07:33 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Clarify db1091 status (duration: 00m 56s) |
[production] |
07:22 |
<ema> |
depool cp3035 and reimage as upload_ats T222937 |
[production] |