2019-04-05
§
|
08:56 |
<hashar> |
Reloaded Zuul for operations/software/gerrit/plugins/barricade https://gerrit.wikimedia.org/r/#/c/integration/config/+/501507/ |
[releng] |
08:55 |
<arturo> |
T220101 reimaging+renaming labtestservices2002 to cloudservices2002-dev |
[production] |
08:43 |
<akosiaris> |
upgrade kubernetes staging cluster to 1.11.9 |
[production] |
08:32 |
<elukey> |
roll restart of aqs on aqs100* to pick up new druid settings |
[production] |
08:21 |
<legoktm> |
deploying https://gerrit.wikimedia.org/r/501416 |
[releng] |
08:07 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Fully repool db1075 (duration: 00m 59s) |
[production] |
08:06 |
<gehel@cumin2001> |
START - Cookbook sre.elasticsearch.rolling-reboot |
[production] |
07:51 |
<elukey> |
restart gerrit on cobalt (timeouts and general slowdown) |
[production] |
07:34 |
<jijiki> |
Repooling thumbor1004 until we replace its memory - T215411 |
[production] |
07:18 |
<moritzm> |
upgrading mw1262-mw1265 to HHVM 3.18.5+dfsg-1+wmf8+deb9u2 and wikidiff 1.8.1 (T203069) |
[production] |
06:56 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: More traffic to db1075 (duration: 00m 57s) |
[production] |
06:04 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: More traffic to db1075 (duration: 01m 00s) |
[production] |
05:32 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Repool db1075 with low weight (duration: 00m 58s) |
[production] |
05:15 |
<marostegui> |
Fully upgrade and reboot db1075 |
[production] |
05:14 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Depool db1075 (duration: 00m 59s) |
[production] |
04:49 |
<gilles> |
T216594 Start purge of namespace 0 on ruwiki |
[production] |
02:27 |
<eileen> |
update civicrm revision changed from 7560af93df to 3c55850631, config revision is 9ad5ef3e15 |
[production] |
01:52 |
<bd808> |
Stopped webservice. Perl CGI processes are hanging and consuming all resources on grid node. (T220164) |
[tools.osm4wiki] |
00:35 |
<bstorm_> |
added the rsyncd ports to security groups for the osmdb cluster |
[clouddb-services] |
00:09 |
<bd808@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:497866|wikitech: Lock LDAP accounts when users are blocked]], [[gerrit:501123|Disable Phabricator accounts when blocked on wikitech]] (T168692) 2/2 (duration: 00m 57s) |
[production] |
00:07 |
<bd808@deploy1001> |
Synchronized wmf-config/wikitech.php: SWAT: [[gerrit:497866|wikitech: Lock LDAP accounts when users are blocked]], [[gerrit:501123|Disable Phabricator accounts when blocked on wikitech]] (T168692) (duration: 00m 59s) |
[production] |
2019-04-04
§
|
23:58 |
<bstorm_> |
pg_basebackup finished on clouddb1004 and streaming replication is flowing |
[clouddb-services] |
23:52 |
<bd808@deploy1001> |
Synchronized php-1.33.0-wmf.23/extensions/LdapAuthentication: SWAT: [[gerrit:501412|Also set an LDAP password policy on Block]] (T168692) (duration: 01m 01s) |
[production] |
23:38 |
<bd808@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:501393|Add smn and sms to wmgExtraLanguageNames]] (T220118) (duration: 01m 02s) |
[production] |
23:37 |
<Krinkle> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/501442 |
[releng] |
23:00 |
<bstorm_> |
Restarting lighttpd webservice since it seems to have gone insane -- possibly due to the osm database restart |
[tools.osm4wiki] |
22:27 |
<Krinkle> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/500360 |
[releng] |
21:48 |
<Krinkle> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/501428 |
[releng] |
21:22 |
<XioNoX> |
renumber AS58587 to AS10075 in eqsin |
[production] |
21:21 |
<bd808> |
Uncordoned tools-worker-1013.tools.eqiad.wmflabs after reboot and forced puppet run |
[tools] |
21:20 |
<mforns> |
Restarted turnilo to clear deleted datasource |
[analytics] |
21:17 |
<bblack> |
DNS deploying https://gerrit.wikimedia.org/r/c/operations/dns/+/500731 which can affect resolution of our CNAME records. If dns-related issues, can revert at will! |
[production] |
21:09 |
<herron> |
restarting eqiad ELK stack for security updates |
[production] |
20:53 |
<bd808> |
Rebooting tools-worker-1013 |
[tools] |
20:50 |
<bd808> |
Draining tools-worker-1013.tools.eqiad.wmflabs |
[tools] |
20:47 |
<Krinkle> |
Updating docker-pkg files on contint1001 for https://gerrit.wikimedia.org/r/501407 |
[releng] |
20:47 |
<Amir1> |
97f84c2 going prod |
[wikilabels] |
20:45 |
<marxarelli> |
promotion of 1.33.0-wmf.24 rolled back to group0 and holding. cc: T206678, T220037 |
[production] |
20:42 |
<Amir1> |
8f44694 going prod |
[wikilabels] |
20:41 |
<dduvall@deploy1001> |
rebuilt and synchronized wikiversions files: Revert "group2/group1 wikis to 1.33.0-wmf.24" |
[production] |
20:39 |
<Amir1> |
8f44694 going staging |
[wikilabels] |
20:36 |
<marxarelli> |
rolling back again following still high rates of DBTransactionError (avg ~ 800/min) |
[production] |
20:29 |
<bd808> |
Released floating IP and deleted instance tools-checker-01 via Horizon |
[tools] |
20:28 |
<bd808> |
Shutdown tools-checker-01 via Horizon |
[tools] |
20:17 |
<bd808> |
Repooled tools-webgrid-lighttpd-0906 after reboot, apt-get dist-upgrade, and forced puppet run |
[tools] |
20:16 |
<dduvall@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.33.0-wmf.24 |
[production] |
20:13 |
<bd808> |
Hard reboot of tools-sgewebgrid-lighttpd-0906 via Horizon |
[tools] |
20:11 |
<marxarelli> |
promoting 1.33.0-wmf.24 to all wikis |
[production] |
20:11 |
<marxarelli> |
error rates look good after proper syncs and re-deploy. cc: T220037 |
[production] |
20:09 |
<bd808> |
Repooled tools-webgrid-lighttpd-0912 after reboot, apt-get dist-upgrade, and forced puppet run |
[tools] |