2019-07-19
§
|
08:37 |
<ariel@deploy1001> |
Started deploy [dumps/dumps@440faa0]: more error reporting for stubs/abstracts/pagelogs; more public table dumps by default |
[production] |
08:36 |
<gehel@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
08:24 |
<gehel> |
repooling wdqs2004 - T228122 |
[production] |
08:22 |
<gehel> |
repooling wdqs2003 - T228122 |
[production] |
08:20 |
<vgutierrez> |
restart pybal on lvs2003 |
[production] |
08:16 |
<vgutierrez> |
restart pybal on lvs2006 |
[production] |
08:10 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Pool db1109 into API (duration: 00m 54s) |
[production] |
07:57 |
<moritzm> |
installing idp1001 T228403 |
[production] |
07:38 |
<moritzm> |
rebooting tungsten for kernel update |
[production] |
07:38 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:38 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:25 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:25 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:25 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:25 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:03 |
<elukey> |
restart php-fpm on mw1330 - op-cache hit ratio low |
[production] |
07:02 |
<jynus> |
reloading dbproxy1004/9 |
[production] |
07:01 |
<elukey> |
depool wdqs2004 from all services (waiting for maintenance) |
[production] |
06:32 |
<legoktm@deploy1001> |
Synchronized php-1.34.0-wmf.13/extensions/EventBus/includes/EventBus.php: Add more debugging to figure out which events are invalid: T225199 (duration: 00m 55s) |
[production] |
06:30 |
<legoktm@deploy1001> |
Synchronized php-1.34.0-wmf.14/extensions/EventBus/includes/EventBus.php: Add more debugging to figure out which events are invalid: T225199 (duration: 00m 55s) |
[production] |
06:15 |
<elukey> |
clear opcache on mwdebug* |
[production] |
05:26 |
<fsero> |
repool ms-fe2005 - T228196 |
[production] |
05:11 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-codfw.php: Repool db2116 (duration: 00m 55s) |
[production] |
04:11 |
<eileen> |
I think I didn't push the turn it on commit - tried again process-control config revision is 9f7eba2193 |
[production] |
03:03 |
<eileen> |
process-control config revision is 7598dc1bf9 (jobs reenabled) |
[production] |
01:52 |
<XioNoX> |
enable outbound sampling on eqiad's router |
[production] |
00:52 |
<sbassett@deploy1001> |
Synchronized private/PrivateSettings.php: Add even more severe rate limits for eswikiquote and some other, smaller wikis (T227416) (duration: 00m 58s) |
[production] |
00:38 |
<mutante> |
mwmaint2001 - puppet fails - not removing a bunch of log dirs for maintenance crons |
[production] |
00:10 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2250.codfw.wmnet |
[production] |
00:08 |
<eileen> |
process-control config revision is 7598dc1bf9 - jobs disabled |
[production] |
00:04 |
<mutante> |
install1002 - exported indices for new scap version - copied back from buster to stretch - upgraded scap version on mw2250 - scap pull now works and starts to rsync (T228482, T228328, T226948) |
[production] |
2019-07-18
§
|
23:50 |
<mutante> |
built new scap version 3.11.1-1 on boron, copied to install1002, imported package with reprepro, copied from stretch to jessie and buster (T228482) |
[production] |
23:22 |
<Lucas_WMDE> |
Evening SWAT done |
[production] |
23:17 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized wmf-config/InitialiseSettings-labs.php: SWAT: [[gerrit:523141|Configure Citoid+Wikibase integration on Beta (production no-op) (T228411)]] (duration: 00m 54s) |
[production] |
23:13 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized wmf-config/Wikibase.php: SWAT: [[gerrit:523140|Set $wgWBRepoSettings[enableRefTabs] in Wikibase.php (T228414)]] (duration: 01m 16s) |
[production] |
23:09 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:523139|Define settings for Citoid+Wikibase integration (T228414)]] (duration: 00m 55s) |
[production] |
22:23 |
<gehel@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: dc=eqiad,name=wdqs1008.eqiad.wmnet |
[production] |
22:16 |
<gehel@cumin1001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
22:00 |
<eevans@> |
helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . |
[production] |
21:49 |
<bd808> |
Cleaned up stale striker logs on labweb1001 and labweb1002. Logs go to journald now so log rotate is not triggered to rotate out logs from before that change. |
[production] |
21:42 |
<eevans@> |
helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . |
[production] |
21:35 |
<bd808@deploy1001> |
Finished deploy [striker/deploy@91594df]: Fixes for deprecation warnings and editing Tool models (T228222, T228332) (duration: 01m 13s) |
[production] |
21:34 |
<bd808@deploy1001> |
Started deploy [striker/deploy@91594df]: Fixes for deprecation warnings and editing Tool models (T228222, T228332) |
[production] |
21:15 |
<mutante> |
gerrit (cobalt) - scheduled 1h downtime, rebooting for kernel upgrade |
[production] |
21:03 |
<jforrester@deploy1001> |
Synchronized php-1.34.0-wmf.14/extensions/Flow: T228290 Fix fatal in ChangesListFormatter::getLogTextLinks() (duration: 01m 02s) |
[production] |
20:57 |
<mutante> |
gerrit2001 - icinga downtime for 1h |
[production] |
20:56 |
<mutante> |
gerrit2001 - reboot for kernel upgrade |
[production] |
20:51 |
<mutante> |
gerrit2001 - apt-get upgrade; apt-get autoremove ; puppet agent -tv |
[production] |
19:55 |
<eevans@> |
helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . |
[production] |
19:33 |
<jforrester@deploy1001> |
Synchronized wmf-config/CommonSettings.php: T228374 Enable SecureLinkFixer in beta cluster (2/2) (duration: 00m 55s) |
[production] |