2017-09-03
§
|
17:52 |
<elukey> |
depooled cp4024 (ulsfo upload) due to kernel errors in dmesg |
[production] |
15:57 |
<ema> |
restart varnish backend on cp1073 (mailbox lag) |
[production] |
09:41 |
<ema@neodymium> |
conftool action : set/pooled=yes; selector: name=cp4024.ulsfo.wmnet,service=varnish-fe |
[production] |
09:41 |
<ema@neodymium> |
conftool action : set/pooled=yes; selector: name=cp4024.ulsfo.wmnet,service=nginx |
[production] |
09:25 |
<ema> |
cp4024 is down, powercycled |
[production] |
09:13 |
<_joe_> |
manually fixed value of /conftool/v1/pools/eqiad/api_appserver/apache2/mw1208.eqiad.wmnet in codfw to allow replication to restart |
[production] |
00:59 |
<legoktm> |
legoktm@terbium:~$ mwscript extensions/WikimediaMaintenance/filebackend/setZoneAccess.php --wiki=hiwikiversity --backend=local-multiwrite # T174859 |
[production] |
2017-09-01
§
|
19:00 |
<JeanFred> |
Deploy latest from Git master: 2be9e28 (T166528), d3aa65a, 61086ce, 637a1c0, 6bbcb0a (T174146), 566ab17, eae2643, 9b8e2f4 |
[tools.heritage] |
18:30 |
<joal> |
Rerun Workflow webrequest-load-wf-misc-2017-9-1-16 after very weird failure |
[analytics] |
16:18 |
<bblack> |
restart varnish backend on cp1074 (mailbox lag) |
[production] |
15:15 |
<urandom> |
Restarting Cassandra: restbase-dev100[5-6]-{a,b} |
[production] |
15:06 |
<urandom> |
Restarting Cassandra: restbase-dev1004-{a,b} |
[production] |
15:00 |
<reedy@tin> |
Synchronized php-1.30.0-wmf.16/extensions/Popups/: T174724 (duration: 00m 46s) |
[production] |
14:48 |
<bd808> |
Deployed 4cab928 (Link to Striker for tool creation) T149458 |
[tools.admin] |
14:28 |
<marostegui> |
Add 150G to /srv partition on labsdb1001 |
[production] |
13:42 |
<marostegui> |
Rename table pr_index on enwiki on db1089 - T174782 |
[production] |
13:10 |
<marostegui> |
Stop MySQL on db1026 as it will be decommissioned - T174763 |
[production] |
12:32 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Restore db1081 original weight - T168661 (duration: 00m 43s) |
[production] |
12:19 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Remove db1026 as it will be decommissioned - T174763 (duration: 00m 43s) |
[production] |
12:17 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Remove db1026 as it will be decommissioned - T174763 (duration: 00m 43s) |
[production] |
11:50 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Increase db1081 weight - T168661 (duration: 00m 43s) |
[production] |
11:23 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1081 with low weight - T168661 (duration: 00m 44s) |
[production] |
10:35 |
<elukey> |
stop puppet on thorium and disable root rsyncs - T174756 |
[production] |
10:31 |
<marostegui> |
Upgrade MariaDB to 10.0.32 on db1081 - T168661 |
[production] |
10:29 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1081 - T168661 (duration: 00m 43s) |
[production] |
10:06 |
<elukey> |
killed root rsyncs on thorium, disabled puppet |
[analytics] |
09:10 |
<godog> |
bounce graphite-web on graphite1001, problematic query? using all CPU |
[production] |
09:04 |
<ema> |
lvs1007 upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
08:59 |
<godog> |
depool restbase200[135] before reimage - T169939 |
[production] |
07:22 |
<elukey> |
restart apache2 and hue on thorium, Analytics sites down, investigating |
[production] |
07:06 |
<marostegui> |
Power reset db2044 as it is unresponsive - T174764 |
[production] |
03:54 |
<krinkle@tin> |
Synchronized wmf-config/InitialiseSettings.php: I4dfc33f66c3 - Enable jQuery 3 on nlwiki sister projects (duration: 00m 43s) |
[production] |
01:31 |
<ottomata> |
restarted hue (a few minutes ago) not totally sure why it died |
[analytics] |
00:57 |
<tstarling@tin> |
Synchronized php-1.30.0-wmf.16/includes/parser/Parser.php: (no justification provided) (duration: 00m 44s) |
[production] |
00:13 |
<RainbowSprinkles> |
gerrit: flushed all non-login caches, things might be sluggish for the next ~15mins or so |
[production] |