2018-10-09
ยง
|
22:26 |
<shdubsh> |
repairing /dev/sdl1 on ms-be2040 - T199198 |
[production] |
21:52 |
<bblack> |
cp1085: varnish backend restart for mbox lag |
[production] |
21:50 |
<mutante> |
releases1001 - restarted jenkins (it went from 200 -> 503 -> 403) curl localhost:8080 works again after restart, icinga check still getting 403 now |
[production] |
21:41 |
<ejegg|food> |
updated fundraising CiviCRM from 7a0d14015e to 1165e7ed79 |
[production] |
20:08 |
<mutante> |
repair /dev/sdg1 on ms-be2041 - T199198 |
[production] |
19:37 |
<XioNoX> |
disable igmp-snooping on asw2-c-eqiad - T201039 |
[production] |
19:25 |
<XioNoX> |
disable igmp-snooping on asw2-b-eqiad - T201039 |
[production] |
19:20 |
<XioNoX> |
bounce igmp-snooping on asw2-b-eqiad |
[production] |
19:00 |
<Krinkle> |
Re-enable beta-scap-eqiad job |
[releng] |
18:24 |
<ottomata> |
adding Accept header to all varnishkafka generated webrequest logs |
[analytics] |
18:24 |
<ottomata> |
adding Accept header to all varnishkafka generated webrequest logs |
[production] |
18:19 |
<Krinkle> |
Messing with scap in beta to test T121597 / D1114 |
[releng] |
17:21 |
<SMalyshev> |
depooled wdq23 again, sigh |
[production] |
15:32 |
<thcipriani> |
deployment-deploy01:sudo rm -rf /tmp/scap_l10n_* |
[releng] |
15:10 |
<joal> |
restart Mediawiki-history-reduced |
[analytics] |
15:08 |
<joal> |
restart wikidata-coeditors oozie job |
[analytics] |
15:08 |
<joal> |
restart wikidata-specialentites oozie job |
[analytics] |
15:00 |
<joal> |
restart wikidata-article-placeholder oozie job |
[analytics] |
14:57 |
<joal> |
restart mediawiki-history denormalize oozie job |
[analytics] |
14:56 |
<joal> |
Restart check_denormalize oozie job |
[analytics] |
14:53 |
<joal> |
Restart clickstream oozie job to pick new spark-lib |
[analytics] |
13:56 |
<ottomata> |
bouncing oozie server on an-coord1001 |
[analytics] |
13:54 |
<moritzm> |
rebooting prometheus1004 for kernel security update |
[production] |
13:46 |
<joal> |
Restarting oozie-api job |
[analytics] |
13:41 |
<moritzm> |
rebooting prometheus1003 for kernel security update |
[production] |
13:36 |
<joal> |
fully restart projectview_geo oozier job |
[analytics] |
13:28 |
<moritzm> |
rebooting prometheus2004 for kernel security update |
[production] |
13:26 |
<joal> |
Full restart of aqs oozie job |
[analytics] |
13:25 |
<joal> |
full restart of projectview_hourly |
[analytics] |
13:14 |
<joal> |
rerun failed aqs-hourl jobs |
[analytics] |
13:13 |
<moritzm> |
rebooting prometheus2003 for kernel security update |
[production] |
12:54 |
<gehel> |
silencing wdqs-public lag alerts (service still functional, and SLO unclear) - T199228 |
[production] |
12:48 |
<elukey> |
re-run all the failed projectview-hourly-coord and aqs-hourly-coord workflows (restarting them via hue) |
[analytics] |
12:47 |
<elukey> |
re-run apis-wf-2018-10-9-8 |
[analytics] |
12:45 |
<moritzm> |
installing imagemagick security updates |
[production] |
11:47 |
<END> |
(ERROR) - Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (exit_code=2) (volans@neodymium) |
[production] |
11:46 |
<START> |
- Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (volans@neodymium) |
[production] |
11:45 |
<akosiaris> |
dry-run services switchover from codfw to eqiad in preparation for Thursday |
[production] |
11:37 |
<END> |
(ERROR) - Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (exit_code=2) (volans@neodymium) |
[production] |
11:37 |
<START> |
- Cookbook sre.switchdc.services.00-reduce-ttl-and-sleep (volans@neodymium) |
[production] |
11:14 |
<volans> |
live-test of the inverted switchdc (eqiad->codfw) completed, all good - T203777 |
[production] |
11:14 |
<END> |
(PASS) - Cookbook sre.switchdc.mediawiki.08-update-tendril (exit_code=0) (volans@neodymium) |
[production] |
11:13 |
<START> |
- Cookbook sre.switchdc.mediawiki.08-update-tendril (volans@neodymium) |
[production] |
11:12 |
<END> |
(PASS) - Cookbook sre.switchdc.mediawiki.08-start-maintenance (exit_code=0) (volans@neodymium) |
[production] |
11:11 |
<START> |
- Cookbook sre.switchdc.mediawiki.08-start-maintenance (volans@neodymium) |
[production] |
11:11 |
<END> |
(PASS) - Cookbook sre.switchdc.mediawiki.08-restore-ttl (exit_code=0) (volans@neodymium) |
[production] |
11:11 |
<START> |
- Cookbook sre.switchdc.mediawiki.08-restore-ttl (volans@neodymium) |
[production] |
11:11 |
<END> |
(PASS) - Cookbook sre.switchdc.mediawiki.07-set-readwrite (exit_code=0) (volans@neodymium) |
[production] |
11:11 |
<[DRY-RUN]> |
MediaWiki read-only period ends at: 2018-10-09 11:11:05.042622 (volans@neodymium) |
[production] |
11:11 |
<START> |
- Cookbook sre.switchdc.mediawiki.07-set-readwrite (volans@neodymium) |
[production] |