2018-09-10
§
|
09:36 |
<START> |
- Cookbook sre.switchdc.mediawiki.00-reduce-ttl (volans@sarin) |
[production] |
09:32 |
<END> |
(PASS) - Cookbook sre.switchdc.mediawiki.00-disable-puppet (exit_code=0) (volans@sarin) |
[production] |
09:31 |
<START> |
- Cookbook sre.switchdc.mediawiki.00-disable-puppet (volans@sarin) |
[production] |
09:30 |
<volans> |
starting execution of "cookbook sre.switchdc.mediawiki --live-test codfw eqiad" - T199073 |
[production] |
08:22 |
<marostegui> |
Drop users metric and wikilytics from core databases |
[production] |
08:04 |
<marostegui> |
Drop unused root grants from core servers |
[production] |
07:46 |
<moritzm> |
installing ghostscript security updates |
[production] |
07:18 |
<volans> |
restarted pdfrender on scb2004 - T174916 |
[production] |
07:04 |
<oblivian@deploy1001> |
Synchronized wmf-config/throttle.php: Deploy throttle rule for Czech School T203909 (duration: 00m 51s) |
[production] |
02:51 |
<l10nupdate@deploy1001> |
ResourceLoader cache refresh completed at Mon Sep 10 02:51:00 UTC 2018 (duration 10m 52s) |
[production] |
02:40 |
<l10nupdate@deploy1001> |
scap sync-l10n completed (1.32.0-wmf.20) (duration: 13m 48s) |
[production] |
00:46 |
<tstarling@deploy1001> |
Synchronized wmf-config/set-time-limit.php: (no justification provided) (duration: 00m 49s) |
[production] |
00:12 |
<tstarling@deploy1001> |
Synchronized w/infinite-loop.php: Testing for T97192 (duration: 00m 48s) |
[production] |
00:07 |
<tstarling@deploy1001> |
Synchronized wmf-config/PhpAutoPrepend.php: T97192 (duration: 00m 49s) |
[production] |
00:04 |
<tstarling@deploy1001> |
Synchronized wmf-config/set-time-limit.php: T97192 (duration: 00m 52s) |
[production] |
2018-09-08
§
|
09:45 |
<gtirloni> |
tools restarted cron and truncated /var/log/exim4/paniclog (T196137) |
[production] |
04:22 |
<krinkle@deploy1001> |
Synchronized multiversion/: Ia27a8f7ed612f (duration: 00m 49s) |
[production] |
04:16 |
<krinkle@deploy1001> |
Synchronized wmf-config/profiler.php: Ia27a8f7ed612f (duration: 00m 54s) |
[production] |
01:10 |
<mutante> |
also rsyncing /var/lib/tor-instances/ data for second instance and restarting service (T196701) |
[production] |
00:53 |
<mutante> |
radium - stopping rsync.service |
[production] |
00:27 |
<mutante> |
torrelay1001 - reset internal state (sighup) with "arm" and pressing x twice |
[production] |
00:18 |
<mutante> |
to watch what is happenin on torrelay1001 - sudo -u debian-tor arm - if asked for password it's in passwords::tor in private |
[production] |
00:16 |
<mutante> |
tor relay switched over from radium to torrelay1001, fixed /var/lib/tor permissions, restarted service, flipped DNS CNAME (5M TTL), traffic can be seen with "arm", monitoring all green (T196701) |
[production] |
2018-09-07
§
|
23:26 |
<mutante> |
ms-be2042 - repairing /dev/sdj1 (T199198) |
[production] |
23:25 |
<mutante> |
ms-be2041 - repairing /dev/sdh1 (T199198) |
[production] |
23:23 |
<mutante> |
ms-be1041 - repairing xfs per https://wikitech.wikimedia.org/wiki/Swift/How_To#Repair_xfs_free_blocks_counter_corruption (T199198) |
[production] |
22:17 |
<mutante> |
gerrit - restarting for config change to move log files to /var/log/gerrit/ |
[production] |
22:16 |
<mutante> |
- cobalt (gerrit) - applying change to move log file location, manually moved logs to /var/log/gerrit, remove old log dir, let puppet re-create it, like on gerrit2001 |
[production] |
21:31 |
<mutante> |
gerrit2001, moving gerrit logfiles to /var/log/gerrit, removing old gerrit logdir, letting puppet re-create it as symlink |
[production] |
18:20 |
<mutante> |
LDAP: correction, 'monipe' replaced with 'onimisionipe' in wmf group (T202708) |
[production] |
18:12 |
<mutante> |
LDAP: added user 'monipe' to group 'wmf' (T202708) |
[production] |
18:02 |
<legoktm@deploy1001> |
Synchronized php-1.32.0-wmf.20/extensions/EUCopyrightCampaign/: Update MEPs - https://gerrit.wikimedia.org/r/458628 (for real this time) (duration: 00m 50s) |
[production] |
17:52 |
<legoktm@deploy1001> |
Synchronized php-1.32.0-wmf.20/extensions/EUCopyrightCampaign/: Update MEPs - https://gerrit.wikimedia.org/r/458628 (duration: 00m 50s) |
[production] |
17:45 |
<XioNoX> |
apply firewall changes on pfw3-eqiad - T203793 |
[production] |
17:40 |
<XioNoX> |
apply firewall changes on pfw3-codfw - T203793 |
[production] |
16:42 |
<XioNoX> |
explicitely permit install1002/2002:80 in filter labs-in4 on cr1/2-eqiad - T190424 |
[production] |
14:56 |
<moritzm> |
uploaded linux-meta 1.20+deb9u1 to apt.wikimedia.org/stretch-wikimedia (provides a new meta package for Linux 4.14) |
[production] |
14:29 |
<moritzm> |
installing PHP security updates on krypton |
[production] |
14:27 |
<moritzm> |
installing libtirpc security updates on trusty |
[production] |
12:35 |
<elukey> |
reboot kafka200[2,3] (eventbus codfw) for kernel + openjdk-8 upgrades |
[production] |
10:03 |
<Amir1> |
ladsgroup@mwmaint1001:~$ mwscript extensions/CentralAuth/maintenance/deleteLocalPasswords.php --wiki=fawiki --user Ladsgroup --prefix (T201009) |
[production] |
09:38 |
<hashar@deploy1001> |
Synchronized php-1.32.0-wmf.20/extensions/UniversalLanguageSelector: Revert "Simplify by using native JavaScript instead of jQuery" - T203750 (duration: 00m 55s) |
[production] |
08:49 |
<ema> |
passive checks awol on einsteinium, restarting icinga -- T196336 |
[production] |
08:45 |
<jynus> |
reloading apache with bad config for tendril for testing (small downtime) |
[production] |
08:33 |
<marostegui> |
Rebooting haproxies to pick up new config after all the tests - T201021 |
[production] |
08:16 |
<banyek> |
genarting false alert about https auth on dbmonitor1001 |
[production] |
07:24 |
<moritzm> |
rebooting mw2270-mw2290 for kernel security updates |
[production] |
06:51 |
<moritzm> |
rebooting mw2240-mw2269 for kernel security updates |
[production] |
06:49 |
<moritzm> |
rebooting mw2240-mw2269 for kernel security updates |
[production] |
04:58 |
<marostegui> |
Disable puppet on dbproxy1006 for logging testing - T201021 |
[production] |