2017-04-21
§
|
09:20 |
<moritzm> |
rebooting etherpad1001 (running etherpad.wikimedia.org) for update to Linux 4.9 |
[production] |
09:10 |
<jynus> |
stopping and upgrading/reconfiguring db2062 (depooled) T116557 |
[production] |
08:49 |
<jynus@naos> |
Synchronized wmf-config/db-codfw.php: Depool db2062 (duration: 01m 20s) |
[production] |
08:32 |
<akosiaris> |
looking at tcpircbot (logmsgbot) problems at tegmen |
[production] |
08:20 |
<elukey> |
rolling restart of aqs (nodejs) on aqs* to pick up upgrades |
[production] |
08:01 |
<moritzm> |
rolling restart of hhvm on application servers in eqiad to pick up ICU security update |
[production] |
07:47 |
<marostegui> |
Stop MySQL on db1071 and db1063 to reclone db1063 - T163109 |
[production] |
07:43 |
<moritzm> |
installing further icu security updates |
[production] |
06:21 |
<marostegui> |
Restart MySQL on db1065 for maintenance - T163351 |
[production] |
06:09 |
<marostegui> |
Deploy alter table enwiki.revision db1067 - T132416 |
[production] |
2017-04-20
§
|
22:28 |
<twentyafterfour> |
enable rate limiting in phabricator |
[production] |
22:17 |
<paravoid> |
setting tw_reuse to 1 on dbproxy1003 |
[production] |
21:47 |
<twentyafterfour> |
started phd on iridium |
[production] |
21:31 |
<twentyafterfour> |
stopped phd on iridium to reduce load on the database |
[production] |
19:26 |
<Amir1> |
deploy finished |
[production] |
19:24 |
<Amir1> |
start of ladsgroup@naos:/srv/mediawiki-staging/php-1.29.0-wmf.20$ scap sync-file php-1.29.0-wmf.20/extensions/ORES/includes/Hooks.php '[[gerrit:349271|Disable ORES in Recentchangeslinked]] (T163063)' |
[production] |
19:15 |
<mutante> |
test logging in fundraising channel |
[production] |
19:06 |
<mutante> |
fixing duplicate ircecho situation - since today it should run from tegmen, the active icinga server |
[production] |
17:51 |
<mutante> |
restarted icinga-wm (ircecho) to pick up config change |
[production] |
17:13 |
<jynus> |
stopping replication on db1040 |
[production] |
17:09 |
<andrewbogott> |
disabling puppet on serpens, seaborgium, pollux, dubnium, labservices1001, labservices1002 for tentative rollout of https://gerrit.wikimedia.org/r/#/c/348920/ |
[production] |
16:58 |
<jynus> |
moving GTID s4 eqiad replicas under db1068 |
[production] |
16:46 |
<ema> |
repool varnish-be on cp2017 |
[production] |
16:18 |
<ema> |
depool varnish-be on cp2017 |
[production] |
16:08 |
<elukey> |
uploaded piwik 2.17.1-1 to jessie-wikimedia main |
[production] |
15:17 |
<Amir1> |
deleting duplicate rows in ores_classification dated after revision 775502802 (dated April 15th) (T163337) |
[production] |
15:16 |
<XioNoX> |
disabling pybal on lvs2002 for T163323 |
[production] |
14:32 |
<moritzm> |
upgrading tor on radium to 0.2.9.10 |
[production] |
14:23 |
<moritzm> |
rebooting radium (tor relay) for kernel update to Linux 4.9 |
[production] |
14:09 |
<moritzm> |
rebooting osmium for kernel update to Linux 4.9 |
[production] |
14:06 |
<gehel> |
rolling restart of kartotherian / tilerator on maps codfw cluster |
[production] |
13:58 |
<gehel> |
rolling restart of kartotherian / tilerator on maps eqiad cluster |
[production] |
13:58 |
<marostegui> |
Stop MySQL on db1068 and db1081 for maintenance - T163110 |
[production] |
13:57 |
<jynus> |
running reset slave all on db2019 |
[production] |
13:53 |
<gehel> |
rolling restart of kartotherian / tilerator on maps-test cluster |
[production] |
13:18 |
<moritzm> |
restarting hhvm on mw2097/2098 to pick up icu security update |
[production] |
13:11 |
<elukey> |
upgrading Piwik to 2.17.1 (brief downtime during the maintenance announced) |
[production] |
12:12 |
<elukey> |
restart Yarn Resource manager on analytics1001 (hadoop master) to pick up new JVM settings |
[production] |
12:11 |
<moritzm> |
installing icu security updates |
[production] |
11:32 |
<_joe_> |
removing hack for jobqueue's refreshlinks T163418 from the jobrunners |
[production] |
11:23 |
<jynus> |
changing db2071 to replicate from db2016 |
[production] |
10:32 |
<moritzm> |
installing remaining dbus updates from jessie point update |
[production] |
10:07 |
<elukey> |
restart Yarn Resource manager on analytics1002 (hadoop master standby) to pick up new JVM settings |
[production] |
09:47 |
<Amir1> |
running the cleanup script for ores_classification in enwiki |
[production] |
09:38 |
<_joe_> |
live-hack redeployed, running scap pull on codfw jobrunners T163418 |
[production] |
09:38 |
<_joe_> |
live-hack redeployed, running scap pull on codfw jobrunners |
[production] |
09:34 |
<hashar@naos> |
Synchronized rpc/RunJobs.php: Revert "rpc: raise exception instead of die" - causes monitoring spam (duration: 01m 20s) |
[production] |
09:17 |
<_joe_> |
removed the live hack, running scap pull again on mw2154 |
[production] |
09:14 |
<_joe_> |
scap pull of live hack for T163418 on mw2154 |
[production] |
08:47 |
<_joe_> |
live-patching ./includes/jobqueue/jobs/RefreshLinksJob.php to drop all recursive jobs, T163418 |
[production] |