2016-06-08
§
|
16:36 |
<hashar> |
Disabled puppet on contint1001 to prevent it from bringing back Jenkins |
[production] |
16:32 |
<otto@palladium> |
conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=mathoid']) |
[production] |
16:32 |
<otto@palladium> |
conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=ores']) |
[production] |
16:32 |
<otto@palladium> |
conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=mobileapps']) |
[production] |
16:32 |
<otto@palladium> |
conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=cxserver']) |
[production] |
16:32 |
<otto@palladium> |
conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=citoid']) |
[production] |
16:32 |
<otto@palladium> |
conftool action : set/pooled=no; selector: scb1002.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=graphoid']) |
[production] |
16:24 |
<ottomata> |
restarting hadoop-yarn-resourcemanager on analytics1002 to make analytics1001 active |
[production] |
16:07 |
<mobrovac> |
scb1002 enabling back puppet |
[production] |
16:02 |
<elukey> |
temporary set a 10TB upperbound to the Kafka webrequest_text topic to free space (T136690) |
[production] |
15:43 |
<ottomata> |
restarting zk in codfw and eqiad 1 by 1 to apply maxClientCnxns=1024 |
[production] |
15:12 |
<ottomata> |
restarting zookeeper 1 by 1 in eqiad |
[production] |
15:03 |
<_joe_> |
contint1001: systemctl mask zuul,zuul-merger |
[production] |
14:57 |
<elukey> |
rolling out the new Varnishkafka version in cache misc (didn't do it before since there was an outage ongoing) |
[production] |
14:53 |
<jynus> |
rebooting gallium with netboot for hardware maintenance |
[production] |
14:44 |
<mobrovac> |
scb1001 enabling and running puppet on scb1001 |
[production] |
13:44 |
<jynus> |
running fsck.ext3 /dev/sda2 in read-write mode for gallium |
[production] |
13:42 |
<ottomata> |
powercycling scb2001 and scb2002 |
[production] |
13:30 |
<akosiaris> |
disabling puppet on scb1001 & scb1002 |
[production] |
13:30 |
<mobrovac> |
change-prop stopped on scb1002 |
[production] |
13:29 |
<akosiaris> |
stopping changeprop on scb1001 |
[production] |
13:26 |
<ottomata> |
powercycling scb1002 |
[production] |
13:18 |
<ottomata> |
powercycling scb1001 |
[production] |
13:08 |
<elukey> |
rolling out new varnishkafka package in cache misc |
[production] |
12:09 |
<jynus> |
mounted temporarily / partition from gallium sda on db1085:/mnt |
[production] |
10:40 |
<moritzm> |
uploaded jenkins 1.651.2 for jessie-wikimedia to carbon |
[production] |
10:13 |
<elukey> |
rolling out the new varnishkafka package to cache maps |
[production] |
10:04 |
<aaron@tin> |
Synchronized php-1.28.0-wmf.5/includes/deferred/LinksDeletionUpdate.php: fd44d649787ede78687b4cd2ef21e44a4c8b843b (duration: 00m 33s) |
[production] |
08:28 |
<hashar> |
stopping Jenkins / zuul / zuul-merger / puppet on gallium |
[production] |
08:15 |
<elukey> |
lowering down webrequest_text kafka topic retention time from 7 days to 4 days to free disk space (T136690) |
[production] |
08:14 |
<hashar> |
Jenkins has bunch of executors dead for what ever reason preventing jobs from running :( |
[production] |
07:53 |
<mobrovac> |
change-prop deploying 84d56e53a |
[production] |
06:59 |
<moritzm> |
enabling ferm on palladium (will lead to temporary puppet failures) |
[production] |
02:58 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Wed Jun 8 02:58:28 UTC 2016 (duration 6m 31s) |
[production] |
02:51 |
<mwdeploy@tin> |
scap sync-l10n completed (1.28.0-wmf.5) (duration: 06m 49s) |
[production] |
02:51 |
<legoktm> |
/ on gallium is currently read-only for some reason |
[production] |
02:29 |
<mwdeploy@tin> |
scap sync-l10n completed (1.28.0-wmf.4) (duration: 11m 11s) |
[production] |
00:11 |
<awight_> |
update fundraising-tools from b2425aef2154d6b689900f4848cca02880321230 to 28bc2da677caa795c58f906db76a1f8d612ac899 |
[production] |
2016-06-07
§
|
23:46 |
<aaron@tin> |
Synchronized php-1.28.0-wmf.5/includes/deferred/LinksUpdate.php: 6d85caaa9bb5918cb2888fc82f2c7c346cf746a2 (duration: 00m 25s) |
[production] |
23:35 |
<SMalyshev> |
redeploying WDQS to update the Updater for T128947 fix |
[production] |
23:35 |
<tgr@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT [[gerrit:292518]] User rights configuration for meta. wmf-supportsafety group (duration: 00m 26s) |
[production] |
23:20 |
<tgr@tin> |
Finished scap: (no message) (duration: 24m 51s) |
[production] |
23:02 |
<awight> |
update paymentswiki from 28e10141454ef53085aed4c6619a34d3a4b43c58 to de11bfe2273d0bcaa0e713389b2d91e8b3567a1d; add PP cert |
[production] |
22:56 |
<tgr> |
scapping AuthManager backports + feature switch enabled on group0 T135504 |
[production] |
22:56 |
<tgr@tin> |
Started scap: (no message) |
[production] |
22:10 |
<mutante> |
icinga config broken: Error: Could not find any host matching 'relforge1001' |
[production] |
21:35 |
<twentyafterfour> |
restarted apache on iridium to deploy D250 |
[production] |
20:02 |
<andrewbogott> |
dist-upgrade on labvirt1010, in hopes of resolving a nova-compute lockup (possibly related to a kvm upgrade earlier today) |
[production] |
20:00 |
<thcipriani@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: group0 to 1.28.0-wmf.5 |
[production] |
19:44 |
<jynus> |
restarting es2017 due to a bunch of ACPI errors (probably memory-caused) |
[production] |