2015-08-30
§
|
20:53 |
<hashar> |
beta-scap-eqiad failling due to some mwdeploy not being able to ssh to other hosts. Attempted to add the ssh key again following https://phabricator.wikimedia.org/T109007#1537572 which fixed it |
[releng] |
14:38 |
<multichill> |
Made local change to unused_images.py to get it to work, see https://phabricator.wikimedia.org/T110829 |
[tools.heritage] |
13:24 |
<valhallasw`cloud> |
force-restarting grrrit-wm |
[tools.lolrrit-wm] |
13:23 |
<valhallasw`cloud> |
killed wikibugs-backup and grrrit-wm on tools-webproxy-01 |
[tools] |
13:20 |
<valhallasw`cloud> |
disabling 503 error page |
[tools] |
13:01 |
<YuviPanda> |
rebooted tools-bastion-01 to see if that remounts NFS |
[tools] |
12:58 |
<godog> |
lvchange -ay labstore/others on labstore1002 |
[production] |
12:52 |
<godog> |
start-nfs on labstore1002 |
[production] |
12:31 |
<godog> |
lvchange -ay labstore/tools on labstore1002 |
[production] |
12:30 |
<godog> |
also disabled puppet on labstore1002 while investigating |
[production] |
12:15 |
<godog> |
trying to manually assemble missing raid on labstore1002 with mdadm --assemble /dev/md/slice51 --uuid 0747643d:b89b36ff:57156095:c33694fc --verbose |
[production] |
11:18 |
<YuviPanda> |
powered labstore1002 back up |
[production] |
11:17 |
<YuviPanda> |
shut down labstore1002, going to powercycle from mgmt |
[production] |
10:57 |
<valhallasw`cloud> |
started wkibugs from tools-webproxy-01 as well, still need to check if the phab<->redis part is still alive |
[tools] |
10:55 |
<valhallasw`cloud> |
restarted grrrit-wm from tools-webproxy-01 |
[tools] |
10:53 |
<valhallasw`cloud> |
Set error page on tools webserver via Hiera + some manual hacking (https://wikitech.wikimedia.org/wiki/Hiera:Tools) |
[tools] |
10:34 |
<YuviPanda> |
disabled backups on labstore1002 to prevent overwriting of good backups on 2001 |
[production] |
10:07 |
<YuviPanda> |
run start-nfs in labstore1002 |
[production] |
09:54 |
<YuviPanda> |
rebooted labstore1002 |
[production] |
09:14 |
<multichill> |
Updated ~/pywikibot to latest version, but still getting a FamilyMaintenanceWarning |
[tools.heritage] |
04:16 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Sun Aug 30 04:16:17 UTC 2015 (duration 16m 16s) |
[production] |
02:23 |
<l10nupdate@tin> |
LocalisationUpdate completed (1.26wmf20) at 2015-08-30 02:23:07+00:00 |
[production] |
02:20 |
<l10nupdate@tin> |
Synchronized php-1.26wmf20/cache/l10n: l10nupdate for 1.26wmf20 (duration: 05m 36s) |
[production] |
2015-08-29
§
|
15:26 |
<jynus> |
killing idle mysql connections from phabricator and setting wait and interactive timeout to 60 |
[production] |
09:30 |
<jynus> |
SCAP failed, cannot depool db1028 |
[production] |
09:28 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1028, return ES servers back from maintenance (duration: 00m 03s) |
[production] |
09:28 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1028, return ES servers back from maintenance (duration: 00m 03s) |
[production] |
09:05 |
<jynus> |
about to depool db1028 due to disk issue |
[production] |
04:17 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Sat Aug 29 04:17:55 UTC 2015 (duration 17m 54s) |
[production] |
02:24 |
<l10nupdate@tin> |
LocalisationUpdate completed (1.26wmf20) at 2015-08-29 02:24:01+00:00 |
[production] |
02:21 |
<l10nupdate@tin> |
Synchronized php-1.26wmf20/cache/l10n: l10nupdate for 1.26wmf20 (duration: 05m 48s) |
[production] |
01:01 |
<bd808> |
Deleted local mwdeploy user on deployment-tmh01 that was causing scap failures |
[releng] |
00:21 |
<bd808> |
stopping and starting jobrunner and jobchron on deployment-tmh01 |
[releng] |
2015-08-28
§
|
23:44 |
<krenair@tin> |
Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/234679/ (duration: 06m 56s) |
[production] |
23:40 |
<bd808> |
Cherry-picked https://gerrit.wikimedia.org/r/#/c/234699/ |
[releng] |
22:51 |
<bd808@tin> |
Synchronized wmf-config/CommonSettings-labs.php: Use ffmpeg instead of avconv on labs beta (I250fe33) (duration: 06m 05s) |
[production] |
22:05 |
<ori> |
disabling puppet on tin for a few minutes to test an ssh-agent-proxy change |
[production] |
20:17 |
<bd808> |
cherry-picked https://gerrit.wikimedia.org/r/#/c/234599 to setup new tmh01 as scap target |
[releng] |
20:15 |
<bd808> |
restored 3 cherry picks that were lost when rebuilding the ops/puppet git repo |
[releng] |
20:07 |
<bd808> |
deployment-puppetmaster has only one cherry-pick; looks like maybe dcausse dropped the prior stack when working on Icc95ac8 |
[releng] |
20:04 |
<catrope@tin> |
Synchronized php-1.26wmf20/resources/src/mediawiki.legacy/shared.css: T110716 (duration: 00m 12s) |
[production] |
18:17 |
<bd808> |
Cleaned up some puppet groups for deployment-prep that no longer exist in ops/puppet |
[releng] |
18:09 |
<robh> |
updating ldap-codfw cert |
[production] |
18:03 |
<bd808> |
Building deployment-tmh01.deployment-prep.eqiad.wmflabs to replace deployment-videoscaler01 |
[releng] |
18:01 |
<bd808> |
Nope, I deleted deployment-videoscaler01 |
[releng] |