2019-12-16
§
|
10:17 |
<elukey> |
stop hadoop-related timers on stat1007 |
[analytics] |
10:14 |
<hashar> |
Restarting CI Jenkins due to out of sync state between Zuul Gearman and what is actually running (some jobs got lost) |
[production] |
10:04 |
<joal> |
Killing user-app eating all cluster (application_1573208467349_190044) |
[analytics] |
09:50 |
<marostegui> |
Stop replication in the same position in labsdb1010 and labsdb1012 - T238399 |
[production] |
09:35 |
<hashar> |
doc1001: sudo -u doc-uploader rm -fR /srv/docroot/org/wikimedia/doc/DOCKER-mediawiki-core |
[releng] |
09:24 |
<hashar> |
Reloading Jenkins CI |
[production] |
09:14 |
<godog> |
upgrade hw raid firmware on ms-be2016 and reboot - T240798 |
[production] |
09:14 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:13 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:05 |
<joal> |
Rerun webrequest-load-wf-text-2019-12-14-18 with updated error-checking parameters (all false positive) |
[analytics] |
09:04 |
<Urbanecm> |
mwscript importImages.php --wiki=commonswiki --comment-ext=txt --user=Coffeeandcrumbs /home/urbanecm/T240825 (T240825) |
[production] |
08:54 |
<ema> |
cp1077: ats-backend-restart to increase RAM cache size T238494 |
[production] |
08:53 |
<moritzm> |
powercycling ms-be2016 T240798 |
[production] |
08:49 |
<elukey> |
re-run webrequest-load 2019-12-14-13 and 2019-12-15-12 with higher mapreduce limits (modified version of refinery on hdfs /user/elukey with https://gerrit.wikimedia.org/r/#/c/analytics/refinery/+/557794/) |
[analytics] |
08:36 |
<ema> |
cp1075: repool all services T240826 |
[production] |
08:12 |
<ema> |
cp1075: wipe varnish-fe and ats-be caches due to missed purges T240826 |
[production] |
08:08 |
<ema> |
cp1075: manually start vhtcpd.service T240826 |
[production] |
07:52 |
<ema> |
cp1075: depool, vhtcpd not running |
[production] |
07:38 |
<marostegui> |
Disable auto-learn on db21[03-35] T240823 |
[production] |
07:27 |
<marostegui> |
Disable auto-learn on db[1126-1138].eqiad.wmnet T240823 |
[production] |
07:22 |
<elukey> |
stop camus timers as prep step for maintenance (if we'll do it) |
[analytics] |
07:13 |
<_joe_> |
restarting cpjobqueue on scb1001 to check if processing rate of recentChanges recovers T240518 |
[production] |
07:11 |
<marostegui> |
Stop replication in the same position in labsdb1010 and labsdb1012 - T238399 |
[production] |
07:09 |
<onimisionipe> |
depool maps2001 for postgres reinit - T239728 |
[production] |
06:59 |
<onimisionipe> |
pool maps2004. osm import is complete - T239728 |
[production] |
06:58 |
<_joe_> |
clearing apcu across multiple api servers to allow metrics to be collected again (task coming soon) |
[production] |
06:56 |
<marostegui> |
Force re-learn cycle on db1130 |
[production] |
06:42 |
<marostegui> |
Depool labsdb1010 - T238399 |
[production] |
06:39 |
<marostegui> |
Recreate views on commonswiki,testcommonswiki for protected_titles on all labsdb hosts - T233135 |
[production] |
06:29 |
<marostegui> |
Remove triggers for ar_comment on db1125:3314 T234704 |
[production] |
06:28 |
<marostegui> |
Stop replication on db1121 for schema change |
[production] |
06:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1121 for schema change', diff saved to https://phabricator.wikimedia.org/P9871 and previous config saved to /var/cache/conftool/dbconfig/20191216-062809-marostegui.json |
[production] |
03:52 |
<tstarling@deploy1001> |
Synchronized docroot/mediawiki.org/keys/keys.html: (no justification provided) (duration: 00m 57s) |
[production] |
03:49 |
<tstarling@deploy1001> |
Synchronized docroot/mediawiki.org/keys/keys.txt: (no justification provided) (duration: 01m 01s) |
[production] |
2019-12-14
§
|
22:56 |
<legoktm> |
dropped concurrency down to 1 |
[library-upgrader] |
22:52 |
<legoktm> |
restarted libup-celery now that gerrit-replica has been restarted |
[library-upgrader] |
22:50 |
<hashar> |
Restarted Gerrit on gerrit2001 # T240763 |
[production] |
20:06 |
<legoktm> |
systemctl stop libup-celery # because of T240763 |
[library-upgrader] |
19:49 |
<James_F> |
Zuul: Deploying https://gerrit.wikimedia.org/r/557275 |
[releng] |
19:49 |
<James_F> |
Zuul: Deploying https://gerrit.wikimedia.org/r/557138 |
[releng] |
17:49 |
<Jayprakash12345> |
Support for locally uploaded books in Wikisource (T240683) |
[tools.bookreader] |
11:47 |
<legoktm> |
temporarily bump concurrency to 2 to clear backlog |
[library-upgrader] |
10:48 |
<valhallasw`cloud> |
re-enabling puppet on tools-sgeexec-0912, likely left-over from NFS maintenance (no reason was specified). |
[tools] |
08:05 |
<legoktm> |
tools.coverage@tools-sgebastion-07:~/extensions$ rm -rf \$DOC_BASENAME/ -v |
[tools.coverage] |
03:29 |
<Cam11598> |
7:29:02 PM <ChanServ> Flags +AV were set on masumreza in #cvn-unifications. |
[cvn] |
03:28 |
<Cam11598> |
7:27:57 PM <ChanServ> Flags +AV were set on masumreza in #cvn-sw. |
[cvn] |