401-450 of 10000 results (40ms)
2019-02-25 §
12:27 <gilles@deploy1001> Started deploy [3d2png/deploy@ca39432]: (no justification provided) [production]
12:22 <gilles@deploy1001> Finished deploy [3d2png/deploy@ca39432]: (no justification provided) (duration: 01m 15s) [production]
12:21 <gilles@deploy1001> Started deploy [3d2png/deploy@ca39432]: (no justification provided) [production]
12:21 <chicocvenancio> PAWS: killed proxy and hub pods to attempt to get it to see routes to open notebooks servers to no avail. Restarted BernhardHumm's notebook pod T217010 [tools]
12:19 <gtirloni> deleted local crontab on tools-bastion-03 (T217019) [tools.wikiloves]
11:49 <moritzm> rolling out intel-microcode 3.20180807a.2 on all jessie/stretch servers, tests on a number of previously unsupported servers with Westmere CPU were successful and I've verified that all other microcode files are identical compared to the current 3.20180807a.1 microcode [production]
11:19 <jijiki> Reimageing thumbor1001 - T214597 [production]
10:40 <jdrewniak@deploy1001> Synchronized portals: Wikimedia Portals Update: [[gerrit:492636| Bumping portals to master (T128546, T202497)]] (duration: 00m 46s) [production]
10:39 <jdrewniak@deploy1001> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:492636| Bumping portals to master (T128546, T202497)]] (duration: 00m 46s) [production]
10:32 <gtirloni> restarted nfsd on labstore1004 [admin]
10:31 <gtirloni> labstore1004 restarted nfsd and killed stuck rpc.mountd.real processed (T216988) [production]
10:16 <jijiki> Depooling thumbor1001 to reimage - T214597 [production]
09:54 <marostegui> Deploy schema change on db1074, this will generate lag on labsdb:s2 - T187295 [production]
09:50 <gtirloni> rebooted tools-sgeexec-09{16,22,40} (T216988) [tools]
09:41 <gtirloni> rebooted tools-sgeexec-09{16,22,40} [tools]
09:31 <gtirloni> commented cronjobs, stop webservices and truncated Worker*.err files (T216988) [tools.iabot]
09:07 <marostegui@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Increase ParserCache TTL from 24 days to 30 - T210992 (duration: 00m 46s) [production]
08:52 <marostegui> Deploy schema change on s2 on codfw master - lag will happen on s2 codfw - T187295 [production]
08:49 <_joe_> generating mcrouter certificate for mw2151 T192457 [production]
08:37 <zhuyifei1999_> uncordon tools-worker-1015.tools.eqiad.wmflabs [tools]
08:34 <legoktm> hard rebooted tools-worker-1015 via horizon [tools]
07:48 <zhuyifei1999_> systemd stuck in D state. :( [tools]
07:44 <zhuyifei1999_> I saved dmesg and process list to a few files in /root if that helps debugging [tools]
07:43 <zhuyifei1999_> D states are not responding to SIGKILL. Will reboot. [tools]
07:37 <zhuyifei1999_> tools-worker-1015.tools.eqiad.wmflabs having severe NFS issues (all NFS accessing processes are stuck in D state). Draining. [tools]
07:09 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Fully repool db1104 after MySQL upgrade (duration: 00m 45s) [production]
06:28 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Repool db1104 in API after MySQL upgrade (duration: 00m 45s) [production]
06:13 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Slowly repool db1104 after MySQL upgrade (duration: 00m 45s) [production]
06:02 <marostegui> Stop MySQL on db1104 for mysql upgrade [production]
06:02 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Depool db1104 for MySQL upgrade (duration: 00m 50s) [production]
2019-02-24 §
22:59 <Lokal_Profil> migrated to Stretch [tools.slumpartikel]
22:13 <Lokal_Profil> migrated to Stretch [tools.wakt]
21:56 <Lokal_Profil> migrated to Stretch [tools.mapillary-commons]
21:49 <eileen> civicrm revision changed from 1b5d974569 to d1fc603677, config revision is 00f9c08766 [production]
21:09 <Krinkle> Reloading Zuul to deploy https://gerrit.wikimedia.org/r/492561, T216964 [releng]
18:20 <elukey> clean up 2017/2018 log files in /var/log/jmxtrans on kafka1013-22 - root partitions filling up [production]
18:15 <elukey> clean up 2017/2018 log files in /var/log/jmxtrans - root partition almost filled up [production]
13:35 <framawiki> moved webservice to stretch, now using kubernetes backend https://wikitech.wikimedia.org/wiki/News/Toolforge_Trusty_deprecation [tools.totoazero]
13:26 <framawiki> moved jobs to stretch, recreated the venv https://wikitech.wikimedia.org/wiki/News/Toolforge_Trusty_deprecation [tools.totoazero]
10:24 <elukey> restart check webrequest service on an-coord1001 (failed due to /mnt/hdfs being unavail) [analytics]
10:22 <elukey> force remount of /mnt/hdfs on an-coord1001 (fuse-hdfs stuck) [production]
04:54 <legoktm> rebuilding docker image for https://gerrit.wikimedia.org/r/485241 [releng]
04:41 <legoktm> legoktm@contint1001:/srv/zuul/git/mediawiki/tools$ sudo -u zuul rm -rf phan [releng]
2019-02-23 §
22:25 <legoktm> deploying https://gerrit.wikimedia.org/r/492497 [releng]
02:32 <Krinkle> Reloading Zuul to deploy https://gerrit.wikimedia.org/r/492427 [releng]
01:59 <Krinkle> Reloading Zuul to deploy ttps://gerrit.wikimedia.org/r/492425 [releng]
2019-02-22 §
20:59 <Krinkle> Updating docker-pkg files on contint1001 for https://gerrit.wikimedia.org/r/492377 [releng]
18:02 <gehel> rolling upgrade on elasticsearch / cirrus / eqiad completed - T215931 [production]
18:00 <gehel@cumin2001> END (PASS) - Cookbook sre.elasticsearch.force-shard-allocation (exit_code=0) [production]
18:00 <gehel@cumin2001> START - Cookbook sre.elasticsearch.force-shard-allocation [production]