51-100 of 10000 results (24ms)
2020-12-22 §
08:52 <hashar> gerrit: running jhat heap analyzer on gerrit2001 # T263008 [production]
07:27 <elukey> reboot stat100[4-8] (analytics hadoop clients) for kernel upgrades [production]
00:20 <crusnov@deploy1001> Finished deploy [netbox/deploy@b17db99]: Redeploy of 2.9.10 to netbox-dev for dep test (duration: 00m 54s) [production]
00:19 <crusnov@deploy1001> Started deploy [netbox/deploy@b17db99]: Redeploy of 2.9.10 to netbox-dev for dep test [production]
2020-12-21 §
23:20 <legoktm@deploy1001> Synchronized docroot/noc/conf/index.php: noc: Fix "Currently active MediaWiki versions" (T235338) (duration: 00m 54s) [production]
22:26 <crusnov@deploy1001> Finished deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing p2 (duration: 00m 05s) [production]
22:26 <crusnov@deploy1001> Started deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing p2 [production]
22:26 <crusnov@deploy1001> Finished deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing (duration: 01m 01s) [production]
22:25 <sbassett> Deployed security patch T270453 [production]
22:25 <crusnov@deploy1001> Started deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing [production]
22:18 <chaomodus> Re-enabling puppet on Netbox production instances after havintg tested netbox2001 with new puppet code T266487 [production]
21:42 <legoktm> manually imported debs to buster-wikimedia thirdparty/pyall component (T241195) [production]
21:09 <chaomodus> merging change 643354 for Netbox 2.9 support, puppet disabled on production machines until testing completed T266487 [production]
19:47 <dcausse> repool wdqs1011 [production]
18:30 <dancy@deploy1001> Finished scap: Backport of l10n changes for T270619 (duration: 21m 12s) [production]
18:28 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on rdb1011.eqiad.wmnet with reason: REIMAGE [production]
18:26 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1011.eqiad.wmnet with reason: REIMAGE [production]
18:18 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE [production]
18:16 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE [production]
18:14 <volans> uploaded python3-wmflib_0.0.5 to apt.wikimedia.org buster-wikimedia [production]
18:09 <dancy@deploy1001> Started scap: Backport of l10n changes for T270619 [production]
17:58 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2024.codfw.wmnet with reason: REIMAGE [production]
17:56 <legoktm@deploy1001> Synchronized /srv/mediawiki-staging/php-1.36.0-wmf.22/extensions/FeaturedFeeds/includes/FeaturedFeeds.php: Don't load entire feed just to output the link to it (T266900) (duration: 01m 01s) [production]
17:56 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1024.eqiad.wmnet with reason: REIMAGE [production]
17:56 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc2024.codfw.wmnet with reason: REIMAGE [production]
17:54 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1024.eqiad.wmnet with reason: REIMAGE [production]
17:33 <robh@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE [production]
17:31 <robh@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE [production]
14:43 <jbond42> disable puppet to upgrade puppet master packages [production]
14:43 <jbond42> upload puppet_5.5.22-1 to wikimedia-buster [production]
14:20 <jbond42> update puppet on puppetmaster1001 [production]
14:16 <jbond42> update puppet on puppetmaster1003 [production]
14:15 <marostegui@cumin1001> dbctl commit (dc=all): 'db1112 (re)pooling @ 100%: After cloning db1154:3313', diff saved to https://phabricator.wikimedia.org/P13616 and previous config saved to /var/cache/conftool/dbconfig/20201221-141555-root.json [production]
14:15 <moritzm> installung sleuthkit security updates on buster [production]
14:00 <marostegui@cumin1001> dbctl commit (dc=all): 'db1112 (re)pooling @ 75%: After cloning db1154:3313', diff saved to https://phabricator.wikimedia.org/P13615 and previous config saved to /var/cache/conftool/dbconfig/20201221-140051-root.json [production]
13:45 <marostegui@cumin1001> dbctl commit (dc=all): 'db1112 (re)pooling @ 50%: After cloning db1154:3313', diff saved to https://phabricator.wikimedia.org/P13614 and previous config saved to /var/cache/conftool/dbconfig/20201221-134548-root.json [production]
13:30 <marostegui@cumin1001> dbctl commit (dc=all): 'db1112 (re)pooling @ 25%: After cloning db1154:3313', diff saved to https://phabricator.wikimedia.org/P13613 and previous config saved to /var/cache/conftool/dbconfig/20201221-133044-root.json [production]
12:57 <hashar> Gerrit briefly paused due to erroneous run of `jmap -clstats` [production]
12:22 <hashar> Running jhat on gerrit1001 to analyze a heap dump, expect CPU usage [production]
11:48 <moritzm> installing libxstream-java security updates on buster [production]
11:31 <moritzm> installing php-pear security updates on buster [production]
09:47 <_joe_> logging out of the long-running root screen session on maps1010 [production]
09:46 <_joe_> logging out of the long-running root screen session on maps1001 [production]
09:46 <_joe_> systemctl reset-failed on deneb, timeout downloading a docker image from the registry [production]
09:24 <dcausse> depooling wdqs1011 (lag) [production]
09:19 <_joe_> powercycling wdqs1011, unresponsive to ssh [production]
08:31 <dcausse@deploy1001> Finished deploy [wdqs/wdqs@512d713]: GUI updates (T269224+i18n updates) (duration: 08m 57s) [production]
08:22 <dcausse@deploy1001> Started deploy [wdqs/wdqs@512d713]: GUI updates (T269224+i18n updates) [production]
08:15 <marostegui> Add ips to the x2 instances on dbctl T269324 [production]
07:52 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2023.codfw.wmnet with reason: REIMAGE [production]