2020-12-22
§
|
18:29 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on labstore1004.eqiad.wmnet with reason: REIMAGE |
[production] |
18:27 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on labstore1004.eqiad.wmnet with reason: REIMAGE |
[production] |
17:27 |
<andrewbogott> |
shutting down labstore1004 in preparation for move and reimage |
[production] |
16:51 |
<mforns@deploy1001> |
Finished deploy [analytics/refinery@21c0c89] (thin): Regular analytics weekly train THIN [analytics/refinery@Ie7bce02179547ee4c6756d52f9956f492c5b4df6] (duration: 00m 08s) |
[production] |
16:51 |
<mforns@deploy1001> |
Started deploy [analytics/refinery@21c0c89] (thin): Regular analytics weekly train THIN [analytics/refinery@Ie7bce02179547ee4c6756d52f9956f492c5b4df6] |
[production] |
16:48 |
<volans> |
restarted ferm on ms-be1026 (failed with DNS query for 'ms-be1055.eqiad.wmnet' failed: query timed out ) |
[production] |
16:15 |
<bstorm> |
downtimed and stopped puppet on labstore1004 and labstore1005 for failover T266202 |
[production] |
15:23 |
<jgiannelos@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . |
[production] |
15:12 |
<jgiannelos@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'wikifeeds' for release 'production' . |
[production] |
15:08 |
<jgiannelos@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'wikifeeds' for release 'staging' . |
[production] |
11:52 |
<marostegui> |
Set db1151 to writable T269324 |
[production] |
11:10 |
<jbond42> |
upload puppet 5.5.22 to jessie-wikimedia |
[production] |
11:02 |
<jbond42> |
upload puppet 5.5.22 to stretch-wikimedia |
[production] |
10:51 |
<volans@cumin2001> |
test SAL message from wmflib, please ignore |
[production] |
10:06 |
<volans> |
upgraded python3-wmflib to 0.0.5 on cumin2001 |
[production] |
08:52 |
<hashar> |
gerrit: running jhat heap analyzer on gerrit2001 # T263008 |
[production] |
07:27 |
<elukey> |
reboot stat100[4-8] (analytics hadoop clients) for kernel upgrades |
[production] |
00:20 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@b17db99]: Redeploy of 2.9.10 to netbox-dev for dep test (duration: 00m 54s) |
[production] |
00:19 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@b17db99]: Redeploy of 2.9.10 to netbox-dev for dep test |
[production] |
2020-12-21
§
|
23:20 |
<legoktm@deploy1001> |
Synchronized docroot/noc/conf/index.php: noc: Fix "Currently active MediaWiki versions" (T235338) (duration: 00m 54s) |
[production] |
22:26 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing p2 (duration: 00m 05s) |
[production] |
22:26 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing p2 |
[production] |
22:26 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing (duration: 01m 01s) |
[production] |
22:25 |
<sbassett> |
Deployed security patch T270453 |
[production] |
22:25 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@0362a12]: Deploy of 2.9.10 to netbox-dev for script testing |
[production] |
22:18 |
<chaomodus> |
Re-enabling puppet on Netbox production instances after havintg tested netbox2001 with new puppet code T266487 |
[production] |
21:42 |
<legoktm> |
manually imported debs to buster-wikimedia thirdparty/pyall component (T241195) |
[production] |
21:09 |
<chaomodus> |
merging change 643354 for Netbox 2.9 support, puppet disabled on production machines until testing completed T266487 |
[production] |
19:47 |
<dcausse> |
repool wdqs1011 |
[production] |
18:30 |
<dancy@deploy1001> |
Finished scap: Backport of l10n changes for T270619 (duration: 21m 12s) |
[production] |
18:28 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on rdb1011.eqiad.wmnet with reason: REIMAGE |
[production] |
18:26 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1011.eqiad.wmnet with reason: REIMAGE |
[production] |
18:18 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE |
[production] |
18:16 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE |
[production] |
18:14 |
<volans> |
uploaded python3-wmflib_0.0.5 to apt.wikimedia.org buster-wikimedia |
[production] |
18:09 |
<dancy@deploy1001> |
Started scap: Backport of l10n changes for T270619 |
[production] |
17:58 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2024.codfw.wmnet with reason: REIMAGE |
[production] |
17:56 |
<legoktm@deploy1001> |
Synchronized /srv/mediawiki-staging/php-1.36.0-wmf.22/extensions/FeaturedFeeds/includes/FeaturedFeeds.php: Don't load entire feed just to output the link to it (T266900) (duration: 01m 01s) |
[production] |
17:56 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1024.eqiad.wmnet with reason: REIMAGE |
[production] |
17:56 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc2024.codfw.wmnet with reason: REIMAGE |
[production] |
17:54 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1024.eqiad.wmnet with reason: REIMAGE |
[production] |
17:33 |
<robh@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE |
[production] |
17:31 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1012.eqiad.wmnet with reason: REIMAGE |
[production] |
14:43 |
<jbond42> |
disable puppet to upgrade puppet master packages |
[production] |
14:43 |
<jbond42> |
upload puppet_5.5.22-1 to wikimedia-buster |
[production] |
14:20 |
<jbond42> |
update puppet on puppetmaster1001 |
[production] |
14:16 |
<jbond42> |
update puppet on puppetmaster1003 |
[production] |
14:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1112 (re)pooling @ 100%: After cloning db1154:3313', diff saved to https://phabricator.wikimedia.org/P13616 and previous config saved to /var/cache/conftool/dbconfig/20201221-141555-root.json |
[production] |
14:15 |
<moritzm> |
installung sleuthkit security updates on buster |
[production] |
14:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1112 (re)pooling @ 75%: After cloning db1154:3313', diff saved to https://phabricator.wikimedia.org/P13615 and previous config saved to /var/cache/conftool/dbconfig/20201221-140051-root.json |
[production] |