2019-10-01
ยง
|
19:00 |
<ayounsi@cumin1001> |
START - Cookbook sre.hosts.rotate-pdu-password |
[production] |
18:59 |
<dduvall@deploy1001> |
Pruned MediaWiki: 1.34.0-wmf.20 (duration: 02m 11s) |
[production] |
18:57 |
<dduvall@deploy1001> |
Pruned MediaWiki: 1.34.0-wmf.19 (duration: 02m 12s) |
[production] |
18:54 |
<dduvall@deploy1001> |
Pruned MediaWiki: 1.34.0-wmf.17 (duration: 02m 48s) |
[production] |
18:48 |
<dduvall@deploy1001> |
Pruned MediaWiki: 1.34.0-wmf.16 (duration: 18m 45s) |
[production] |
17:53 |
<ayounsi@cumin1001> |
END (ERROR) - Cookbook sre.hosts.rotate-pdu-password (exit_code=97) |
[production] |
17:52 |
<thcipriani> |
gerrit restart for new config changes incoming |
[production] |
17:52 |
<ayounsi@cumin1001> |
START - Cookbook sre.hosts.rotate-pdu-password |
[production] |
17:50 |
<ayounsi@cumin1001> |
END (ERROR) - Cookbook sre.hosts.rotate-pdu-password (exit_code=97) |
[production] |
17:48 |
<ayounsi@cumin1001> |
START - Cookbook sre.hosts.rotate-pdu-password |
[production] |
17:48 |
<XioNoX> |
rotate PDUs passwords - T233053 |
[production] |
17:18 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:14 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
17:09 |
<krinkle@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: T156095 - c28baa1862401 (duration: 00m 59s) |
[production] |
17:07 |
<mutante> |
Welcome new deployer Andrew Kostka (WMDE) (T233202) |
[production] |
17:07 |
<marxarelli> |
cutting wmf/1.34.0-wmf.25 |
[production] |
16:16 |
<_joe_> |
manually downgrading php-geoip on deploy*, it was still at the 7.0-only version from the distro |
[production] |
16:14 |
<@> |
helmfile [CODFW] Ran 'sync' command on namespace 'restrouter' for release 'production' . |
[production] |
16:14 |
<@> |
helmfile [CODFW] Ran 'sync' command on namespace 'restrouter' for release 'production' . |
[production] |
16:10 |
<@> |
helmfile [EQIAD] Ran 'sync' command on namespace 'restrouter' for release 'production' . |
[production] |
16:06 |
<@> |
helmfile [STAGING] Ran 'sync' command on namespace 'restrouter' for release 'staging' . |
[production] |
15:36 |
<_joe_> |
uninstalling temporarily the math rendering related packages from mwdebug2002, test for T195847 |
[production] |
15:36 |
<elukey> |
powercycle an-conf1001 to test some bios settings |
[production] |
15:12 |
<jbond42> |
puppetmaster2001 is back online |
[production] |
14:33 |
<dcausse> |
created cirrussearch indices for nqowiki (T234326) |
[production] |
14:18 |
<moritzm> |
rebooting krb1001 for some tests |
[production] |
14:17 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:17 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:10 |
<hashar> |
Restarting CI Jenkins |
[production] |
14:08 |
<cdanis> |
โ๏ธ cdanis@puppetmaster2001.codfw.wmnet ~ ๐โ (cd /var/lib/git/labs/private ; git rev-parse HEAD | sudo tee /srv/config-master/labsprivate-sha1.txt ) |
[production] |
14:08 |
<cdanis> |
โ๏ธ cdanis@puppetmaster2001.codfw.wmnet ~ ๐โ (cd /var/lib/git/operations/puppet ; git rev-parse HEAD | sudo tee /srv/config-master/puppet-sha1.txt ) |
[production] |
14:08 |
<herron> |
beginning rolling reboots of eqiad and codfw logstash collectors |
[production] |
14:02 |
<moritzm> |
rebooting mw1265 for some tests |
[production] |
14:01 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:01 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:59 |
<cdanis> |
โ๏ธ cdanis@puppetmaster2001.codfw.wmnet ~ ๐โ sudo touch /srv/config-master/puppet-sha1.txt /srv/config-master/labsprivate-sha1.txt && sudo chown gitpuppet:gitpuppet /srv/config-master/puppet-sha1.txt /srv/config-master/labsprivate-sha1.txt |
[production] |
13:42 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:40 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:24 |
<jbond42> |
reimage puppetmaster2001 |
[production] |
12:37 |
<hashar> |
Gerrit misbehaved temporarily due to human operator error (hashar ran jstack -l -m which bring the jvm to an halt) |
[production] |
11:16 |
<jbond42> |
update puppet.ulsfo.wmnet to point to puppetmaster1001 |
[production] |
10:45 |
<jbond42> |
update puppet.esqin.wmnet to point to puppetmaster1001 |
[production] |
10:17 |
<moritzm> |
upgrading ferm on remaining mw servers 2.4.2pre T153468 |
[production] |
09:35 |
<moritzm> |
run systemctl reset-failed on puppetmaster2002 to clear failed puppet-master.service |
[production] |
09:19 |
<moritzm> |
upgrading ferm on a number of systems to 2.4.2pre T153468 |
[production] |
09:07 |
<vgutierrez> |
restarting acme-chief on acmechief1001 to catch up with python3-cryptography upgrades - T234131 |
[production] |
09:04 |
<vgutierrez> |
upgrading python3-cryptography to version 2.6.1-3+deb10u1~wmf1 on acme-chief hosts - T234131 |
[production] |
09:03 |
<moritzm> |
rebalancing ganeti/row_B after rolling reboot |
[production] |
08:57 |
<vgutierrez> |
upgrading python3-cryptography to version 2.6.1-3+deb10u1~wmf1 on acmechief-test1001 - T234131 |
[production] |
08:41 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |