2021-04-28
ยง
|
12:55 |
<moritzm> |
upgrading snapshot hosts to PHP 7.4.32 |
[production] |
12:48 |
<jayme> |
restarting pybal on lvs2009 - T271573 |
[production] |
12:45 |
<moritzm> |
upgrading labweb to PHP 7.4.32 |
[production] |
12:43 |
<jmm@cumin2001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
12:42 |
<jayme> |
restarting pybal on lvs5003,lvs4007 - T271573 |
[production] |
12:39 |
<jayme> |
restarting pybal on lvs2010 - T271573 |
[production] |
12:36 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) |
[production] |
12:28 |
<apergos> |
manually edited /srv/deployment/dumps/dumps-cache/config on snapshots1011,12,13 to change deploy1001 to deploy1002 (where did it get the old value from? these are new installs!) |
[production] |
12:16 |
<moritzm> |
rolling restart of cassandra in restbase-dev to pick up Java security updates |
[production] |
12:15 |
<jmm@cumin2001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
12:15 |
<jmm@cumin2001> |
END (FAIL) - Cookbook sre.cassandra.roll-restart (exit_code=99) |
[production] |
12:15 |
<jmm@cumin2001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
11:53 |
<jayme> |
switching SRV record _etcd._tcp to new etcd cluster (for codfw, eqsin, ulsfo) |
[production] |
11:22 |
<Urbanecm> |
EU B&C window done |
[production] |
11:20 |
<urbanecm@deploy1002> |
Synchronized php-1.37.0-wmf.3/extensions/Popups/: 8d0ae5e8fedefa911fc216bfc810d7a6169ea7e5: Separate reference preview settings in beta & non-beta (T281235) (duration: 01m 08s) |
[production] |
11:16 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: ddbc378e41783356e28cd90bbefa08624ea2844c: Enable partial action blocks on testwiki (T280528) (duration: 01m 07s) |
[production] |
11:05 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1002.eqiad.wmnet with reason: REIMAGE |
[production] |
11:03 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: REIMAGE |
[production] |
11:03 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1002.eqiad.wmnet with reason: REIMAGE |
[production] |
11:01 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudgw1001.eqiad.wmnet with reason: REIMAGE |
[production] |
10:44 |
<jbond42> |
updated the check-raid nrpe script to python3 |
[production] |
09:40 |
<moritzm> |
restarting Tomcat on idp-test1001 to pick up Java security updates |
[production] |
09:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 100%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15618 and previous config saved to /var/cache/conftool/dbconfig/20210428-092103-root.json |
[production] |
09:19 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host contint1001.wikimedia.org |
[production] |
09:12 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host contint1001.wikimedia.org |
[production] |
09:09 |
<moritzm> |
restarting jenkins* on releases to pick up Java security updates |
[production] |
09:08 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host contint2001.wikimedia.org |
[production] |
09:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 75%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15617 and previous config saved to /var/cache/conftool/dbconfig/20210428-090559-root.json |
[production] |
08:59 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host contint2001.wikimedia.org |
[production] |
08:50 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 50%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15616 and previous config saved to /var/cache/conftool/dbconfig/20210428-085056-root.json |
[production] |
08:42 |
<urbanecm@deploy1002> |
Synchronized wmf-config/InterwikiSortOrders.php: 96ad0d4ad294c442b4936a63ae1cd9de9c098aa9: Add alt, bcl, diq, mad, mni, mnw, nia, skr, tay and trv to InterwikiSortOrders (duration: 01m 08s) |
[production] |
08:41 |
<urbanecm@deploy1002> |
sync-file aborted: 96ad0d4ad294c442b4936a63ae1cd9de9c098aa9: Add alt, bcl, diq, mad, mni, mnw, nia, skr, tay and trv to InterwikiSortOrders (duration: 00m 02s) |
[production] |
08:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15615 and previous config saved to /var/cache/conftool/dbconfig/20210428-083625-marostegui.json |
[production] |
08:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 25%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15614 and previous config saved to /var/cache/conftool/dbconfig/20210428-083552-root.json |
[production] |
08:34 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 25%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P15613 and previous config saved to /var/cache/conftool/dbconfig/20210428-083458-root.json |
[production] |
08:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 100%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15612 and previous config saved to /var/cache/conftool/dbconfig/20210428-082625-root.json |
[production] |
08:25 |
<effie> |
update php7.2 on jobrunners and parsoid servers && rolling php7.2-fpm restarts |
[production] |
08:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 75%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15611 and previous config saved to /var/cache/conftool/dbconfig/20210428-081121-root.json |
[production] |
07:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 50%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15610 and previous config saved to /var/cache/conftool/dbconfig/20210428-075618-root.json |
[production] |
07:52 |
<effie> |
update php7.2 on api servers && rolling php7.2-fpm restarts |
[production] |
07:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3317 (re)pooling @ 25%: Repool db1098:3317', diff saved to https://phabricator.wikimedia.org/P15609 and previous config saved to /var/cache/conftool/dbconfig/20210428-074114-root.json |
[production] |
07:40 |
<marostegui> |
Deploy schema change on db1098:3316 and db1098:3316 T266486 T268392 T273360 |
[production] |
07:27 |
<effie> |
update php7.2 on appservers && rolling php7.2-fpm restarts |
[production] |
07:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1098 for schema change and kernel upgrade', diff saved to https://phabricator.wikimedia.org/P15608 and previous config saved to /var/cache/conftool/dbconfig/20210428-072609-marostegui.json |
[production] |
07:19 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
07:14 |
<elukey@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
07:12 |
<elukey> |
add AAAA record for kafka-main200[3,4,5].codfw.wmnet |
[production] |
07:10 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
07:05 |
<elukey@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
07:04 |
<elukey> |
add AAAA record for kafka-main2002.codfw.wmnet |
[production] |