2021-03-10
ยง
|
18:16 |
<mforns> |
starting deployment of refinery (session length oozie job) |
[analytics] |
18:05 |
<Majavah> |
create deployment-ircd02 for T277081 |
[releng] |
17:48 |
<mutante> |
new Wikimedia project language "trv" added - Seediq is an Atayalic language spoken in the mountains of Northern Taiwan by the Seediq and Taroko people. |
[production] |
17:45 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on kafka-logging2003.codfw.wmnet with reason: REIMAGE |
[production] |
17:42 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2003.codfw.wmnet with reason: REIMAGE |
[production] |
17:26 |
<marxarelli> |
`rm -rf /srv/dump` on deployment-db06 and reenabling puppet |
[releng] |
17:25 |
<marxarelli> |
`rm -rf /srv/restore` on deployment-db08 and reenabling puppet |
[releng] |
17:24 |
<marxarelli> |
`rm -rf /srv/backup /srv/restore` on deployment-db07 and reenabling puppet |
[releng] |
17:19 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on kafka-logging2002.codfw.wmnet with reason: REIMAGE |
[production] |
17:17 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2002.codfw.wmnet with reason: REIMAGE |
[production] |
17:09 |
<Majavah> |
set beta cluster mediawiki as read write on mw config (T276968) |
[releng] |
17:03 |
<Majavah> |
make deployment-db06 read-write T276968 |
[releng] |
16:56 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1030.eqiad.wmnet |
[production] |
16:54 |
<razzi> |
rebalance kafka partitions for webrequest_upload partition 15 |
[analytics] |
16:52 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on kafka-logging2001.codfw.wmnet with reason: REIMAGE |
[production] |
16:51 |
<arturo> |
rebooting cloudvirt1030 for T275753 |
[admin] |
16:50 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host cloudvirt1030.eqiad.wmnet |
[production] |
16:50 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2001.codfw.wmnet with reason: REIMAGE |
[production] |
16:50 |
<Majavah> |
`reset slave;` on new master deployment-db06 T276968 |
[releng] |
16:49 |
<Majavah> |
add deployment-db07 as a replica of db06 for T276968 |
[releng] |
16:48 |
<arturo> |
briefly stopping VM content-similarity-prototype to migrate hypervisor |
[wmf-research-tools] |
16:48 |
<arturo> |
briefly stopping VM toolsbeta-test-k8s-etcd-8 to migrate hypervisor |
[toolsbeta] |
16:48 |
<arturo> |
briefly stopping VM toolhub-beta01 to migrate hypervisor |
[toolhub] |
16:48 |
<arturo> |
briefly stopping VM maps-beta-1 to migrate hypervisor |
[entity-detection] |
16:47 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging1001.eqiad.wmnet with reason: REIMAGE |
[production] |
16:45 |
<Urbanecm> |
root@deployment-db07:/opt/wmf-mariadb104/bin# ./mysql_upgrade -h 127.0.0.1 # T276968 |
[releng] |
16:45 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging1001.eqiad.wmnet with reason: REIMAGE |
[production] |
16:20 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging1001.eqiad.wmnet with reason: REIMAGE |
[production] |
16:18 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging1001.eqiad.wmnet with reason: REIMAGE |
[production] |
16:12 |
<Majavah> |
deployment-db08 CHANGE MASTER to MASTER_USER='repl', MASTER_PASSWORD='redacted', MASTER_PORT=3306, MASTER_HOST='deployment-db06.deployment-prep.eqiad1.wikimedia.cloud', MASTER_LOG_FILE='deployment-db06-bin.000059', MASTER_LOG_POS=522469730; (T276968) |
[releng] |
16:06 |
<Urbanecm> |
start root@deployment-db07:/srv/sqldata.db06# rsync --progress -r deployment-db06:/srv/sqldata/ . (T276968) |
[releng] |
15:57 |
<Majavah> |
set deployment-db06 as readonly from mysql side T276968 |
[releng] |
15:54 |
<Urbanecm> |
Start `root@deployment-db08:/opt/wmf-mariadb104/bin# ./mysql_upgrade -h 127.0.0.1` (T276968) |
[releng] |
15:54 |
<Urbanecm> |
Start mariadb on db08 (T276968) |
[releng] |
15:33 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1127 (re)pooling @ 100%: Repool db1127 after schema change', diff saved to https://phabricator.wikimedia.org/P14744 and previous config saved to /var/cache/conftool/dbconfig/20210310-153324-root.json |
[production] |
15:22 |
<Urbanecm> |
rsync deployment-db06:/srv/sqldata to deployment-db08:/srv/sqldata in a tmux session on deploymdeployment-db08 (T276968) |
[releng] |
15:22 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sodium.wikimedia.org |
[production] |
15:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1127 (re)pooling @ 60%: Repool db1127 after schema change', diff saved to https://phabricator.wikimedia.org/P14743 and previous config saved to /var/cache/conftool/dbconfig/20210310-151820-root.json |
[production] |
15:16 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host sodium.wikimedia.org |
[production] |
15:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1127 (re)pooling @ 30%: Repool db1127 after schema change', diff saved to https://phabricator.wikimedia.org/P14742 and previous config saved to /var/cache/conftool/dbconfig/20210310-150316-root.json |
[production] |
14:53 |
<klausman@puppetmaster1001> |
conftool action : set/pooled=yes:weight=1; selector: cluster=ml_serve,service=kubemaster |
[production] |
14:52 |
<Majavah> |
delete deployment-db08 /srv/sqldata to attempt procedure in https://phabricator.wikimedia.org/T276968#6900199 |
[releng] |
14:52 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2061.codfw.wmnet |
[production] |
14:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1127 (re)pooling @ 10%: Repool db1127 after schema change', diff saved to https://phabricator.wikimedia.org/P14741 and previous config saved to /var/cache/conftool/dbconfig/20210310-144813-root.json |
[production] |
14:44 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2061.codfw.wmnet |
[production] |
14:43 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2060.codfw.wmnet |
[production] |
14:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1127', diff saved to https://phabricator.wikimedia.org/P14740 and previous config saved to /var/cache/conftool/dbconfig/20210310-143547-marostegui.json |
[production] |
14:35 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2060.codfw.wmnet |
[production] |
14:34 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2059.codfw.wmnet |
[production] |
14:26 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ms-be2059.codfw.wmnet |
[production] |