2019-10-23
ยง
|
11:16 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 0889da0: Add custom Minerva wordmark for Hebrew wikivoyage (2/2; T234278) (duration: 01m 01s) |
[production] |
11:10 |
<arturo> |
re-create VM `toolsbeta-test-k8s-haproxy-2` to test https://gerrit.wikimedia.org/r/545532 (T236074) |
[toolsbeta] |
11:09 |
<urbanecm@deploy1001> |
Synchronized static/images/mobile/copyright: SWAT: 0889da0: Add custom Minerva wordmark for Hebrew wikivoyage (1/2; T234278) (duration: 01m 01s) |
[production] |
11:06 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: cf8e2f1: Set $wgArticleCountMethod to any for frwikiquote (T236212) (duration: 01m 12s) |
[production] |
10:46 |
<ema> |
cp-ats: rolling ATS backend restart to apply https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/545522/ T233274 |
[production] |
10:13 |
<jynus> |
reverting dbtree revision to HEAD~1 T224589 |
[production] |
10:11 |
<jynus> |
deploying new version of dbtree T224589 |
[production] |
10:04 |
<ema> |
cp1075: ats-backend-restart to test https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/545508/ |
[production] |
09:54 |
<arturo> |
manually restart mariadb in cloudinfra-db02 to fix replication |
[cloudinfra] |
09:42 |
<godog> |
bounce burrow-logging-eqiad.service on kafkamon1001 |
[production] |
09:41 |
<arturo> |
the cloudinfra-db01 VM was rebooted bc the hypervisor rebooted |
[cloudinfra] |
09:41 |
<arturo> |
manually start mariadb in cloudinfra-db01 |
[cloudinfra] |
09:40 |
<godog> |
roll restart logstash to pick up new rsyslog-notice partitions |
[production] |
09:31 |
<godog> |
bump rsyslog-notice topic to 6 partitions |
[production] |
09:23 |
<arturo> |
cloudvirt1026 reboot ended OK |
[admin] |
09:13 |
<arturo> |
9 tools-sgeexec nodes and 6 other related VMs are down because hypervisor is rebooting |
[tools] |
09:12 |
<arturo> |
rebooting cloudvirt1026 for kernel upgrade |
[admin] |
09:12 |
<hashar> |
Pooling back integration-agent-docker-1006 , went offline for unknown reason? |
[releng] |
09:09 |
<arturo> |
cloudvirt1025 reboot ended OK |
[admin] |
09:09 |
<hashar> |
CI Jenkins: downgrading AnsiColor plugin from 0.6.2 to 0.5.3 # T236222 |
[releng] |
09:03 |
<arturo> |
tools-sgebastion-08 is down because hypervisor is rebooting |
[tools] |
09:03 |
<arturo> |
paws-master-01/03 and a couple of other servers are down because hypervisor is rebooting |
[paws] |
09:00 |
<moritzm> |
rebooting logstash2021 for some firmware tests |
[production] |
09:00 |
<arturo> |
rebooting cloudvirt1025 for kernel upgrade |
[admin] |
08:59 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:59 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:54 |
<moritzm> |
installing systemd bugfix update on mw canaries |
[production] |
08:51 |
<arturo> |
icinga downtime cloudvirt1025/1026 for reboots |
[admin] |
08:50 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:50 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:42 |
<godog> |
roll restart rsyslog on cirrus and wqds hosts to pick up changes to logback topic partitions |
[production] |
08:33 |
<hashar> |
contint1001: backing up Zuul logs to /var/log/zuul/backup-T236114 so we get gerrit activities traces for T236114 |
[releng] |
08:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2091:3312 after table compression', diff saved to https://phabricator.wikimedia.org/P9452 and previous config saved to /var/cache/conftool/dbconfig/20191023-082826-marostegui.json |
[production] |
08:23 |
<godog> |
roll restart logstash in codfw/eqiad to pick up new kafka partitions |
[production] |
08:22 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Change weights to x100 on s8 eqiad - T231018', diff saved to https://phabricator.wikimedia.org/P9451 and previous config saved to /var/cache/conftool/dbconfig/20191023-082246-marostegui.json |
[production] |
08:11 |
<godog> |
kafka-logging eqiad set 12 partitions for ^mwlog- ^logback- and eqiad.client.error topics |
[production] |
08:09 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Change weights to x100 on s8 codfw - T231018', diff saved to https://phabricator.wikimedia.org/P9450 and previous config saved to /var/cache/conftool/dbconfig/20191023-080857-marostegui.json |
[production] |
07:55 |
<godog> |
kafka-logging delete unused topic syslog-notice |
[production] |
07:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Change weights to x100 on s7 eqiad - T231018', diff saved to https://phabricator.wikimedia.org/P9449 and previous config saved to /var/cache/conftool/dbconfig/20191023-075106-marostegui.json |
[production] |
07:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Change weights to x100 on s7 codfw - T231018', diff saved to https://phabricator.wikimedia.org/P9448 and previous config saved to /var/cache/conftool/dbconfig/20191023-074828-marostegui.json |
[production] |
07:46 |
<XioNoX> |
powering down cr2-esams for relocation (for real this time) |
[production] |
07:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Change weights to x100 on s6 eqiad - T231018', diff saved to https://phabricator.wikimedia.org/P9447 and previous config saved to /var/cache/conftool/dbconfig/20191023-073831-marostegui.json |
[production] |
07:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Change weights to x100 on s6 codfw - T231018', diff saved to https://phabricator.wikimedia.org/P9446 and previous config saved to /var/cache/conftool/dbconfig/20191023-073556-marostegui.json |
[production] |
07:30 |
<XioNoX> |
powering down cr2-esams for relocation |
[production] |
07:28 |
<hashar> |
logstash: refreshing index fields for logstash-* indices (via https://logstash.wikimedia.org/app/kibana#/management/kibana/indices/logstash-* ) # T234564 |
[production] |
07:05 |
<XioNoX> |
redirect ns2 to eqiad - T235805 |
[production] |
07:04 |
<marostegui> |
Enable slow query log 1/10 on db1089 (enwiki) T223151 |
[production] |
07:02 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:02 |
<ayounsi@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
06:59 |
<XioNoX> |
depool esams - T235805 |
[production] |