2020-08-05
ยง
|
15:11 |
<pt1979@cumin2001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
15:08 |
<godog> |
bounce logstash on logstash100[789] - udp loss reported |
[production] |
15:05 |
<pt1979@cumin2001> |
START - Cookbook sre.dns.netbox |
[production] |
14:48 |
<elukey> |
reboot stat1008 for unexpected maintenance (GPU stuck) |
[production] |
14:33 |
<otto@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
14:32 |
<otto@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . |
[production] |
14:27 |
<otto@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
14:27 |
<otto@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . |
[production] |
14:25 |
<moritzm> |
installing nmap bugfix updates from buster point release |
[production] |
14:24 |
<otto@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . |
[production] |
14:24 |
<otto@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
14:20 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:20 |
<sukhe@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:14 |
<moritzm> |
installing pillow security updates |
[production] |
14:03 |
<moritzm> |
installing node-minimist security updates |
[production] |
13:51 |
<moritzm> |
installing Linux update to 4.9.132 from buster point update (no reboots, just the package updates) |
[production] |
13:32 |
<jayme> |
updated helmfile to 0.125.2-0 and helm-diff to 3.1.2-1 on contint* and deploy* |
[production] |
13:28 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
13:24 |
<volans@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
13:04 |
<elukey> |
restart yarn resource managers on an-master100[12] to pick up new Yarn settings - https://gerrit.wikimedia.org/r/c/operations/puppet/+/618529 |
[production] |
13:00 |
<moritzm> |
installing libjpeg-turbo security updates on stretch |
[production] |
12:52 |
<XioNoX> |
netmon1002:/srv/deployment/librenms/librenms$ sudo -u librenms ./lnms migrate |
[production] |
12:49 |
<jayme> |
imported helm-diff_3.1.2-1 to buster-wikimedia, jessie-wikimedia and stretch-wikimedia |
[production] |
12:46 |
<moritzm> |
installing imagemagick security updates on buster |
[production] |
12:33 |
<moritzm> |
installing net-snmp security updates on icinga hosts |
[production] |
11:36 |
<awight> |
EU Bacon reclosed |
[production] |
11:36 |
<awight@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:614891|Switch test wikis to new version of vector by default (3/3) (T254227)]] (duration: 01m 07s) |
[production] |
11:29 |
<awight> |
EU Bacon reopened |
[production] |
11:28 |
<awight> |
EU Bacon complete |
[production] |
11:26 |
<awight@deploy1001> |
Synchronized wmf-config: Config: [[gerrit:618481|FileImporter: full default deployment (T232542)]] (duration: 01m 04s) |
[production] |
11:23 |
<jayme> |
imported helm-diff_3.1.2-0 to jessie-wikimedia and stretch-wikimedia |
[production] |
11:22 |
<jayme> |
imported helm-diff_3.1.2-0 to buster-wikimedia |
[production] |
11:19 |
<awight@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:618303|Add import sources for lijwikisource (T259633)]] (duration: 01m 07s) |
[production] |
11:13 |
<awight@deploy1001> |
sync-file aborted: Config: [[gerrit:618303|Add import sources for lijwikisource (T259633)]] (duration: 00m 13s) |
[production] |
11:10 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:595542|Enable Data Bridge on Test Wikidata clients (T232584)]] (duration: 01m 20s) |
[production] |
10:39 |
<XioNoX> |
reboot cr3-ulsfo - T259621 |
[production] |
10:28 |
<XioNoX> |
drain traffic away cr3-ulsfo - T259621 |
[production] |
10:21 |
<moritzm> |
installing libssh security updates |
[production] |
10:18 |
<XioNoX> |
reboot cr4-ulsfo - T259621 |
[production] |
09:58 |
<XioNoX> |
drain traffic away cr4-ulsfo |
[production] |
09:53 |
<XioNoX> |
depool ulsfo - T259621 |
[production] |
09:32 |
<elukey> |
set ticket max renewable lifetime to 7d on all kerberos clients (was zero, the default) |
[production] |
09:07 |
<jayme> |
imported helmfile_0.125.2-0 to jessie-wikimedia |
[production] |
09:07 |
<jayme> |
imported helmfile_0.125.2-0 to stretch-wikimedia |
[production] |
09:05 |
<jayme> |
imported helmfile_0.125.2-0 to buster-wikimedia |
[production] |
08:39 |
<marostegui> |
Remove revision triggers on db1125:3317 |
[production] |
08:39 |
<marostegui> |
Stop replication on db1079 for MCR, this will generate lag on s7 on labsdb |
[production] |
08:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1079 for MCR', diff saved to https://phabricator.wikimedia.org/P12173 and previous config saved to /var/cache/conftool/dbconfig/20200805-083916-marostegui.json |
[production] |
08:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1094', diff saved to https://phabricator.wikimedia.org/P12172 and previous config saved to /var/cache/conftool/dbconfig/20200805-083833-marostegui.json |
[production] |
08:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1094', diff saved to https://phabricator.wikimedia.org/P12171 and previous config saved to /var/cache/conftool/dbconfig/20200805-082908-marostegui.json |
[production] |