2020-08-06
§
|
04:37 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1079', diff saved to https://phabricator.wikimedia.org/P12179 and previous config saved to /var/cache/conftool/dbconfig/20200806-043758-marostegui.json |
[production] |
03:04 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=wtp2019.codfw.wmnet |
[production] |
02:24 |
<eileen> |
process-control config revision is 525eb71235 turn off delete deleted contacts |
[production] |
01:52 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
01:52 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
01:19 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
01:19 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
01:17 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
01:17 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
00:35 |
<mutante> |
wtp2019 - reimaging - parsoid service does not work, unlike on all other wtp*, making sure it's clean |
[production] |
00:00 |
<mutante> |
LDAP - removed demon from nda group |
[production] |
2020-08-05
§
|
23:57 |
<eileen> |
civicrm revision changed from 150c3476c4 to 72452e28a9, config revision is b6ece03513 |
[production] |
23:02 |
<shdubsh> |
logstash in codfw looks stuck -- restarting |
[production] |
19:41 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: Revert group1 wikis to 1.36.0-wmf.2 |
[production] |
19:39 |
<pt1979@cumin2001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
19:37 |
<pt1979@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
19:13 |
<brennen@deploy1001> |
Synchronized php: group1 wikis to 1.36.0-wmf.3 (duration: 01m 44s) |
[production] |
19:11 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.36.0-wmf.3 |
[production] |
18:26 |
<Lucas_WMDE> |
Morning backport window done |
[production] |
18:25 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized php-1.36.0-wmf.3/extensions/ContentTranslation/: Backport: [[gerrit:618566|Pass jQuery objects into jqueryMsg]] (duration: 01m 11s) |
[production] |
18:14 |
<mutante> |
test !log |
[production] |
18:10 |
<lucaswerkmeister-wmde@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:618343|Re-enable growth study quick survey (T257015)]] (duration: 01m 12s) |
[production] |
17:30 |
<shdubsh> |
test prometheus-icinga-exporter upgrade on icinga2001 |
[production] |
16:50 |
<elukey> |
powercycle stat1005 after GPU issue |
[production] |
15:56 |
<otto@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: EventStreamConfig - Add eventgate-logging-external streams and destination_event_service settings - T251935 (duration: 01m 05s) |
[production] |
15:50 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . |
[production] |
15:43 |
<hnowlan@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'api-gateway' for release 'staging' . |
[production] |
15:11 |
<pt1979@cumin2001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
15:08 |
<godog> |
bounce logstash on logstash100[789] - udp loss reported |
[production] |
15:05 |
<pt1979@cumin2001> |
START - Cookbook sre.dns.netbox |
[production] |
14:48 |
<elukey> |
reboot stat1008 for unexpected maintenance (GPU stuck) |
[production] |
14:33 |
<otto@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
14:32 |
<otto@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . |
[production] |
14:27 |
<otto@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
14:27 |
<otto@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . |
[production] |
14:25 |
<moritzm> |
installing nmap bugfix updates from buster point release |
[production] |
14:24 |
<otto@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'production' . |
[production] |
14:24 |
<otto@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' . |
[production] |
14:20 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:20 |
<sukhe@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:14 |
<moritzm> |
installing pillow security updates |
[production] |
14:03 |
<moritzm> |
installing node-minimist security updates |
[production] |
13:51 |
<moritzm> |
installing Linux update to 4.9.132 from buster point update (no reboots, just the package updates) |
[production] |
13:32 |
<jayme> |
updated helmfile to 0.125.2-0 and helm-diff to 3.1.2-1 on contint* and deploy* |
[production] |
13:28 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
13:24 |
<volans@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
13:04 |
<elukey> |
restart yarn resource managers on an-master100[12] to pick up new Yarn settings - https://gerrit.wikimedia.org/r/c/operations/puppet/+/618529 |
[production] |
13:00 |
<moritzm> |
installing libjpeg-turbo security updates on stretch |
[production] |
12:52 |
<XioNoX> |
netmon1002:/srv/deployment/librenms/librenms$ sudo -u librenms ./lnms migrate |
[production] |
12:49 |
<jayme> |
imported helm-diff_3.1.2-1 to buster-wikimedia, jessie-wikimedia and stretch-wikimedia |
[production] |