2020-07-09
ยง
|
12:58 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
12:57 |
<akosiaris@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
12:57 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
12:56 |
<akosiaris@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
12:56 |
<akosiaris@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'proton' for release 'production' . |
[production] |
12:54 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
12:54 |
<moritzm> |
rebooting install* servers for kernel security update |
[production] |
12:43 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
12:40 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
12:40 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
12:38 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
12:38 |
<moritzm> |
rebooting urldownloader1001/2001 for kernel update (failed over, these are now the inactive ones) |
[production] |
12:23 |
<jmm@cumin2001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) |
[production] |
12:22 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
12:22 |
<moritzm> |
rebooting dbmonitor1001 / tendril.wikimedia.org for kernek update |
[production] |
12:11 |
<XioNoX> |
enable asw2-b-eqiad:ae3 (to cloudsw1-c8) - T251632 |
[production] |
11:56 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
11:54 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
11:52 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
11:50 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
11:50 |
<moritzm> |
rebooting debmonitor1001 for kernel update |
[production] |
11:42 |
<urbanecm@deploy1001> |
Synchronized php-1.35.0-wmf.40/extensions/Translate/tag/SpecialPageTranslation.php: 6541d3ff51f52fe8a1bdbfa86022f8d97d6c7680: DeprecatablePropertyArray: Use MW_VERSION instead of array_key_exists (T257531) (duration: 01m 05s) |
[production] |
11:28 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: 3a7c1c33e58637437f819edf039008a00dc5be27: Rename namespace on kn.wikipedia.org (T255337) (duration: 01m 04s) |
[production] |
11:24 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: 0a3c1f94a702b527842ed4f34d8bf41b26235e64: Add *.oireachtas.ie to the wgCopyUploadsDomains whitelist for commonswiki (T256543) (duration: 01m 04s) |
[production] |
11:19 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
11:17 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
11:10 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:10 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:09 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: e6f442c6900524482806aeb1b5162e65bf7c97ac: Enable Quicksurveys for Desktop Improvements Project (T246977) (duration: 01m 06s) |
[production] |
11:01 |
<vgutierrez> |
restart ats-tls on cp1085 |
[production] |
10:55 |
<_joe_> |
restarting php7.2-fpm on mw1282, workers failing with sigill |
[production] |
10:54 |
<_joe_> |
depool mw1282 |
[production] |
10:54 |
<mvolz@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'citoid' for release 'production' . |
[production] |
10:34 |
<mvolz@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'citoid' for release 'production' . |
[production] |
10:23 |
<_joe_> |
rolling restart the remaining restbases in eqiad, and all of codfw |
[production] |
10:22 |
<mvolz@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'citoid' for release 'staging' . |
[production] |
10:09 |
<_joe_> |
restarting restbase on rb1020-22 |
[production] |
09:53 |
<_joe_> |
restarting restbase on restbase1024,1023 |
[production] |
09:36 |
<_joe_> |
restarting restbase on rb1026,1027 to switch to proton on k8s |
[production] |
09:34 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:31 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:28 |
<_joe_> |
restarting restbase on restbase1025 to pick up the switch to k8s of proton |
[production] |
09:27 |
<godog> |
bounce thanos-compact on thanos-fe2001 |
[production] |
09:07 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro (exit_code=0) |
[production] |
08:52 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1079', diff saved to https://phabricator.wikimedia.org/P11828 and previous config saved to /var/cache/conftool/dbconfig/20200709-085228-marostegui.json |
[production] |
08:44 |
<marostegui> |
Stop haproxy on dbproxy1017 before upgrading to buster - T255408 |
[production] |
08:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1136', diff saved to https://phabricator.wikimedia.org/P11827 and previous config saved to /var/cache/conftool/dbconfig/20200709-082355-marostegui.json |
[production] |
08:23 |
<moritzm> |
imported osm2pgsql 0.96.0+ds-1~bpo9+1 to "main" component T256877 |
[production] |
08:22 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro |
[production] |
08:20 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) |
[production] |