2020-07-01
§
|
10:14 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:14 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:09 |
<jayme> |
draining and docker restart (one at a time) kubernetes[2001-2004].codfw.wmnet |
[production] |
09:52 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:52 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:46 |
<jayme> |
cordoning kubernetes[2001-2004].codfw.wmnet,kubernetes[1001-1004].eqiad.wmnet - T256786 |
[production] |
09:42 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:42 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:34 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:34 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:23 |
<jayme> |
restarting dockerd on kubestage1002.eqiad.wmnet - T256786 |
[production] |
09:15 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:15 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:08 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:08 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:53 |
<jayme> |
draining kubernetes staging node kubestage1001.eqiad.wmnet - T256786 |
[production] |
08:52 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:52 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:44 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:44 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:29 |
<XioNoX> |
disable BGP to nfacct in eqiad - T256790 |
[production] |
08:23 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:23 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:08 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . |
[production] |
08:05 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:05 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:01 |
<vgutierrez> |
rolling restart of esams cache nodes to catch up on kernel upgrades |
[production] |
07:42 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:42 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:40 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:40 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:39 |
<ema> |
cp2041: restart purged, varnishkafka after librdkafka1 upgrade to 0.11.6-1.1wmf1 T256444 |
[production] |
05:47 |
<_joe_> |
restarting nfacctd on netflow1001, it's segfaulting |
[production] |
04:01 |
<krinkle@deploy1001> |
Synchronized php-1.35.0-wmf.39/maintenance/findBadBlobs.php: I47c11190b665 (duration: 01m 08s) |
[production] |
00:14 |
<krinkle@deploy1001> |
Synchronized private/PrivateSettings.php: T254795 - Set $wmgXhguiDBuser and $wmgXhguiDBpasswor (duration: 01m 06s) |
[production] |
2020-06-30
§
|
21:48 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
21:46 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
21:45 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
21:43 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
21:42 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
21:40 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
21:40 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) |
[production] |
21:38 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
21:38 |
<crusnov@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) |
[production] |
21:38 |
<crusnov@cumin1001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
19:19 |
<hashar@deploy1001> |
rebuilt and synchronized wikiversions files: group 0 wikis to 1.35.0-wmf.39 # T254176 |
[production] |
18:31 |
<cdanis> |
T256790 ✔️ cdanis@netflow2001.codfw.wmnet ~ 🕝☕ sudo apt install valgrind |
[production] |
18:27 |
<tgr> |
Morning deploys done |
[production] |
18:23 |
<tgr@deploy1001> |
Synchronized php-1.35.0-wmf.39/extensions/ElectronPdfService/src/ElectronPdfServiceHooks.php: Backport: [[gerrit:608485|Hotfix: "Undefined index: print" (T256761)]] (duration: 01m 05s) |
[production] |
18:11 |
<shdubsh> |
restart varnishmtail,atsmtail,ncredirmtail on ncredir,cp hosts in codfw and eqsin |
[production] |