2020-07-01
ยง
|
11:07 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'blubberoid' for release 'staging' . |
[production] |
11:02 |
<ema> |
restbase2009 depooled T256863 |
[production] |
11:02 |
<ema@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=restbase2009.codfw.wmnet |
[production] |
11:01 |
<arturo> |
live-hacking puppetmaster with https://gerrit.wikimedia.org/r/c/operations/puppet/+/608849 (T256737) |
[tools] |
10:50 |
<ema> |
power on restbase2009 |
[production] |
10:45 |
<jayme> |
draining and docker restart (one at a time) kubernetes[1001-1004].eqiad.wmnet - T256786 |
[production] |
10:34 |
<ema> |
power-cycle restbase2009 |
[production] |
10:17 |
<XioNoX> |
renumber NTT transit links - T254877 |
[production] |
10:16 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:16 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:14 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:14 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:09 |
<jayme> |
draining and docker restart (one at a time) kubernetes[2001-2004].codfw.wmnet |
[production] |
10:02 |
<wm-bot> |
<lucaswerkmeister-wmde> deployed e0b49bc2a8 (toolforge.org) |
[tools.wdmm] |
10:02 |
<wm-bot> |
<lucaswerkmeister-wmde> deployed e69222c7b6 (service.template) |
[tools.wdmm] |
09:59 |
<wm-bot> |
<lucaswerkmeister-wmde> deployed 1abeb708cf |
[tools.wdmm] |
09:52 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:52 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:46 |
<jayme> |
cordoning kubernetes[2001-2004].codfw.wmnet,kubernetes[1001-1004].eqiad.wmnet - T256786 |
[production] |
09:42 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:42 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:34 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:34 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:23 |
<jayme> |
restarting dockerd on kubestage1002.eqiad.wmnet - T256786 |
[production] |
09:22 |
<RhinosF1> |
moved meetbot stuff to new host |
[tools.zppixbot] |
09:15 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:15 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:08 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:08 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:00 |
<RhinosF1> |
decom complete |
[tools.zppixbot-test] |
08:53 |
<jayme> |
draining kubernetes staging node kubestage1001.eqiad.wmnet - T256786 |
[production] |
08:52 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:52 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:51 |
<RhinosF1> |
decom |
[tools.zppixbot-test] |
08:47 |
<James_F> |
Zuul: Configure the REL1_35 test and gate pipelines T256377 |
[releng] |
08:44 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:44 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:38 |
<James_F> |
Zuul: Stop defining the REL1_33 pipelines at all T256087 |
[releng] |
08:37 |
<James_F> |
Zuul: Stop ascribing stuff to REL1_33 pipelines T256087 |
[releng] |
08:29 |
<XioNoX> |
disable BGP to nfacct in eqiad - T256790 |
[production] |
08:23 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:23 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:08 |
<jayme@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . |
[production] |
08:05 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
08:05 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
08:01 |
<vgutierrez> |
rolling restart of esams cache nodes to catch up on kernel upgrades |
[production] |
07:42 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:42 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:40 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:40 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |