2021-06-10
ยง
|
17:24 |
<razzi> |
sudo systemctl restart hadoop-hdfs-namenode on an-master1002 |
[analytics] |
17:24 |
<razzi> |
sudo systemctl restart hadoop-hdfs-zkfc on an-master1002 |
[analytics] |
17:12 |
<razzi> |
sudo -u hdfs /usr/bin/hdfs haadmin -failover an-master1002-eqiad-wmnet an-master1001-eqiad-wmnet |
[analytics] |
17:11 |
<moritzm> |
updating bullseye installer image to latest daily image (kernel ABI changed again) T275873 |
[production] |
17:09 |
<jgiannelos@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
17:06 |
<jgiannelos@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' . |
[production] |
17:03 |
<jgiannelos@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' . |
[production] |
16:53 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) |
[production] |
16:51 |
<moritzm> |
installing rails security updates |
[production] |
16:37 |
<krinkle@deploy1002> |
Synchronized wmf-config/CommonSettings.php: no-op for Beta I2a42c222003 (duration: 01m 07s) |
[production] |
16:34 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:29 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
16:25 |
<razzi> |
rolling restart hadoop masters to pick up https://gerrit.wikimedia.org/r/c/operations/puppet/+/698194 |
[analytics] |
16:24 |
<razzi@cumin1001> |
START - Cookbook sre.hadoop.roll-restart-masters |
[production] |
16:11 |
<dcaro> |
cleaning up the nova-fullstack VMs after merging puppet fix https://gerrit.wikimedia.org/r/c/operations/puppet/+/699237 |
[admin-monitoring] |
15:09 |
<papaul> |
power down ms-be2038 for BBU replacement |
[production] |
15:08 |
<dcaro> |
stopping the nova-fullstack service to allow troubleshooting the current VMs |
[admin-monitoring] |
14:07 |
<ottomata> |
altered event.wmdebannerevent event.eventRate field to change type from BIGINT to DOUBLE - T282562 |
[analytics] |
12:50 |
<majavah> |
deploying https://phabricator.wikimedia.org/R2080:072ca798be2c8e95dfe6b86d8b86e28872155e67 |
[tools.sge-jobs] |
12:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 100%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16417 and previous config saved to /var/cache/conftool/dbconfig/20210610-123201-root.json |
[production] |
12:16 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 75%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16416 and previous config saved to /var/cache/conftool/dbconfig/20210610-121657-root.json |
[production] |
12:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 60%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16415 and previous config saved to /var/cache/conftool/dbconfig/20210610-120153-root.json |
[production] |
11:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 50%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16414 and previous config saved to /var/cache/conftool/dbconfig/20210610-114650-root.json |
[production] |
11:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 40%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16413 and previous config saved to /var/cache/conftool/dbconfig/20210610-113146-root.json |
[production] |
11:16 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 30%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16412 and previous config saved to /var/cache/conftool/dbconfig/20210610-111643-root.json |
[production] |
11:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 20%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16411 and previous config saved to /var/cache/conftool/dbconfig/20210610-110139-root.json |
[production] |
11:00 |
<jbond@deploy1002> |
Finished deploy [netbox/deploy@e9f2382]: deploy v2.10.4-wmf4 to netbox-next (duration: 00m 53s) |
[production] |
10:59 |
<jbond@deploy1002> |
Started deploy [netbox/deploy@e9f2382]: deploy v2.10.4-wmf4 to netbox-next |
[production] |
10:58 |
<wm-bot> |
Finished rebooting the nodes ['cloudcephmon2002-dev', 'cloudcephmon2003-dev', 'cloudcephmon2004-dev'] (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:58 |
<wm-bot> |
Finished rebooting node cloudcephmon2004-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:55 |
<wm-bot> |
Rebooting node cloudcephmon2004-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:55 |
<wm-bot> |
Finished rebooting node cloudcephmon2003-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:52 |
<wm-bot> |
Rebooting node cloudcephmon2003-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:52 |
<wm-bot> |
Finished rebooting node cloudcephmon2002-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:49 |
<wm-bot> |
Rebooting node cloudcephmon2002-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:49 |
<wm-bot> |
Rebooting the nodes cloudcephmon2002-dev,cloudcephmon2003-dev,cloudcephmon2004-dev (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:48 |
<wm-bot> |
Finished rebooting the nodes ['cloudcephosd2001-dev', 'cloudcephosd2002-dev', 'cloudcephosd2003-dev'] (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:48 |
<wm-bot> |
Finished rebooting node cloudcephosd2003-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:47 |
<topranks> |
T283163: Adding "metric-out minimum-igp" to BGP group Confed_eqord on eqiad, codfw and eqdfw CRs. |
[production] |
10:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 10%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16410 and previous config saved to /var/cache/conftool/dbconfig/20210610-104635-root.json |
[production] |
10:45 |
<wm-bot> |
Rebooting node cloudcephosd2003-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:45 |
<wm-bot> |
Finished rebooting node cloudcephosd2002-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:43 |
<urbanecm@deploy1002> |
Synchronized php-1.37.0-wmf.9/extensions/WikiEditor/modules/jquery.wikiEditor.js: 8a17c43c5470b84ba58239bb2cf947dbebf1979f: Fix call to renamed var (T284716) (duration: 01m 25s) |
[production] |
10:42 |
<wm-bot> |
Rebooting node cloudcephosd2002-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:42 |
<wm-bot> |
Finished rebooting node cloudcephosd2001-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:39 |
<wm-bot> |
Rebooting node cloudcephosd2001-dev.codfw.wmnet (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:39 |
<wm-bot> |
Rebooting the nodes cloudcephosd2001-dev,cloudcephosd2002-dev,cloudcephosd2003-dev (T281248) - cookbook ran by dcaro@vulcanus |
[admin] |
10:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1098:3316 (re)pooling @ 5%: Repool db1098:3316', diff saved to https://phabricator.wikimedia.org/P16409 and previous config saved to /var/cache/conftool/dbconfig/20210610-103132-root.json |
[production] |
10:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1113:3316', diff saved to https://phabricator.wikimedia.org/P16408 and previous config saved to /var/cache/conftool/dbconfig/20210610-103032-marostegui.json |
[production] |
10:29 |
<mvolz@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'citoid' for release 'production' . |
[production] |