2022-09-07
ยง
|
13:52 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on rdb1009.eqiad.wmnet with reason: host reimage |
[production] |
13:49 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on rdb1009.eqiad.wmnet with reason: host reimage |
[production] |
13:48 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
13:46 |
<samtar@deploy1002> |
Finished scap: Backport for [[gerrit:825762|private/readme.php: Add $wgPhonosApiKeyGoogle (T315491)]] (duration: 04m 51s) |
[production] |
13:42 |
<samtar@deploy1002> |
samtar and samtar: Backport for [[gerrit:825762|private/readme.php: Add $wgPhonosApiKeyGoogle (T315491)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
13:42 |
<samtar@deploy1002> |
Started scap: Backport for [[gerrit:825762|private/readme.php: Add $wgPhonosApiKeyGoogle (T315491)]] |
[production] |
13:38 |
<samtar@deploy1002> |
Synchronized php-1.39.0-wmf.27/extensions/GrowthExperiments/modules/ext.growthExperiments.MentorDashboard.Vue/components/MenteeOverview/MenteeFiltersForm.vue: Backport: [[gerrit:830199|Mentee overview(vue): prevent clicks on more recent edit buttons to submit the filters (T316926)]] (duration: 04m 07s) |
[production] |
13:38 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:37 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
13:37 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
13:36 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
13:36 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.reimage for host rdb1009.eqiad.wmnet with OS bullseye |
[production] |
13:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 100%: Pooling after upgrade', diff saved to https://phabricator.wikimedia.org/P34107 and previous config saved to /var/cache/conftool/dbconfig/20220907-131223-root.json |
[production] |
12:57 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 75%: Pooling after upgrade', diff saved to https://phabricator.wikimedia.org/P34106 and previous config saved to /var/cache/conftool/dbconfig/20220907-125718-root.json |
[production] |
12:57 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2120 (re)pooling @ 100%: Pooling after cloning another host', diff saved to https://phabricator.wikimedia.org/P34105 and previous config saved to /var/cache/conftool/dbconfig/20220907-125706-root.json |
[production] |
12:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 50%: Pooling after upgrade', diff saved to https://phabricator.wikimedia.org/P34104 and previous config saved to /var/cache/conftool/dbconfig/20220907-124213-root.json |
[production] |
12:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2120 (re)pooling @ 75%: Pooling after cloning another host', diff saved to https://phabricator.wikimedia.org/P34103 and previous config saved to /var/cache/conftool/dbconfig/20220907-124201-root.json |
[production] |
12:31 |
<jbond> |
re-enable puppet |
[production] |
12:27 |
<moritzm> |
installing runc security updates on codfw staging hosts |
[production] |
12:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 25%: Pooling after upgrade', diff saved to https://phabricator.wikimedia.org/P34102 and previous config saved to /var/cache/conftool/dbconfig/20220907-122708-root.json |
[production] |
12:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2120 (re)pooling @ 50%: Pooling after cloning another host', diff saved to https://phabricator.wikimedia.org/P34101 and previous config saved to /var/cache/conftool/dbconfig/20220907-122656-root.json |
[production] |
12:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 10%: Pooling after upgrade', diff saved to https://phabricator.wikimedia.org/P34100 and previous config saved to /var/cache/conftool/dbconfig/20220907-121204-root.json |
[production] |
12:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2120 (re)pooling @ 25%: Pooling after cloning another host', diff saved to https://phabricator.wikimedia.org/P34099 and previous config saved to /var/cache/conftool/dbconfig/20220907-121152-root.json |
[production] |
12:08 |
<jbond> |
disable puppet fleet wide to fix issues |
[production] |
11:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 5%: Pooling after upgrade', diff saved to https://phabricator.wikimedia.org/P34098 and previous config saved to /var/cache/conftool/dbconfig/20220907-115659-root.json |
[production] |
11:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2120 (re)pooling @ 10%: Pooling after cloning another host', diff saved to https://phabricator.wikimedia.org/P34097 and previous config saved to /var/cache/conftool/dbconfig/20220907-115647-root.json |
[production] |
11:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 1%: Pooling after upgrade', diff saved to https://phabricator.wikimedia.org/P34096 and previous config saved to /var/cache/conftool/dbconfig/20220907-114154-root.json |
[production] |
11:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2120 (re)pooling @ 5%: Pooling after cloning another host', diff saved to https://phabricator.wikimedia.org/P34095 and previous config saved to /var/cache/conftool/dbconfig/20220907-114142-root.json |
[production] |
11:34 |
<jbond> |
change default puppet file permissions ro root:root |
[production] |
11:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1201 (re)pooling @ 100%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P34094 and previous config saved to /var/cache/conftool/dbconfig/20220907-111821-root.json |
[production] |
11:05 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/api-gateway: sync |
[production] |
11:04 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/api-gateway: sync |
[production] |
11:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1201 (re)pooling @ 75%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P34093 and previous config saved to /var/cache/conftool/dbconfig/20220907-110316-root.json |
[production] |
11:01 |
<btullis@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
11:01 |
<btullis@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'. |
[production] |
11:01 |
<btullis@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
11:00 |
<btullis@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'sync'. |
[production] |
11:00 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on 7 hosts with reason: Downtime pending inclusion in production |
[production] |
11:00 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on 7 hosts with reason: Downtime pending inclusion in production |
[production] |
11:00 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/api-gateway: sync |
[production] |
10:59 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/api-gateway: sync |
[production] |
10:59 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/api-gateway: sync |
[production] |
10:59 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/api-gateway: sync |
[production] |
10:53 |
<cgoubert@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: dc=eqiad,cluster=parsoid,name=wtp1046.eqiad.wmnet |
[production] |
10:53 |
<cgoubert@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: dc=eqiad,cluster=parsoid,name=wtp1045.eqiad.wmnet |
[production] |
10:53 |
<cgoubert@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: dc=eqiad,cluster=parsoid,name=wtp1044.eqiad.wmnet |
[production] |
10:52 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on wtp[1044-1046].eqiad.wmnet with reason: Downtiming replaced wtp servers |
[production] |
10:52 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on wtp[1044-1046].eqiad.wmnet with reason: Downtiming replaced wtp servers |
[production] |
10:48 |
<cgoubert@puppetmaster1001> |
conftool action : set/pooled=no:weight=10; selector: dc=eqiad,cluster=parsoid,name=parse1017.eqiad.wmnet |
[production] |
10:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1201 (re)pooling @ 50%: Pooling for the first time', diff saved to https://phabricator.wikimedia.org/P34092 and previous config saved to /var/cache/conftool/dbconfig/20220907-104811-root.json |
[production] |