2023-02-07
ยง
|
16:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1187 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P43763 and previous config saved to /var/cache/conftool/dbconfig/20230207-160852-root.json |
[production] |
16:02 |
<urbanecm@deploy1002> |
urbanecm: Backport for [[gerrit:886985|Restore mediawiki.page-undelete hook (T329064)]], [[gerrit:887346|Restore mediawiki.page-undelete hook (T329064)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
16:00 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:886985|Restore mediawiki.page-undelete hook (T329064)]], [[gerrit:887346|Restore mediawiki.page-undelete hook (T329064)]] |
[production] |
15:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1187 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P43762 and previous config saved to /var/cache/conftool/dbconfig/20230207-155347-root.json |
[production] |
15:53 |
<moritzm> |
installing tiff security updates |
[production] |
15:48 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc2045.codfw.wmnet with OS bullseye |
[production] |
15:47 |
<urbanecm@deploy1002> |
Finished scap: 20a79c55b7073e791e297a5389fa66819f596178: Don't add custom attributes in unwrapParsoidSections() (T328268) (duration: 07m 34s) |
[production] |
15:43 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1043.eqiad.wmnet with OS bullseye |
[production] |
15:39 |
<urbanecm@deploy1002> |
Started scap: 20a79c55b7073e791e297a5389fa66819f596178: Don't add custom attributes in unwrapParsoidSections() (T328268) |
[production] |
15:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1187 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P43761 and previous config saved to /var/cache/conftool/dbconfig/20230207-153842-root.json |
[production] |
15:32 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2045.codfw.wmnet with reason: host reimage |
[production] |
15:29 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc2045.codfw.wmnet with reason: host reimage |
[production] |
15:28 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1043.eqiad.wmnet with reason: host reimage |
[production] |
15:26 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:886997|Add "Page Frame" to DiscussionTools beta feature on enwiki (T327456)]] (duration: 10m 39s) |
[production] |
15:25 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1043.eqiad.wmnet with reason: host reimage |
[production] |
15:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1187 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P43760 and previous config saved to /var/cache/conftool/dbconfig/20230207-152337-root.json |
[production] |
15:20 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host people1003.eqiad.wmnet |
[production] |
15:17 |
<urbanecm@deploy1002> |
matmarex and urbanecm: Backport for [[gerrit:886997|Add "Page Frame" to DiscussionTools beta feature on enwiki (T327456)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
15:16 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host people1003.eqiad.wmnet |
[production] |
15:15 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:886997|Add "Page Frame" to DiscussionTools beta feature on enwiki (T327456)]] |
[production] |
15:14 |
<volans@cumin2002> |
END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) depool restbase-async in eqiad: T327925 |
[production] |
15:14 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1001.eqiad.wmnet |
[production] |
15:13 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reimage for host mc1043.eqiad.wmnet with OS bullseye |
[production] |
15:13 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reimage for host mc2045.codfw.wmnet with OS bullseye |
[production] |
15:12 |
<vgutierrez> |
repool codfw edge site - T327925 |
[production] |
15:09 |
<volans@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) restbase-async.discovery.wmnet on all recursors |
[production] |
15:09 |
<volans@cumin2002> |
START - Cookbook sre.dns.wipe-cache restbase-async.discovery.wmnet on all recursors |
[production] |
15:09 |
<volans@cumin2002> |
START - Cookbook sre.discovery.service-route depool restbase-async in eqiad: T327925 |
[production] |
15:08 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host sretest1001.eqiad.wmnet |
[production] |
15:07 |
<volans@cumin2002> |
END (PASS) - Cookbook sre.discovery.datacenter-route (exit_code=0) pool all active/active services in codfw: T327925 |
[production] |
15:05 |
<marostegui> |
dbmaint deploy schema change on s8 T328807 T328828 |
[production] |
15:04 |
<vgutierrez> |
restart pybal in lvs2010 - T327925 |
[production] |
15:01 |
<marostegui> |
dbmaint deploy schema change on s6 T328807 |
[production] |
15:00 |
<vgutierrez> |
restart pybal in lvs2009 - T327925 |
[production] |
14:59 |
<marostegui> |
dbmaint deploy schema change on s6 T328828 |
[production] |
14:53 |
<moritzm> |
adding nfraison to pwstore T328915 |
[production] |
14:46 |
<volans@cumin2002> |
START - Cookbook sre.discovery.datacenter-route pool all active/active services in codfw: T327925 |
[production] |
14:40 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=thanos-fe2002.codfw.wmnet,service=thanos-web |
[production] |
14:40 |
<filippo@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=thanos-fe2001.codfw.wmnet,service=thanos-web |
[production] |
14:36 |
<claime> |
repooled appserver, api_appserver, jobrunner, parsoid - T327925 |
[production] |
14:36 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:codfw and (A:swift-fe or A:swift-fe-canary or A:swift-fe-codfw or A:swift-fe-eqiad) |
[production] |
14:36 |
<cgoubert@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,cluster=api_appserver |
[production] |
14:35 |
<cgoubert@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,cluster=jobrunner |
[production] |
14:35 |
<cgoubert@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,cluster=appserver |
[production] |
14:35 |
<cgoubert@cumin1001> |
conftool action : set/pooled=yes; selector: dc=codfw,cluster=parsoid |
[production] |
14:32 |
<Emperor> |
pool ms-fe2009 (codfw as a whole still depooled) T327925 |
[production] |
14:27 |
<jbond> |
enable puppet in codfw, uslfo, esams post switch upgrade T327925 |
[production] |
14:26 |
<claime> |
depooled appserver, api_appserver, jobrunner, parsoid - T327925 |
[production] |
14:25 |
<mvernon@cumin2002> |
START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:codfw and (A:swift-fe or A:swift-fe-canary or A:swift-fe-codfw or A:swift-fe-eqiad) |
[production] |
14:21 |
<cgoubert@cumin1001> |
conftool action : set/pooled=no; selector: dc=codfw,cluster=parsoid |
[production] |