2024-05-16
ยง
|
14:28 |
<ladsgroup@deploy1002> |
Finished scap: Backport for [[gerrit:1032493|Stop writing to the old columns of pagelinks in s6 (T352010)]] (duration: 15m 42s) |
[production] |
14:28 |
<hnowlan> |
migrated 5% of commons traffic to k8s |
[production] |
14:28 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2174.codfw.wmnet with reason: host reimage |
[production] |
14:25 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2174.codfw.wmnet with reason: host reimage |
[production] |
14:19 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2176 (re)pooling @ 2%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P62510 and previous config saved to /var/cache/conftool/dbconfig/20240516-141957-arnaudb.json |
[production] |
14:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1024 (re)pooling @ 50%: After reimage', diff saved to https://phabricator.wikimedia.org/P62509 and previous config saved to /var/cache/conftool/dbconfig/20240516-141932-root.json |
[production] |
14:15 |
<ladsgroup@deploy1002> |
ladsgroup: Continuing with sync |
[production] |
14:15 |
<ladsgroup@deploy1002> |
ladsgroup: Backport for [[gerrit:1032493|Stop writing to the old columns of pagelinks in s6 (T352010)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:13 |
<ladsgroup@deploy1002> |
Started scap: Backport for [[gerrit:1032493|Stop writing to the old columns of pagelinks in s6 (T352010)]] |
[production] |
14:09 |
<Lucas_WMDE> |
START lucaswerkmeister-wmde@mwmaint1002:~$ time mwscript extensions/DiscussionTools/maintenance/persistRevisionThreadItems.php --wiki enwiki --current --all --start '["76318767"]' 2>&1 | tee -a ~/T315510-enwiki-5; date |
[production] |
14:08 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2174.codfw.wmnet with OS bookworm |
[production] |
14:07 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2174.codfw.wmnet with reason: reimage |
[production] |
14:07 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2174.codfw.wmnet with reason: reimage |
[production] |
14:06 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'T364290 db2174', diff saved to https://phabricator.wikimedia.org/P62508 and previous config saved to /var/cache/conftool/dbconfig/20240516-140620-arnaudb.json |
[production] |
14:04 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'db2176 (re)pooling @ 1%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P62507 and previous config saved to /var/cache/conftool/dbconfig/20240516-140451-arnaudb.json |
[production] |
14:04 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1024 (re)pooling @ 25%: After reimage', diff saved to https://phabricator.wikimedia.org/P62506 and previous config saved to /var/cache/conftool/dbconfig/20240516-140426-root.json |
[production] |
14:04 |
<jsn@deploy1002> |
Finished scap: Backport for [[gerrit:1032429|Make EntitySchemaValue::getArrayValue() match EntityIdValue (T362955 T362001)]], [[gerrit:1032430|Make EntitySchemaValue::getArrayValue() match EntityIdValue (T362955 T362001)]] (duration: 16m 11s) |
[production] |
14:03 |
<Emperor> |
depool, restart swift-proxy, repool ms-fe1010 as ~12% connection failures reported by envoy since late 14th May T360913 |
[production] |
13:59 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2176.codfw.wmnet with OS bookworm |
[production] |
13:51 |
<jsn@deploy1002> |
jsn and lucaswerkmeister-wmde: Continuing with sync |
[production] |
13:50 |
<jsn@deploy1002> |
jsn and lucaswerkmeister-wmde: Backport for [[gerrit:1032429|Make EntitySchemaValue::getArrayValue() match EntityIdValue (T362955 T362001)]], [[gerrit:1032430|Make EntitySchemaValue::getArrayValue() match EntityIdValue (T362955 T362001)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es1024 (re)pooling @ 10%: After reimage', diff saved to https://phabricator.wikimedia.org/P62505 and previous config saved to /var/cache/conftool/dbconfig/20240516-134918-root.json |
[production] |
13:48 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es1024.eqiad.wmnet with OS bookworm |
[production] |
13:47 |
<jsn@deploy1002> |
Started scap: Backport for [[gerrit:1032429|Make EntitySchemaValue::getArrayValue() match EntityIdValue (T362955 T362001)]], [[gerrit:1032430|Make EntitySchemaValue::getArrayValue() match EntityIdValue (T362955 T362001)]] |
[production] |
13:37 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2176.codfw.wmnet with reason: host reimage |
[production] |
13:34 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2176.codfw.wmnet with reason: host reimage |
[production] |
13:32 |
<jsn@deploy1002> |
Finished scap: Backport for [[gerrit:1031028|Enable async jobqueue-powered URL uploads on commons (T295007)]] (duration: 18m 18s) |
[production] |
13:31 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es1024.eqiad.wmnet with reason: host reimage |
[production] |
13:27 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on es1024.eqiad.wmnet with reason: host reimage |
[production] |
13:19 |
<jsn@deploy1002> |
jsn and hnowlan: Continuing with sync |
[production] |
13:18 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1173 (T352010)', diff saved to https://phabricator.wikimedia.org/P62503 and previous config saved to /var/cache/conftool/dbconfig/20240516-131800-ladsgroup.json |
[production] |
13:17 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2176.codfw.wmnet with OS bookworm |
[production] |
13:16 |
<jsn@deploy1002> |
jsn and hnowlan: Backport for [[gerrit:1031028|Enable async jobqueue-powered URL uploads on commons (T295007)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:15 |
<arnaudb@cumin1002> |
END (ERROR) - Cookbook sre.mysql.upgrade (exit_code=97) for db2176.codfw.wmnet |
[production] |
13:15 |
<arnaudb@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2176.codfw.wmnet |
[production] |
13:14 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'T364290 db2176', diff saved to https://phabricator.wikimedia.org/P62502 and previous config saved to /var/cache/conftool/dbconfig/20240516-131429-arnaudb.json |
[production] |
13:14 |
<jsn@deploy1002> |
Started scap: Backport for [[gerrit:1031028|Enable async jobqueue-powered URL uploads on commons (T295007)]] |
[production] |
13:12 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host es1024.eqiad.wmnet with OS bookworm |
[production] |
13:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es1024 T364289', diff saved to https://phabricator.wikimedia.org/P62501 and previous config saved to /var/cache/conftool/dbconfig/20240516-131111-root.json |
[production] |
13:02 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1173', diff saved to https://phabricator.wikimedia.org/P62500 and previous config saved to /var/cache/conftool/dbconfig/20240516-130252-ladsgroup.json |
[production] |
12:47 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1173', diff saved to https://phabricator.wikimedia.org/P62499 and previous config saved to /var/cache/conftool/dbconfig/20240516-124743-ladsgroup.json |
[production] |
10:48 |
<fnegri@cumin1002> |
END (PASS) - Cookbook sre.wikireplicas.update-views (exit_code=0) |
[production] |
10:46 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 100%: Maint over', diff saved to https://phabricator.wikimedia.org/P62497 and previous config saved to /var/cache/conftool/dbconfig/20240516-104601-ladsgroup.json |
[production] |
10:43 |
<claime> |
New redirects for T25216 T204830 T31186 operational |
[production] |
10:37 |
<fnegri@cumin1002> |
START - Cookbook sre.wikireplicas.update-views |
[production] |
10:32 |
<claime> |
cumin 'A:all-mw' -b30 "run-puppet-agent -q" - T25216 T204830 T31186 |
[production] |
10:31 |
<claime> |
cumin 'A:all-mw' "enable-puppet 'New redirects T25216 T204830 T31186 - cgoubert'" |
[production] |
10:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Test pc4 master switch', diff saved to https://phabricator.wikimedia.org/P62496 and previous config saved to /var/cache/conftool/dbconfig/20240516-103148-marostegui.json |
[production] |
10:30 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 75%: Maint over', diff saved to https://phabricator.wikimedia.org/P62495 and previous config saved to /var/cache/conftool/dbconfig/20240516-103055-ladsgroup.json |
[production] |
10:30 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Test pc4 master switch', diff saved to https://phabricator.wikimedia.org/P62494 and previous config saved to /var/cache/conftool/dbconfig/20240516-103039-marostegui.json |
[production] |