2024-02-08
ยง
|
20:07 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
19:01 |
<brennen> |
train 1.42.0-wmf.17 (T354435): currently rolled back to group0; blocked pending a fix for edit metrics (further details to come) |
[production] |
18:58 |
<ejegg> |
re-enabled fundraising scheduled jobs |
[production] |
18:49 |
<ejegg> |
standalone SmashPig upgraded from 20d6434e to 669a9fe3 |
[production] |
18:48 |
<brennen@deploy2002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.42.0-wmf.17 refs T354435 |
[production] |
18:41 |
<ejegg> |
jobs disabled for option change |
[production] |
18:03 |
<bd808@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/toolhub: apply |
[production] |
18:02 |
<bd808@deploy2002> |
helmfile [eqiad] START helmfile.d/services/toolhub: apply |
[production] |
18:02 |
<bd808@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/toolhub: apply |
[production] |
18:01 |
<bd808@deploy2002> |
helmfile [codfw] START helmfile.d/services/toolhub: apply |
[production] |
18:01 |
<bd808@deploy2002> |
helmfile [staging] DONE helmfile.d/services/toolhub: apply |
[production] |
18:00 |
<bd808@deploy2002> |
helmfile [staging] START helmfile.d/services/toolhub: apply |
[production] |
17:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 100%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56563 and previous config saved to /var/cache/conftool/dbconfig/20240208-175206-root.json |
[production] |
17:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2103 (re)pooling @ 100%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56562 and previous config saved to /var/cache/conftool/dbconfig/20240208-175149-root.json |
[production] |
17:45 |
<mutante> |
deploy1002/deploy2002 - change in scap foreachwikiindblist deployed (gerrit:992263) |
[production] |
17:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 75%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56561 and previous config saved to /var/cache/conftool/dbconfig/20240208-173701-root.json |
[production] |
17:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2103 (re)pooling @ 75%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56560 and previous config saved to /var/cache/conftool/dbconfig/20240208-173644-root.json |
[production] |
17:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 100%: After schema change', diff saved to https://phabricator.wikimedia.org/P56559 and previous config saved to /var/cache/conftool/dbconfig/20240208-172902-root.json |
[production] |
17:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 50%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56558 and previous config saved to /var/cache/conftool/dbconfig/20240208-172156-root.json |
[production] |
17:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2103 (re)pooling @ 50%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56557 and previous config saved to /var/cache/conftool/dbconfig/20240208-172139-root.json |
[production] |
17:15 |
<brennen@deploy2002> |
Synchronized php: group1 wikis to 1.42.0-wmf.17 refs T354435 (duration: 06m 52s) |
[production] |
17:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 75%: After schema change', diff saved to https://phabricator.wikimedia.org/P56556 and previous config saved to /var/cache/conftool/dbconfig/20240208-171358-root.json |
[production] |
17:09 |
<brennen@deploy2002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.42.0-wmf.17 refs T354435 |
[production] |
17:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 25%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56555 and previous config saved to /var/cache/conftool/dbconfig/20240208-170651-root.json |
[production] |
17:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2103 (re)pooling @ 25%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56554 and previous config saved to /var/cache/conftool/dbconfig/20240208-170634-root.json |
[production] |
17:01 |
<brennen> |
train 1.42.0-wmf.17 (T354435): blockers resolved, rolling to group1 |
[production] |
16:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 50%: After schema change', diff saved to https://phabricator.wikimedia.org/P56553 and previous config saved to /var/cache/conftool/dbconfig/20240208-165853-root.json |
[production] |
16:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 10%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56552 and previous config saved to /var/cache/conftool/dbconfig/20240208-165147-root.json |
[production] |
16:51 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.mysql.clone (exit_code=0) Will create a clone of db2169.codfw.wmnet onto db2194.codfw.wmnet |
[production] |
16:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2103 (re)pooling @ 10%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56551 and previous config saved to /var/cache/conftool/dbconfig/20240208-165129-root.json |
[production] |
16:48 |
<cgoubert@cumin2002> |
conftool action : set/pooled=yes; selector: name=(mw2379|mw2380|mw2382|mw2383|mw2384|mw2385|mw2386|mw2387|mw2388|mw2389|mw2390|mw2391|mw2392|mw2393|mw2394|mw2396|mw2397|mw2398|mw2399|mw2400|mw2298|mw2299|mw2300).* |
[production] |
16:48 |
<claime> |
Repooling mw2379|mw2380|mw2382|mw2383|mw2384|mw2385|mw2386|mw2387|mw2388|mw2389|mw2390|mw2391|mw2392|mw2393|mw2394|mw2396|mw2397|mw2398|mw2399|mw2400|mw2298|mw2299|mw2300 - T355862 |
[production] |
16:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 25%: After schema change', diff saved to https://phabricator.wikimedia.org/P56550 and previous config saved to /var/cache/conftool/dbconfig/20240208-164348-root.json |
[production] |
16:40 |
<claime> |
Uncordoning mw2377.codfw.wmnet mw2378.codfw.wmnet mw2381.codfw.wmnet mw2395.codfw.wmnet mw2291.codfw.wmnet mw2292.codfw.wmnet mw2293.codfw.wmnet mw2294.codfw.wmnet mw2295.codfw.wmnet mw2296.codfw.wmnet mw2297.codfw.wmnet - T355862 |
[production] |
16:37 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for asw-a-codfw,cr[1-2]-codfw,lsw1-a3-codfw.mgmt |
[production] |
16:37 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for asw-a-codfw,cr[1-2]-codfw,lsw1-a3-codfw.mgmt |
[production] |
16:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 5%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56549 and previous config saved to /var/cache/conftool/dbconfig/20240208-163642-root.json |
[production] |
16:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2103 (re)pooling @ 5%: After network maintenance', diff saved to https://phabricator.wikimedia.org/P56548 and previous config saved to /var/cache/conftool/dbconfig/20240208-163624-root.json |
[production] |
16:31 |
<hnowlan@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye |
[production] |
16:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 10%: After schema change', diff saved to https://phabricator.wikimedia.org/P56547 and previous config saved to /var/cache/conftool/dbconfig/20240208-162843-root.json |
[production] |
16:26 |
<hnowlan@cumin2002> |
START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye |
[production] |
16:23 |
<topranks> |
Server move completed codfw rack A3 T355862 |
[production] |
16:15 |
<Dreamy_Jazz> |
Running `mwscript extensions/MediaModeration/maintenance/scanFilesInScanTable.php --wiki=commonswiki --use-jobqueue --sleep 30 --verbose 2>&1 | tee ~/scan-files-in-scan-table-commonswiki-sleep-30-no-render-now.txt` on a tmux session - See https://wikitech.wikimedia.org/wiki/MediaModeration |
[production] |
16:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2122 (re)pooling @ 5%: After schema change', diff saved to https://phabricator.wikimedia.org/P56546 and previous config saved to /var/cache/conftool/dbconfig/20240208-161338-root.json |
[production] |
16:10 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on 39 hosts with reason: Migrating servers in codfw rack A3 to lsw1-a3-codfw |
[production] |
16:10 |
<hnowlan@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye |
[production] |
16:09 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 0:30:00 on 39 hosts with reason: Migrating servers in codfw rack A3 to lsw1-a3-codfw |
[production] |
16:09 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on asw-a-codfw,cr[1-2]-codfw,lsw1-a3-codfw.mgmt with reason: server uplink migration codfw rack a3 |
[production] |
16:09 |
<cmooney@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on asw-a-codfw,cr[1-2]-codfw,lsw1-a3-codfw.mgmt with reason: server uplink migration codfw rack a3 |
[production] |
16:07 |
<topranks> |
Commencing server uplink moves from old switch to new in codfw rack A3 T355862 |
[production] |