2025-07-15
§
|
06:30 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1185 - Depool db1185.eqiad.wmnet to then clone it to db1230.eqiad.wmnet - marostegui@cumin1002 |
[production] |
06:29 |
<marostegui@cumin1002> |
START - Cookbook sre.mysql.depool db1185 - Depool db1185.eqiad.wmnet to then clone it to db1230.eqiad.wmnet - marostegui@cumin1002 |
[production] |
06:29 |
<marostegui@cumin1002> |
START - Cookbook sre.mysql.clone of db1185.eqiad.wmnet onto db1230.eqiad.wmnet |
[production] |
06:29 |
<jmm@cumin2002> |
DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging AndyRussG out of all services on: 2394 hosts |
[production] |
06:18 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1230.eqiad.wmnet with reason: maintenance |
[production] |
06:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1230 T399446', diff saved to https://phabricator.wikimedia.org/P79042 and previous config saved to /var/cache/conftool/dbconfig/20250715-060600-root.json |
[production] |
06:05 |
<marostegui@dns1006> |
END - running authdns-update |
[production] |
06:04 |
<marostegui@dns1006> |
START - running authdns-update |
[production] |
06:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote db1210 to s5 primary and set section read-write T399446', diff saved to https://phabricator.wikimedia.org/P79041 and previous config saved to /var/cache/conftool/dbconfig/20250715-060223-marostegui.json |
[production] |
06:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set s5 eqiad as read-only for maintenance - T399446', diff saved to https://phabricator.wikimedia.org/P79040 and previous config saved to /var/cache/conftool/dbconfig/20250715-060114-root.json |
[production] |
05:54 |
<marostegui> |
Starting s5 eqiad failover from db1230 to db1210 - T399446 |
[production] |
05:50 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db1210 with weight 0 T399446', diff saved to https://phabricator.wikimedia.org/P79039 and previous config saved to /var/cache/conftool/dbconfig/20250715-055011-root.json |
[production] |
05:49 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 23 hosts with reason: Primary switchover s5 T399446 |
[production] |
04:01 |
<mwpresync@deploy1003> |
Pruned MediaWiki: 1.45.0-wmf.7 (duration: 01m 42s) |
[production] |
03:48 |
<mwpresync@deploy1003> |
Finished scap sync-world: testwikis to 1.45.0-wmf.10 refs T392180 (duration: 45m 36s) |
[production] |
03:03 |
<mwpresync@deploy1003> |
Started scap sync-world: testwikis to 1.45.0-wmf.10 refs T392180 |
[production] |
2025-07-14
§
|
23:53 |
<zabe@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1169266|Disable categorylinks read new on wikis which depend on missing index]] (duration: 09m 09s) |
[production] |
23:47 |
<zabe@deploy1003> |
zabe: Continuing with sync |
[production] |
23:45 |
<zabe@deploy1003> |
zabe: Backport for [[gerrit:1169266|Disable categorylinks read new on wikis which depend on missing index]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
23:44 |
<zabe@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1169266|Disable categorylinks read new on wikis which depend on missing index]] |
[production] |
23:35 |
<zabe@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1169246|Set categorylinks to read new on more wikis (T397912)]] (duration: 08m 26s) |
[production] |
23:29 |
<zabe@deploy1003> |
zabe: Continuing with sync |
[production] |
23:28 |
<zabe@deploy1003> |
zabe: Backport for [[gerrit:1169246|Set categorylinks to read new on more wikis (T397912)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
23:26 |
<zabe@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1169246|Set categorylinks to read new on more wikis (T397912)]] |
[production] |
22:28 |
<ryankemper@cumin1003> |
END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) |
[production] |
21:28 |
<ryankemper@cumin1003> |
START - Cookbook sre.wdqs.restart |
[production] |
20:57 |
<dancy@deploy1003> |
Installation of scap version "4.188.2" completed for 2 hosts |
[production] |
20:56 |
<dancy@deploy1003> |
Installing scap version "4.188.2" for 2 host(s) |
[production] |
20:49 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs1017.eqiad.wmnet with OS bookworm |
[production] |
20:49 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - brett@cumin2002" |
[production] |
20:46 |
<dreamyjazz@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1169226|Readers Use Cases Survey: Set token param name (T398870)]] (duration: 09m 30s) |
[production] |
20:45 |
<brett@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - brett@cumin2002" |
[production] |
20:40 |
<dreamyjazz@deploy1003> |
dani, dreamyjazz: Continuing with sync |
[production] |
20:38 |
<dreamyjazz@deploy1003> |
dani, dreamyjazz: Backport for [[gerrit:1169226|Readers Use Cases Survey: Set token param name (T398870)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:36 |
<dreamyjazz@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1169226|Readers Use Cases Survey: Set token param name (T398870)]] |
[production] |
20:28 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs1017.eqiad.wmnet with reason: host reimage |
[production] |
20:25 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on lvs1017.eqiad.wmnet with reason: host reimage |
[production] |
20:24 |
<dreamyjazz@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1148390|Set hCaptcha config (T382148)]] (duration: 13m 14s) |
[production] |
20:18 |
<dreamyjazz@deploy1003> |
dreamyjazz, reedy: Continuing with sync |
[production] |
20:13 |
<dreamyjazz@deploy1003> |
dreamyjazz, reedy: Backport for [[gerrit:1148390|Set hCaptcha config (T382148)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:11 |
<dreamyjazz@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1148390|Set hCaptcha config (T382148)]] |
[production] |
20:08 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host lvs1017.eqiad.wmnet with OS bookworm |
[production] |
20:08 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host lvs1017.eqiad.wmnet with OS bullseye |
[production] |
19:58 |
<dancy@deploy1003> |
Installing scap version "4.188.0" for 1 host(s) |
[production] |
19:42 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host lvs1017.eqiad.wmnet with OS bullseye |
[production] |
18:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2229 (T399249)', diff saved to https://phabricator.wikimedia.org/P79037 and previous config saved to /var/cache/conftool/dbconfig/20250714-185800-marostegui.json |
[production] |
18:42 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2229', diff saved to https://phabricator.wikimedia.org/P79036 and previous config saved to /var/cache/conftool/dbconfig/20250714-184253-marostegui.json |
[production] |
18:27 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2229', diff saved to https://phabricator.wikimedia.org/P79035 and previous config saved to /var/cache/conftool/dbconfig/20250714-182745-marostegui.json |
[production] |
18:12 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2229 (T399249)', diff saved to https://phabricator.wikimedia.org/P79034 and previous config saved to /var/cache/conftool/dbconfig/20250714-181238-marostegui.json |
[production] |
17:59 |
<eevans@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on aqs1012.eqiad.wmnet with reason: Drive replacement |
[production] |