2024-06-27
§
|
06:15 |
<arnaudb> |
Starting es6 eqiad failover from es1037 to es1038 - T368401 |
[production] |
06:10 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Set es1038 with weight 0 T368401', diff saved to https://phabricator.wikimedia.org/P65507 and previous config saved to /var/cache/conftool/dbconfig/20240627-061055-arnaudb.json |
[production] |
06:10 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Primary switchover es6 T368401 |
[production] |
06:10 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: Primary switchover es6 T368401 |
[production] |
06:09 |
<arnaudb@deploy1002> |
Finished scap: Backport for [[gerrit:1049555|mariadb: disable writes on es6 (T368401)]] (duration: 08m 00s) |
[production] |
06:04 |
<arnaudb@deploy1002> |
arnaudb: Continuing with sync |
[production] |
06:04 |
<arnaudb@deploy1002> |
arnaudb: Backport for [[gerrit:1049555|mariadb: disable writes on es6 (T368401)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
06:01 |
<arnaudb@deploy1002> |
Started scap: Backport for [[gerrit:1049555|mariadb: disable writes on es6 (T368401)]] |
[production] |
03:55 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
03:55 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
03:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214 (T364069)', diff saved to https://phabricator.wikimedia.org/P65506 and previous config saved to /var/cache/conftool/dbconfig/20240627-035544-marostegui.json |
[production] |
03:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P65505 and previous config saved to /var/cache/conftool/dbconfig/20240627-034037-marostegui.json |
[production] |
03:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P65504 and previous config saved to /var/cache/conftool/dbconfig/20240627-032530-marostegui.json |
[production] |
03:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214 (T364069)', diff saved to https://phabricator.wikimedia.org/P65503 and previous config saved to /var/cache/conftool/dbconfig/20240627-031023-marostegui.json |
[production] |
00:56 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2175 (T367856)', diff saved to https://phabricator.wikimedia.org/P65502 and previous config saved to /var/cache/conftool/dbconfig/20240627-005613-marostegui.json |
[production] |
00:56 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
00:55 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
00:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367856)', diff saved to https://phabricator.wikimedia.org/P65501 and previous config saved to /var/cache/conftool/dbconfig/20240627-005549-marostegui.json |
[production] |
00:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65500 and previous config saved to /var/cache/conftool/dbconfig/20240627-004042-marostegui.json |
[production] |
00:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65499 and previous config saved to /var/cache/conftool/dbconfig/20240627-002535-marostegui.json |
[production] |
00:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367856)', diff saved to https://phabricator.wikimedia.org/P65498 and previous config saved to /var/cache/conftool/dbconfig/20240627-001028-marostegui.json |
[production] |
2024-06-26
§
|
23:56 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5021.eqsin.wmnet with OS bullseye |
[production] |
23:26 |
<mutante> |
people1004 - stopped confd which logs every 3 seconds that it can't find any templates (T356296) |
[production] |
23:23 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage |
[production] |
23:20 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage |
[production] |
23:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1214 (T364069)', diff saved to https://phabricator.wikimedia.org/P65497 and previous config saved to /var/cache/conftool/dbconfig/20240626-231020-marostegui.json |
[production] |
23:10 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1214.eqiad.wmnet with reason: Maintenance |
[production] |
23:10 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1214.eqiad.wmnet with reason: Maintenance |
[production] |
23:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1211 (T364069)', diff saved to https://phabricator.wikimedia.org/P65496 and previous config saved to /var/cache/conftool/dbconfig/20240626-230958-marostegui.json |
[production] |
22:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1211', diff saved to https://phabricator.wikimedia.org/P65495 and previous config saved to /var/cache/conftool/dbconfig/20240626-225451-marostegui.json |
[production] |
22:47 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS bullseye |
[production] |
22:41 |
<brett@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5021.eqsin.wmnet |
[production] |
22:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1211', diff saved to https://phabricator.wikimedia.org/P65494 and previous config saved to /var/cache/conftool/dbconfig/20240626-223944-marostegui.json |
[production] |
22:26 |
<brett@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp5020.eqsin.wmnet |
[production] |
22:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1211 (T364069)', diff saved to https://phabricator.wikimedia.org/P65493 and previous config saved to /var/cache/conftool/dbconfig/20240626-222434-marostegui.json |
[production] |
22:22 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5020.eqsin.wmnet with OS bullseye |
[production] |
21:50 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage |
[production] |
21:46 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage |
[production] |
21:40 |
<cjming> |
end of UTC late backport window |
[production] |
21:38 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1050005|Homepage: don't load yesterdays edits on desktop (T368405)]] (duration: 08m 48s) |
[production] |
21:33 |
<cjming@deploy1002> |
cjming, migr: Continuing with sync |
[production] |
21:32 |
<cjming@deploy1002> |
cjming, migr: Backport for [[gerrit:1050005|Homepage: don't load yesterdays edits on desktop (T368405)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:29 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1050005|Homepage: don't load yesterdays edits on desktop (T368405)]] |
[production] |
21:29 |
<hashar> |
restarting CI Jenkins |
[production] |
21:13 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5020.eqsin.wmnet with OS bullseye |
[production] |
21:13 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5020.eqsin.wmnet with OS bullseye |
[production] |
21:05 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1050002|Homepage: log rendering time for each module and each wiki (T368405)]] (duration: 14m 01s) |
[production] |
20:59 |
<eileen> |
config revision changed from 0b822cd3 to 994e7b81 |
[production] |
20:57 |
<cjming@deploy1002> |
cjming, migr: Continuing with sync |
[production] |
20:55 |
<cjming@deploy1002> |
cjming, migr: Backport for [[gerrit:1050002|Homepage: log rendering time for each module and each wiki (T368405)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |