2024-06-27
§
|
07:32 |
<slyngshede@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM idp-test1004.wikimedia.org - slyngshede@cumin1002" |
[production] |
07:31 |
<kartik@deploy1002> |
kcvelaga, kartik: Continuing with sync |
[production] |
07:30 |
<kartik@deploy1002> |
kcvelaga, kartik: Backport for [[gerrit:1048393|Add Metrics Platform stream configuration and registration for MinT for Wikipedia Readers feature by Language and Product Localization team. (T368028)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:27 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:1048393|Add Metrics Platform stream configuration and registration for MinT for Wikipedia Readers feature by Language and Product Localization team. (T368028)]] |
[production] |
07:24 |
<slyngshede@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
07:24 |
<slyngshede@cumin1002> |
START - Cookbook sre.ganeti.makevm for new host idp-test1004.wikimedia.org |
[production] |
07:18 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:1049898|Enable MinT for Wikipedia readers MVP on a set of pilot wikis (T363465)]] (duration: 14m 19s) |
[production] |
07:13 |
<kartik@deploy1002> |
kartik: Continuing with sync |
[production] |
07:06 |
<kartik@deploy1002> |
kartik: Backport for [[gerrit:1049898|Enable MinT for Wikipedia readers MVP on a set of pilot wikis (T363465)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:04 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:1049898|Enable MinT for Wikipedia readers MVP on a set of pilot wikis (T363465)]] |
[production] |
06:45 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'weight es1038 T368401', diff saved to https://phabricator.wikimedia.org/P65510 and previous config saved to /var/cache/conftool/dbconfig/20240627-064506-arnaudb.json |
[production] |
06:40 |
<arnaudb@deploy1002> |
Finished scap: Backport for [[gerrit:1050096|Revert "mariadb: disable writes on es6"]] (duration: 07m 43s) |
[production] |
06:35 |
<arnaudb@deploy1002> |
arnaudb: Continuing with sync |
[production] |
06:35 |
<arnaudb@deploy1002> |
arnaudb: Backport for [[gerrit:1050096|Revert "mariadb: disable writes on es6"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
06:32 |
<arnaudb@deploy1002> |
Started scap: Backport for [[gerrit:1050096|Revert "mariadb: disable writes on es6"]] |
[production] |
06:23 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'weight es1037 T368401', diff saved to https://phabricator.wikimedia.org/P65509 and previous config saved to /var/cache/conftool/dbconfig/20240627-062338-arnaudb.json |
[production] |
06:16 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Promote es1038 to es6 primary T368401', diff saved to https://phabricator.wikimedia.org/P65508 and previous config saved to /var/cache/conftool/dbconfig/20240627-061639-arnaudb.json |
[production] |
06:15 |
<arnaudb> |
Starting es6 eqiad failover from es1037 to es1038 - T368401 |
[production] |
06:10 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Set es1038 with weight 0 T368401', diff saved to https://phabricator.wikimedia.org/P65507 and previous config saved to /var/cache/conftool/dbconfig/20240627-061055-arnaudb.json |
[production] |
06:10 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Primary switchover es6 T368401 |
[production] |
06:10 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 6 hosts with reason: Primary switchover es6 T368401 |
[production] |
06:09 |
<arnaudb@deploy1002> |
Finished scap: Backport for [[gerrit:1049555|mariadb: disable writes on es6 (T368401)]] (duration: 08m 00s) |
[production] |
06:04 |
<arnaudb@deploy1002> |
arnaudb: Continuing with sync |
[production] |
06:04 |
<arnaudb@deploy1002> |
arnaudb: Backport for [[gerrit:1049555|mariadb: disable writes on es6 (T368401)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
06:01 |
<arnaudb@deploy1002> |
Started scap: Backport for [[gerrit:1049555|mariadb: disable writes on es6 (T368401)]] |
[production] |
03:55 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
03:55 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
03:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214 (T364069)', diff saved to https://phabricator.wikimedia.org/P65506 and previous config saved to /var/cache/conftool/dbconfig/20240627-035544-marostegui.json |
[production] |
03:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P65505 and previous config saved to /var/cache/conftool/dbconfig/20240627-034037-marostegui.json |
[production] |
03:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P65504 and previous config saved to /var/cache/conftool/dbconfig/20240627-032530-marostegui.json |
[production] |
03:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214 (T364069)', diff saved to https://phabricator.wikimedia.org/P65503 and previous config saved to /var/cache/conftool/dbconfig/20240627-031023-marostegui.json |
[production] |
00:56 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2175 (T367856)', diff saved to https://phabricator.wikimedia.org/P65502 and previous config saved to /var/cache/conftool/dbconfig/20240627-005613-marostegui.json |
[production] |
00:56 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
00:55 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
00:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367856)', diff saved to https://phabricator.wikimedia.org/P65501 and previous config saved to /var/cache/conftool/dbconfig/20240627-005549-marostegui.json |
[production] |
00:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65500 and previous config saved to /var/cache/conftool/dbconfig/20240627-004042-marostegui.json |
[production] |
00:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148', diff saved to https://phabricator.wikimedia.org/P65499 and previous config saved to /var/cache/conftool/dbconfig/20240627-002535-marostegui.json |
[production] |
00:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2148 (T367856)', diff saved to https://phabricator.wikimedia.org/P65498 and previous config saved to /var/cache/conftool/dbconfig/20240627-001028-marostegui.json |
[production] |
2024-06-26
§
|
23:56 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5021.eqsin.wmnet with OS bullseye |
[production] |
23:26 |
<mutante> |
people1004 - stopped confd which logs every 3 seconds that it can't find any templates (T356296) |
[production] |
23:23 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage |
[production] |
23:20 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage |
[production] |
23:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1214 (T364069)', diff saved to https://phabricator.wikimedia.org/P65497 and previous config saved to /var/cache/conftool/dbconfig/20240626-231020-marostegui.json |
[production] |
23:10 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1214.eqiad.wmnet with reason: Maintenance |
[production] |
23:10 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1214.eqiad.wmnet with reason: Maintenance |
[production] |
23:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1211 (T364069)', diff saved to https://phabricator.wikimedia.org/P65496 and previous config saved to /var/cache/conftool/dbconfig/20240626-230958-marostegui.json |
[production] |
22:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1211', diff saved to https://phabricator.wikimedia.org/P65495 and previous config saved to /var/cache/conftool/dbconfig/20240626-225451-marostegui.json |
[production] |
22:47 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS bullseye |
[production] |
22:41 |
<brett@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp5021.eqsin.wmnet |
[production] |
22:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1211', diff saved to https://phabricator.wikimedia.org/P65494 and previous config saved to /var/cache/conftool/dbconfig/20240626-223944-marostegui.json |
[production] |