2024-04-19
§
|
06:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P60994 and previous config saved to /var/cache/conftool/dbconfig/20240419-062130-root.json |
[production] |
06:08 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1234', diff saved to https://phabricator.wikimedia.org/P60993 and previous config saved to /var/cache/conftool/dbconfig/20240419-060810-ladsgroup.json |
[production] |
06:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P60992 and previous config saved to /var/cache/conftool/dbconfig/20240419-060625-root.json |
[production] |
05:53 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1234 (T352010)', diff saved to https://phabricator.wikimedia.org/P60991 and previous config saved to /var/cache/conftool/dbconfig/20240419-055303-ladsgroup.json |
[production] |
05:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P60990 and previous config saved to /var/cache/conftool/dbconfig/20240419-055118-root.json |
[production] |
05:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P60989 and previous config saved to /var/cache/conftool/dbconfig/20240419-053612-root.json |
[production] |
05:26 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1202.eqiad.wmnet with OS bookworm |
[production] |
05:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P60988 and previous config saved to /var/cache/conftool/dbconfig/20240419-052107-root.json |
[production] |
05:06 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1202.eqiad.wmnet with reason: host reimage |
[production] |
05:04 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1202.eqiad.wmnet with reason: host reimage |
[production] |
05:02 |
<marostegui> |
dbmaint Upgrade s7 eqiad to Bookworm and MariaDB 10.6 T362745 |
[production] |
05:02 |
<marostegui> |
dbmaint Upgrade s7 codfw to Bookworm and MariaDB 10.6 T362745 |
[production] |
04:50 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db1202.eqiad.wmnet with OS bookworm |
[production] |
04:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1202', diff saved to https://phabricator.wikimedia.org/P60987 and previous config saved to /var/cache/conftool/dbconfig/20240419-044906-root.json |
[production] |
04:49 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1173.eqiad.wmnet with reason: Maintenance |
[production] |
04:48 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1173.eqiad.wmnet with reason: Maintenance |
[production] |
04:48 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2114.codfw.wmnet with reason: Maintenance |
[production] |
04:47 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2114.codfw.wmnet with reason: Maintenance |
[production] |
2024-04-18
§
|
23:42 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1177 (T352010)', diff saved to https://phabricator.wikimedia.org/P60986 and previous config saved to /var/cache/conftool/dbconfig/20240418-234247-ladsgroup.json |
[production] |
23:42 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance |
[production] |
23:42 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance |
[production] |
23:42 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172 (T352010)', diff saved to https://phabricator.wikimedia.org/P60985 and previous config saved to /var/cache/conftool/dbconfig/20240418-234225-ladsgroup.json |
[production] |
23:27 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P60984 and previous config saved to /var/cache/conftool/dbconfig/20240418-232717-ladsgroup.json |
[production] |
23:12 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P60983 and previous config saved to /var/cache/conftool/dbconfig/20240418-231210-ladsgroup.json |
[production] |
23:06 |
<mutante> |
graphite - switched SSL cert provider from cergen to cfssl - restarted envoyproxy |
[production] |
22:57 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172 (T352010)', diff saved to https://phabricator.wikimedia.org/P60982 and previous config saved to /var/cache/conftool/dbconfig/20240418-225702-ladsgroup.json |
[production] |
22:31 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T362508, excessive lag) xfer wikidata from wdqs2022.codfw.wmnet -> wdqs2023.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
21:34 |
<damilare> |
civicrm upgraded from 28adb4da to e95e03d9 |
[production] |
21:11 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T362508, excessive lag) xfer wikidata from wdqs2022.codfw.wmnet -> wdqs2023.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
21:01 |
<cjming> |
end of UTC late backport window |
[production] |
21:00 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1020939|Add templateeditor right to sysops in dawiki and fix typo in group name (T361461)]] (duration: 16m 24s) |
[production] |
20:48 |
<cjming@deploy1002> |
cjming and nmw03: Continuing with sync |
[production] |
20:46 |
<cjming@deploy1002> |
cjming and nmw03: Backport for [[gerrit:1020939|Add templateeditor right to sysops in dawiki and fix typo in group name (T361461)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:43 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1020939|Add templateeditor right to sysops in dawiki and fix typo in group name (T361461)]] |
[production] |
20:42 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on wdqs2023.codfw.wmnet with reason: T362508 |
[production] |
20:42 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on wdqs2023.codfw.wmnet with reason: T362508 |
[production] |
20:42 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1021518|Temporarily restore wgMinervaApplyKnownTemplateHacks for cached HTML (T362747)]] (duration: 17m 14s) |
[production] |
20:32 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1234 (T352010)', diff saved to https://phabricator.wikimedia.org/P60980 and previous config saved to /var/cache/conftool/dbconfig/20240418-203256-ladsgroup.json |
[production] |
20:32 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1234.eqiad.wmnet with reason: Maintenance |
[production] |
20:32 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1234.eqiad.wmnet with reason: Maintenance |
[production] |
20:32 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1232 (T352010)', diff saved to https://phabricator.wikimedia.org/P60979 and previous config saved to /var/cache/conftool/dbconfig/20240418-203234-ladsgroup.json |
[production] |
20:30 |
<cjming@deploy1002> |
jdlrobson and cjming: Continuing with sync |
[production] |
20:27 |
<cjming@deploy1002> |
jdlrobson and cjming: Backport for [[gerrit:1021518|Temporarily restore wgMinervaApplyKnownTemplateHacks for cached HTML (T362747)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:25 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1021518|Temporarily restore wgMinervaApplyKnownTemplateHacks for cached HTML (T362747)]] |
[production] |
20:23 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1021510|Revert "WikimediaEvents: Set IPoid URL and enable ip_reputation/score (2nd attempt)"]], [[gerrit:1020942|Revert "ext-EventLogging: Add mediawiki.ip_reputation.score"]] (duration: 14m 56s) |
[production] |
20:17 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1232', diff saved to https://phabricator.wikimedia.org/P60978 and previous config saved to /var/cache/conftool/dbconfig/20240418-201727-ladsgroup.json |
[production] |
20:11 |
<cjming@deploy1002> |
cjming and phuedx: Continuing with sync |
[production] |
20:11 |
<cjming@deploy1002> |
cjming and phuedx: Backport for [[gerrit:1021510|Revert "WikimediaEvents: Set IPoid URL and enable ip_reputation/score (2nd attempt)"]], [[gerrit:1020942|Revert "ext-EventLogging: Add mediawiki.ip_reputation.score"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:09 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1021510|Revert "WikimediaEvents: Set IPoid URL and enable ip_reputation/score (2nd attempt)"]], [[gerrit:1020942|Revert "ext-EventLogging: Add mediawiki.ip_reputation.score"]] |
[production] |
20:02 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1232', diff saved to https://phabricator.wikimedia.org/P60977 and previous config saved to /var/cache/conftool/dbconfig/20240418-200218-ladsgroup.json |
[production] |