2024-04-19
§
|
08:40 |
<isaranto@deploy1002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
08:17 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1194.eqiad.wmnet with OS bookworm |
[production] |
07:55 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1194.eqiad.wmnet with reason: host reimage |
[production] |
07:52 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1194.eqiad.wmnet with reason: host reimage |
[production] |
07:38 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db1194.eqiad.wmnet with OS bookworm |
[production] |
07:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1194', diff saved to https://phabricator.wikimedia.org/P61001 and previous config saved to /var/cache/conftool/dbconfig/20240419-073638-root.json |
[production] |
07:24 |
<moritzm> |
installing Linux 6.1.85 on Bookworm hosts |
[production] |
07:15 |
<moritzm> |
installing PHP 7.4 security updates on cloudweb and bullseye snapshot hosts |
[production] |
07:03 |
<moritzm> |
imported PHP 1:7.4.33-1+0~20221108.73+debian10~1.gbpa00350a+wmf11u2 to component/php74 (backport of latest PHP security fixes) |
[production] |
06:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P60999 and previous config saved to /var/cache/conftool/dbconfig/20240419-065142-root.json |
[production] |
06:38 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1235 (T352010)', diff saved to https://phabricator.wikimedia.org/P60998 and previous config saved to /var/cache/conftool/dbconfig/20240419-063847-ladsgroup.json |
[production] |
06:38 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1235.eqiad.wmnet with reason: Maintenance |
[production] |
06:38 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1235.eqiad.wmnet with reason: Maintenance |
[production] |
06:38 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1234 (T352010)', diff saved to https://phabricator.wikimedia.org/P60997 and previous config saved to /var/cache/conftool/dbconfig/20240419-063825-ladsgroup.json |
[production] |
06:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P60996 and previous config saved to /var/cache/conftool/dbconfig/20240419-063636-root.json |
[production] |
06:23 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1234', diff saved to https://phabricator.wikimedia.org/P60995 and previous config saved to /var/cache/conftool/dbconfig/20240419-062317-ladsgroup.json |
[production] |
06:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P60994 and previous config saved to /var/cache/conftool/dbconfig/20240419-062130-root.json |
[production] |
06:08 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1234', diff saved to https://phabricator.wikimedia.org/P60993 and previous config saved to /var/cache/conftool/dbconfig/20240419-060810-ladsgroup.json |
[production] |
06:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P60992 and previous config saved to /var/cache/conftool/dbconfig/20240419-060625-root.json |
[production] |
05:53 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1234 (T352010)', diff saved to https://phabricator.wikimedia.org/P60991 and previous config saved to /var/cache/conftool/dbconfig/20240419-055303-ladsgroup.json |
[production] |
05:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P60990 and previous config saved to /var/cache/conftool/dbconfig/20240419-055118-root.json |
[production] |
05:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P60989 and previous config saved to /var/cache/conftool/dbconfig/20240419-053612-root.json |
[production] |
05:26 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1202.eqiad.wmnet with OS bookworm |
[production] |
05:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1202 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P60988 and previous config saved to /var/cache/conftool/dbconfig/20240419-052107-root.json |
[production] |
05:06 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1202.eqiad.wmnet with reason: host reimage |
[production] |
05:04 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1202.eqiad.wmnet with reason: host reimage |
[production] |
05:02 |
<marostegui> |
dbmaint Upgrade s7 eqiad to Bookworm and MariaDB 10.6 T362745 |
[production] |
05:02 |
<marostegui> |
dbmaint Upgrade s7 codfw to Bookworm and MariaDB 10.6 T362745 |
[production] |
04:50 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db1202.eqiad.wmnet with OS bookworm |
[production] |
04:49 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1202', diff saved to https://phabricator.wikimedia.org/P60987 and previous config saved to /var/cache/conftool/dbconfig/20240419-044906-root.json |
[production] |
04:49 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1173.eqiad.wmnet with reason: Maintenance |
[production] |
04:48 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1173.eqiad.wmnet with reason: Maintenance |
[production] |
04:48 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2114.codfw.wmnet with reason: Maintenance |
[production] |
04:47 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2114.codfw.wmnet with reason: Maintenance |
[production] |
2024-04-18
§
|
23:42 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1177 (T352010)', diff saved to https://phabricator.wikimedia.org/P60986 and previous config saved to /var/cache/conftool/dbconfig/20240418-234247-ladsgroup.json |
[production] |
23:42 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance |
[production] |
23:42 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1177.eqiad.wmnet with reason: Maintenance |
[production] |
23:42 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172 (T352010)', diff saved to https://phabricator.wikimedia.org/P60985 and previous config saved to /var/cache/conftool/dbconfig/20240418-234225-ladsgroup.json |
[production] |
23:27 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P60984 and previous config saved to /var/cache/conftool/dbconfig/20240418-232717-ladsgroup.json |
[production] |
23:12 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P60983 and previous config saved to /var/cache/conftool/dbconfig/20240418-231210-ladsgroup.json |
[production] |
23:06 |
<mutante> |
graphite - switched SSL cert provider from cergen to cfssl - restarted envoyproxy |
[production] |
22:57 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1172 (T352010)', diff saved to https://phabricator.wikimedia.org/P60982 and previous config saved to /var/cache/conftool/dbconfig/20240418-225702-ladsgroup.json |
[production] |
22:31 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) (T362508, excessive lag) xfer wikidata from wdqs2022.codfw.wmnet -> wdqs2023.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
21:34 |
<damilare> |
civicrm upgraded from 28adb4da to e95e03d9 |
[production] |
21:11 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (T362508, excessive lag) xfer wikidata from wdqs2022.codfw.wmnet -> wdqs2023.codfw.wmnet w/ force delete existing files, repooling both afterwards |
[production] |
21:01 |
<cjming> |
end of UTC late backport window |
[production] |
21:00 |
<cjming@deploy1002> |
Finished scap: Backport for [[gerrit:1020939|Add templateeditor right to sysops in dawiki and fix typo in group name (T361461)]] (duration: 16m 24s) |
[production] |
20:48 |
<cjming@deploy1002> |
cjming and nmw03: Continuing with sync |
[production] |
20:46 |
<cjming@deploy1002> |
cjming and nmw03: Backport for [[gerrit:1020939|Add templateeditor right to sysops in dawiki and fix typo in group name (T361461)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:43 |
<cjming@deploy1002> |
Started scap: Backport for [[gerrit:1020939|Add templateeditor right to sysops in dawiki and fix typo in group name (T361461)]] |
[production] |