2024-01-31
§
|
06:35 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2142.codfw.wmnet with OS bookworm |
[production] |
06:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2107', diff saved to https://phabricator.wikimedia.org/P55910 and previous config saved to /var/cache/conftool/dbconfig/20240131-062846-marostegui.json |
[production] |
06:22 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2114.codfw.wmnet with OS bookworm |
[production] |
06:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1224 (re)pooling @ 10%: After onsite maintenance', diff saved to https://phabricator.wikimedia.org/P55909 and previous config saved to /var/cache/conftool/dbconfig/20240131-062109-root.json |
[production] |
06:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2114 T354506', diff saved to https://phabricator.wikimedia.org/P55908 and previous config saved to /var/cache/conftool/dbconfig/20240131-061932-root.json |
[production] |
06:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2107 (T355609)', diff saved to https://phabricator.wikimedia.org/P55907 and previous config saved to /var/cache/conftool/dbconfig/20240131-061340-marostegui.json |
[production] |
06:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1224 (re)pooling @ 5%: After onsite maintenance', diff saved to https://phabricator.wikimedia.org/P55906 and previous config saved to /var/cache/conftool/dbconfig/20240131-060602-root.json |
[production] |
06:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2107 (T355609)', diff saved to https://phabricator.wikimedia.org/P55905 and previous config saved to /var/cache/conftool/dbconfig/20240131-060337-marostegui.json |
[production] |
06:03 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2107.codfw.wmnet with reason: Maintenance |
[production] |
06:03 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db2107.codfw.wmnet with reason: Maintenance |
[production] |
05:53 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2097.codfw.wmnet with reason: Maintenance |
[production] |
05:53 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 6:00:00 on db2097.codfw.wmnet with reason: Maintenance |
[production] |
05:50 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1224 (re)pooling @ 1%: After onsite maintenance', diff saved to https://phabricator.wikimedia.org/P55904 and previous config saved to /var/cache/conftool/dbconfig/20240131-055057-root.json |
[production] |
05:41 |
<eileen> |
civicrm upgraded from 6de61520 to 520337a0 |
[production] |
05:30 |
<fab@deploy2002> |
Finished deploy [airflow-dags/research@97c6a4e]: (no justification provided) (duration: 00m 14s) |
[production] |
05:30 |
<fab@deploy2002> |
Started deploy [airflow-dags/research@97c6a4e]: (no justification provided) |
[production] |
03:29 |
<eileen> |
tools upgraded from 02281338 to c823e692 |
[production] |
03:05 |
<fab@deploy2002> |
Finished deploy [airflow-dags/research@6a97a34]: (no justification provided) (duration: 00m 23s) |
[production] |
03:05 |
<fab@deploy2002> |
Started deploy [airflow-dags/research@6a97a34]: (no justification provided) |
[production] |
2024-01-30
§
|
23:54 |
<mutante> |
LDAP - added aklapper to group releng T356043 |
[production] |
23:07 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for sessionstore1006.eqiad.wmnet |
[production] |
23:07 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for sessionstore1006.eqiad.wmnet |
[production] |
22:49 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on sessionstore1006.eqiad.wmnet with reason: Bootstrapping — T353402 |
[production] |
22:48 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on sessionstore1006.eqiad.wmnet with reason: Bootstrapping — T353402 |
[production] |
22:40 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: activate first private IP host config - bking@cumin2002 - T355617 |
[production] |
22:20 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for sessionstore1005.eqiad.wmnet |
[production] |
22:20 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for sessionstore1005.eqiad.wmnet |
[production] |
22:10 |
<cjming> |
end of UTC late backport window |
[production] |
22:09 |
<cjming@deploy2002> |
Finished scap: Backport for [[gerrit:994254|[eswiki] Add 13 namespaces to $wgExemptFromUserRobotsControl (T355033)]] (duration: 08m 24s) |
[production] |
22:02 |
<cjming@deploy2002> |
cjming and superpes: Continuing with sync |
[production] |
22:02 |
<cjming@deploy2002> |
cjming and superpes: Backport for [[gerrit:994254|[eswiki] Add 13 namespaces to $wgExemptFromUserRobotsControl (T355033)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
22:00 |
<cjming@deploy2002> |
Started scap: Backport for [[gerrit:994254|[eswiki] Add 13 namespaces to $wgExemptFromUserRobotsControl (T355033)]] |
[production] |
21:59 |
<cjming@deploy2002> |
Finished scap: Backport for [[gerrit:994211|[ukwiki] Change autoconfirmed setting (T355972)]], [[gerrit:994214|[ganwiki] Add 'suppressredirect' to transwiki usergroup and change assignment and revocation methods (T354850)]], [[gerrit:994220|[ganwiki] Add new namespace aliases (T355854)]] (duration: 09m 32s) |
[production] |
21:53 |
<cjming@deploy2002> |
superpes and cjming: Continuing with sync |
[production] |
21:51 |
<cjming@deploy2002> |
superpes and cjming: Backport for [[gerrit:994211|[ukwiki] Change autoconfirmed setting (T355972)]], [[gerrit:994214|[ganwiki] Add 'suppressredirect' to transwiki usergroup and change assignment and revocation methods (T354850)]], [[gerrit:994220|[ganwiki] Add new namespace aliases (T355854)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:50 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on sessionstore1005.eqiad.wmnet with reason: Bootstrapping — T353402 |
[production] |
21:50 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on sessionstore1005.eqiad.wmnet with reason: Bootstrapping — T353402 |
[production] |
21:49 |
<cjming@deploy2002> |
Started scap: Backport for [[gerrit:994211|[ukwiki] Change autoconfirmed setting (T355972)]], [[gerrit:994214|[ganwiki] Add 'suppressredirect' to transwiki usergroup and change assignment and revocation methods (T354850)]], [[gerrit:994220|[ganwiki] Add new namespace aliases (T355854)]] |
[production] |
21:44 |
<cjming@deploy2002> |
Finished scap: Backport for [[gerrit:994143|Run CheckerJob against read-only clusters (T354793)]] (duration: 07m 41s) |
[production] |
21:42 |
<mutante> |
LDAP - added jnuche to group releng (T356043) - already done/approved in the past in T301149 |
[production] |
21:41 |
<mutante> |
LDAP - added jhuneidi to group releng (T356043) - already done/approved in the past in T210028 |
[production] |
21:40 |
<mutante> |
LDAP - added brennen to group releng (T356043) - already done/approved in the past in T215365 |
[production] |
21:38 |
<cjming@deploy2002> |
cjming and ebernhardson: Continuing with sync |
[production] |
21:38 |
<cjming@deploy2002> |
cjming and ebernhardson: Backport for [[gerrit:994143|Run CheckerJob against read-only clusters (T354793)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:37 |
<cjming@deploy2002> |
Started scap: Backport for [[gerrit:994143|Run CheckerJob against read-only clusters (T354793)]] |
[production] |
21:36 |
<cjming@deploy2002> |
Finished scap: Backport for [[gerrit:994142|Run CheckerJob against read-only clusters (T354793)]] (duration: 07m 49s) |
[production] |
21:34 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: activate first private IP host config - bking@cumin2002 - T355617 |
[production] |
21:30 |
<cjming@deploy2002> |
ebernhardson and cjming: Continuing with sync |
[production] |
21:30 |
<cjming@deploy2002> |
ebernhardson and cjming: Backport for [[gerrit:994142|Run CheckerJob against read-only clusters (T354793)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:28 |
<cjming@deploy2002> |
Started scap: Backport for [[gerrit:994142|Run CheckerJob against read-only clusters (T354793)]] |
[production] |