2024-03-07
§
|
23:21 |
<htriedman@deploy2002> |
Finished deploy [airflow-dags/platform_eng@00efab7]: (no justification provided) (duration: 00m 27s) |
[production] |
23:21 |
<htriedman@deploy2002> |
Started deploy [airflow-dags/platform_eng@00efab7]: (no justification provided) |
[production] |
22:49 |
<ejegg> |
donorwiki upgraded from bc49e5a6 to 9b31d4fe |
[production] |
22:47 |
<inflatador> |
bking@pcc-worker1006 deleted all dirs older than 22 Jan to free up space |
[production] |
22:23 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db2156 (re)pooling @ 100%: Maint over', diff saved to https://phabricator.wikimedia.org/P58661 and previous config saved to /var/cache/conftool/dbconfig/20240307-222330-ladsgroup.json |
[production] |
22:17 |
<rzl@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on db2124.codfw.wmnet with reason: index corruption |
[production] |
22:16 |
<rzl@cumin2002> |
START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on db2124.codfw.wmnet with reason: index corruption |
[production] |
22:10 |
<rzl@cumin2002> |
dbctl commit (dc=all): 'Depool db2124', diff saved to https://phabricator.wikimedia.org/P58659 and previous config saved to /var/cache/conftool/dbconfig/20240307-221056-rzl.json |
[production] |
22:08 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db2156 (re)pooling @ 75%: Maint over', diff saved to https://phabricator.wikimedia.org/P58658 and previous config saved to /var/cache/conftool/dbconfig/20240307-220824-ladsgroup.json |
[production] |
21:53 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db2156 (re)pooling @ 25%: Maint over', diff saved to https://phabricator.wikimedia.org/P58657 and previous config saved to /var/cache/conftool/dbconfig/20240307-215319-ladsgroup.json |
[production] |
21:38 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'db2156 (re)pooling @ 10%: Maint over', diff saved to https://phabricator.wikimedia.org/P58656 and previous config saved to /var/cache/conftool/dbconfig/20240307-213814-ladsgroup.json |
[production] |
21:19 |
<brennen@deploy2002> |
Finished scap: Backport for [[gerrit:1009337|Fixes: Less_Exception_Compiler (T359414 T357740)]] (duration: 14m 41s) |
[production] |
21:09 |
<brennen@deploy2002> |
brennen and jdlrobson: Continuing with sync |
[production] |
21:07 |
<brennen@deploy2002> |
brennen and jdlrobson: Backport for [[gerrit:1009337|Fixes: Less_Exception_Compiler (T359414 T357740)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:04 |
<brennen@deploy2002> |
Started scap: Backport for [[gerrit:1009337|Fixes: Less_Exception_Compiler (T359414 T357740)]] |
[production] |
20:50 |
<dancy@deploy2002> |
Finished deploy [cassandra/logstash-logback-encoder@c200e79]: (no justification provided) (duration: 00m 35s) |
[production] |
20:50 |
<dancy@deploy2002> |
Started deploy [cassandra/logstash-logback-encoder@c200e79]: (no justification provided) |
[production] |
20:49 |
<dancy@deploy2002> |
Finished deploy [cassandra/logstash-logback-encoder@162f72f]: (no justification provided) (duration: 00m 56s) |
[production] |
20:49 |
<dancy@deploy2002> |
Started deploy [cassandra/logstash-logback-encoder@162f72f]: (no justification provided) |
[production] |
18:49 |
<btullis> |
running a wikidata dump manually on snapshot1009 for partitions 25,27 |
[production] |
18:22 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 60 days, 0:00:00 on wdqs[1022-1025].eqiad.wmnet with reason: T337013 |
[production] |
18:22 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 60 days, 0:00:00 on wdqs[1022-1025].eqiad.wmnet with reason: T337013 |
[production] |
18:19 |
<bearloga@deploy2002> |
Finished deploy [airflow-dags/analytics_product@15edf4a]: (no justification provided) (duration: 00m 08s) |
[production] |
18:19 |
<bearloga@deploy2002> |
Started deploy [airflow-dags/analytics_product@15edf4a]: (no justification provided) |
[production] |
17:43 |
<cwhite> |
set aside WAL for prometheus@k8s in codfw and restart - T354399 |
[production] |
17:28 |
<cwhite> |
set aside WAL for prometheus@k8s in eqiad and restart - T354399 |
[production] |
17:25 |
<dancy@deploy2002> |
Finished scap: testing T358117 (duration: 11m 15s) |
[production] |
17:22 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2217 (T352010)', diff saved to https://phabricator.wikimedia.org/P58654 and previous config saved to /var/cache/conftool/dbconfig/20240307-172227-ladsgroup.json |
[production] |
17:14 |
<dancy@deploy2002> |
Started scap: testing T358117 |
[production] |
17:07 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2217', diff saved to https://phabricator.wikimedia.org/P58653 and previous config saved to /var/cache/conftool/dbconfig/20240307-170720-ladsgroup.json |
[production] |
16:52 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2217', diff saved to https://phabricator.wikimedia.org/P58652 and previous config saved to /var/cache/conftool/dbconfig/20240307-165213-ladsgroup.json |
[production] |
16:48 |
<cgoubert@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply |
[production] |
16:47 |
<cgoubert@deploy2002> |
helmfile [codfw] START helmfile.d/services/mw-parsoid: apply |
[production] |
16:47 |
<cgoubert@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply |
[production] |
16:47 |
<cgoubert@deploy2002> |
helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply |
[production] |
16:44 |
<dancy@deploy2002> |
Installation of scap version "4.70.0" completed for 373 hosts |
[production] |
16:43 |
<dancy@deploy2002> |
Installing scap version "4.70.0" for 373 hosts |
[production] |
16:38 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.reimage for host dbprov2006.codfw.wmnet with OS bullseye |
[production] |
16:38 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.reimage for host dbprov2005.codfw.wmnet with OS bullseye |
[production] |
16:37 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2217 (T352010)', diff saved to https://phabricator.wikimedia.org/P58651 and previous config saved to /var/cache/conftool/dbconfig/20240307-163706-ladsgroup.json |
[production] |
16:29 |
<cdanis> |
T343529 ✔ cdanis@prometheus2005.codfw.wmnet ~ 🕦☕sudo systemctl restart thanos-sidecar@k8s.service |
[production] |
16:20 |
<jnuche@deploy2002> |
rebuilt and synchronized wikiversions files: group2 wikis to 1.42.0-wmf.21 refs T354439 |
[production] |
16:19 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2112.codfw.wmnet with reason: Maintenance |
[production] |
16:19 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2112.codfw.wmnet with reason: Maintenance |
[production] |
16:19 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1184.eqiad.wmnet with reason: Maintenance |
[production] |
16:19 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1184.eqiad.wmnet with reason: Maintenance |
[production] |
16:18 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2165.codfw.wmnet with reason: Maintenance |
[production] |
16:18 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2165.codfw.wmnet with reason: Maintenance |
[production] |
16:18 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1209.eqiad.wmnet with reason: Maintenance |
[production] |
16:18 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1209.eqiad.wmnet with reason: Maintenance |
[production] |