1-50 of 10000 results (64ms)
2024-03-07 §
23:21 <htriedman@deploy2002> Finished deploy [airflow-dags/platform_eng@00efab7]: (no justification provided) (duration: 00m 27s) [production]
23:21 <htriedman@deploy2002> Started deploy [airflow-dags/platform_eng@00efab7]: (no justification provided) [production]
22:49 <ejegg> donorwiki upgraded from bc49e5a6 to 9b31d4fe [production]
22:47 <inflatador> bking@pcc-worker1006 deleted all dirs older than 22 Jan to free up space [production]
22:23 <ladsgroup@cumin1002> dbctl commit (dc=all): 'db2156 (re)pooling @ 100%: Maint over', diff saved to https://phabricator.wikimedia.org/P58661 and previous config saved to /var/cache/conftool/dbconfig/20240307-222330-ladsgroup.json [production]
22:17 <rzl@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on db2124.codfw.wmnet with reason: index corruption [production]
22:16 <rzl@cumin2002> START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on db2124.codfw.wmnet with reason: index corruption [production]
22:10 <rzl@cumin2002> dbctl commit (dc=all): 'Depool db2124', diff saved to https://phabricator.wikimedia.org/P58659 and previous config saved to /var/cache/conftool/dbconfig/20240307-221056-rzl.json [production]
22:08 <ladsgroup@cumin1002> dbctl commit (dc=all): 'db2156 (re)pooling @ 75%: Maint over', diff saved to https://phabricator.wikimedia.org/P58658 and previous config saved to /var/cache/conftool/dbconfig/20240307-220824-ladsgroup.json [production]
21:53 <ladsgroup@cumin1002> dbctl commit (dc=all): 'db2156 (re)pooling @ 25%: Maint over', diff saved to https://phabricator.wikimedia.org/P58657 and previous config saved to /var/cache/conftool/dbconfig/20240307-215319-ladsgroup.json [production]
21:38 <ladsgroup@cumin1002> dbctl commit (dc=all): 'db2156 (re)pooling @ 10%: Maint over', diff saved to https://phabricator.wikimedia.org/P58656 and previous config saved to /var/cache/conftool/dbconfig/20240307-213814-ladsgroup.json [production]
21:19 <brennen@deploy2002> Finished scap: Backport for [[gerrit:1009337|Fixes: Less_Exception_Compiler (T359414 T357740)]] (duration: 14m 41s) [production]
21:09 <brennen@deploy2002> brennen and jdlrobson: Continuing with sync [production]
21:07 <brennen@deploy2002> brennen and jdlrobson: Backport for [[gerrit:1009337|Fixes: Less_Exception_Compiler (T359414 T357740)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:04 <brennen@deploy2002> Started scap: Backport for [[gerrit:1009337|Fixes: Less_Exception_Compiler (T359414 T357740)]] [production]
20:50 <dancy@deploy2002> Finished deploy [cassandra/logstash-logback-encoder@c200e79]: (no justification provided) (duration: 00m 35s) [production]
20:50 <dancy@deploy2002> Started deploy [cassandra/logstash-logback-encoder@c200e79]: (no justification provided) [production]
20:49 <dancy@deploy2002> Finished deploy [cassandra/logstash-logback-encoder@162f72f]: (no justification provided) (duration: 00m 56s) [production]
20:49 <dancy@deploy2002> Started deploy [cassandra/logstash-logback-encoder@162f72f]: (no justification provided) [production]
18:49 <btullis> running a wikidata dump manually on snapshot1009 for partitions 25,27 [production]
18:22 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 60 days, 0:00:00 on wdqs[1022-1025].eqiad.wmnet with reason: T337013 [production]
18:22 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 60 days, 0:00:00 on wdqs[1022-1025].eqiad.wmnet with reason: T337013 [production]
18:19 <bearloga@deploy2002> Finished deploy [airflow-dags/analytics_product@15edf4a]: (no justification provided) (duration: 00m 08s) [production]
18:19 <bearloga@deploy2002> Started deploy [airflow-dags/analytics_product@15edf4a]: (no justification provided) [production]
17:43 <cwhite> set aside WAL for prometheus@k8s in codfw and restart - T354399 [production]
17:28 <cwhite> set aside WAL for prometheus@k8s in eqiad and restart - T354399 [production]
17:25 <dancy@deploy2002> Finished scap: testing T358117 (duration: 11m 15s) [production]
17:22 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217 (T352010)', diff saved to https://phabricator.wikimedia.org/P58654 and previous config saved to /var/cache/conftool/dbconfig/20240307-172227-ladsgroup.json [production]
17:14 <dancy@deploy2002> Started scap: testing T358117 [production]
17:07 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217', diff saved to https://phabricator.wikimedia.org/P58653 and previous config saved to /var/cache/conftool/dbconfig/20240307-170720-ladsgroup.json [production]
16:52 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217', diff saved to https://phabricator.wikimedia.org/P58652 and previous config saved to /var/cache/conftool/dbconfig/20240307-165213-ladsgroup.json [production]
16:48 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-parsoid: apply [production]
16:47 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-parsoid: apply [production]
16:47 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply [production]
16:47 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply [production]
16:44 <dancy@deploy2002> Installation of scap version "4.70.0" completed for 373 hosts [production]
16:43 <dancy@deploy2002> Installing scap version "4.70.0" for 373 hosts [production]
16:38 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host dbprov2006.codfw.wmnet with OS bullseye [production]
16:38 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host dbprov2005.codfw.wmnet with OS bullseye [production]
16:37 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2217 (T352010)', diff saved to https://phabricator.wikimedia.org/P58651 and previous config saved to /var/cache/conftool/dbconfig/20240307-163706-ladsgroup.json [production]
16:29 <cdanis> T343529 ✔ cdanis@prometheus2005.codfw.wmnet ~ 🕦☕sudo systemctl restart thanos-sidecar@k8s.service [production]
16:20 <jnuche@deploy2002> rebuilt and synchronized wikiversions files: group2 wikis to 1.42.0-wmf.21 refs T354439 [production]
16:19 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2112.codfw.wmnet with reason: Maintenance [production]
16:19 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2112.codfw.wmnet with reason: Maintenance [production]
16:19 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1184.eqiad.wmnet with reason: Maintenance [production]
16:19 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1184.eqiad.wmnet with reason: Maintenance [production]
16:18 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2165.codfw.wmnet with reason: Maintenance [production]
16:18 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2165.codfw.wmnet with reason: Maintenance [production]
16:18 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1209.eqiad.wmnet with reason: Maintenance [production]
16:18 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1209.eqiad.wmnet with reason: Maintenance [production]