201-250 of 10000 results (44ms)
2024-03-29 §
09:21 <filippo@cumin2002> START - Cookbook sre.puppet.migrate-host for host alert1001.wikimedia.org [production]
09:18 <filippo@cumin2002> END (FAIL) - Cookbook sre.puppet.migrate-host (exit_code=99) for host alert1001.wikimedia.org [production]
09:18 <filippo@cumin2002> START - Cookbook sre.puppet.migrate-host for host alert1001.wikimedia.org [production]
08:36 <dcausse> repooling wdqs1013 (T360993) [production]
05:47 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1174 (T352010)', diff saved to https://phabricator.wikimedia.org/P59009 and previous config saved to /var/cache/conftool/dbconfig/20240329-054724-ladsgroup.json [production]
05:47 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1174.eqiad.wmnet with reason: Maintenance [production]
05:47 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1174.eqiad.wmnet with reason: Maintenance [production]
2024-03-28 §
23:57 <ejegg> donorwiki upgraded from c7f1325c to 5e39bdc5 [production]
23:56 <ejegg> payments-wiki upgraded from cca87e29 to 5e39bdc5 [production]
21:25 <tgr@deploy1002> Finished scap: Backport for [[gerrit:1015145|Enter deprecation trial for third-party cookie blocking (T359957)]], [[gerrit:1014634|Add CommunityConfiguration log channel (T361072)]] (duration: 19m 30s) [production]
21:14 <tgr@deploy1002> urbanecm and tgr: Continuing with sync [production]
21:08 <tgr@deploy1002> urbanecm and tgr: Backport for [[gerrit:1015145|Enter deprecation trial for third-party cookie blocking (T359957)]], [[gerrit:1014634|Add CommunityConfiguration log channel (T361072)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:06 <tgr@deploy1002> Started scap: Backport for [[gerrit:1015145|Enter deprecation trial for third-party cookie blocking (T359957)]], [[gerrit:1014634|Add CommunityConfiguration log channel (T361072)]] [production]
20:59 <inflatador> bking@mwmaint1002 sudo apt-get install ripgrep (faster recursive grep) [production]
20:58 <jhuneidi@deploy1002> rebuilt and synchronized wikiversions files: group2 wikis to 1.42.0-wmf.24 refs T360156 [production]
20:52 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host dbprov1005.eqiad.wmnet with OS bullseye [production]
20:22 <hashar@deploy1002> Finished deploy [integration/docroot@c89a404]: add CodeMirror to opensource.yaml - T359986 (duration: 00m 06s) [production]
20:22 <hashar@deploy1002> Started deploy [integration/docroot@c89a404]: add CodeMirror to opensource.yaml - T359986 [production]
20:18 <jhuneidi@deploy1002> Synchronized php: group1 wikis to 1.42.0-wmf.24 refs T360156 (duration: 12m 33s) [production]
20:07 <ryankemper@cumin2002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: elastic2090* for ban elastic2090 before reimage - ryankemper@cumin2002 - T353878 [production]
20:07 <ryankemper@cumin2002> START - Cookbook sre.elasticsearch.ban Banning hosts: elastic2090* for ban elastic2090 before reimage - ryankemper@cumin2002 - T353878 [production]
20:06 <ryankemper@cumin2002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Unbanning all hosts in search_codfw [production]
20:06 <ryankemper@cumin2002> START - Cookbook sre.elasticsearch.ban Unbanning all hosts in search_codfw [production]
20:06 <jhuneidi@deploy1002> rebuilt and synchronized wikiversions files: group1 wikis to 1.42.0-wmf.24 refs T360156 [production]
19:51 <pfischer@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
19:51 <pfischer@deploy1002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
19:48 <ryankemper> T353878 Updated cross cluster remote seed conf with latest master info: `ryankemper@mwmaint1002:~/elastic$ python push_cross_cluster_conf.py https://search.svc.codfw.wmnet:9443/_cluster/settings --ccc chi=chi_codfw_masters.lst psi=psi_codfw_masters.lst omega=omega_codfw_masters.lst` [production]
19:45 <jhuneidi@deploy1002> rebuilt and synchronized wikiversions files: group0 wikis to 1.42.0-wmf.24 refs T360156 [production]
19:36 <pfischer@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
19:35 <pfischer@deploy1002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
19:34 <damilare> civicrm upgraded from 2e0ac12f to ed776060 [production]
19:28 <jhuneidi@deploy1002> Finished scap: Backport for [[gerrit:1015204|objectcache: Restore default keyspace for LocalServerCache service (T358346 T361177)]] (duration: 34m 00s) [production]
19:16 <jhuneidi@deploy1002> tgr and jhuneidi: Continuing with sync [production]
19:14 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbprov1005.eqiad.wmnet with OS bullseye [production]
19:04 <mutante> CI (contint) - replacing envoy SSL cert (puppet CA -> cfssl) [production]
18:58 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host dbprov1005.eqiad.wmnet with OS bullseye [production]
18:56 <jhuneidi@deploy1002> tgr and jhuneidi: Backport for [[gerrit:1015204|objectcache: Restore default keyspace for LocalServerCache service (T358346 T361177)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
18:54 <jhuneidi@deploy1002> Started scap: Backport for [[gerrit:1015204|objectcache: Restore default keyspace for LocalServerCache service (T358346 T361177)]] [production]
18:52 <pfischer@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
18:52 <pfischer@deploy1002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
18:51 <pfischer@deploy1002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
18:51 <pfischer@deploy1002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
18:48 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbprov1005.eqiad.wmnet with OS bullseye [production]
18:25 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host dbprov1005.eqiad.wmnet with OS bullseye [production]
18:11 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
18:11 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
17:39 <joal@deploy1002> Finished deploy [airflow-dags/analytics@f64680f]: Regular deploy of Analytics airflow dags [airflow-dags/analytics@f64680fc] (duration: 00m 27s) [production]
17:39 <joal@deploy1002> Started deploy [airflow-dags/analytics@f64680f]: Regular deploy of Analytics airflow dags [airflow-dags/analytics@f64680fc] [production]
17:27 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbprov1005.eqiad.wmnet with OS bullseye [production]
17:27 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts elastic2037.codfw.wmnet [production]