6751-6800 of 10000 results (111ms)
2023-12-19 ยง
18:43 <mforns@deploy2002> Finished deploy [analytics/refinery@28dccef] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28dccefe] (duration: 03m 16s) [production]
18:39 <mforns@deploy2002> Started deploy [analytics/refinery@28dccef] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28dccefe] [production]
18:39 <mforns@deploy2002> Finished deploy [analytics/refinery@28dccef] (thin): Regular analytics weekly train THIN [analytics/refinery@28dccefe] (duration: 00m 06s) [production]
18:39 <mforns@deploy2002> Started deploy [analytics/refinery@28dccef] (thin): Regular analytics weekly train THIN [analytics/refinery@28dccefe] [production]
18:39 <mforns@deploy2002> Finished deploy [analytics/refinery@28dccef]: Regular analytics weekly train [analytics/refinery@28dccefe] (duration: 09m 18s) [production]
18:29 <mforns@deploy2002> Started deploy [analytics/refinery@28dccef]: Regular analytics weekly train [analytics/refinery@28dccefe] [production]
18:29 <xcollazo@deploy2002> Finished deploy [airflow-dags/analytics@d275e4f]: Deploy latest DAG changes to Analytics Airflow instance (duration: 00m 31s) [production]
18:28 <xcollazo@deploy2002> Started deploy [airflow-dags/analytics@d275e4f]: Deploy latest DAG changes to Analytics Airflow instance [production]
18:25 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on testhost2001.codfw.wmnet with reason: host reimage [production]
18:22 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on testhost2001.codfw.wmnet with reason: host reimage [production]
18:07 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host testhost2001.codfw.wmnet with OS bullseye [production]
18:06 <pt1979@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host testhost2001.codfw.wmnet with OS bullseye [production]
17:51 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host testhost2001.codfw.wmnet with OS bullseye [production]
16:23 <aikochou@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
16:15 <aikochou@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
16:12 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on moss-be[2001-2003].codfw.wmnet with reason: not in service, being used to test a destructive cookbook [production]
16:12 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on moss-be[2001-2003].codfw.wmnet with reason: not in service, being used to test a destructive cookbook [production]
16:04 <ayounsi@cumin1001> END (FAIL) - Cookbook sre.network.peering (exit_code=99) with action 'email' for AS: 327700 [production]
16:04 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 327700 [production]
16:02 <ayounsi@cumin1001> END (FAIL) - Cookbook sre.network.peering (exit_code=99) with action 'email' for AS: 139901 [production]
16:00 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 139901 [production]
16:00 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 15133 [production]
15:58 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 15133 [production]
15:55 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 5398 [production]
15:55 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on mw2448.codfw.wmnet with reason: hw failure [production]
15:55 <cgoubert@cumin2002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on mw2448.codfw.wmnet with reason: hw failure [production]
15:55 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 5398 [production]
15:42 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:983758|Change virtual domain of botpassword to plural (T351559)]] (duration: 07m 01s) [production]
15:38 <moritzm> installing gnutls28 security updates on bookworm [production]
15:37 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde and ladsgroup: Continuing with sync [production]
15:37 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde and ladsgroup: Backport for [[gerrit:983758|Change virtual domain of botpassword to plural (T351559)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:35 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:983758|Change virtual domain of botpassword to plural (T351559)]] [production]
15:33 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:984174|Use main replica DB in importExistingFilesToScanTable.php]] (duration: 07m 47s) [production]
15:27 <lucaswerkmeister-wmde@deploy2002> kharlan and lucaswerkmeister-wmde: Continuing with sync [production]
15:27 <lucaswerkmeister-wmde@deploy2002> kharlan and lucaswerkmeister-wmde: Backport for [[gerrit:984174|Use main replica DB in importExistingFilesToScanTable.php]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:25 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:984174|Use main replica DB in importExistingFilesToScanTable.php]] [production]
15:23 <taavi@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on cloudvirt1063.eqiad.wmnet with reason: host is down, downtiming in icinga too [production]
15:23 <taavi@cumin1001> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on cloudvirt1063.eqiad.wmnet with reason: host is down, downtiming in icinga too [production]
15:22 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:984172|Make SearchEntitiesIntegrationTest an ApiTestCase (T353334)]], [[gerrit:984173|Use link batch in search APIs (T353334)]] (duration: 08m 49s) [production]
15:16 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Continuing with sync [production]
15:15 <moritzm> installing exim4 bugfix updates from Bookworm point release [production]
15:15 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Backport for [[gerrit:984172|Make SearchEntitiesIntegrationTest an ApiTestCase (T353334)]], [[gerrit:984173|Use link batch in search APIs (T353334)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:13 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:984172|Make SearchEntitiesIntegrationTest an ApiTestCase (T353334)]], [[gerrit:984173|Use link batch in search APIs (T353334)]] [production]
15:10 <moritzm> installing nagios-plugins-contrib bugfix updates from Bookworm point release [production]
14:44 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
14:43 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
14:43 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
14:42 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
14:33 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/proton: apply [production]
14:32 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/proton: apply [production]