KeTa

KeTa

Estonian Language Data Infrastructure (Keeleandmete Teadustaristu – KeTa) is a research infrastructure that supports R&D activities that use language data. It provides services for collecting, preserving, making accessible, and reusing Estonian language data both as datasets and through various digital tools. KeTa’s mission is to offer a comprehensive infrastructure and services that meet the demands and challenges of the fields of linguistics and language technology. KeTa brings together the resources of Estonian universities, research and development institutions, and other organizations related to linguistics to support and promote the use and development of language technology. KeTa plays a crucial role in aggregating high-quality language datasets for the development and evaluation of large language models. KeTa is a member of CLARIN ERIC (Common Language Resources and Technology Infrastructure of European Research Infrastructure Consortium) network.