Curriculum Learning for large language models in low-resource languages

Large language models (LLMs) are at the core of the current AI revolution, and have laid the groundwork for tremendous advancements in Natural Language Processing. Building LLMs require huge amounts of data, which is not available for low resource languages. As a result, LLMs shine in high-resource languages like English, but lag behind in many others, especially in those where training resources are scarce, including many regional languages in Europe. The data scarcity problem is usually alleviated by augmenting the training corpora in the target

IGARRITZ: euskarazko testu iragarpenerako web ingurune egokitua

Motrizitate mugatua duten ikasleek, garun paralisi batek sorturiko muga dutenek adibidez, tresna egokituak izaten dituzte testuak idazteko; esaterako, begiradaren jarraipeneko hardware bat, zeinarekin ordenagailuan letrak aukeratu eta sistemak iragartzen dituen hitzak aukeratu daitezkeen. Sistema hauek euskaraz idazteko baliabideak izaten dituzte, edo euskarazko hitz zerrendak sartuta iragarpenak aukeratzeko abagunea ematen dute. Edozein testu iragarpenen xedea da testua idazteko esfortzua murriztea, baita testu luzeagoak azkarrago idatzi ahal izatea ere.

Orriak

Ixa taldea RSS-rako harpidetza egin