Euralex 2024

Name: Euralex 2024
Start: 2024-10-08T08:30:00+02:00
End: 2024-10-12T19:00:00+02:00
Location: Hotel Croatia

8–12 Oct 2024

Hotel Croatia

Europe/Warsaw timezone

The Automatic Determination of Translation Equivalents in Lexicography: What Works and What Doesn't?

12 Oct 2024, 09:00

30m

Ragusa Hall (Hotel Croatia)

Ragusa Hall

Hotel Croatia

Paralel Sessions

Michaela Denisová Gilles-Maurice de Schryver Pavel Rychlý

Cross-lingual embedding models act as facilitator of lexical knowledge transfer and offer many advantages, notably their applicability to low-resource and nonstandard language pairs, making them a valuable tool for retrieving translation equivalents in lexicography. Despite their potential, these models have primarily been developed with a focus on Natural Language Processing (NLP), leading to significant issues, including flawed training and evaluation data, as well as inadequate evaluation metrics and procedures. In this paper, we introduce cross-lingual embedding models for lexicography, addressing the challenges and limitations inherent in the current NLP-focused research. We demonstrate the problematic aspects across three baseline cross-lingual embedding models and three language pairs and outline possible solutions. We show the importance of high-quality data, advocating that its role is vital compared to algorithmic optimisation in enhancing the effectiveness of these models.

Michaela Denisová Gilles-Maurice de Schryver Pavel Rychlý

There are no materials yet.

Euralex 2024

The Automatic Determination of Translation Equivalents in Lexicography: What Works and What Doesn't?

Ragusa Hall

Hotel Croatia

Speakers

Description

Co-authors

Presentation materials

Choose timezone

Euralex 2024

Speakers

Description

Co-authors

Presentation materials