8–12 Oct 2024
Hotel Croatia
Europe/Warsaw timezone

Sustainability and Lexicography: Evaluating the Methodological Approach for the Development of an Automatic Multilingual Semantic Tagger

9 Oct 2024, 17:00
1h 30m
Tihi salon (Hotel Croatia)

Tihi salon

Hotel Croatia

Speakers

Iván Arias Arias María José Domínguez Vázquez Carlos Valcárcel Riveiro

Description

Understanding the semantic value of linguistic utterances is crucial for linguistics, lexicography, automatic text interpretation, and various NLP tasks. To address subtle variations within the semantic level, as is well known, machines retrieve stored data from corpora, lexicons and terminologies, and are equipped with taggers and rule-based systems. We already have tools for the development of new lexical resources. Alongside dictionaries, which are excellent repositories of information, corpus managers allow for the retrieval and statistical measurement of the distributive properties of vocabulary and the encoding of its syntagmatic properties. However, by using these tools, it is not always possible to simultaneously search for semantic, formal, and statistical data related to relational and/or categorial meaning, since most of these lack semantic annotation. To address this gap, we initiated a research project, ESMASES+, in September 2023. The main goal of this project is to create an automatic, sustainable, and multilingual semantic annotator. The tagger is conceived to automatically delineate the ontological meaning of nouns in Spanish, French, Galician, and German and resorts to lexical data of previous projects.

Co-authors

Presentation materials

There are no materials yet.