Skip to main content
Article
Overview of GUA-SPA at IberLEF 2023: Guarani-Spanish Code Switching Analysis
Procesamiento del Lenguaje Natural
  • Luis Chiruzzo, Universidad de la Republica
  • Marvin Agüero-Torales, Universidad de Granada
  • Gustavo Giménez-Lugo, Universidade Tecnológica Federal do Paraná
  • Aldo Alvarez, Universidad Nacional de Itapúa
  • Yliana Rodríguez, Universidad de la Republica
  • Santiago Góngora, Universidad de la Republica
  • Thamar Solorio, University of Houston & Mohamed bin Zayed University of Artificial Intelligence
Document Type
Article
Abstract

We present the first shared task for detecting and analyzing code-switching in Guarani and Spanish, GUA-SPA at IberLEF 2023. The challenge consisted of three tasks: identifying the language of a token, NER, and a novel task of classifying the way a Spanish span is used in the code-switched context. We annotated a corpus of 1500 texts extracted from news articles and tweets, around 25 thousand tokens, with the information for the tasks. Three teams took part in the evaluation phase, obtaining in general good results for Task 1, and more mixed results for Tasks 2 and 3.

DOI
10.26342/2023-71-25
Publication Date
9-1-2023
Keywords
  • Code-switching,
  • Guarani,
  • NER,
  • Spanish
Comments

IR conditions: non-described

Citation Information
Luis Chiruzzo, Marvin Agüero-Torales, Gustavo Giménez-Lugo, Aldo Alvarez, et al.. "Overview of GUA-SPA at IberLEF 2023: Guarani-Spanish Code Switching Analysis" Procesamiento del Lenguaje Natural Iss. 71 (2023) p. 321 - 328 ISSN: 11355948
Available at: http://works.bepress.com/thamar-solorio/1/