Ana gezinime geç Aramaya geç Ana içeriğe geç

CorefInst: Leveraging LLMs for Multilingual Coreference Resolution

  • NLP Research Group
  • Istanbul Technical University

Araştırma sonucu: Dergiye katkıMakalebilirkişi

Özet

Coreference Resolution (CR) is a crucial yet challenging task in natural language understanding, often constrained by task-specific architectures and encoder-based language models that demand extensive training and lack adaptability. This study introduces the first multilingual CR methodology which leverages decoder-only LLMs to handle both overt and zero mentions. The article explores how to model the CR task for LLMs via five different instruction sets using a controlled inference method. The approach is evaluated across three LLMs: Llama 3.1, Gemma 2, and Mistral 0.3. The results indicate that LLMs, when instruction-tuned with a suitable instruction set, can surpass state-of-the-art task-specific architectures. Specifically, our best model, a fully fine-tuned Llama 3.1 for multilingual CR, outperforms the leading multilingual CR model (i.e., Corpipe 24 single stage variant) by 2 percentage points on average across all languages in the CorefUD v1.2 dataset collection.

Orijinal dilİngilizce
Sayfa (başlangıç-bitiş)64-80
Sayfa sayısı17
DergiTransactions of the Association for Computational Linguistics
Hacim14
DOI'lar
Yayın durumuYayınlandı - 2026

Bibliyografik not

Publisher Copyright:
© 2026 Association for Computational Linguistics. This is an open-access article distributed under the terms of the https://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. For a full description of the license, please visit https://creativecommons.org/licenses/by/4.0/legalcode.

Parmak izi

CorefInst: Leveraging LLMs for Multilingual Coreference Resolution' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap