Let Guidelines Guide You: A Prescriptive Guideline-Centered Data Annotation Methodology
Mar 2, 2026·,,,,,·
1 min read
Federico Ruggeri
Eleonora Misino
Arianna Muti
Katerina Korre
Alberto Barrón-Cedeño
Paolo Torroni
Abstract
We introduce Guideline-Centered Annotation Methodology (GCAM), a novel data methodology designed to report the annotation guidelines associated with each data instance. GCAM addresses four key limitations of the standard application of the prescriptive annotation methodology by reducing the information loss during annotation, ensuring adherence to guidelines, and enabling the efficient reuse of annotated data across multiple tasks that rely on the same guidelines. We evaluate GCAM with a focus on text classification tasks through (i) a human annotation study and (ii) an experimental evaluation with several machine learning models. guaranteeing a transparent evaluation of the successful application of the prescriptive paradigm and enabling a fine-grained model error analysis.
Type
Publication
Transactions of the Association for Computational Linguistics (TACL)
Add the full text or supplementary notes for the publication here using Markdown formatting.