2511.17402

Total: 1

#1 PUCP-Metrix: A Comprehensive Open-Source Repository of Linguistic Metrics for Spanish [PDF] [Copy] [Kimi] [REL]

Authors: Javier Alonso Villegas Luis, Marco Antonio Sobrevilla Cabezudo

Linguistic features remain essential for interpretability and tasks involving style, structure, and readability, but existing Spanish tools offer limited coverage. We present PUCP-Metrix, an open-source repository of 182 linguistic metrics spanning lexical diversity, syntactic and semantic complexity, cohesion, psycholinguistics, and readability. PUCP-Metrix enables fine-grained, interpretable text analysis. We evaluate its usefulness on Automated Readability Assessment and Machine-Generated Text Detection, showing competitive performance compared to an existing repository and strong neural baselines. PUCP-Metrix offers a comprehensive, extensible resource for Spanish, supporting diverse NLP applications.

Subject: Computation and Language

Publish: 2025-11-21 17:03:00 UTC