ALEE generates AMR-based English minimal pairs with fine-grained semantic shifts, translates them, and evaluates embedding models on 275+ languages to expose cross-lingual gaps linked to training data and tokenization.
Computational Linguistics , volume =
5 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CL 5years
2026 5verdicts
UNVERDICTED 5representative citing papers
PluRule is a new multimodal multilingual benchmark showing that state-of-the-art vision-language models perform only marginally better than a trivial baseline at detecting specific rule violations in pluralistic online communities.
GiLT augments Transformers with semantic dependency graphs by modulating attention to improve syntactic generalization while keeping perplexity competitive and enabling better finetuning on downstream tasks.
Proposes treating Pāṇini's Astādhyāyī as a unifying computational architecture and benchmark foundation for Indic language NLP to improve accuracy, data efficiency, and transfer.
A symbolic system extracts events from 450 property crime reports, with 54.1% high-confidence outputs, 93.7% mapped via PropBank-VerbNet-WordNet, and 100% human agreement on incident initiation, stolen items, and temporal cues.
citing papers explorer
-
ALEE: Any-Language Evaluation of Embeddings via English-Centric Minimal Pairs
ALEE generates AMR-based English minimal pairs with fine-grained semantic shifts, translates them, and evaluates embedding models on 275+ languages to expose cross-lingual gaps linked to training data and tokenization.
-
PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media
PluRule is a new multimodal multilingual benchmark showing that state-of-the-art vision-language models perform only marginally better than a trivial baseline at detecting specific rule violations in pluralistic online communities.
-
GiLT: Augmenting Transformer Language Models with Dependency Graphs
GiLT augments Transformers with semantic dependency graphs by modulating attention to improve syntactic generalization while keeping perplexity competitive and enabling better finetuning on downstream tasks.
-
A P\={a}ninian Foundation for Indic Language Processing
Proposes treating Pāṇini's Astādhyāyī as a unifying computational architecture and benchmark foundation for Indic language NLP to improve accuracy, data efficiency, and transfer.
-
Ontology for Policing: Conceptual Knowledge Learning for Semantic Understanding and Reasoning in Law Enforcement Reports
A symbolic system extracts events from 450 property crime reports, with 54.1% high-confidence outputs, 93.7% mapped via PropBank-VerbNet-WordNet, and 100% human agreement on incident initiation, stolen items, and temporal cues.