ALEE generates AMR-based English minimal pairs with fine-grained semantic shifts, translates them, and evaluates embedding models on 275+ languages to expose cross-lingual gaps linked to training data and tokenization.
Natural Language Decompositions of Implicit Content Enable Better Text Representations
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Scene Abstraction framework builds structured scene representations for lexical meaning via LLM prompting, with COCA-Scenes dataset and human experiments showing 82.4% identification accuracy and 86.4% preference over ATOMIC baselines.
citing papers explorer
-
ALEE: Any-Language Evaluation of Embeddings via English-Centric Minimal Pairs
ALEE generates AMR-based English minimal pairs with fine-grained semantic shifts, translates them, and evaluates embedding models on 275+ languages to expose cross-lingual gaps linked to training data and tokenization.
-
Scene Abstraction for Lexical Semantics: Structured Representations of Situated Meaning
Scene Abstraction framework builds structured scene representations for lexical meaning via LLM prompting, with COCA-Scenes dataset and human experiments showing 82.4% identification accuracy and 86.4% preference over ATOMIC baselines.