AGIEval shows GPT-4 exceeding average human scores on SAT Math at 95% and Chinese college entrance English at 92.5%, while revealing weaker results on complex reasoning tasks.
P ars T wi NER : A Corpus for Named Entity Recognition at Informal P ersian
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2representative citing papers
Trains a 125M-parameter Persian PLM on a curated 45GB corpus using vector semantic deduplication for domain balance, topping QA and NLI benchmarks while remaining competitive on NER and classification.
citing papers explorer
-
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
AGIEval shows GPT-4 exceeding average human scores on SAT Math at 95% and Chinese college entrance English at 92.5%, while revealing weaker results on complex reasoning tasks.
-
IHUBERT: Vector-Based Semantic Deduplication and Domain-Balanced Pretraining for Persian Resources
Trains a 125M-parameter Persian PLM on a curated 45GB corpus using vector semantic deduplication for domain balance, topping QA and NLI benchmarks while remaining competitive on NER and classification.