SCAN is a framework for fine-grained LLM capability assessment via automatic taxonomy construction from queries, query synthesis for coverage, visualization tools, and a PC2-enhanced LLM-as-a-judge method, applied to 21 models showing intra-family variations.
How can I optimize a Python script that processes large datasets and visualizes the results?
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SCAN: Structured Capability Assessment and Navigation for LLMs
SCAN is a framework for fine-grained LLM capability assessment via automatic taxonomy construction from queries, query synthesis for coverage, visualization tools, and a PC2-enhanced LLM-as-a-judge method, applied to 21 models showing intra-family variations.