SkyNative introduces an encoder-free architecture using raw patch tokens and modality-specific parameters in a unified autoregressive model to improve image-grounded reasoning in remote sensing vision-language tasks.
Gpt-4o mini: advancing cost-efficient intelligence, 2024
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
DeltaPrompts generates 200k synthetic high-divergence reasoning prompts to escape zero-delta saturation in multimodal distillation, yielding up to 15% relative gains on chart, document, and perception benchmarks across multiple settings.
ArgLLMs build argumentation frameworks from LLMs to support explainable and contestable formal reasoning for claim verification.
citing papers explorer
-
SkyNative: A Native Multimodal Framework for Remote Sensing Visual Evidence Reasoning
SkyNative introduces an encoder-free architecture using raw patch tokens and modality-specific parameters in a unified autoregressive model to improve image-grounded reasoning in remote sensing vision-language tasks.
-
DeltaPrompts: Escaping the Zero-Delta Trap in Multimodal Distillation
DeltaPrompts generates 200k synthetic high-divergence reasoning prompts to escape zero-delta saturation in multimodal distillation, yielding up to 15% relative gains on chart, document, and perception benchmarks across multiple settings.
-
Argumentative Large Language Models for Explainable and Contestable Claim Verification
ArgLLMs build argumentation frameworks from LLMs to support explainable and contestable formal reasoning for claim verification.