Llava-mole: Sparse mixture of lora experts for mitigating data con- flicts in instruction finetuning mllms

· 2024 · arXiv 2401.16160

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification

cs.CV · 2026-05-18 · conditional · novelty 6.0

ViSA proposes expert-driven token generation and dual-branch local fusion modules for view-aware semantic alignment in AGPReID, reporting up to 10.06% mAP gains on the CARGO benchmark.

SMoES: Soft Modality-Guided Expert Specialization in MoE-VLMs

cs.CV · 2026-04-27 · unverdicted · novelty 6.0

SMoES improves MoE-VLM performance and efficiency via soft modality-guided expert routing and inter-bin mutual information regularization, yielding 0.9-4.2% task gains and 56% communication reduction.

Little by Little: Continual Learning via Incremental Mixture of Rank-1 Associative Memory Experts

cs.LG · 2025-06-26 · unverdicted · novelty 6.0

MoRAM frames continual learning as incremental addition of rank-1 adapters viewed as self-activating key-value associative memory units in a mixture-of-experts setup.

LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing

cs.LG · 2025-06-17 · unverdicted · novelty 6.0

LoRA-Mixer routes modular LoRA experts into attention projection matrices with an adaptive Routing Specialization Loss to improve multi-task performance while using fewer trainable parameters than prior LoRA-MoE methods.

citing papers explorer

Showing 4 of 4 citing papers.

View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification cs.CV · 2026-05-18 · conditional · none · ref 30
ViSA proposes expert-driven token generation and dual-branch local fusion modules for view-aware semantic alignment in AGPReID, reporting up to 10.06% mAP gains on the CARGO benchmark.
SMoES: Soft Modality-Guided Expert Specialization in MoE-VLMs cs.CV · 2026-04-27 · unverdicted · none · ref 6
SMoES improves MoE-VLM performance and efficiency via soft modality-guided expert routing and inter-bin mutual information regularization, yielding 0.9-4.2% task gains and 56% communication reduction.
Little by Little: Continual Learning via Incremental Mixture of Rank-1 Associative Memory Experts cs.LG · 2025-06-26 · unverdicted · none · ref 13
MoRAM frames continual learning as incremental addition of rank-1 adapters viewed as self-activating key-value associative memory units in a mixture-of-experts setup.
LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing cs.LG · 2025-06-17 · unverdicted · none · ref 23
LoRA-Mixer routes modular LoRA experts into attention projection matrices with an adaptive Routing Specialization Loss to improve multi-task performance while using fewer trainable parameters than prior LoRA-MoE methods.

Llava-mole: Sparse mixture of lora experts for mitigating data con- flicts in instruction finetuning mllms

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer