Residual Paving decomposes selective refusal editing into an early-layer router for intervention decisions and later-layer residual experts for edits, with oracle routing showing that learned route selectivity is the primary bottleneck across six backbones.
Title resolution pending
10 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 10roles
method 2polarities
use method 2representative citing papers
Optimistic bilevel optimization with manifold lower-level minimizers is differentiable if the optimistic selection is unique, yielding a pseudoinverse hyper-gradient and a convergent HG-MS algorithm whose rate depends on intrinsic manifold dimension.
Symmetric Nucleus Subsampling and Expert Embedding Engine reduce modality gaps in multimodal embeddings by over 90% and outperform baselines in data curation for downstream models.
SeqLoRA applies bilevel optimization to sequential LoRA adaptation for continual multi-concept text-to-image generation with theoretical bounds on forgetting and interference.
NeuroMAS reframes multi-agent language systems as neural architectures where LLM agents learn coordination via reinforcement learning rather than predefined roles.
StaR-MoE adds sensitivity-aware routing alignment and asymmetric capacity regularization to expandable MoE architectures for class-incremental learning, reducing interference from routing drift and improving average and last-task accuracy on four benchmarks.
GRPO with reference-free rewards improves NLLB-200 translation quality on 13 languages up to +5.03 chrF++, competing with supervised fine-tuning on complex languages without target data.
Adapted MelBERT MIP-only reaches 0.7281 F1 on Chinese token-level metaphor detection, outperforming RoBERTa and Qwen QLoRA, with all artifacts released for reproducibility.
IConFace performs unified reference-aware and no-reference blind face restoration by asymmetrically conditioning identity from references and structure from the degraded image.
An explanatory book that supplies a clear mental map and intuition for how Vision-Language Models combine vision and language capabilities.
citing papers explorer
-
Residual Paving: Diagnosing the Routing Bottleneck in Selective Refusal Editing
Residual Paving decomposes selective refusal editing into an early-layer router for intervention decisions and later-layer residual experts for edits, with oracle routing showing that learned route selectivity is the primary bottleneck across six backbones.
-
Select-then-differentiate: Solving Bilevel Optimization with Manifold Lower-level Solution Sets
Optimistic bilevel optimization with manifold lower-level minimizers is differentiable if the optimistic selection is unique, yielding a pseudoinverse hyper-gradient and a convergent HG-MS algorithm whose rate depends on intrinsic manifold dimension.
-
Multimodal Data Curation Through Ranked Retrieval
Symmetric Nucleus Subsampling and Expert Embedding Engine reduce modality gaps in multimodal embeddings by over 90% and outperform baselines in data curation for downstream models.
-
SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation
SeqLoRA applies bilevel optimization to sequential LoRA adaptation for continual multi-concept text-to-image generation with theoretical bounds on forgetting and interference.
-
NeuroMAS: Multi-Agent Systems as Neural Networks with Joint Reinforcement Learning
NeuroMAS reframes multi-agent language systems as neural architectures where LLM agents learn coordination via reinforcement learning rather than predefined roles.
-
Stable Routing for Mixture-of-Experts in Class-Incremental Learning
StaR-MoE adds sensitivity-aware routing alignment and asymmetric capacity regularization to expandable MoE architectures for class-incremental learning, reducing interference from routing drift and improving average and last-task accuracy on four benchmarks.
-
Reference-Free Reinforcement Learning Fine-Tuning for MT: A Seq2Seq Perspective
GRPO with reference-free rewards improves NLLB-200 translation quality on 13 languages up to +5.03 chrF++, competing with supervised fine-tuning on complex languages without target data.
-
A Reproducible Multi-Architecture Baseline for Token-Level Chinese Metaphor Identification under the MIPVU Framework
Adapted MelBERT MIP-only reaches 0.7281 F1 on Chinese token-level metaphor detection, outperforming RoBERTa and Qwen QLoRA, with all artifacts released for reproducibility.
-
IConFace: Identity-Structure Asymmetric Conditioning for Unified Reference-Aware Face Restoration
IConFace performs unified reference-aware and no-reference blind face restoration by asymmetrically conditioning identity from references and structure from the degraded image.
-
From Pixels to Prompts: Vision-Language Models
An explanatory book that supplies a clear mental map and intuition for how Vision-Language Models combine vision and language capabilities.