Title resolution pending

Edward J · 2021

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

method 2

citation-polarity summary

use method 2

representative citing papers

SteeringDiffusion: A Bottlenecked Activation Control Interface for Diffusion Models

cs.CV · 2026-05-03 · unverdicted · novelty 7.0

SteeringDiffusion supplies a bottlenecked, prompt-conditioned activation interface for frozen diffusion models that delivers smooth monotonic content-style control via one runtime scalar and timestep gating.

DeepParse: Hybrid Log Parsing with LLM-Synthesized Regex Masks

cs.SE · 2026-04-22 · unverdicted · novelty 7.0

DeepParse mines reusable regex patterns with an LLM from few log samples and applies them via Drain to achieve 97.6% average parsing accuracy on 16 datasets, outperforming baselines and cutting anomaly detection false alarms by over 30%.

Randomized Antipodal Search Done Right for Data Pareto Improvement of LLM Unlearning

cs.LG · 2026-04-17 · unverdicted · novelty 7.0

RASLIK uses randomized antipodal search on linearized influence kernels to achieve data Pareto improvement in LLM unlearning, outperforming baselines with sublinear complexity and double gains in quality and efficiency.

Can MLLMs Reason About Visual Persuasion? Evaluating the Efficacy and Faithfulness of Reasoning

cs.CV · 2026-05-09 · conditional · novelty 6.0

Diverse teacher-generated rationales improve MLLM visual persuasiveness prediction via supervised fine-tuning, while a new three-dimensional faithfulness framework shows that prediction accuracy alone does not ensure faithful reasoning and that decision sensitivity best matches human preferences.

Safety Drift After Fine-Tuning: Evidence from High-Stakes Domains

cs.CY · 2026-04-27 · unverdicted · novelty 6.0

Benign fine-tuning of foundation models induces large, heterogeneous, and often contradictory changes in safety metrics across general and domain-specific benchmarks.

UAF: A Unified Audio Front-end LLM for Full-Duplex Speech Interaction

cs.AI · 2026-04-21 · unverdicted · novelty 6.0

UAF is the first unified audio front-end LLM that turns multiple front-end tasks into one sequence prediction model processing streaming audio chunks and reference prompts to output semantic and control tokens for full-duplex interaction.

Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction

cs.AI · 2025-10-12 · unverdicted · novelty 6.0

Traj-CoA is a multi-agent LLM framework that sequentially processes noisy five-year EHR data via worker agents into EHRMem for manager-agent lung cancer risk prediction and outperforms four categories of baselines in zero-shot evaluation.

WhisperRT -- Turning Whisper into a Causal Streaming Model

cs.CL · 2025-08-17 · conditional · novelty 6.0

WhisperRT converts Whisper to a causal streaming ASR model via encoder causality, decoder synchronization on partial states, and fine-tuning, achieving better performance than non-fine-tuned streaming methods on sub-300ms chunks with lower complexity.

Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM

cs.CV · 2025-05-23 · unverdicted · novelty 6.0

Slot-MLLM introduces a slot-attention-based object-centric visual tokenizer with Q-Former encoder, diffusion decoder, and residual vector quantization for improved local visual comprehension and generation in multimodal LLMs.

Long Context Transfer from Language to Vision

cs.CV · 2024-06-24 · unverdicted · novelty 6.0

Extending language model context length enables LMMs to process over 200K visual tokens from long videos without video training, achieving SOTA on Video-MME via dense frame sampling.

Zephyr: Direct Distillation of LM Alignment

cs.LG · 2023-10-25 · accept · novelty 6.0

Zephyr-7B achieves state-of-the-art chat benchmark results among 7B models by distilling alignment via dDPO on AI feedback preferences, surpassing the 70B Llama-2-Chat model on MT-Bench with no human data required.

Beyond Surface Artifacts: Capturing Shared Latent Forgery Knowledge Across Modalities

cs.CV · 2026-04-09 · unverdicted · novelty 5.0

Introduces MAF framework and DeepModal-Bench to capture universal cross-modal forgery traces for better generalization in multimodal deepfake detection.

Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation

cs.CL · 2026-03-15 · unverdicted · novelty 5.0

A small language model fine-tuned on tool-augmented chain-of-thought data generated by a larger LLM learns to selectively call tools, delivering better content moderation accuracy at lower inference cost.

InvDesFlow-AL: active learning-based workflow for inverse design of functional materials

cond-mat.mtrl-sci · 2025-05-14 · unverdicted · novelty 5.0

InvDesFlow-AL combines active learning with diffusion generative models to improve crystal structure prediction accuracy by 33% and identifies Li2AuH6 as a candidate BCS superconductor with 140 K transition temperature.

Aggregating Low Rank Adapters in Federated Fine-tuning

cs.LG · 2025-01-10 · unverdicted · novelty 5.0

Proposes and benchmarks a new aggregation technique for LoRA adapters in federated fine-tuning against existing methods on GLUE tasks.

Reinforcement Learning for LLM Post-Training: A Survey

cs.CL · 2024-07-23 · unverdicted · novelty 3.0

A survey deriving a unified policy gradient framework for LLM post-training methods and providing technical comparisons of PPO, GRPO, DPO variants.

citing papers explorer

Showing 16 of 16 citing papers.

SteeringDiffusion: A Bottlenecked Activation Control Interface for Diffusion Models cs.CV · 2026-05-03 · unverdicted · none · ref 6
SteeringDiffusion supplies a bottlenecked, prompt-conditioned activation interface for frozen diffusion models that delivers smooth monotonic content-style control via one runtime scalar and timestep gating.
DeepParse: Hybrid Log Parsing with LLM-Synthesized Regex Masks cs.SE · 2026-04-22 · unverdicted · none · ref 14
DeepParse mines reusable regex patterns with an LLM from few log samples and applies them via Drain to achieve 97.6% average parsing accuracy on 16 datasets, outperforming baselines and cutting anomaly detection false alarms by over 30%.
Randomized Antipodal Search Done Right for Data Pareto Improvement of LLM Unlearning cs.LG · 2026-04-17 · unverdicted · none · ref 16
RASLIK uses randomized antipodal search on linearized influence kernels to achieve data Pareto improvement in LLM unlearning, outperforming baselines with sublinear complexity and double gains in quality and efficiency.
Can MLLMs Reason About Visual Persuasion? Evaluating the Efficacy and Faithfulness of Reasoning cs.CV · 2026-05-09 · conditional · none · ref 42
Diverse teacher-generated rationales improve MLLM visual persuasiveness prediction via supervised fine-tuning, while a new three-dimensional faithfulness framework shows that prediction accuracy alone does not ensure faithful reasoning and that decision sensitivity best matches human preferences.
Safety Drift After Fine-Tuning: Evidence from High-Stakes Domains cs.CY · 2026-04-27 · unverdicted · none · ref 24
Benign fine-tuning of foundation models induces large, heterogeneous, and often contradictory changes in safety metrics across general and domain-specific benchmarks.
UAF: A Unified Audio Front-end LLM for Full-Duplex Speech Interaction cs.AI · 2026-04-21 · unverdicted · none · ref 14
UAF is the first unified audio front-end LLM that turns multiple front-end tasks into one sequence prediction model processing streaming audio chunks and reference prompts to output semantic and control tokens for full-duplex interaction.
Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction cs.AI · 2025-10-12 · unverdicted · none · ref 48
Traj-CoA is a multi-agent LLM framework that sequentially processes noisy five-year EHR data via worker agents into EHRMem for manager-agent lung cancer risk prediction and outperforms four categories of baselines in zero-shot evaluation.
WhisperRT -- Turning Whisper into a Causal Streaming Model cs.CL · 2025-08-17 · conditional · none · ref 13
WhisperRT converts Whisper to a causal streaming ASR model via encoder causality, decoder synchronization on partial states, and fine-tuning, achieving better performance than non-fine-tuned streaming methods on sub-300ms chunks with lower complexity.
Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM cs.CV · 2025-05-23 · unverdicted · none · ref 20
Slot-MLLM introduces a slot-attention-based object-centric visual tokenizer with Q-Former encoder, diffusion decoder, and residual vector quantization for improved local visual comprehension and generation in multimodal LLMs.
Long Context Transfer from Language to Vision cs.CV · 2024-06-24 · unverdicted · none · ref 26
Extending language model context length enables LMMs to process over 200K visual tokens from long videos without video training, achieving SOTA on Video-MME via dense frame sampling.
Zephyr: Direct Distillation of LM Alignment cs.LG · 2023-10-25 · accept · none · ref 67
Zephyr-7B achieves state-of-the-art chat benchmark results among 7B models by distilling alignment via dDPO on AI feedback preferences, surpassing the 70B Llama-2-Chat model on MT-Bench with no human data required.
Beyond Surface Artifacts: Capturing Shared Latent Forgery Knowledge Across Modalities cs.CV · 2026-04-09 · unverdicted · none · ref 18
Introduces MAF framework and DeepModal-Bench to capture universal cross-modal forgery traces for better generalization in multimodal deepfake detection.
Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation cs.CL · 2026-03-15 · unverdicted · none · ref 18
A small language model fine-tuned on tool-augmented chain-of-thought data generated by a larger LLM learns to selectively call tools, delivering better content moderation accuracy at lower inference cost.
InvDesFlow-AL: active learning-based workflow for inverse design of functional materials cond-mat.mtrl-sci · 2025-05-14 · unverdicted · none · ref 21
InvDesFlow-AL combines active learning with diffusion generative models to improve crystal structure prediction accuracy by 33% and identifies Li2AuH6 as a candidate BCS superconductor with 140 K transition temperature.
Aggregating Low Rank Adapters in Federated Fine-tuning cs.LG · 2025-01-10 · unverdicted · none · ref 14
Proposes and benchmarks a new aggregation technique for LoRA adapters in federated fine-tuning against existing methods on GLUE tasks.
Reinforcement Learning for LLM Post-Training: A Survey cs.CL · 2024-07-23 · unverdicted · none · ref 78
A survey deriving a unified policy gradient framework for LLM post-training methods and providing technical comparisons of PPO, GRPO, DPO variants.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer