Title resolution pending

Reinforcement Learning, Control as Probabilistic Inference: Tutorial, Review , author= · 2018

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Plan First, Diffuse Later: Extrinsic Graph Guidance for Long-Horizon Diffusion Planning

cs.RO · 2026-05-16 · unverdicted · novelty 6.0

XDiffuser combines extrinsic graph planning with diffusion models to guide denoising and improve performance on long-horizon robotic tasks including multi-agent coordination and TSP-style problems.

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

cs.AI · 2024-08-13 · unverdicted · novelty 6.0

Agent Q integrates MCTS-guided search, self-critique, and off-policy DPO to train LLM agents that outperform behavior cloning and reinforced fine-tuning baselines in WebShop and achieve up to 95.4% success in real-world booking scenarios.

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

cs.LG · 2024-02-18 · unverdicted · novelty 5.0

POVID generates AI-created preference data to fine-tune vision-language models with DPO, reducing hallucinations and improving benchmark scores.

Controllable Molecular Generative Foundation Models

cs.LG · 2026-05-14

citing papers explorer

Showing 1 of 1 citing paper after filters.

Controllable Molecular Generative Foundation Models cs.LG · 2026-05-14 · unreviewed · ref 29

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer