Venom: A PyTorch Generative Modeling Toolkit

Liang Yan

arxiv: 2605.17605 · v1 · pith:M3TUKSQInew · submitted 2026-05-17 · 💻 cs.LG

Venom: A PyTorch Generative Modeling Toolkit

Liang Yan This is my paper

Pith reviewed 2026-05-20 14:27 UTC · model grok-4.3

classification 💻 cs.LG

keywords generative modelsPyTorch toolkitdiffusion modelsVAEsGANsnormalizing flowsenergy-based modelseducational software

0 comments

The pith

A unified PyTorch toolkit can bring together diffusion models, VAEs, GANs and other generative families under consistent APIs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces VENOM as an educational toolkit that implements several generative modeling approaches in PyTorch. These include diffusion and score-based models, flow matching, variational autoencoders, normalizing flows, GANs, and energy-based models. The goal is to provide a single interface starting with MNIST examples so that users can easily compare training objectives, sampling methods, and conditioning techniques. This matters because the field has many separately coded paradigms, making it hard for newcomers to see the connections and differences without switching between multiple codebases. The toolkit focuses on readability and reproducible scripts rather than scaling to large models.

Core claim

VENOM is an educational PyTorch toolkit that implements representative generative modeling families under a unified, MNIST-first interface. It includes diffusion and score-based models, flow matching and one-step generators, variational autoencoders, normalizing flows, generative adversarial networks, and energy-based models. The package offers separate training and sampling scripts, guidance examples, and tutorial notebooks to support teaching, prototyping, and lightweight benchmarking.

What carries the argument

The unified training and sampling APIs that allow consistent use across different generative model families while preserving their individual mathematical properties.

If this is right

Users gain access to bilingual tutorial notebooks for learning each model family.
Classifier and classifier-free guidance can be demonstrated with shared code examples.
Lightweight benchmarking becomes possible by reusing the same evaluation setup across models.
The organization by model family aids in teaching the distinct training objectives side by side.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The consistent interface could help reveal underlying similarities between models such as diffusion and flow matching.
Similar unified toolkits might be developed for other machine learning subfields to reduce fragmentation.
Prototyping new conditioning mechanisms could be faster when starting from the existing API structure.

Load-bearing premise

That the various generative models can share the same training and sampling interface without losing accuracy in representing their unique structures or requiring users to write special-case code.

What would settle it

A concrete test would be to implement a new model family, such as a specific type of score-based model, and verify whether all its sampling and training steps fit within the provided unified scripts without modification.

read the original abstract

Modern generative modeling has grown into a broad collection of related but often separately implemented paradigms, including denoising diffusion models, score-based stochastic differential equations, flow matching, variational autoencoders, normalizing flows, adversarial models, and energy-based models. For newcomers, this fragmentation makes it difficult to compare training objectives, inference procedures, sampling algorithms, and conditioning mechanisms within a single coherent codebase. We introduce V ENOM, an educational PyTorch toolkit that implements representative generative modeling families under a unified, MNIST-first interface. V ENOM emphasizes breadth, readability, reproducible entry points, and consistent training and sampling APIs rather than large-scale performance engineering. The package currently includes diffusion and score-based models, flow matching and one-step generators, variational autoencoders, normalizing flows, generative adversarial networks, and energy-based models. It provides separate training and sampling scripts, classifier and classifier-free guidance examples, bilingual tutorial notebooks, and a model-family organization that supports teaching, prototyping, and lightweight benchmarking.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Venom is a clean educational PyTorch wrapper for existing generative model families with consistent APIs, but it adds no new methods, benchmarks, or validation.

read the letter

The main point is that this paper presents Venom as a unified PyTorch toolkit for diffusion, flow matching, VAEs, normalizing flows, GANs, and energy-based models, all starting from MNIST examples with shared training and sampling interfaces plus bilingual notebooks. It aims at readability and teaching rather than scale or new algorithms. That packaging can genuinely help newcomers avoid hunting across separate codebases to see how the objectives and sampling steps differ in practice. The emphasis on reproducible entry points and separate scripts is a reasonable design choice for an educational tool. The code organization by model family also looks practical for prototyping and lightweight comparisons. The soft spot is the lack of any reported benchmarks, reproduction checks, or timing results that would confirm the implementations match the original papers or that the unified API avoids forcing model-specific bypasses. The stress-test worry about whether one set of calls can faithfully cover the distinct math without inconsistencies is reasonable to raise here, since the description alone does not show how the abstraction layer handles the differences in loss functions or iterative versus direct sampling. If the code requires frequent workarounds, the claimed convenience drops. This work is mainly for instructors and students who want a single starting point for exploring generative models rather than for researchers needing new capabilities or rigorous comparisons. It is not a research contribution in the usual sense. I would still send it to peer review because a documented, open educational toolkit can be worth referee time if the implementations hold up under code inspection.

Referee Report

1 major / 1 minor

Summary. The manuscript introduces VENOM, a PyTorch generative modeling toolkit that implements representative families including diffusion and score-based models, flow matching and one-step generators, variational autoencoders, normalizing flows, generative adversarial networks, and energy-based models under a unified, MNIST-first interface. It provides consistent training and sampling APIs, separate scripts, classifier and classifier-free guidance examples, bilingual tutorial notebooks, and emphasizes breadth, readability, and reproducibility for educational and prototyping purposes.

Significance. Should the unified interface prove effective in capturing the distinct objectives, sampling procedures, and conditioning mechanisms of these generative modeling families without significant loss of fidelity or need for bypasses, VENOM could provide a valuable resource for teaching and comparing these paradigms in a single coherent codebase. The focus on MNIST and reproducible entry points supports its educational goals, and the inclusion of multiple model families in one package is a strength for newcomers.

major comments (1)

[Abstract] The claim that the toolkit implements these families 'under a unified, MNIST-first interface' with 'consistent training and sampling APIs' is central but lacks any supporting details, code examples, or verification in the provided manuscript that the abstraction does not collapse important mathematical differences between, e.g., iterative sampling in diffusion models and direct sampling in VAEs.

minor comments (1)

The abstract contains 'V ENOM' with a space; this should be corrected to 'VENOM' for consistency.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive review and for recognizing the potential educational value of VENOM. We address the single major comment below and will revise the manuscript to incorporate additional details.

read point-by-point responses

Referee: [Abstract] The claim that the toolkit implements these families 'under a unified, MNIST-first interface' with 'consistent training and sampling APIs' is central but lacks any supporting details, code examples, or verification in the provided manuscript that the abstraction does not collapse important mathematical differences between, e.g., iterative sampling in diffusion models and direct sampling in VAEs.

Authors: We agree that the manuscript is concise and does not currently include code examples or explicit verification of the abstraction. The unified interface is implemented via a shared base class that standardizes the training loop and sampling entry points while delegating family-specific logic (loss computation, forward process, and sampling procedure) to each subclass. This design preserves core mathematical distinctions, such as the iterative denoising steps required by diffusion and score-based models versus the direct latent decoding in VAEs or the single-pass generation in GANs and flow-matching models. To address the concern, the revised manuscript will include a short code example illustrating the common API across families together with a brief discussion confirming that distinct sampling procedures are retained without bypasses. revision: yes

Circularity Check

0 steps flagged

No circularity: software toolkit description with no derivations or predictions

full rationale

The paper presents VENOM as an educational PyTorch package implementing various generative models (diffusion, flow matching, VAEs, GANs, etc.) under a unified MNIST-first interface. No mathematical derivation chain, first-principles results, fitted predictions, or load-bearing self-citations exist. The abstract and description focus on implementation, APIs, tutorials, and reproducibility rather than deriving new results from prior ones. Any discussion of unified APIs concerns design assumptions or potential inconsistencies, not circular reduction of claims to inputs. This is a standard non-finding for a software paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a software toolkit description with no mathematical derivations, empirical fits, or new theoretical entities.

pith-pipeline@v0.9.0 · 5684 in / 1057 out tokens · 57651 ms · 2026-05-20T14:27:01.578599+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

VENOM emphasizes breadth, readability, reproducible entry points, and consistent training and sampling APIs rather than large-scale performance engineering. The package currently includes diffusion and score-based models, flow matching..., variational autoencoders, normalizing flows, generative adversarial networks, and energy-based models.
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Family-level organization. Each major generative modeling paradigm is implemented in a dedicated sub-package, including venom.diffusion, venom.vae, venom.flows, venom.gan, and venom.ebm.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages · 5 internal anchors

[1]

Stochastic Interpolants: A Unifying Framework for Flows and Diffusions

Michael S. Albergo, Nicholas M. Boffi, and Eric Vanden-Eijnden. Stochastic interpolants: A unifying framework for flows and diffusions.arXiv preprint arXiv:2303.08797, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[2]

Wasserstein generative adversarial networks

Martin Arjovsky, Soumith Chintala, and Leon Bottou. Wasserstein generative adversarial networks. In International Conference on Machine Learning, 2017

work page 2017
[3]

Importance weighted autoencoders

Yuri Burda, Roger Grosse, and Ruslan Salakhutdinov. Importance weighted autoencoders. InInterna- tional Conference on Learning Representations, 2016

work page 2016
[4]

InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets

Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. InAdvances in Neural Information Processing Systems, 2016

work page 2016
[5]

Diffusion models beat GANs on image synthesis

Prafulla Dhariwal and Alexander Quinn Nichol. Diffusion models beat GANs on image synthesis. In Advances in Neural Information Processing Systems, 2021

work page 2021
[6]

NICE: Non-linear Independent Components Estimation

Laurent Dinh, David Krueger, and Yoshua Bengio. NICE: Non-linear independent components estima- tion.arXiv preprint arXiv:1410.8516, 2014. 5

work page internal anchor Pith review Pith/arXiv arXiv 2014
[7]

Density estimation using real NVP

Laurent Dinh, Jascha Sohl-Dickstein, and Samy Bengio. Density estimation using real NVP. In International Conference on Learning Representations, 2017

work page 2017
[8]

Implicit generation and modeling with energy based models

Yilun Du and Igor Mordatch. Implicit generation and modeling with energy based models. InAdvances in Neural Information Processing Systems, 2019

work page 2019
[9]

Neural spline flows

Conor Durkan, Artur Bekasov, Iain Murray, and George Papamakarios. Neural spline flows. InAdvances in Neural Information Processing Systems, 2019

work page 2019
[10]

One step diffusion via shortcut models

Kevin Frans, Danijar Hafner, Sergey Levine, and Pieter Abbeel. One step diffusion via shortcut models. InInternational Conference on Learning Representations, 2025

work page 2025
[11]

Mean Flows for One-step Generative Modeling

Zhengyang Geng, Mingyang Deng, Xingjian Bai, J. Zico Kolter, and Kaiming He. Mean flows for one-step generative modeling.arXiv preprint arXiv:2505.13447, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[12]

Generative adversarial nets

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. InAdvances in Neural Information Processing Systems, 2014

work page 2014
[13]

Will Grathwohl, Ricky T. Q. Chen, Jesse Bettencourt, Ilya Sutskever, and David Duvenaud. FFJORD: Free-form continuous dynamics for scalable reversible generative models. InInternational Conference on Learning Representations, 2019

work page 2019
[14]

Your classifier is secretly an energy based model and you should treat it like one

Will Grathwohl, Kuan-Chieh Wang, J¨orn-Henrik Jacobsen, David Duvenaud, Mohammad Norouzi, and Kevin Swersky. Your classifier is secretly an energy based model and you should treat it like one. In International Conference on Learning Representations, 2020

work page 2020
[15]

Improved training of wasserstein GANs

Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron Courville. Improved training of wasserstein GANs. InAdvances in Neural Information Processing Systems, 2017

work page 2017
[16]

Noise-contrastive estimation: A new estimation principle for unnormalized statistical models

Michael Gutmann and Aapo Hyv¨arinen. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. InInternational Conference on Artificial Intelligence and Statistics, 2010

work page 2010
[17]

Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner

Irina Higgins, Loic Matthey, Arka Pal, Christopher P. Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner. beta-V AE: Learning basic visual concepts with a constrained variational framework. InInternational Conference on Learning Representations, 2017

work page 2017
[18]

Geoffrey E. Hinton. Training products of experts by minimizing contrastive divergence.Neural Computation, 14(8):1771–1800, 2002

work page 2002
[19]

Classifier-free diffusion guidance

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance. InNeurIPS Workshop on Deep Generative Models and Downstream Applications, 2021

work page 2021
[20]

Flow++: Improving flow-based generative models with variational dequantization and architecture design

Jonathan Ho, Xi Chen, Aravind Srinivas, Yan Duan, and Pieter Abbeel. Flow++: Improving flow-based generative models with variational dequantization and architecture design. InInternational Conference on Machine Learning, 2019

work page 2019
[21]

Denoising diffusion probabilistic models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models. InAdvances in Neural Information Processing Systems, 2020

work page 2020
[22]

Estimation of non-normalized statistical models by score matching.Journal of Machine Learning Research, 6:695–709, 2005

Aapo Hyv ¨arinen. Estimation of non-normalized statistical models by score matching.Journal of Machine Learning Research, 6:695–709, 2005. 6

work page 2005
[23]

Elucidating the design space of diffusion-based generative models

Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. Elucidating the design space of diffusion-based generative models. InAdvances in Neural Information Processing Systems, 2022

work page 2022
[24]

Kingma and Prafulla Dhariwal

Diederik P. Kingma and Prafulla Dhariwal. Glow: Generative flow with invertible 1x1 convolutions. In Advances in Neural Information Processing Systems, 2018

work page 2018
[25]

Kingma and Max Welling

Diederik P. Kingma and Max Welling. Auto-encoding variational bayes. InInternational Conference on Learning Representations, 2014

work page 2014
[26]

Kingma, Tim Salimans, Rafal Jozefowicz, Xi Chen, Ilya Sutskever, and Max Welling

Diederik P. Kingma, Tim Salimans, Rafal Jozefowicz, Xi Chen, Ilya Sutskever, and Max Welling. Improved variational inference with inverse autoregressive flow. InAdvances in Neural Information Processing Systems, 2016

work page 2016
[27]

A tutorial on energy-based learning

Yann LeCun, Sumit Chopra, Raia Hadsell, Marc’Aurelio Ranzato, and Fu Jie Huang. A tutorial on energy-based learning. InPredicting Structured Data. MIT Press, 2006

work page 2006
[28]

Geometric GAN

Jae Hyun Lim and Jong Chul Ye. Geometric GAN.arXiv preprint arXiv:1705.02894, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[29]

Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, and Matt Le. Flow matching for generative modeling. InInternational Conference on Learning Representations, 2023

work page 2023
[30]

Flow straight and fast: Learning to generate and transfer data with rectified flow

Xingchao Liu, Chengyue Gong, and Qiang Liu. Flow straight and fast: Learning to generate and transfer data with rectified flow. InInternational Conference on Learning Representations, 2023

work page 2023
[31]

DPM-Solver: A fast ODE solver for diffusion probabilistic model sampling in around 10 steps

Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, and Jun Zhu. DPM-Solver: A fast ODE solver for diffusion probabilistic model sampling in around 10 steps. InAdvances in Neural Information Processing Systems, 2022

work page 2022
[32]

Xudong Mao, Qing Li, Haoran Xie, Raymond Y . K. Lau, Zhen Wang, and Stephen Paul Smolley. Least squares generative adversarial networks. InInternational Conference on Computer Vision, 2017

work page 2017
[33]

Conditional Generative Adversarial Nets

Mehdi Mirza and Simon Osindero. Conditional generative adversarial nets.arXiv preprint arXiv:1411.1784, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[34]

Spectral normalization for generative adversarial networks

Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. Spectral normalization for generative adversarial networks. InInternational Conference on Learning Representations, 2018

work page 2018
[35]

Improved denoising diffusion probabilistic models

Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, 2021

work page 2021
[36]

Conditional image synthesis with auxiliary classifier gans

Augustus Odena, Christopher Olah, and Jonathon Shlens. Conditional image synthesis with auxiliary classifier gans. InInternational Conference on Machine Learning, 2017

work page 2017
[37]

Masked autoregressive flow for density estimation

George Papamakarios, Theo Pavlakou, and Iain Murray. Masked autoregressive flow for density estimation. InAdvances in Neural Information Processing Systems, 2017

work page 2017
[38]

Unsupervised representation learning with deep convolutional generative adversarial networks

Alec Radford, Luke Metz, and Soumith Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. InInternational Conference on Learning Representations, 2016

work page 2016
[39]

Variational inference with normalizing flows

Danilo Jimenez Rezende and Shakir Mohamed. Variational inference with normalizing flows. In International Conference on Machine Learning, 2015. 7

work page 2015
[40]

Progressive distillation for fast sampling of diffusion models

Tim Salimans and Jonathan Ho. Progressive distillation for fast sampling of diffusion models. In International Conference on Learning Representations, 2022

work page 2022
[41]

Denoising diffusion implicit models

Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. InInternational Conference on Learning Representations, 2021

work page 2021
[42]

Generative modeling by estimating gradients of the data distribution

Yang Song and Stefano Ermon. Generative modeling by estimating gradients of the data distribution. In Advances in Neural Information Processing Systems, 2019

work page 2019
[43]

Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole

Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations. InInternational Conference on Learning Representations, 2021

work page 2021
[44]

Consistency models

Yang Song, Prafulla Dhariwal, Mark Chen, and Ilya Sutskever. Consistency models. InInternational Conference on Machine Learning, 2023

work page 2023
[45]

Improving and generalizing flow-based generative models with minibatch optimal transport.Transactions on Machine Learning Research, 2024

Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector- Brooks, Guy Wolf, and Yoshua Bengio. Improving and generalizing flow-based generative models with minibatch optimal transport.Transactions on Machine Learning Research, 2024

work page 2024
[46]

Neural discrete representation learning

Aaron van den Oord, Oriol Vinyals, and Koray Kavukcuoglu. Neural discrete representation learning. InAdvances in Neural Information Processing Systems, 2017

work page 2017
[47]

Poisson flow generative models

Yilun Xu, Ziming Liu, Max Tegmark, and Tommi Jaakkola. Poisson flow generative models. In Advances in Neural Information Processing Systems, 2022. 8

work page 2022

[1] [1]

Stochastic Interpolants: A Unifying Framework for Flows and Diffusions

Michael S. Albergo, Nicholas M. Boffi, and Eric Vanden-Eijnden. Stochastic interpolants: A unifying framework for flows and diffusions.arXiv preprint arXiv:2303.08797, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[2] [2]

Wasserstein generative adversarial networks

Martin Arjovsky, Soumith Chintala, and Leon Bottou. Wasserstein generative adversarial networks. In International Conference on Machine Learning, 2017

work page 2017

[3] [3]

Importance weighted autoencoders

Yuri Burda, Roger Grosse, and Ruslan Salakhutdinov. Importance weighted autoencoders. InInterna- tional Conference on Learning Representations, 2016

work page 2016

[4] [4]

InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets

Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. InAdvances in Neural Information Processing Systems, 2016

work page 2016

[5] [5]

Diffusion models beat GANs on image synthesis

Prafulla Dhariwal and Alexander Quinn Nichol. Diffusion models beat GANs on image synthesis. In Advances in Neural Information Processing Systems, 2021

work page 2021

[6] [6]

NICE: Non-linear Independent Components Estimation

Laurent Dinh, David Krueger, and Yoshua Bengio. NICE: Non-linear independent components estima- tion.arXiv preprint arXiv:1410.8516, 2014. 5

work page internal anchor Pith review Pith/arXiv arXiv 2014

[7] [7]

Density estimation using real NVP

Laurent Dinh, Jascha Sohl-Dickstein, and Samy Bengio. Density estimation using real NVP. In International Conference on Learning Representations, 2017

work page 2017

[8] [8]

Implicit generation and modeling with energy based models

Yilun Du and Igor Mordatch. Implicit generation and modeling with energy based models. InAdvances in Neural Information Processing Systems, 2019

work page 2019

[9] [9]

Neural spline flows

Conor Durkan, Artur Bekasov, Iain Murray, and George Papamakarios. Neural spline flows. InAdvances in Neural Information Processing Systems, 2019

work page 2019

[10] [10]

One step diffusion via shortcut models

Kevin Frans, Danijar Hafner, Sergey Levine, and Pieter Abbeel. One step diffusion via shortcut models. InInternational Conference on Learning Representations, 2025

work page 2025

[11] [11]

Mean Flows for One-step Generative Modeling

Zhengyang Geng, Mingyang Deng, Xingjian Bai, J. Zico Kolter, and Kaiming He. Mean flows for one-step generative modeling.arXiv preprint arXiv:2505.13447, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[12] [12]

Generative adversarial nets

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. InAdvances in Neural Information Processing Systems, 2014

work page 2014

[13] [13]

Will Grathwohl, Ricky T. Q. Chen, Jesse Bettencourt, Ilya Sutskever, and David Duvenaud. FFJORD: Free-form continuous dynamics for scalable reversible generative models. InInternational Conference on Learning Representations, 2019

work page 2019

[14] [14]

Your classifier is secretly an energy based model and you should treat it like one

Will Grathwohl, Kuan-Chieh Wang, J¨orn-Henrik Jacobsen, David Duvenaud, Mohammad Norouzi, and Kevin Swersky. Your classifier is secretly an energy based model and you should treat it like one. In International Conference on Learning Representations, 2020

work page 2020

[15] [15]

Improved training of wasserstein GANs

Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron Courville. Improved training of wasserstein GANs. InAdvances in Neural Information Processing Systems, 2017

work page 2017

[16] [16]

Noise-contrastive estimation: A new estimation principle for unnormalized statistical models

Michael Gutmann and Aapo Hyv¨arinen. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models. InInternational Conference on Artificial Intelligence and Statistics, 2010

work page 2010

[17] [17]

Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner

Irina Higgins, Loic Matthey, Arka Pal, Christopher P. Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner. beta-V AE: Learning basic visual concepts with a constrained variational framework. InInternational Conference on Learning Representations, 2017

work page 2017

[18] [18]

Geoffrey E. Hinton. Training products of experts by minimizing contrastive divergence.Neural Computation, 14(8):1771–1800, 2002

work page 2002

[19] [19]

Classifier-free diffusion guidance

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance. InNeurIPS Workshop on Deep Generative Models and Downstream Applications, 2021

work page 2021

[20] [20]

Flow++: Improving flow-based generative models with variational dequantization and architecture design

Jonathan Ho, Xi Chen, Aravind Srinivas, Yan Duan, and Pieter Abbeel. Flow++: Improving flow-based generative models with variational dequantization and architecture design. InInternational Conference on Machine Learning, 2019

work page 2019

[21] [21]

Denoising diffusion probabilistic models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models. InAdvances in Neural Information Processing Systems, 2020

work page 2020

[22] [22]

Estimation of non-normalized statistical models by score matching.Journal of Machine Learning Research, 6:695–709, 2005

Aapo Hyv ¨arinen. Estimation of non-normalized statistical models by score matching.Journal of Machine Learning Research, 6:695–709, 2005. 6

work page 2005

[23] [23]

Elucidating the design space of diffusion-based generative models

Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. Elucidating the design space of diffusion-based generative models. InAdvances in Neural Information Processing Systems, 2022

work page 2022

[24] [24]

Kingma and Prafulla Dhariwal

Diederik P. Kingma and Prafulla Dhariwal. Glow: Generative flow with invertible 1x1 convolutions. In Advances in Neural Information Processing Systems, 2018

work page 2018

[25] [25]

Kingma and Max Welling

Diederik P. Kingma and Max Welling. Auto-encoding variational bayes. InInternational Conference on Learning Representations, 2014

work page 2014

[26] [26]

Kingma, Tim Salimans, Rafal Jozefowicz, Xi Chen, Ilya Sutskever, and Max Welling

Diederik P. Kingma, Tim Salimans, Rafal Jozefowicz, Xi Chen, Ilya Sutskever, and Max Welling. Improved variational inference with inverse autoregressive flow. InAdvances in Neural Information Processing Systems, 2016

work page 2016

[27] [27]

A tutorial on energy-based learning

Yann LeCun, Sumit Chopra, Raia Hadsell, Marc’Aurelio Ranzato, and Fu Jie Huang. A tutorial on energy-based learning. InPredicting Structured Data. MIT Press, 2006

work page 2006

[28] [28]

Geometric GAN

Jae Hyun Lim and Jong Chul Ye. Geometric GAN.arXiv preprint arXiv:1705.02894, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[29] [29]

Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, and Matt Le. Flow matching for generative modeling. InInternational Conference on Learning Representations, 2023

work page 2023

[30] [30]

Flow straight and fast: Learning to generate and transfer data with rectified flow

Xingchao Liu, Chengyue Gong, and Qiang Liu. Flow straight and fast: Learning to generate and transfer data with rectified flow. InInternational Conference on Learning Representations, 2023

work page 2023

[31] [31]

DPM-Solver: A fast ODE solver for diffusion probabilistic model sampling in around 10 steps

Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, and Jun Zhu. DPM-Solver: A fast ODE solver for diffusion probabilistic model sampling in around 10 steps. InAdvances in Neural Information Processing Systems, 2022

work page 2022

[32] [32]

Xudong Mao, Qing Li, Haoran Xie, Raymond Y . K. Lau, Zhen Wang, and Stephen Paul Smolley. Least squares generative adversarial networks. InInternational Conference on Computer Vision, 2017

work page 2017

[33] [33]

Conditional Generative Adversarial Nets

Mehdi Mirza and Simon Osindero. Conditional generative adversarial nets.arXiv preprint arXiv:1411.1784, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[34] [34]

Spectral normalization for generative adversarial networks

Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. Spectral normalization for generative adversarial networks. InInternational Conference on Learning Representations, 2018

work page 2018

[35] [35]

Improved denoising diffusion probabilistic models

Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, 2021

work page 2021

[36] [36]

Conditional image synthesis with auxiliary classifier gans

Augustus Odena, Christopher Olah, and Jonathon Shlens. Conditional image synthesis with auxiliary classifier gans. InInternational Conference on Machine Learning, 2017

work page 2017

[37] [37]

Masked autoregressive flow for density estimation

George Papamakarios, Theo Pavlakou, and Iain Murray. Masked autoregressive flow for density estimation. InAdvances in Neural Information Processing Systems, 2017

work page 2017

[38] [38]

Unsupervised representation learning with deep convolutional generative adversarial networks

Alec Radford, Luke Metz, and Soumith Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. InInternational Conference on Learning Representations, 2016

work page 2016

[39] [39]

Variational inference with normalizing flows

Danilo Jimenez Rezende and Shakir Mohamed. Variational inference with normalizing flows. In International Conference on Machine Learning, 2015. 7

work page 2015

[40] [40]

Progressive distillation for fast sampling of diffusion models

Tim Salimans and Jonathan Ho. Progressive distillation for fast sampling of diffusion models. In International Conference on Learning Representations, 2022

work page 2022

[41] [41]

Denoising diffusion implicit models

Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. InInternational Conference on Learning Representations, 2021

work page 2021

[42] [42]

Generative modeling by estimating gradients of the data distribution

Yang Song and Stefano Ermon. Generative modeling by estimating gradients of the data distribution. In Advances in Neural Information Processing Systems, 2019

work page 2019

[43] [43]

Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole

Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations. InInternational Conference on Learning Representations, 2021

work page 2021

[44] [44]

Consistency models

Yang Song, Prafulla Dhariwal, Mark Chen, and Ilya Sutskever. Consistency models. InInternational Conference on Machine Learning, 2023

work page 2023

[45] [45]

Improving and generalizing flow-based generative models with minibatch optimal transport.Transactions on Machine Learning Research, 2024

Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector- Brooks, Guy Wolf, and Yoshua Bengio. Improving and generalizing flow-based generative models with minibatch optimal transport.Transactions on Machine Learning Research, 2024

work page 2024

[46] [46]

Neural discrete representation learning

Aaron van den Oord, Oriol Vinyals, and Koray Kavukcuoglu. Neural discrete representation learning. InAdvances in Neural Information Processing Systems, 2017

work page 2017

[47] [47]

Poisson flow generative models

Yilun Xu, Ziming Liu, Max Tegmark, and Tommi Jaakkola. Poisson flow generative models. In Advances in Neural Information Processing Systems, 2022. 8

work page 2022