Awesome-Diffusion-Models

Awesome License: MIT Made With Love

This repository contains a collection of resources and papers on Diffusion Models.

Please refer to this page as this page may not contain all the information due to page constraints.

Contents

Resources

Introductory Posts

:fast_forward: DiffusionFastForward: 01-Diffusion-Theory
Mikolaj Czerkawski (@mikonvergence)
[Website]
4 Feb 2023

How diffusion models work: the math from scratch
Sergios Karagiannakos,Nikolas Adaloglou
[Website]
24 Sep 2022

A Path to the Variational Diffusion Loss
Alex Alemi
[Website] [Colab]
15 Sep 2022

The Annotated Diffusion Model
Niels Rogge, Kashif Rasul
[Website]
06 Jun 2022

The recent rise of diffusion-based models
Maciej Domagała
[Website]
06 Jun 2022

Introduction to Diffusion Models for Machine Learning
Ryan O’Connor
[Website]
12 May 2022

Improving Diffusion Models as an Alternative To GANs
Arash Vahdat and Karsten Kreis
[Website-Part 1] [Website-Part 2]
26 Apr 2022

An introduction to Diffusion Probabilistic Models
Ayan Das
[Website]
04 Dec 2021

Introduction to deep generative modeling: Diffusion-based Deep Generative Models
Jakub Tomczak
[Website]
30 Aug 2021

What are Diffusion Models?
Lilian Weng
[Website]
11 Jul 2021

Diffusion Models as a kind of VAE
Angus Turner
[Website]
29 Jun 2021

Generative Modeling by Estimating Gradients of the Data Distribution
Yang Song
[Website]
5 May 2021

Introductory Papers

Understanding Diffusion Models: A Unified Perspective
Calvin Luo
arXiv 2022. [Paper]
25 Aug 2022

How to Train Your Energy-Based Models
Yang Song, Diederik P. Kingma
arXiv 2022. [Paper]
9 Jan 2021

Introductory Videos

:fast_forward: DiffusionFastForward
Mikolaj Czerkawski (@mikonvergence)
[Video]
4 Mar 2023

Diffusion models from scratch in PyTorch
DeepFindr
[Video]
18 Jul 2022

Diffusion Models | Paper Explanation | Math Explained
Outlier
[Video]
6 Jun 2022

What are Diffusion Models?
Ari Seff
[Video]
20 Apr 2022

Diffusion models explained
AI Coffee Break with Letitia
[Video]
23 Mar 2022

Introductory Lectures

Denoising Diffusion-based Generative Modeling: Foundations and Applications
Karsten Kreis, Ruiqi Gao, Arash Vahdat
[Page]
19 Jun 2022

Diffusion Probabilistic Models
Jascha Sohl-Dickstein, MIT 6.S192 - Lecture 22
[Video]
19 Apr 2022

Tutorial and Jupyter Notebook

:fast_forward: DiffusionFastForward: train from scratch in colab
Mikolaj Czerkawski (@mikonvergence)
[Github] [notebook]

diffusion-for-beginners
ozanciga
[Github]

Beyond Diffusion: What is Personalized Image Generation and How Can You Customize Image Synthesis?
J. Rafid Siddiqui
[Github] [Medium]

Diffusion_models_tutorial
FilippoMB
[Github]

ScoreDiffusionModel
JeongJiHeon
[Github]

Minimal implementation of diffusion models
VSehwag
[Github]

diffusion_tutorial
sunlin-ai
[Github]

Denoising diffusion probabilistic models
acids-ircam
[Github]

Centipede Diffusion
Zalring
[Notebook]

Deforum Stable Diffusion
deforum
[Notebook]

Stable Diffusion Interpolation
None
[Notebook]

Keras Stable Diffusion: GPU starter example
None
[Notebook]

Huemin Jax Diffusion
huemin-art
[Notebook]

Disco Diffusion
alembics
[Notebook]

Simplified Disco Diffusion
entmike
[Notebook]

WAS’s Disco Diffusion - Portrait Generator Playground
WASasquatch
[Notebook]

Diffusers - Hugging Face
huggingface
[Notebook]

Papers

Survey

A Survey on Video Diffusion Models
Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu and Yu-Gang Jiang arXiv 2023. [Paper]
16 Oct 2023

State of the Art on Diffusion Models for Visual Computing
Ryan Po, Wang Yifan, Vladislav Golyanik, Kfir Aberman, Jonathan T. Barron, Amit H. Bermano, Eric Ryan Chan, Tali Dekel, Aleksander Holynski, Angjoo Kanazawa, C. Karen Liu, Lingjie Liu, Ben Mildenhall, Matthias Nießner, Björn Ommer, Christian Theobalt, Peter Wonka, Gordon Wetzstein
arXiv 2023. [Paper]
11 Oct 2023

Memory in Plain Sight: A Survey of the Uncanny Resemblances between Diffusion Models and Associative Memories
Benjamin Hoover, Hendrik Strobelt, Dmitry Krotov, Judy Hoffman, Zsolt Kira, Duen Horng Chau
arXiv 2023. [Paper]
28 Sep 2023

A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang, Zheng Wang, Jing Huang, Mohiuddin Muhammad Tasnim, Wei Shi
arXiv 2023. [Paper]
25 Aug 2023

Diffusion Models for Image Restoration and Enhancement – A Comprehensive Survey
Xin Li, Yulin Ren, Xin Jin, Cuiling Lan, Xingrui Wang, Wenjun Zeng, Xinchao Wang, Zhibo Chen
arXiv 2023. [Paper]
18 Aug 2023

A Comprehensive Survey on Generative Diffusion Models for Structured Data
Heejoon Koo, To Eun Kim
arXiv 2023. [Paper]
7 Jun 2023

On the Design Fundamentals of Diffusion Models: A Survey
Ziyi Chang, George A. Koulieris, Hubert P. H. Shum
arXiv 2023. [Paper]
7 Jun 2023

Diffusion Models in NLP: A Survey
Hao Zou, Zae Myung Kim, Dongyeop Kang
arXiv 2023. [Paper]
24 May 2023

Diffusion Models for Time Series Applications: A Survey
Lequan Lin, Zhengkun Li, Ruikun Li, Xuliang Li, Junbin Gao
arXiv 2023. [Paper]
1 May 2023

A Comprehensive Survey on Knowledge Distillation of Diffusion Models
Weijian Luo
arXiv 2023. [Paper]
9 Apr 2023

A Survey on Graph Diffusion Models: Generative AI in Science for Molecule, Protein and Material
Mengchun Zhang, Maryam Qamar, Taegoo Kang, Yuna Jung, Chenshuang Zhang, Sung-Ho Bae, Chaoning Zhang
arXiv 2023. [Paper]
4 Apr 2023

Audio Diffusion Model for Speech Synthesis: A Survey on Text To Speech and Speech Enhancement in Generative AI
Chenshuang Zhang, Chaoning Zhang, Sheng Zheng, Mengchun Zhang, Maryam Qamar, Sung-Ho Bae, In So Kweon
arXiv 2023. [Paper]
23 Mar 2023

Diffusion Models in NLP: A Survey
Yuansong Zhu, Yu Zhao
arXiv 2023. [Paper]
14 Mar 2023

Text-to-image Diffusion Model in Generative AI: A Survey
Chenshuang Zhang, Chaoning Zhang, Mengchun Zhang, In So Kweon
arXiv 2023. [Paper]
14 Mar 2023

Diffusion Models for Non-autoregressive Text Generation: A Survey
Yifan Li, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen
arXiv 2023. [Paper]
12 Mar 2023

Diffusion Models in Bioinformatics: A New Wave of Deep Learning Revolution in Action
Zhiye Guo, Jian Liu, Yanli Wang, Mengrui Chen, Duolin Wang, Dong Xu, Jianlin Cheng
arXiv 2023. [Paper]
13 Feb 2023

Generative Diffusion Models on Graphs: Methods and Applications
Wenqi Fan, Chengyi Liu, Yunqing Liu, Jiatong Li, Hang Li, Hui Liu, Jiliang Tang, Qing Li
arXiv 2023. [Paper]
6 Feb 2023

Diffusion Models for Medical Image Analysis: A Comprehensive Survey
Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Moein Heidari, Reza Azad, Mohsen Fayyaz, Ilker Hacihaliloglu, Dorit Merhof
arXiv 2022. [Paper] [Github]
14 Nov 2022

Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq, Naveed Akhtar, Ganna Pogrebna
arXiv 2022. [Paper]
7 Oct 2022

Diffusion Models in Vision: A Survey
Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah
arXiv 2022. [Paper]
10 Sep 2022

A Survey on Generative Diffusion Model
Hanqun Cao, Cheng Tan, Zhangyang Gao, Guangyong Chen, Pheng-Ann Heng, Stan Z. Li
arXiv 2022. [Paper]
6 Sep 2022

Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang, Zhilong Zhang, Shenda Hong, Wentao Zhang
arXiv 2022. [Paper]
2 Sep 2022

Vision

Generation

DiffEnc: Variational Diffusion with a Learned Encoder
Beatrix M. G. Nielsen, Anders Christensen, Andrea Dittadi, Ole Winther
arXiv 2023. [Paper]
30 Oct 2023

Upgrading VAE Training With Unlimited Data Plans Provided by Diffusion Models
Tim Z. Xiao, Johannes Zenn, Robert Bamler
arXiv 2023. [Paper]
30 Oct 2023

Successfully Applying Lottery Ticket Hypothesis to Diffusion Model
Chao Jiang, Bo Hui, Bohan Liu, Da Yan
arXiv 2023. [Paper]
28 Oct 2023

Noise-Free Score Distillation
Oren Katzir, Or Patashnik, Daniel Cohen-Or, Dani Lischinski
arXiv 2023. [Paper]
26 Oct 2023

The statistical thermodynamics of generative diffusion models
Luca Ambrogioni
arXiv 2023. [Paper]
26 Oct 2023

Improving Denoising Diffusion Models via Simultaneous Estimation of Image and Noise
Zhenkai Zhang, Krista A. Ehinger, Tom Drummond
arXiv 2023. [Paper]
26 Oct 2023

Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration
Longlin Yu, Tianyu Xie, Yu Zhu, Tong Yang, Xiangyu Zhang, Cheng Zhang
arXiv 2023. [Paper] [Github]
26 Oct 2023

RePoseDM: Recurrent Pose Alignment and Gradient Guidance for Pose Guided Image Synthesis
Anant Khandelwal
arXiv 2023. [Paper]
24 Oct 2023

Improved Techniques for Training Consistency Models
Yang Song, Prafulla Dhariwal
arXiv 2023. [Paper]
22 Oct 2023

ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
Zhongzhan Huang, Pan Zhou, Shuicheng Yan, Liang Lin
NeurIPS 2023. [Paper] [Github]
20 Oct 2023

Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models
Gabriele Corso, Yilun Xu, Valentin de Bortoli, Regina Barzilay, Tommi Jaakkola
arXiv 2023. [Paper] [Github]
19 Oct 2023

Closed-Form Diffusion Models
Christopher Scarvelis, Haitz Sáez de Ocáriz Borde, Justin Solomon
arXiv 2023. [Paper]
19 Oct 2023

Elucidating The Design Space of Classifier-Guided Diffusion Generation
Jiajun Ma, Tianyang Hu, Wenjia Wang, Jiacheng Sun
arXiv 2023. [Paper] [Github]
17 Oct 2023

BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
Siqi Kou, Lei Gan, Dequan Wang, Chongxuan Li, Zhijie Deng
arXiv 2023. [Paper]
17 Oct 2023

Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models
Zijian Zhang, Luping Liu. Zhijie Lin, Yichen Zhu, Zhou Zhao
arXiv 2023. [Paper]
15 Oct 2023

Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner
Mengfei Xia, Yujun Shen, Changsong Lei, Yu Zhou, Ran Yi, Deli Zhao, Wenping Wang, Yong-jin Liu
arXiv 2023. [Paper]
14 Oct 2023

Unseen Image Synthesis with Diffusion Models
Ye Zhu, Yu Wu, Zhiwei Deng, Olga Russakovsky, Yan Yan
arXiv 2023. [Paper]
13 Oct 2023

Debias the Training of Diffusion Models
Hu Yu, Li Shen, Jie Huang, Man Zhou, Hongsheng Li, Feng Zhao
arXiv 2023. [Paper]
12 Oct 2023

Neural Diffusion Models
Grigory Bartosh, Dmitry Vetrov, Christian A. Naesseth
arXiv 2023. [Paper]
12 Oct 2023

Efficient Integrators for Diffusion Generative Models
Kushagra Pandey, Maja Rudolph, Stephan Mandt
arXiv 2023. [Paper]
11 Oct 2023

Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
Huangjie Zheng, Zhendong Wang, Jianbo Yuan, Guanghan Ning, Pengcheng He, Quanzeng You, Hongxia Yang, Mingyuan Zhou
arXiv 2023. [Paper]
10 Oct 2023

Language Model Beats Diffusion – Tokenizer is Key to Visual Generation
Lijun Yu, José Lezama, Nitesh B. Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang
arXiv 2023. [Paper] [Github]
9 Oct 2023

The Emergence of Reproducibility and Consistency in Diffusion Models
Huijie Zhang, Jinfan Zhou, Yifu Lu, Minzhe Guo, Liyue Shen, Qing Qu
arXiv 2023. [Paper]
8 Oct 2023

DiffNAS: Bootstrapping Diffusion Models by Prompting for Better Architectures
Wenhao Li, Xiu Su, Shan You, Fei Wang, Chen Qian, Chang Xu
arXiv 2023. [Paper]
7 Oct 2023

Observation-Guided Diffusion Probabilistic Models
Junoh Kang, Jinyoung Choi, Sungik Choi, Bohyung Han
arXiv 2023. [Paper]
6 Oct 2023

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Simian Luo, Yiqin Tan, Longbo Huang, Jian Li, Hang Zhao
arXiv 2023. [Paper]
6 Oct 2023

Denoising Diffusion Step-aware Models
Shuai Yang, Yukang Chen, Luozhou Wang, Shu Liu, Yingcong Chen
arXiv 2023. [Paper]
5 Oct 2023

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models
Yefei He, Jing Liu, Weijia Wu, Hong Zhou, Bohan Zhuang
arXiv 2023. [Paper]
5 Oct 2023

Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
Peiyu Yu, Yaxuan Zhu, Sirui Xie, Xiaojian Ma, Ruiqi Gao, Song-Chun Zhu, Ying Nian Wu
NeurIPS 2023. [Paper] [Github]
5 Oct 2023

On Memorization in Diffusion Models
Xiangming Gu, Chao Du, Tianyu Pang, Chongxuan Li, Min Lin, Ye Wang
arXiv 2023. [Paper] [Github]
4 Oct 2023

Sequential Data Generation with Groupwise Diffusion Process
Sangyun Lee, Gayoung Lee, Hyunsu Kim, Junho Kim, Youngjung Uh
arXiv 2023. [Paper]
2 Oct 2023

Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Yutong He, Yuki Mitsufuji, Stefano Ermon
arXiv 2023. [Paper]
1 Oct 2023

Completing Visual Objects via Bridging Generation and Segmentation
Xiang Li, Yinpeng Chen, Chung-Ching Lin, Rita Singh, Bhiksha Raj, Zicheng Liu
arXiv 2023. [Paper]
1 Oct 2023

Decoding Realistic Images from Brain Activity with Contrastive Self-supervision and Latent Diffusion
Jingyuan Sun, Mingxiao Li, Marie-Francine Moens
arXiv 2023. [Paper]
30 Sep 2023

FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery
Tasin Islam, Alina Miron, XiaoHui Liu, Yongmin Li
arXiv 2023. [Paper]
29 Sep 2023

Denoising Diffusion Bridge Models
Linqi Zhou, Aaron Lou, Samar Khanna, Stefano Ermon
arXiv 2023. [Paper]
29 Sep 2023

DeeDiff: Dynamic Uncertainty-Aware Early Exiting for Accelerating Diffusion Model Generation
Shengkun Tang, Yaqing Wang, Caiwen Ding, Yi Liang, Yao Li, Dongkuan Xu
arXiv 2023. [Paper]
29 Sep 2023

Distilling ODE Solvers of Diffusion Models into Smaller Steps
Sanghwan Kim, Hao Tang, Fisher Yu
arXiv 2023. [Paper]
28 Sep 2023

Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation
Xin Yuan, Michael Maire
arXiv 2023. [Paper]
27 Sep 2023

Generative Escher Meshes
Noam Aigerman, Thibault Groueix
arXiv 2023. [Paper]
25 Sep 2023

Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models
Yangming Li, Boris van Breugel, Mihaela van der Schaar
arXiv 2023. [Paper]
25 Sep 2023

GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
Mingzhen Sun, Weining Wang, Zihan Qin, Jiahui Sun, Sihan Chen, Jing Liu
arXiv 2023. [Paper] [Github]
23 Sep 2023

Score Mismatching for Generative Modeling
Senmao Ye, Fei Liu
arXiv 2023. [Paper]
20 Sep 2023

Generalised Probabilistic Diffusion Scale-Spaces
Pascal Peter
arXiv 2023. [Paper]
15 Sep 2023

Generative Image Dynamics
Zhengqi Li, Richard Tucker, Noah Snavely, Aleksander Holynski
arXiv 2023. [Paper] [Project]
14 Sep 2023

Beta Diffusion
Mingyuan Zhou, Tianqi Chen, Zhendong Wang, Huangjie Zheng
NeurIPS 2023. [Paper]
14 Sep 2023

Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion Models
Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi
arXiv 2023. [Paper]
12 Sep 2023

Elucidating the solution space of extended reverse-time SDE for diffusion models
Qinpeng Cui, Xinyi Zhang, Zongqing Lu, Qingmin Liao
arXiv 2023. [Paper]
12 Sep 2023

Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood
Yaxuan Zhu, Jianwen Xie, Yingnian Wu, Ruiqi Gao
arXiv 2023. [Paper]
10 Sep 2023

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Jiayan Teng, Wendi Zheng, Ming Ding, Wenyi Hong, Jianqiao Wangni, Zhuoyi Yang, Jie Tang
arXiv 2023. [Paper]
4 Sep 2023

Gradient Domain Diffusion Models for Image Synthesis
Yuanhao Gong
arXiv 2023. [Paper]
5 Sep 2023

Hierarchical Masked 3D Diffusion Model for Video Outpainting
Fanda Fan, Chaoxu Guo, Litong Gong, Biao Wang, Tiezheng Ge, Yuning Jiang, Chunjie Luo, Jianfeng Zhan
arXiv 2023. [Paper] [Github]
5 Sep 2023

Diffusion Models with Deterministic Normalizing Flow Priors
Mohsen Zand, Ali Etemad, Michael Greenspan
arXiv 2023. [Paper] [Github]
3 Sep 2023

Diffusion Inertial Poser: Human Motion Reconstruction from Arbitrary Sparse IMU Configurations
Tom Van Wouwe, Seunghwan Lee, Antoine Falisse, Scott Delp, C. Karen Liu
AAAI 2024. [Paper]
31 Aug 2023

Conditioning Score-Based Generative Models by Neuro-Symbolic Constraints
Davide Scassola, Sebastiano Saccani, Ginevra Carbone, Luca Bortolussi
arXiv 2023. [Paper]
31 Aug 2023

Elucidating the Exposure Bias in Diffusion Models
Mang Ning, Mingxiao Li, Jianlin Su, Albert Ali Salah, Itir Onal Ertugrul
arXiv 2023. [Paper]
29 Aug 2023

Residual Denoising Diffusion Models
Jiawei Liu, Qiang Wang, Huijie Fan, Yinong Wang, Yandong Tang, Liangqiong Qu
arXiv 2023. [Paper] [Github]
25 Aug 2023

Efficient Transfer Learning in Diffusion Models via Adversarial Noise
Xiyu Wang, Baijiong Lin, Daochang Liu, Chang Xu
arXiv 2023. [Paper]
23 Aug 2023

Boosting Diffusion Models with an Adaptive Momentum Sampler
Xiyu Wang, Anh-Dung Dinh, Daochang Liu, Chang Xu
arXiv 2023. [Paper]
23 Aug 2023

Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image
Liao Shen, Xingyi Li, Huiqiang Sun, Juewen Peng, Ke Xian, Zhiguo Cao, Guosheng Lin
ACM MM 2023. [Paper]
20 Aug 2023

Spiking-Diffusion: Vector Quantized Discrete Diffusion Model with Spiking Neural Networks
Mingxuan Liu, Rui Wen, Hong Chen
arXiv 2023. [Paper]
20 Aug 2023

SciRE-Solver: Efficient Sampling of Diffusion Probabilistic Models by Score-integrand Solver with Recursive Derivative Estimation
Shigui Li, Wei Chen, Delu Zeng
arXiv 2023. [Paper]
15 Aug 2023

Improved Order Analysis and Design of Exponential Integrator for Diffusion Models Sampling
Qinsheng Zhang, Jiaming Song, Yongxin Chen
arXiv 2023. [Paper]
4 Aug 2023

Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Zheng Ding, Mengqi Zhang, Jiajun Wu, Zhuowen Tu
arXiv 2023. [Paper]
2 Aug 2023

Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Xin Yuan, Linjie Li, Jianfeng Wang, Zhengyuan Yang, Kevin Lin, Zicheng Liu, Lijuan Wang
arXiv 2023. [Paper]
27 Jul 2023

Synthesis of Batik Motifs using a Diffusion – Generative Adversarial Network
One Octadion, Novanto Yudistira, Diva Kurnianingtyas
arXiv 2023. [Paper]
22 Jul 2023

DPM-OT: A New Diffusion Probabilistic Model Based on Optimal Transport
Zezeng Li, ShengHao Li, Zhanpeng Wang, Na Lei, Zhongxuan Luo, Xianfeng Gu
arXiv 2023. [Paper] [Github]
21 Jul 2023

Diffusion Sampling with Momentum for Mitigating Divergence Artifacts
Suttisak Wizadwongsa, Worameth Chinchuthakun, Pramook Khungurn, Amit Raj, Supasorn Suwajanakorn
arXiv 2023. [Paper]
20 Jul 2023

Flow Matching in Latent Space
Quan Dao, Hao Phung, Binh Nguyen, Anh Tran
arXiv 2023. [Paper] [Project]
17 Jul 2023

Manifold-Guided Sampling in Diffusion Models for Unbiased Image Generation
Xingzhe Su, Wenwen Qiang, Zeen Song, Hang Gao, Fengge Wu, Changwen Zheng
arXiv 2023. [Paper]
17 Jul 2023

Complexity Matters: Rethinking the Latent Space for Generative Modeling
Tianyang Hu, Fei Chen, Haonan Wang, Jiawei Li, Wenjia Wang, Jiacheng Sun, Zhenguo Li
arXiv 2023. [Paper]
17 Jul 2023

Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim, Kyungmin Lee, June Suk Choi, Jongheon Jeong, Kihyuk Sohn, Jinwoo Shin
arXiv 2023. [Paper] [Project] [Github]
4 Jul 2023

ProtoDiffusion: Classifier-Free Diffusion Guidance with Prototype Learning
Gulcin Baykal, Halil Faruk Karagoz, Taha Binhuraib, Gozde Unal
arXiv 2023. [Paper]
4 Jul 2023

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas Müller, Joe Penna, Robin Rombach
arXiv 2023. [Paper] [Github]
4 Jul 2023

Bidirectional Temporal Diffusion Model for Temporally Consistent Human Animation
Tserendorj Adiya, Sanghun Kim, Jung Eun Lee, Jae Shin Yoon, Hwasup Lim
arXiv 2023. [Paper]
2 Jul 2023

Spiking Denoising Diffusion Probabilistic Models
Jiahang Cao, Ziqing Wang, Hanzhong Guo, Hao Cheng, Qiang Zhang, Renjing Xu
arXiv 2023. [Paper]
29 Jun 2023

DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data
Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
arXiv 2023. [Paper]
25 Jun 2023

Decoupled Diffusion Models with Explicit Transition Probability
Yuhang Huang, Zheng Qin, Xinwang Liu, Kai Xu
arXiv 2023. [Paper]
23 Jun 2023

Continuous Layout Editing of Single Images with Diffusion Models
Zhiyuan Zhang, Zhitong Huang, Jing Liao
arXiv 2023. [Paper]
22 Jun 2023

Semi-Implicit Denoising Diffusion Models (SIDDMs)
Yanwu Xu, Mingming Gong, Shaoan Xie, Wei Wei, Matthias Grundmann, kayhan Batmanghelich, Tingbo Hou
arXiv 2023. [Paper]
21 Jun 2023

Eliminating Lipschitz Singularities in Diffusion Models
Zhantao Yang, Ruili Feng, Han Zhang, Yujun Shen, Kai Zhu, Lianghua Huang, Yifei Zhang, Yu Liu, Deli Zhao, Jingren Zhou, Fan Cheng
arXiv 2023. [Paper]
20 Jun 2023

GD-VDM: Generated Depth for better Diffusion-based Video Generation
Ariel Lapid, Idan Achituve, Lior Bracha, Ethan Fetaya
arXiv 2023. [Paper]
19 Jun 2023

Image Harmonization with Diffusion Model
Jiajie Li, Jian Wang, Chen Wang, Jinjun Xiong
arXiv 2023. [Paper]
17 Jun 2023

Training Diffusion Classifiers with Denoising Assistance
Chandramouli Sastry, Sri Harsha Dumpala, Sageev Oore
arXiv 2023. [Paper]
15 Jun 2023

Conditional Human Sketch Synthesis with Explicit Abstraction Control
Dar-Yen Chen
arXiv 2023. [Paper]
15 Jun 2023

Fast Training of Diffusion Models with Masked Transformers
Hongkai Zheng, Weili Nie, Arash Vahdat, Anima Anandkumar
arXiv 2023. [Paper] [Github]
15 Jun 2023

Relation-Aware Diffusion Model for Controllable Poster Layout Generation
Fengheng Li, An Liu, Wei Feng, Honghe Zhu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Jingping Shao
arXiv 2023. [Paper]
15 Jun 2023

OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models
Enshu Liu, Xuefei Ning, Zinan Lin, Huazhong Yang, Yu Wang
arXiv 2023. [Paper]
15 Jun 2023

DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$
Allan Jabri, Sjoerd van Steenkiste, Emiel Hoogeboom, Mehdi S. M. Sajjadi, Thomas Kipf
arXiv 2023. [Paper]
13 Jun 2023

Fast Diffusion Model
Zike Wu, Pan Zhou, Kenji Kawaguchi, Hanwang Zhang
arXiv 2023. [Paper] [Github]
12 Jun 2023

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu
arXiv 2023. [Paper]
8 Jun 2023

Multi-Architecture Multi-Expert Diffusion Models
Yunsung Lee, Jin-Young Kim, Hyojun Go, Myeongho Jeong, Shinhyeok Oh, Seungtaek Choi
arXiv 2023. [Paper]
8 Jun 2023

Interpreting and Improving Diffusion Models Using the Euclidean Distance Function
Frank Permenter, Chenyang Yuan
arXiv 2023. [Paper]
8 Jun 2023

Video Diffusion Models with Local-Global Context Guidance
Siyuan Yang, Lu Zhang, Yu Liu, Zhizhuo Jiang, You He
IJCAI 2023. [Paper] [Github]
5 Jun 2023

Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models
Andrew F. Luo, Margaret M. Henderson, Leila Wehbe, Michael J. Tarr
arXiv 2023. [Paper]
5 Jun 2023

Faster Training of Diffusion Models and Improved Density Estimation via Parallel Score Matching
Etrit Haxholli, Marco Lorenzi
arXiv 2023. [Paper]
5 Jun 2023

Temporal Dynamic Quantization for Diffusion Models
Junhyuk So, Jungwon Lee, Daehyun Ahn, Hyungjun Kim, Eunhyeok Park
arXiv 2023. [Paper]
4 Jun 2023

Conditional Generation from Unconditional Diffusion Models using Denoiser Representations
Alexandros Graikos, Srikar Yellapragada, Dimitris Samaras
BMVC 2023. [Paper] [Github]
2 Jun 2023

Conditioning Diffusion Models via Attributes and Semantic Masks for Face Generation
Nico Giambi, Giuseppe Lisanti
arXiv 2023. [Paper]
1 Jun 2023

Differential Diffusion: Giving Each Pixel Its Strength
Eran Levin, Ohad Fried
arXiv 2023. [Paper]
1 Jun 2023

Addressing Discrepancies in Semantic and Visual Alignment in Neural Networks
Natalie Abreu, Nathan Vaska, Victoria Helus
arXiv 2023. [Paper]
1 Jun 2023

Addressing Negative Transfer in Diffusion Models
Hyojun Go, JinYoung Kim, Yunsung Lee, Seunghyun Lee, Shinhyeok Oh, Hyeongdon Moon, Seungtaek Choi
arXiv 2023. [Paper]
1 Jun 2023

A Geometric Perspective on Diffusion Models
Defang Chen, Zhenyu Zhou, Jian-Ping Mei, Chunhua Shen, Chun Chen, Can Wang
arXiv 2023. [Paper]
31 May 2023

Spontaneous symmetry breaking in generative diffusion models
Gabriel Raya, Luca Ambrogioni
arXiv 2023. [Paper]
31 May 2023

Perturbation-Assisted Sample Synthesis: A Novel Approach for Uncertainty Quantification
Yifei Liu, Rex Shen, Xiaotong Shen
arXiv 2023. [Paper]
30 May 2023

One-Line-of-Code Data Mollification Improves Optimization of Likelihood-based Generative Models
Ba-Hien Tran, Giulio Franzese, Pietro Michiardi, Maurizio Filippone
arXiv 2023. [Paper]
30 May 2023

Ambient Diffusion: Learning Clean Distributions from Corrupted Data
Giannis Daras, Kulin Shah, Yuval Dagan, Aravind Gollakota, Alexandros G. Dimakis, Adam Klivans
arXiv 2023. [Paper]
30 May 2023

Towards Accurate Data-free Quantization for Diffusion Models
Changyuan Wang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie Zhou, Jiwen Lu
arXiv 2023. [Paper]
30 May 2023

BRIGHT: Bi-level Feature Representation of Image Collections using Groups of Hash Tables
Dingdong Yang, Yizhi Wang, Ali Mahdavi-Amiri, Hao Zhang
arXiv 2023. [Paper] [Project]
29 May 2023

Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
Weijian Luo, Tianyang Hu, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhihua Zhang
arXiv 2023. [Paper]
29 May 2023

Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling
Tianqi Chen, Mingyuan Zhou
ICML 2023. [Paper] [Github]
28 May 2023

Reconstructing the Mind’s Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors
Paul S. Scotti, Atmadeep Banerjee, Jimmie Goode, Stepan Shabalin, Alex Nguyen, Ethan Cohen, Aidan J. Dempster, Nathalie Verlinde, Elad Yundler, David Weisberg, Kenneth A. Norman, Tanishq Mathew Abraham
arXiv 2023. [Paper] [Github]
29 May 2023

Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities
Jingyuan Sun, Mingxiao Li, Zijiao Chen, Yunhao Zhang, Shaonan Wang, Marie-Francine Moens
arXiv 2023. [Paper]
26 May 2023

Parallel Sampling of Diffusion Models
Andy Shih, Suneel Belkhale, Stefano Ermon, Dorsa Sadigh, Nima Anari
arXiv 2023. [Paper] [Github]
25 May 2023

Trans-Dimensional Generative Modeling via Jump Diffusion Models
Andrew Campbell, William Harvey, Christian Weilbach, Valentin De Bortoli, Tom Rainforth, Arnaud Doucet
arXiv 2023. [Paper]
25 May 2023

UDPM: Upsampling Diffusion Probabilistic Models
Shady Abu-Hussein, Raja Giryes
arXiv 2023. [Paper]
25 May 2023

Unifying GANs and Score-Based Diffusion as Generative Particle Models
Jean-Yves Franceschi, Mike Gartrell, Ludovic Dos Santos, Thibaut Issenhuth, Emmanuel de Bézenac, Mickaël Chen, Alain Rakotomamonjy
arXiv 2023. [Paper]
25 May 2023

DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
Taesun Yeom, Minhyeok Lee
arXiv 2023. [Paper]
24 May 2023

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps
Mingxiao Li, Tingyu Qu, Wei Sun, Marie-Francine Moens
arXiv 2023. [Paper]
24 May 2023

Robust Classification via a Single Diffusion Model
Huanran Chen, Yinpeng Dong, Zhengyi Wang, Xiao Yang, Chengqi Duan, Hang Su, Jun Zhu
arXiv 2023. [Paper]
24 May 2023

On the Generalization of Diffusion Model
Mingyang Yi, Jiacheng Sun, Zhenguo Li
arXiv 2023. [Paper]
24 May 2023

VDT: An Empirical Study on Video Diffusion with Transformers
Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding
arXiv 2023. [Paper] [Github]
22 May 2023

Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity
Zijiao Chen, Jiaxin Qing, Juan Helen Zhou
arXiv 2023. [Paper] [Project]
19 May 2023

PTQD: Accurate Post-Training Quantization for Diffusion Models
Yefei He, Luping Liu, Jing Liu, Weijia Wu, Hong Zhou, Bohan Zhuang
arXiv 2023. [Paper]
18 May 2023

Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces
Javier E Santos, Zachary R. Fox, Nicholas Lubbers, Yen Ting Lin
arXiv 2023. [Paper]
18 May 2023

Structural Pruning for Diffusion Models
Gongfan Fang, Xinyin Ma, Xinchao Wang
arXiv 2023. [Paper] [Github]
18 May 2023

Catch-Up Distillation: You Only Need to Train Once for Accelerating Sampling
Shitong Shao, Xu Dai, Shouyi Yin, Lujun Li, Huanran Chen, Yang Hu
arXiv 2023. [Paper]
18 May 2023

Controllable Mind Visual Diffusion Model
Bohan Zeng, Shanglin Li, Xuhui Liu, Sicheng Gao, Xiaolong Jiang, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang
arXiv 2023. [Paper]
17 May 2023

Analyzing Bias in Diffusion-based Face Generation Models
Malsha V. Perera, Vishal M. Patel
arXiv 2023. [Paper]
10 May 2023

Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
Kaiwen Zheng, Cheng Lu, Jianfei Chen, Jun Zhu
ICML 2023. [Paper]
6 May 2023

LEO: Generative Latent Image Animator for Human Video Synthesis
Yaohui Wang, Xin Ma, Xinyuan Chen, Antitza Dantcheva, Bo Dai, Yu Qiao
arXiv 2023. [Paper] [Project] [Github]
6 May 2023

Iterative α-(de)Blending: a Minimalist Deterministic Diffusion Model
Eric Heitz, Laurent Belcour, Thomas Chambon
SIGGRAPH 2023. [Paper]
5 May 2023

Reconstructing seen images from human brain activity via guided stochastic search
Reese Kneeland, Jordyn Ojeda, Ghislain St-Yves, Thomas Naselaris
arXiv 2023. [Paper]
30 Apr 2023

Motion-Conditioned Diffusion Model for Controllable Video Synthesis
Tsai-Shien Chen, Chieh Hubert Lin, Hung-Yu Tseng, Tsung-Yi Lin, Ming-Hsuan Yang
arXiv 2023. [Paper] [Project]
27 Apr 2023

Score-based Generative Modeling Through Backward Stochastic Differential Equations: Inversion and Generation
Zihao Wang
arXiv 2023. [Paper]
26 Apr 2023

Exploring Compositional Visual Generation with Latent Classifier Guidance
Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min
CVPR Workshop 2023. [Paper]
25 Apr 2023

Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models
Zhendong Wang, Yifan Jiang, Huangjie Zheng, Peihao Wang, Pengcheng He, Zhangyang Wang, Weizhu Chen, Mingyuan Zhou
arXiv 2023. [Paper]
25 Apr 2023

Variational Diffusion Auto-encoder: Deep Latent Variable Model with Unconditional Diffusion Prior
Georgios Batzolis, Jan Stanczuk, Carola-Bibiane Schönlieb
arXiv 2023. [Paper]
24 Apr 2023

LaMD: Latent Motion Diffusion for Video Generation
Yaosi Hu, Zhenzhong Chen, Chong Luo
arXiv 2023. [Paper]
23 Apr 2023

Lookahead Diffusion Probabilistic Models for Refining Mean Estimation
Guoqiang Zhang, Niwa Kenta, W. Bastiaan Kleijn
CVPR 2023. [Paper] [Github]
22 Apr 2023

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models
Seung Wook Kim, Bradley Brown, Kangxue Yin, Karsten Kreis, Katja Schwarz, Daiqing Li, Robin Rombach, Antonio Torralba, Sanja Fidler
CVPR 2023. [Paper]
19 Apr 2023

Attributing Image Generative Models using Latent Fingerprints
Guangyu Nie, Changhoon Kim, Yezhou Yang, Yi Ren
arXiv 2023. [Paper]
17 Apr 2023

Identity Encoder for Personalized Diffusion
Yu-Chuan Su, Kelvin C.K. Chan, Yandong Li, Yang Zhao, Han Zhang, Boqing Gong, Huisheng Wang, Xuhui Jia
arXiv 2023. [Paper]
14 Apr 2023

Memory Efficient Diffusion Probabilistic Models via Patch-based Generation
Shinei Arakawa, Hideki Tsunashima, Daichi Horita, Keitaro Tanaka, Shigeo Morishima
arXiv 2023. [Paper]
14 Apr 2023

DCFace: Synthetic Face Generation with Dual Condition Diffusion Model
Minchul Kim, Feng Liu, Anil Jain, Xiaoming Liu
arXiv 2023. [Paper] [Github]
14 Apr 2023

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning
Enze Xie, Lewei Yao, Han Shi, Zhili Liu, Daquan Zhou, Zhaoqiang Liu, Jiawei Li, Zhenguo Li
arXiv 2023. [Paper]
13 Apr 2023

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong, Wei Xiong, Deepanshu Goyal, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang
arXiv 2023. [Paper]
13 Apr 2023

DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
Johanna Karras, Aleksander Holynski, Ting-Chun Wang, Ira Kemelmacher-Shlizerman
arXiv 2023. [Paper] [Project] [Github]
12 Apr 2023

Reflected Diffusion Models
Aaron Lou, Stefano Ermon
ICML 2023. [Paper] [Project] [Github]
10 Apr 2023

Binary Latent Diffusion
Ze Wang, Jiang Wang, Zicheng Liu, Qiang Qiu
arXiv 2023. [Paper]
10 Apr 2023

Diffusion Models as Masked Autoencoders
Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan Yuille, Christoph Feichtenhofer
arXiv 2023. [Paper] [Project]
6 Apr 2023

Few-shot Semantic Image Synthesis with Class Affinity Transfer
Marlène Careil, Jakob Verbeek, Stéphane Lathuilière
CVPR 2023. [Paper]
5 Apr 2023

EGC: Image Generation and Classification via a Diffusion Energy-Based Model
Qiushan Guo, Chuofan Ma, Yi Jiang, Zehuan Yuan, Yizhou Yu, Ping Luo
arxiv 2023. [Paper] [Project]
4 Apr 2023

Token Merging for Fast Stable Diffusion
Daniel Bolya, Judy Hoffman
arXiv 2023. [Paper] [Github]
30 Mar 2023

A Closer Look at Parameter-Efficient Tuning in Diffusion Models
Chendong Xiang, Fan Bao, Chongxuan Li, Hang Su, Jun Zhu
arXiv 2023. [Paper]
31 Mar 2023

-Diff: Infinite Resolution Diffusion with Subsampled Mollified States
Sam Bond-Taylor, Chris G. Willcocks
arXiv 2023. [Paper]
31 Mar 2023

3D-aware Image Generation using 2D Diffusion Models
Jianfeng Xiang, Jiaolong Yang, Binbin Huang, Xin Tong
arXiv 2023. [Paper] [Project]
31 Mar 2023

Consistent View Synthesis with Pose-Guided Diffusion Models
Hung-Yu Tseng, Qinbo Li, Changil Kim, Suhib Alsisan, Jia-Bin Huang, Johannes Kopf
CVPR 2023. [Paper]
30 Mar 2023

DiffCollage: Parallel Generation of Large Content with Diffusion Models
Qinsheng Zhang, Jiaming Song, Xun Huang, Yongxin Chen, Ming-Yu Liu
CVPR 2023. [Paper] [Project]
30 Mar 2023

Masked Diffusion Transformer is a Strong Image Synthesizer
Shanghua Gao, Pan Zhou, Ming-Ming Cheng, Shuicheng Yan
arXiv 2023. [Paper] [Github]
25 Mar 2023

Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min
CVPR 2023. [Paper] [Github]
24 Mar 2023

NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Gong Ming, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan
arXiv 2023. [Paper] [Project]
22 Mar 2023

Object-Centric Slot Diffusion
Jindong Jiang, Fei Deng, Gautam Singh, Sungjin Ahn
arXiv 2023. [Paper]
20 Mar 2023

LDMVFI: Video Frame Interpolation with Latent Diffusion Models
Duolikun Danier, Fan Zhang, David Bull
arXiv 2023. [Paper]
16 Mar 2023

Efficient Diffusion Training via Min-SNR Weighting Strategy
Tiankai Hang, Shuyang Gu, Chen Li, Jianmin Bao, Dong Chen, Han Hu, Xin Geng, Baining Guo
arXiv 2023. [Paper]
16 Mar 2023

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
CVPR 2023. [Paper]
15 Mar 2023

Interpretable ODE-style Generative Diffusion Model via Force Field Construction
Weiyang Jin, Yongpei Zhu, Yuxi Peng
arXiv 2023. [Paper]
14 Mar 2023

Regularized Vector Quantization for Tokenized Image Synthesis
Jiahui Zhang, Fangneng Zhan, Christian Theobalt, Shijian Lu
arXiv 2023. [Paper]
11 Mar 2023

PARASOL: Parametric Style Control for Diffusion Image Synthesis
Gemma Canet Tarrés, Dan Ruta, Tu Bui, John Collomosse
arXiv 2023. [Paper]
11 Mar 2023

Brain-Diffuser: Natural scene reconstruction from fMRI signals using generative latent diffusion
Furkan Ozcelik, Rufin VanRullen
arXiv 2023. [Paper]
9 Mar 2023

Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation
Paul Hagemann, Lars Ruthotto, Gabriele Steidl, Nicole Tianjiao Yang
arXiv 2023. [Paper]
8 Mar 2023

TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
David Berthelot, Arnaud Autef, Jierui Lin, Dian Ang Yap, Shuangfei Zhai, Siyuan Hu, Daniel Zheng, Walter Talbott, Eric Gu
arXiv 2023. [Paper]
7 Mar 2023

Generative Diffusions in Augmented Spaces: A Complete Recipe
Kushagra Pandey, Stephan Mandt
arXiv 2023. [Paper]
3 Mar 2023

Consistency Models
Yang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever
arXiv 2023. [Paper]
2 Mar 2023

Diffusion Probabilistic Fields
Peiye Zhuang, Samira Abnar, Jiatao Gu, Alex Schwing, Joshua M. Susskind, Miguel Ángel Bautista
ICLR 2023. [Paper]
1 Mar 2023

Unsupervised Discovery of Semantic Latent Directions in Diffusion Models
Yong-Hyun Park, Mingi Kwon, Junghyo Jo, Youngjung Uh
arXiv 2023. [Paper]
24 Feb 2023

Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC
Yilun Du, Conor Durkan, Robin Strudel, Joshua B. Tenenbaum, Sander Dieleman, Rob Fergus, Jascha Sohl-Dickstein, Arnaud Doucet, Will Grathwohl
arXiv 2023. [Paper] [Project]
22 Feb 2023

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan
arXiv 2023. [Paper]
21 Feb 2023

On Calibrating Diffusion Probabilistic Models
Tianyu Pang, Cheng Lu, Chao Du, Min Lin, Shuicheng Yan, Zhijie Deng
arXiv 2023. [Paper] [Github]
21 Feb 2023

Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels
Zebin You, Yong Zhong, Fan Bao, Jiacheng Sun, Chongxuan Li, Jun Zhu
arXiv 2023. [Paper]
21 Feb 2023

Cross-domain Compositing with Pretrained Diffusion Models
Roy Hachnochi, Mingrui Zhao, Nadav Orzech, Rinon Gal, Ali Mahdavi-Amiri, Daniel Cohen-Or, Amit Haim Bermano
arXiv 2023. [Paper] [Github]
20 Feb 2023

Restoration based Generative Models
Jaemoo Choi, Yesom Park, Myungjoo Kang
arXiv 2023. [Paper]
20 Feb 2023

Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent
Giannis Daras, Yuval Dagan, Alexandros G. Dimakis, Constantinos Daskalakis
arXiv 2023. [Paper] [Github]
17 Feb 2023

LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation
Jiaxin Cheng, Xiao Liang, Xingjian Shi, Tong He, Tianjun Xiao, Mu Li
arXiv 2023. [Paper]
16 Feb 2023

Video Probabilistic Diffusion Models in Projected Latent Space
Sihyun Yu, Kihyuk Sohn, Subin Kim, Jinwoo Shin
arXiv 2023. [Paper] [Github]
15 Feb 2023

DiffFaceSketch: High-Fidelity Face Image Synthesis with Sketch-Guided Latent Diffusion Model
Yichen Peng, Chunqi Zhao, Haoran Xie, Tsukasa Fukusato, Kazunori Miyata
arXiv 2023. [Paper]
14 Feb 2023

Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions
Raghav Singhal, Mark Goldstein, Rajesh Ranganath
arXiv 2023. [Paper]
14 Feb 2023

Preconditioned Score-based Generative Models
Li Zhang, Hengyuan Ma, Xiatian Zhu, Jianfeng Feng
arXiv 2023. [Paper] Github]
13 Feb 2023

Star-Shaped Denoising Diffusion Probabilistic Models
Andrey Okhotin, Dmitry Molchanov, Vladimir Arkhipkin, Grigory Bartosh, Aibek Alanov, Dmitry Vetrov
arXiv 2023. [Paper]
10 Feb 2023

UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
Wenliang Zhao, Lujia Bai, Yongming Rao, Jie Zhou, Jiwen Lu
arXiv 2023. [Paper] [Project] [Github]
9 Feb 2023

Geometry of Score Based Generative Models
Sandesh Ghimire, Jinyang Liu, Armand Comas, Davin Hill, Aria Masoomi, Octavia Camps, Jennifer Dy
arXiv 2023. [Paper]
9 Feb 2023

Q-Diffusion: Quantizing Diffusion Models
Xiuyu Li, Long Lian, Yijiang Liu, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt Keutzer
arXiv 2023. [Paper]
8 Feb 2023

PFGM++: Unlocking the Potential of Physics-Inspired Generative Models
Yilun Xu, Ziming Liu, Yonglong Tian, Shangyuan Tong, Max Tegmark, Tommi Jaakkola
arXiv 2023. [Paper] [Github]
8 Feb 2023

Long Horizon Temperature Scaling
Andy Shih, Dorsa Sadigh, Stefano Ermon
arXiv 2023. [Paper]
7 Feb 2023

Spatial Functa: Scaling Functa to ImageNet Classification and Generation
Matthias Bauer, Emilien Dupont, Andy Brock, Dan Rosenbaum, Jonathan Schwarz, Hyunjik Kim
arXiv 2023. [Paper]
6 Feb 2023

ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories
Zijian Zhang, Zhou Zhao, Jun Yu, Qi Tian
AAAI 2023. [Paper]
5 Feb 2023

Divide and Compose with Score Based Generative Models
Sandesh Ghimire, Armand Comas, Davin Hill, Aria Masoomi, Octavia Camps, Jennifer Dy
arXiv 2023. [Paper] [Github]
5 Feb 2023

Stable Target Field for Reduced Variance Score Estimation in Diffusion Models
Yilun Xu, Shangyuan Tong, Tommi Jaakkola
ICLR 2023. [Paper] [Github]
1 Feb 2023

DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models
Tao Yang, Yuwang Wang, Yan Lv, Nanning Zheng
NeurIPS 2023. [Paper]
31 Jan 2023

Optimizing DDPM Sampling with Shortcut Fine-Tuning
Ying Fan, Kangwook Lee
arXiv 2023. [Paper]
31 Jan 2023

Learning Data Representations with Joint Diffusion Models
Kamil Deja, Tomasz Trzcinski, Jakub M. Tomczak
arXiv 2023. [Paper]
31 Jan 2023

ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models
Shengmeng Li, Luping Liu, Zenghao Chai, Runnan Li, Xu Tan
arXiv 2023. [Paper]
30 Jan 2023

Don’t Play Favorites: Minority Guidance for Diffusion Models
Soobin Um, Jong Chul Ye
arXiv 2023. [Paper] [Github]
29 Jan 2023

Accelerating Guided Diffusion Sampling with Splitting Numerical Methods
Suttisak Wizadwongsa, Supasorn Suwajanakorn
ICLR 2023. [Paper]
27 Jan 2023

Input Perturbation Reduces Exposure Bias in Diffusion Models
Mang Ning, Enver Sangineto, Angelo Porrello, Simone Calderara, Rita Cucchiara
arXiv 2023. [Paper] [Github]
27 Jan 2023

Minimizing Trajectory Curvature of ODE-based Generative Models
Sangyun Lee, Beomsu Kim, Jong Chul Ye
arXiv 2023. [Paper]
27 Jan 2023

On the Importance of Noise Scheduling for Diffusion Models
Ting Chen
arXiv 2023. [Paper]
26 Jan 2023

simple diffusion: End-to-end diffusion for high resolution images
Emiel Hoogeboom, Jonathan Heek, Tim Salimans
arXiv 2023. [Paper]
26 Jan 2023

Fast Inference in Denoising Diffusion Models via MMD Finetuning
Emanuele Aiello, Diego Valsesia, Enrico Magli
arXiv 2023. [Paper] [Github]
19 Jan 2023

Exploring Transformer Backbones for Image Diffusion Models
Princy Chahal
arXiv 2022. [Paper]
27 Dec 2022

Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models
Zijian Zhang, Zhou Zhao, Zhijie Lin
arXiv 2022. [Paper]
26 Dec 2022

Scalable Adaptive Computation for Iterative Generation
Allan Jabri, David Fleet, Ting Chen
arXiv 2022. [Paper]
22 Dec 2022

Hierarchically branched diffusion models for efficient and interpretable multi-class conditional generation
Alex M. Tseng, Tommaso Biancalani, Max Shen, Gabriele Scalia
arXiv 2022. [Paper]
21 Dec 2022

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Ludan Ruan, Yiyang Ma, Huan Yang, Huiguo He, Bei Liu, Jianlong Fu, Nicholas Jing Yuan, Qin Jin, Baining Guo
arXiv 2022. [Paper] [Github]
19 Dec 2022

Scalable Diffusion Models with Transformers
William Peebles, Saining Xie
arXiv 2022. [Paper] [Project] [Github]
19 Dec 2022

DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
Gyeongnyeon Kim, Wooseok Jang, Gyuseong Lee, Susung Hong, Junyoung Seo, Seungryong Kim
arXiv 2022. [Paper] [Project]
17 Dec 2022

Towards Practical Plug-and-Play Diffusion Models
Hyojun Go, Yunsung Lee, Jin-Young Kim, Seunghyun Lee, Myeongho Jeong, Hyun Seung Lee, Seungtaek Choi
arXiv 2022. [Paper]
12 Dec 2022

Semantic Brain Decoding: from fMRI to conceptually similar image reconstruction of visual stimuli
Matteo Ferrante, Tommaso Boccato, Nicola Toschi
arXiv 2022. [Paper]
13 Dec 2022

MAGVIT: Masked Generative Video Transformer
Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang
arXiv 2022. [Paper] Project]
10 Dec 2022

Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding
Gyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho Yang
arXiv 2022. [Paper]
6 Dec 2022

Fine-grained Image Editing by Pixel-wise Guidance Using Diffusion Models
Naoki Matsunaga, Masato Ishii, Akio Hayakawa, Kenji Suzuki, Takuya Narihira
arXiv 2022. [Paper]
5 Dec 2022

VIDM: Video Implicit Diffusion Models
Kangfu Mei, Vishal M. Patel
arXiv 2022. [Paper] [Project] [Github]
1 Dec 2022

Why Are Conditional Generative Models Better Than Unconditional Ones?
Fan Bao, Chongxuan Li, Jiacheng Sun, Jun Zhu
arXiv 2022. [Paper]
1 Dec 2022

High-Fidelity Guided Image Synthesis with Latent Diffusion Models
Jaskirat Singh, Stephen Gould, Liang Zheng
arXiv 2022. [Paper] [Project]
30 Nov 2022

Score-based Continuous-time Discrete Diffusion Models
Haoran Sun, Lijun Yu, Bo Dai, Dale Schuurmans, Hanjun Dai
arXiv 2022. [Paper]
30 Nov 2022

Wavelet Diffusion Models are fast and scalable Image Generators
Hao Phung, Quan Dao, Anh Tran
arXiv 2022. [Paper]
29 Nov 2022

Dimensionality-Varying Diffusion Process
Han Zhang, Ruili Feng, Zhantao Yang, Lianghua Huang, Yu Liu, Yifei Zhang, Yujun Shen, Deli Zhao, Jingren Zhou, Fan Cheng
arXiv 2022. [Paper]
29 Nov 2022

Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models
Dongjun Kim, Yeongmin Kim, Wanmo Kang, Il-Chul Moon
arXiv 2022. [Paper]
28 Nov 2022

Diffusion Probabilistic Model Made Slim
Xingyi Yang, Daquan Zhou, Jiashi Feng, Xinchao Wang
arXiv 2022. [Paper]
27 Nov 2022

Fast Sampling of Diffusion Models via Operator Learning
Hongkai Zheng, Weili Nie, Arash Vahdat, Kamyar Azizzadenesheli, Anima Anandkumar
arXiv 2022. [Paper]
24 Nov 2022

Latent Video Diffusion Models for High-Fidelity Video Generation with Arbitrary Lengths
Yingqing He, Tianyu Yang, Yong Zhang, Ying Shan, Qifeng Chen
arXiv 2022. [Paper]
23 Nov 2022

Paint by Example: Exemplar-based Image Editing with Diffusion Models
Binxin Yang, Shuyang Gu, Bo Zhang, Ting Zhang, Xuejin Chen, Xiaoyan Sun, Dong Chen, Fang Wen
arXiv 2022. [Paper]
23 Nov 2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image
Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li
arXiv 2022. [Paper] [Github]
22 Nov 2022

Accelerating Diffusion Sampling with Classifier-based Feature Distillation
Wujie Sun, Defang Chen, Can Wang, Deshi Ye, Yan Feng, Chun Chen
arXiv 2022. [Paper]
22 Nov 2022

SceneComposer: Any-Level Semantic Image Synthesis
Yu Zeng, Zhe Lin, Jianming Zhang, Qing Liu, John Collomosse, Jason Kuen, Vishal M. Patel
arXiv 2022. [Paper] [Project]
21 Nov 2022

Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training
Ling Yang, Zhilin Huang, Yang Song, Shenda Hong, Guohao Li, Wentao Zhang, Bin Cui, Bernard Ghanem, Ming-Hsuan Yang
arXiv 2022. [Paper]
21 Nov 2022

SinFusion: Training Diffusion Models on a Single Image or Video
Yaniv Nikankin, Niv Haim, Michal Irani
arXiv 2022. [Paper]
21 Nov 2022

MagicVideo: Efficient Video Generation With Latent Diffusion Models
Daquan Zhou, Weimin Wang, Hanshu Yan, Weiwei Lv, Yizhe Zhu, Jiashi Feng
arXiv 2022. [Paper] [Project]
20 Nov 2022

Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding
Zijiao Chen, Jiaxin Qing, Tiange Xiang, Wan Lin Yue, Juan Helen Zhou
arXiv 2022. [Paper] [Project] [Github]
13 Nov 2022

Few-shot Image Generation with Diffusion Models
Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
arXiv 2022. [Paper]
7 Nov 2022

From Denoising Diffusions to Denoising Markov Models
Joe Benton, Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet
arXiv 2022. [Paper] [Github]
7 Nov 2022

Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
Muyang Li, Ji Lin, Chenlin Meng, Stefano Ermon, Song Han, Jun-Yan Zhu
NeurIPS 2022. [Paper] [Github]
4 Nov 2022

An optimal control perspective on diffusion-based generative modeling
Julius Berner, Lorenz Richter, Karen Ullrich
NeurIPS Workshop 2022. [Paper]
2 Nov 2022

Entropic Neural Optimal Transport via Diffusion Processes
Nikita Gushchin, Alexander Kolesov, Alexander Korotin, Dmitry Vetrov, Evgeny Burnaev
arXiv 2022. [Paper]
2 Nov 2022

DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models
Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, Jun Zhu
NeurIPS 2022 (Oral). [Paper] [Github]
2 Nov 2022

Score-based Denoising Diffusion with Non-Isotropic Gaussian Noise Models
Vikram Voleti, Christopher Pal, Adam Oberman
NeurIPS Workshop 2022. [Paper]
21 Oct 2022

Deep Equilibrium Approaches to Diffusion Models
Ashwini Pokle, Zhengyang Geng, Zico Kolter
NeurIPS 2022. [Paper] [Github]
23 Oct 2022

Representation Learning with Diffusion Models
Jeremias Traub
arXiv 2022. [Paper] [Github]
20 Oct 2022

Self-Guided Diffusion Models
Vincent Tao Hu, David W Zhang, Yuki M. Asano, Gertjan J. Burghouts, Cees G. M. Snoek
arXiv 2022. [Paper] [Project]
12 Oct 2022

GENIE: Higher-Order Denoising Diffusion Solvers
Tim Dockhorn, Arash Vahdat, Karsten Kreis
NeurIPS 2022. [Paper] [Project [Github]
11 Oct 2022

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation
Jiatao Gu, Shuangfei Zhai, Yizhe Zhang, Miguel Angel Bautista, Josh Susskind
arXiv 2022. [Paper] [Project]
10 Oct 2022

On Distillation of Guided Diffusion Models
Chenlin Meng, Ruiqi Gao, Diederik P. Kingma, Stefano Ermon, Jonathan Ho, Tim Salimans
arXiv 2022. [Paper]
6 Oct 2022

Improving Sample Quality of Diffusion Model Using Self-Attention Guidance
Susung Hong, Gyuseong Lee, Wooseok Jang, Seungryong Kim
arXiv 2022. [Paper] [Project]
3 Oct 2022

OCD: Learning to Overfit with Conditional Diffusion Models
Shahar Shlomo Lutati, Lior Wolf
arXiv 2022. [Paper] [Github]
2 Oct 2022

Generated Faces in the Wild: Quantitative Comparison of Stable Diffusion, Midjourney and DALL-E 2
Ali Borji
arXiv 2022. [Paper] [Github]
2 Oct 2022

Denoising MCMC for Accelerating Diffusion-Based Generative Models
Beomsu Kim, Jong Chul Ye
arXiv 2022. [Paper] [Github]
29 Sep 2022

All are Worth Words: a ViT Backbone for Score-based Diffusion Models
Fan Bao, Chongxuan Li, Yue Cao, Jun Zhu
arXiv 2022. [Paper]
25 Sep 2022

Neural Wavelet-domain Diffusion for 3D Shape Generation
Ka-Hei Hui, Ruihui Li, Jingyu Hu, Chi-Wing Fu
arXiv 2022. [Paper]
19 Sep 2022

Can segmentation models be trained with fully synthetically generated data?
Virginia Fernandez, Walter Hugo Lopez Pinaya, Pedro Borges, Petru-Daniel Tudosiu, Mark S Graham, Tom Vercauteren, M Jorge Cardoso
arXiv 2022. [Paper]
17 Sep 2022

Blurring Diffusion Models
Emiel Hoogeboom, Tim Salimans
arXiv 2022. [Paper]
12 Sep 2022

Soft Diffusion: Score Matching for General Corruptions
Giannis Daras, Mauricio Delbracio, Hossein Talebi, Alexandros G. Dimakis, Peyman Milanfar
arXiv 2022. [Paper]
12 Sep 2022

Improved Masked Image Generation with Token-Critic
José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa
arXiv 2022. [Paper]
9 Sep 2022

Let us Build Bridges: Understanding and Extending Diffusion Generative Models
Xingchao Liu, Lemeng Wu, Mao Ye, Qiang Liu
arXiv 2022. [Paper]
31 Aug 2022

Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wan-Cyuan Fan, Yen-Chun Chen, DongDong Chen, Yu Cheng, Lu Yuan, Yu-Chiang Frank Wang
arXiv 2022. [Paper]
29 Aug 2022

Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model
Shin-I Cheng, Yu-Jie Chen, Wei-Chen Chiu, Hsin-Ying Lee, Hung-Yu Tseng
arXiv 2022. [Paper] [Project]
26 Aug 2022

Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise
Arpit Bansal, Eitan Borgnia, Hong-Min Chu, Jie S. Li, Hamid Kazemi, Furong Huang, Micah Goldblum, Jonas Geiping, Tom Goldstein
arXiv 2022. [Paper] [Github]
19 Aug 2022

Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance
Bahjat Kawar, Roy Ganz, Michael Elad
arXiv 2022. [Paper]
18 Aug 2022

Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model
Xiulong Yang, Sheng-Min Shih, Yinlin Fu, Xiaoting Zhao, Shihao Ji
arXiv 2022. [Paper] [Github]
16 Aug 2022

Applying Regularized Schrödinger-Bridge-Based Stochastic Process in Generative Modeling
Ki-Ung Song
arXiv 2022. [Paper] [Github]
15 Aug 2022

Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting Chen, Ruixiang Zhang, Geoffrey Hinton
arXiv 2022. [Paper]
8 Aug 2022

Pyramidal Denoising Diffusion Probabilistic Models
Dohoon Ryu, Jong Chul Ye
arXiv 2022. [Paper]
3 Aug 2022

Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis
Sangyun Lee, Hyungjin Chung, Jaehyeon Kim, Jong Chul Ye
arxiv 2022. [Paper] [Github]
16 Jul 2022

Improving Diffusion Model Efficiency Through Patching
Troy Luhman, Eric Luhman
arXiv 2022. [Paper] [Github]
9 Jul 2022

Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling
Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng
ECCV 2022. [Paper]
5 Jul 2022

SPI-GAN: Distilling Score-based Generative Models with Straight-Path Interpolations
Jinsung Jeon, Noseong Park
arxiv 2022. [Paper]
29 Jun 2022

Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation
Shengming Li, Guangcong Zheng, Hui Wang, Taiping Yao, Yang Chen, Shoudong Ding, Xi Li
arXiv 2022. [Paper]
23 Jun 2022

Generative Modelling With Inverse Heat Dissipation
Severi Rissanen, Markus Heinonen, Arno Solin
arXiv 2022. [Paper] [Project]
21 Jun 2022

Diffusion models as plug-and-play priors
Alexandros Graikos, Nikolay Malkin, Nebojsa Jojic, Dimitris Samaras
NeurIPS 2022. [Paper] [Github]
17 Jun 2022

A Flexible Diffusion Model
Weitao Du, Tao Yang, He Zhang, Yuanqi Du
ICML 2023. [Paper]
17 Jun 2022

Lossy Compression with Gaussian Diffusion
Lucas Theis, Tim Salimans, Matthew D. Hoffman, Fabian Mentzer
arXiv 2022. [Paper]
17 Jun 2022

Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching
Cheng Lu, Kaiwen Zheng, Fan Bao, Jianfei Chen, Chongxuan Li, Jun Zhu
ICML 2022. [Paper] [Github]
16 Jun 2022

Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models
Fan Bao, Chongxuan Li, Jiacheng Sun, Jun Zhu, Bo Zhang
ICML 2022. [Paper] [Github]
15 Jun 2022

Diffusion Models for Video Prediction and Infilling
Tobias Höppe, Arash Mehrjou, Stefan Bauer, Didrik Nielsen, Andrea Dittadi
arXiv 2022. [Paper]
15 Jun 2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation
Ye Zhu, Yu Wu, Kyle Olszewski, Jian Ren, Sergey Tulyakov, Yan Yan
arXiv 2022. [Paper] [Github]
15 Jun 2022

gDDIM: Generalized denoising diffusion implicit models
Qinsheng Zhang, Molei Tao, Yongxin Chen
arXiv 2022. [Paper] [Github]
11 Jun 2022

How Much is Enough? A Study on Diffusion Times in Score-based Generative Models
Giulio Franzese, Simone Rossi, Lixuan Yang, Alessandro Finamore, Dario Rossi, Maurizio Filippone, Pietro Michiardi
arXiv 2022. [Paper]
10 Jun 2022

Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models
Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M Patel
arXiv 2022. [Paper]
10 Jun 2022

Accelerating Score-based Generative Models for High-Resolution Image Synthesis
Hengyuan Ma, Li Zhang, Xiatian Zhu, Jingfeng Zhang, Jianfeng Feng
arXiv 2022. [Paper]
8 Jun 2022

Diffusion-GAN: Training GANs with Diffusion
Zhendong Wang, Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou
arXiv 2022. [Paper]
5 Jun 2022

DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps
Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, Jun Zhu
NeurrIPS 2022. [Paper] [Github]
2 Jun 2022

Elucidating the Design Space of Diffusion-Based Generative Models
Tero Karras, Miika Aittala, Timo Aila, Samuli Laine
NeurIPS 2022. [Paper]
1 Jun 2022

On Analyzing Generative and Denoising Capabilities of Diffusion-based Deep Generative Models
Kamil Deja, Anna Kuzina, Tomasz Trzciński, Jakub M. Tomczak
NeurIPS 2022. [Paper]
31 May 2022

Few-Shot Diffusion Models
Giorgio Giannone, Didrik Nielsen, Ole Winther
arXiv 2022. [Paper]
30 May 2022

A Continuous Time Framework for Discrete Denoising Models
Andrew Campbell, Joe Benton, Valentin De Bortoli, Tom Rainforth, George Deligiannidis, Arnaud Doucet
arXiv 2022. [Paper]
30 May 2022

Maximum Likelihood Training of Implicit Nonlinear Diffusion Models
Dongjun Kim, Byeonghu Na, Se Jung Kwon, Dongsoo Lee, Wanmo Kang, Il-Chul Moon
NeurIPS 2022. [Paper]
27 May 2022

Accelerating Diffusion Models via Early Stop of the Diffusion Process
Zhaoyang Lyu, Xudong XU, Ceyuan Yang, Dahua Lin, Bo Dai
ICML 2022. [Paper]
25 May 2022

Flexible Diffusion Modeling of Long Videos
William Harvey, Saeid Naderiparizi, Vaden Masrani, Christian Weilbach, Frank Wood
arXiv 2022. [Paper] [Github]
23 May 2022

MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
Vikram Voleti, Alexia Jolicoeur-Martineau, Christopher Pal
NeurIPS 2022. [Paper] [Github]
19 May 2022

On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models
Vedant Singh, Surgan Jandial, Ayush Chopra, Siddharth Ramesh, Balaji Krishnamurthy, Vineeth N. Balasubramanian
CVPR Workshop 2022. [Paper]
8 May 2022

Subspace Diffusion Generative Models
Bowen Jing, Gabriele Corso, Renato Berlinghieri, Tommi Jaakkola
arXiv 2022. [Paper] [Github]
3 May 2022

Fast Sampling of Diffusion Models with Exponential Integrator
Qinsheng Zhang, Yongxin Chen
arXiv 2022. [Paper]
29 Apr 2022

Semi-Parametric Neural Image Synthesis
Andreas Blattmann, Robin Rombach, Kaan Oktay, Jonas Müller, Björn Ommer
NeurIPS 2022. [Paper]
25 Apr 2022

Video Diffusion Models
Jonathan Ho, Tim Salimans, Alexey Gritsenko, William Chan, Mohammad Norouzi, David J. Fleet
NeurIPS 2022. [Paper]
7 Apr 2022

Perception Prioritized Training of Diffusion Models
Jooyoung Choi, Jungbeom Lee, Chaehun Shin, Sungwon Kim, Hyunwoo Kim, Sungroh Yoon
CVPR 2022. [Paper] [Github]
1 Apr 2022

Generating High Fidelity Data from Low-density Regions using Diffusion Models
Vikash Sehwag, Caner Hazirbas, Albert Gordo, Firat Ozgenel, Cristian Canton Ferrer
arXiv 2022. [Paper]
31 Mar 2022

Diffusion Models for Counterfactual Explanations
Guillaume Jeanneret, Loïc Simon, Frédéric Jurie
arXiv 2022. [Paper]
29 Mar 2022

Denoising Likelihood Score Matching for Conditional Score-based Data Generation
Chen-Hao Chao, Wei-Fang Sun, Bo-Wun Cheng, Yi-Chen Lo, Chia-Che Chang, Yu-Lun Liu, Yu-Lin Chang, Chia-Ping Chen, Chun-Yi Lee
ICLR 2022. [Paper]
27 Mar 2022

Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang, Prakhar Srivastava, Stephan Mandt
arXiv 2022. [Paper] [Github]
16 Mar 2022

Dynamic Dual-Output Diffusion Models
Yaniv Benny, Lior Wolf
CVPR 2022. [Paper]
8 Mar 2022

Conditional Simulation Using Diffusion Schrödinger Bridges
Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet
arXiv 2022. [Paper]
27 Feb 2022

Diffusion Causal Models for Counterfactual Estimation
Pedro Sanchez, Sotirios A. Tsaftaris
PMLR 2022. [Paper]
21 Feb 2022

Pseudo Numerical Methods for Diffusion Models on Manifolds
Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao
ICLR 2022. [Paper] [Github]
20 Feb 2022

Truncated Diffusion Probabilistic Models
Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou
arXiv 2022. [Paper]
19 Feb 2022

Understanding DDPM Latent Codes Through Optimal Transport
Valentin Khrulkov, Ivan Oseledets
arXiv 2022. [Paper]
14 Feb 2022

Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality
Daniel Watson, William Chan, Jonathan Ho, Mohammad Norouzi
ICLR 2022. [Paper]
11 Feb 2022

Diffusion bridges vector quantized Variational AutoEncoders
Max Cohen, Guillaume Quispe, Sylvain Le Corff, Charles Ollion, Eric Moulines
ICML 2022. [Paper]
10 Feb 2022

Progressive Distillation for Fast Sampling of Diffusion Models
Tim Salimans, Jonathan Ho
ICLR 2022. [Paper]
1 Feb 2022

Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models
Fan Bao, Chongxuan Li, Jun Zhu, Bo Zhang
ICLR 2022. [Paper]
17 Jan 2022

DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents
Kushagra Pandey, Avideep Mukherjee, Piyush Rai, Abhishek Kumar
arXiv 2022. [Paper] [Github]
2 Jan 2022

Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal Derivatives
Hideyuki Tachibana, Mocho Go, Muneyoshi Inahara, Yotaro Katayama, Yotaro Watanabe
arXiv 2021. [Paper]
26 Dec 2021

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol, Prafulla Dhariwal, Aditya Ramesh, Pranav Shyam, Pamela Mishkin, Bob McGrew, Ilya Sutskever, Mark Chen
ICML 2021. [Paper] [Github]
20 Dec 2021

High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
arXiv 2021. [Paper] [Github]
20 Dec 2021

Heavy-tailed denoising score matching
Jacob Deasy, Nikola Simidjievski, Pietro Liò
arXiv 2021. [Paper]
17 Dec 2021

High Fidelity Visualization of What Your Self-Supervised Representation Knows About
Florian Bordes, Randall Balestriero, Pascal Vincent
arXiv 2021. [Paper]
16 Dec 2021

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Zhisheng Xiao, Karsten Kreis, Arash Vahdat
arXiv 2021. [Paper] [Project]
15 Dec 2021

Score-Based Generative Modeling with Critically-Damped Langevin Diffusion
Tim Dockhorn, Arash Vahdat, Karsten Kreis
ICLR 2022. [Paper] [Project]
14 Dec 2021

More Control for Free! Image Synthesis with Semantic Diffusion Guidance
Xihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell
arXiv 2021. [Paper]
10 Dec 2021

Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Minghui Hu, Yujie Wang, Tat-Jen Cham, Jianfei Yang, P.N.Suganthan
arXiv 2021. [Paper]
3 Dec 2021

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Konpat Preechakul, Nattanat Chatthee, Suttisak Wizadwongsa, Supasorn Suwajanakorn
CVPR 2022. [Paper] [Project] [Github]
30 Dec 2021

Conditional Image Generation with Score-Based Diffusion Models
Georgios Batzolis, Jan Stanczuk, Carola-Bibiane Schönlieb, Christian Etmann
arXiv 2021. [Paper]
26 Nov 2021

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
Sam Bond-Taylor, Peter Hessey, Hiroshi Sasaki, Toby P. Breckon, Chris G. Willcocks
arXiv 2021. [Paper] [Github]
24 Nov 2021

Diffusion Normalizing Flow
Qinsheng Zhang, Yongxin Chen
NeurIPS 2021. [Paper] [Github]
14 Oct 2021

Denoising Diffusion Gamma Models
Eliya Nachmani, Robin San Roman, Lior Wolf
arXiv 2021. [Paper]
10 Oct 2021

Score-based Generative Neural Networks for Large-Scale Optimal Transport
Max Daniels, Tyler Maunu, Paul Hand
arXiv 2021. [Paper]
7 Oct 2021

Score-Based Generative Classifiers
Roland S. Zimmermann, Lukas Schott, Yang Song, Benjamin A. Dunn, David A. Klindt
arXiv 2021. [Paper]
1 Oct 2021

Classifier-Free Diffusion Guidance
Jonathan Ho, Tim Salimans
NeurIPS Workshop 2021. [Paper]
28 Sep 2021

Bilateral Denoising Diffusion Models
Max W. Y. Lam, Jun Wang, Rongjie Huang, Dan Su, Dong Yu
arXiv 2021. [Paper] [Project]
26 Aug 2021

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser, Robin Rombach, Andreas Blattmann, Björn Ommer
NeurIPS 2021. [Paper] [Project]
19 Aug 2021

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon
ICCV 2021 (Oral). [Paper] [Github]
6 Aug 2021

SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations
Chenlin Meng, Yutong He, Yang Song, Jiaming Song, Jiajun Wu, Jun-Yan Zhu, Stefano Ermon
ICLR 2022. [Paper] [Project] [Github]
2 Aug 2021

Structured Denoising Diffusion Models in Discrete State-Spaces
Jacob Austin, Daniel D. Johnson, Jonathan Ho, Daniel Tarlow, Rianne van den Berg
NeurIPS 2021. [Paper]
7 Jul 2021

Variational Diffusion Models
Diederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho
arXiv 2021. [Paper] [Github]
1 Jul 2021

Diffusion Priors In Variational Autoencoders
Antoine Wehenkel, Gilles Louppe
ICML Workshop 2021. [Paper]
29 Jun 2021

Deep Generative Learning via Schrödinger Bridge
Gefei Wang, Yuling Jiao, Qian Xu, Yang Wang, Can Yang
ICML 2021. [Paper]
19 Jun 2021

Non Gaussian Denoising Diffusion Models
Eliya Nachmani, Robin San Roman, Lior Wolf
arXiv 2021. [Paper] [Project]
14 Jun 2021

D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha, Jiaming Song, Chenlin Meng, Stefano Ermon
NeurIPS 2021. [Paper] [Project] [Github]
12 Jun 2021

Score-based Generative Modeling in Latent Space
Arash Vahdat, Karsten Kreis, Jan Kautz
arXiv 2021. [Paper]
10 Jun 2021

Learning to Efficiently Sample from Diffusion Probabilistic Models
Daniel Watson, Jonathan Ho, Mohammad Norouzi, William Chan
arXiv 2021. [Paper]
7 Jun 2021

A Variational Perspective on Diffusion-Based Generative Models and Score Matching
Chin-Wei Huang, Jae Hyun Lim, Aaron Courville
NeurIPS 2021. [Paper] [Github]
5 Jun 2021

Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation
Dongjun Kim, Seungjae Shin, Kyungwoo Song, Wanmo Kang, Il-Chul Moon
ICML 2022. [Paper]
10 Jun 2021

Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling
Valentin De Bortoli, James Thornton, Jeremy Heng, Arnaud Doucet
arXiv 2021. [Paper] [Project] [Github]
1 Jun 2021

On Fast Sampling of Diffusion Probabilistic Models
Zhifeng Kong, Wei Ping
ICML Workshop 2021. [Paper] [Github]
31 May 2021

Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho, Chitwan Saharia, William Chan, David J. Fleet, Mohammad Norouzi, Tim Salimans
JMLR 2021. [Paper] [Project]
30 May 2021

Gotta Go Fast When Generating Data with Score-Based Models
Alexia Jolicoeur-Martineau, Ke Li, Rémi Piché-Taillefer, Tal Kachman, Ioannis Mitliagkas
arXiv 2021. [Paper] [Github]
28 May 2021

Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal, Alex Nichol
arXiv 2021. [Paper] [Github]
11 May 2021

Image Super-Resolution via Iterative Refinement
Chitwan Saharia, Jonathan Ho, William Chan, Tim Salimans, David J. Fleet, Mohammad Norouzi
arXiv 2021. [Paper] [Project] [Github]
15 Apr 2021

Noise Estimation for Generative Diffusion Models
Robin San-Roman, Eliya Nachmani, Lior Wolf
arXiv 2021. [Paper]
6 Apr 2021

Improved Denoising Diffusion Probabilistic Models
Alex Nichol, Prafulla Dhariwal
ICLR 2021. [Paper] [Github]
18 Feb 2021

Maximum Likelihood Training of Score-Based Diffusion Models
Yang Song, Conor Durkan, Iain Murray, Stefano Ermon
arXiv 2021. [Paper]
22 Jan 2021

Knowledge Distillation in Iterative Generative Models for Improved Sampling Speed
Eric Luhman, Troy Luhman
arXiv 2021. [Paper] [Github]
7 Jan 2021

Learning Energy-Based Models by Diffusion Recovery Likelihood
Ruiqi Gao, Yang Song, Ben Poole, Ying Nian Wu, Diederik P. Kingma
ICLR 2021. [Paper] [Github]
15 Dec 2020

Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, Ben Poole
ICLR 2021 (Oral). [Paper] [Github]
26 Nov 2020

Variational (Gradient) Estimate of the Score Function in Energy-based Latent Variable Models
Fan Bao, Kun Xu, Chongxuan Li, Lanqing Hong, Jun Zhu, Bo Zhang
ICML 2021. [Paper]
16 Oct 2020

Denoising Diffusion Implicit Models
Jiaming Song, Chenlin Meng, Stefano Ermon
ICLR 2021. [Paper] [Github]
6 Oct 2020

Adversarial score matching and improved sampling for image generation
Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Rémi Tachet des Combes, Ioannis Mitliagkas
ICLR 2021. [Paper] [Github]
11 Sep 2020

Denoising Diffusion Probabilistic Models
Jonathan Ho, Ajay Jain, Pieter Abbeel
NeurIPS 2020. [Paper] [Github] [Github2]
19 Jun 2020

Improved Techniques for Training Score-Based Generative Models
Yang Song, Stefano Ermon
NeurIPS 2020. [Paper] [Github]
16 Jun 2020

Generative Modeling by Estimating Gradients of the Data Distribution
Yang Song, Stefano Ermon
NeurIPS 2019. [Paper] [Project] [Github]
12 Jul 2019

Neural Stochastic Differential Equations: Deep Latent Gaussian Models in the Diffusion Limit
Belinda Tzen, Maxim Raginsky
arXiv 2019. [Paper]
23 May 2019

Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Sohl-Dickstein, Eric A. Weiss, Niru Maheswaranathan, Surya Ganguli
ICML 2015. [Paper] [Github]
2 Mar 2015

Classification

Likelihood-based Out-of-Distribution Detection with Denoising Diffusion Probabilistic Models
Joseph Goodier, Neill D. F. Campbell
BMVC 2023. [Paper]
26 Oct 2023

Multi-scale Diffusion Denoised Smoothing
Jongheon Jeong, Jinwoo Shin
NeurIPS 2023. [Paper]
25 Oct 2023

DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection
Se-Ho Kim, Inyong Koo, Inyoung Lee, Byeongjun Park, Changick Kim
arXiv 2023. [Paper]
25 Oct 2023

Denoising Task Routing for Diffusion Models
Byeongjun Park, Sangmin Woo, Hyojun Go, Jin-Young Kim, Changick Kim
arXiv 2023. [Paper]
11 Oct 2023

Leveraging Diffusion-Based Image Variations for Robust Training on Poisoned Data
Lukas Struppek, Martin B. Hentschel, Clifton Poth, Dominik Hintersdorf, Kristian Kersting
arXiv 2023. [Paper] [Github]
10 Oct 2023

Dream the Impossible: Outlier Imagination with Diffusion Models
Xuefeng Du, Yiyou Sun, Xiaojin Zhu, Yixuan Li
NeurIPS 2023. [Paper] [Github]
23 Sep 2023

Zero-Shot Object Counting with Language-Vision Models
Jingyi Xu, Hieu Le, Dimitris Samaras
CVPR 2023. [Paper] [Github]
22 Sep 2023

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Jingdong Wang, Qinghua Zheng
arXiv 2023. [Paper]
20 Sep 2023

Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Yunhao Ge, Jiashu Xu, Brian Nlong Zhao, Neel Joshi, Laurent Itti, Vibhav Vineet
arXiv 2023. [Paper] [Github]
12 Sep 2023

DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection
Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma
arXiv 2023. [Paper] [Project] [Github]
7 Sep 2023

Diffusion-based 3D Object Detection with Random Boxes
Xin Zhou, Jinghua Hou, Tingting Yao, Dingkang Liang, Zhe Liu, Zhikang Zou, Xiaoqing Ye, Jianwei Cheng, Xiang Bai
PRCV 2023. [Paper]
5 Sep 2023

Diffusion Model as Representation Learner
Xingyi Yang, Xinchao Wang
ICCV 2023. [Paper]
21 Aug 2023

DiffusionTrack: Diffusion Model For Multi-Object Tracking
Run Luo, Zikai Song, Lintao Ma, Jinlin Wei, Wei Yang, Min Yang
arXiv 2023. [Paper]
19 Aug 2023

DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models
Ruiyuan Gao, Chenchen Zhao, Lanqing Hong, Qiang Xu
arXiv 2023. [Paper]
15 Aug 2023

IDiff-Face: Synthetic-based Face Recognition through Fizzy Identity-Conditioned Diffusion Models
Fadi Boutros, Jonas Henry Grebe, Arjan Kuijper, Naser Damer
ICCV 2023. [Paper]
9 Aug 2023

Exploiting Synthetic Data for Data Imbalance Problems: Baselines from a Data Perspective
Moon Ye-Bin, Nam Hyeon-Woo, Wonseok Choi, Nayeong Kim, Suha Kwak, Tae-Hyun Oh
arXiv 2023. [Paper]
2 Aug 2023

Diffusion Model for Camouflaged Object Detection
Zhennan Chen, Rongrong Gao, Tian-Zhu Xiang, Fan Lin
ECAI 2023. [Paper]
1 Aug 2023

DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation
Runyang Feng, Yixing Gao, Tze Ho Elden Tse, Xueqing Ma, Hyung Jin Chang
arXiv 2023. [Paper]
31 Jul 2023

MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
Baoquan Zhang, Demin Yu
arXiv 2023. [Paper]
31 Jul 2023

Generative Prompt Model for Weakly Supervised Object Localization
Yuzhong Zhao, Qixiang Ye, Weijia Wu, Chunhua Shen, Fang Wan
ICCV 2023. [Paper] [Github]
19 Jul 2023

Diffusion Models Beat GANs on Image Classification
Soumik Mukhopadhyay, Matthew Gwilliam, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Srinidhi Hegde, Tianyi Zhou, Abhinav Shrivastava
arXiv 2023. [Paper]
17 Jul 2023

Diffusion to Confusion: Naturalistic Adversarial Patch Generation Based on Diffusion Model for Object Detector
Shuo-Yen Lin, Ernie Chu, Che-Hsien Lin, Jun-Cheng Chen, Jia-Ching Wang
arXiv 2023. [Paper]
16 Jul 2023

DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, Karsten Kreis, Antonio Torralba, Sanja Fidler
arXiv 2023. [Paper] [Project]
14 Jul 2023

ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion
Yingjun Du, Zehao Xiao, Shengcai Liao, Cees Snoek
arXiv 2023. [Paper]
26 Jun 2023

Masked Diffusion Models are Fast Learners
Jiachen Lei, Peng Cheng, Zhongjie Ba, Kui Ren
arXiv 2023. [Paper]
20 Jun 2023

Renderers are Good Zero-Shot Representation Learners: Exploring Diffusion Latents for Metric Learning
Michael Tang, David Shustin
arXiv 2023. [Paper]
19 Jun 2023

The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models
Roy Voetman, Maya Aghaei, Klaas Dijkstra
arXiv 2023. [Paper]
16 Jun 2023

When Hyperspectral Image Classification Meets Diffusion Models: An Unsupervised Feature Learning Framework
Jingyi Zhou, Jiamu Sheng, Jiayuan Fan, Peng Ye, Tong He, Bin Wang, Tao Chen
arXiv 2023. [Paper]
15 Jun 2023

DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles
Tal Daniel, Aviv Tamar
arXiv 2023. [Paper]
9 Jun 2023

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu
arXiv 2023. [Paper]
8 Jun 2023

Conditional Generation from Unconditional Diffusion Models using Denoiser Representations
Alexandros Graikos, Srikar Yellapragada, Dimitris Samaras
BMVC 2023. [Paper] [Github]
2 Jun 2023

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Sitian Shen, Zilin Zhu, Linqian Fan, Harry Zhang, Xinxiao Wu
arXiv 2023. [Paper]
25 May 2023

Training on Thin Air: Improve Image Classification with Generated Data
Yongchao Zhou, Hshmat Sahak, Jimmy Ba
arXiv 2023. [Paper] [Project] [Github]
24 May 2023

Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?
Zheng Li, Yuxuan Li, Penghai Zhao, Renjie Song, Xiang Li, Jian Yang
arXiv 2023. [Paper] [Github]
22 May 2023

Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model
Jie Yang, Bingliang Li, Fengyu Yang, Ailing Zeng, Lei Zhang, Ruimao Zhang
arXiv 2023. [Paper]
20 May 2023

Meta-DM: Applications of Diffusion Models on Few-Shot Learning
Wentao Hu, Xiurong Jiang, Jiarun Liu, Yuqi Yang, Hui Tian
arXiv 2023. [Paper]
14 May 2023

Class-Balancing Diffusion Models
Yiming Qin, Huangjie Zheng, Jiangchao Yao, Mingyuan Zhou, Ya Zhang
CVPR 2023. [Paper]
30 Apr 2023

Synthetic Data from Diffusion Models Improves ImageNet Classification
Shekoofeh Azizi, Simon Kornblith, Chitwan Saharia, Mohammad Norouzi, David J. Fleet
arXiv 2023. [Paper]
17 Apr 2023

OVTrack: Open-Vocabulary Multiple Object Tracking
Siyuan Li, Tobias Fischer, Lei Ke, Henghui Ding, Martin Danelljan, Fisher Yu
arXiv 2023. [Paper]
17 Apr 2023

Your Diffusion Model is Secretly a Zero-Shot Classifier
Alexander C. Li, Mihir Prabhudesai, Shivam Duggal, Ellis Brown, Deepak Pathak
arXiv 2023. [Paper] [Project]
28 Mar 2023

Text-to-Image Diffusion Models are Zero-Shot Classifiers
Kevin Clark, Priyank Jaini
arXiv 2023. [Paper]
27 Mar 2023

Diffusion Denoised Smoothing for Certified and Adversarial Robust Out-Of-Distribution Detection
Nicola Franco, Daniel Korth, Jeanette Miriam Lorenz, Karsten Roscher, Stephan Guennemann
arXiv 2023. [Paper]
27 Mar 2023

CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images
Jordan J. Bird, Ahmad Lotfi
arXiv 2023. [Paper]
24 Mar 2023

Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Weilai Xiang, Hongyu Yang, Di Huang, Yunhong Wang
arXiv 2023. [Paper] )]
17 Mar 2023

Boosting Zero-shot Classification with Synthetic Data Diversity via Stable Diffusion
Jordan Shipard, Arnold Wiliem, Kien Nguyen Thanh, Wei Xiang, Clinton Fookes
arXiv 2023. [Paper]
7 Feb 2023

Fake it till you make it: Learning(s) from a synthetic ImageNet clone
Mert Bulent Sariyildiz, Karteek Alahari, Diane Larlus, Yannis Kalantidis
CVPR 2023. [Paper] [Project]
16 Dec 2022

DiffAlign : Few-shot learning using diffusion based synthesis and alignment
Aniket Roy, Anshul Shah, Ketul Shah, Anirban Roy, Rama Chellappa
arXiv 2022. [Paper]
11 Dec 2022

Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection
Luping Liu, Yi Ren, Xize Cheng, Zhou Zhao
arXiv 2022. [Paper] [Github]
21 Nov 2022

DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen, Peize Sun, Yibing Song, Ping Luo
arXiv 2022. [Paper] [Github]
17 Nov 2022

Denoising Diffusion Models for Out-of-Distribution Detection
Mark S. Graham, Walter H.L. Pinaya, Petru-Daniel Tudosiu, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso
arXiv 2022. [Paper] [Github]
14 Nov 2022

A simple, efficient and scalable contrastive masked autoencoder for learning visual representations
Shlok Mishra, Joshua Robinson, Huiwen Chang, David Jacobs, Aaron Sarna, Aaron Maschinot, Dilip Krishnan
arXiv 2022. [Paper]
30 Oct 2022

From Points to Functions: Infinite-dimensional Representations in Diffusion Models
Sarthak Mittal, Guillaume Lajoie, Stefan Bauer, Arash Mehrjou
arXiv 2022. [Paper] [Github]
25 Oct 2022

Boomerang: Local sampling on image manifolds using diffusion models
Lorenzo Luzi, Ali Siahkoohi, Paul M Mayer, Josue Casco-Rodriguez, Richard Baraniuk
arXiv 2022. [Paper] [Colab]
21 Oct 2022

Meta-Learning via Classifier(-free) Guidance
Elvis Nava, Seijin Kobayashi, Yifei Yin, Robert K. Katzschmann, Benjamin F. Grewe
arXiv 2022. [Paper]
17 Oct 2022

Segmentation

One-shot Localization and Segmentation of Medical Images with Foundation Models
Deepa Anand, Gurunath Reddy M, Vanika Singhal, Dattesh D. Shanbhag, Shriram KS, Uday Patil, Chitresh Bhushan, Kavitha Manickam, Dawei Gui, Rakesh Mullick, Avinash Gopal, Parminder Bhatia, Taha Kass-Hout
arXiv 2023. [Paper]
28 Oct 2023

Semantic-preserving image coding based on Conditional Diffusion models
Francesco Pezone, Osman Musa, Giuseppe Caire, Sergio Barbarossa
arXiv 2023. [Paper]
24 Oct 2023

Diffusion-based Data Augmentation for Nuclei Image Segmentation
Xinyi Yu, Guanbin Li, Wei Lou, Siqi Liu, Xiang Wan, Yan Chen, Haofeng Li
arXiv 2023. [Paper]
22 Oct 2023

EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model
Zheyuan Zhang, Lanhong Yao, Bin Wang, Debesh Jha, Elif Keles, Alpay Medetalibeyoglu, Ulas Bagci
arXiv 2023. [Paper]
19 Oct 2023

Towards Training-free Open-world Segmentation via Image Prompting Foundation Models
Lv Tang, Peng-Tao Jiang, Hao-Ke Xiao, Bo Li
arXiv 2023. [Paper]
17 Oct 2023

Towards Generic Semi-Supervised Framework for Volumetric Medical Image Segmentation
Haonan Wang, Xiaomeng Li
NeurIPS 2023. [Paper] [Github]
17 Oct 2023

Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation
Wangyu Wu, Tianhong Dai, Xiaowei Huang, Fei Ma, Jimin Xiao
arXiv 2023. [Paper]
15 Oct 2023

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks
ICCV 2023. [Paper]
30 Sep 2023

Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation
Xin Yuan, Michael Maire
arXiv 2023. [Paper]
27 Sep 2023

Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation
Quang Nguyen, Truong Vu, Anh Tran, Khoi Nguyen
arXiv 2023. [Paper]
25 Sep 2023

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy
arXiv 2023. [Paper] [Github]
22 Sep 2023

Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Yunhao Ge, Jiashu Xu, Brian Nlong Zhao, Neel Joshi, Laurent Itti, Vibhav Vineet
arXiv 2023. [Paper] [Github]
12 Sep 2023

Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation
Zhiqing Zhang, Guojia Fan, Tianyong Liu, Nan Li, Yuyang Liu, Ziyu Liu, Canwei Dong, Shoujun Zhou
arXiv 2023. [Paper]
12 Sep 2023

From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao, Qi Yang, Feng Zhou, Changshui Zhang
arXiv 2023. [Paper]
8 Sep 2023

SLiMe: Segment Like Me
Aliasghar Khani, Saeid Asgari Taghanaki, Aditya Sanghi, Ali Mahdavi Amiri, Ghassan Hamarneh
arXiv 2023. [Paper] [Github]
6 Sep 2023

Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
Jinglong Wang, Xiawei Li, Jing Zhang, Qingyuan Xu, Qin Zhou, Qian Yu, Lu Sheng, Dong Xu
arXiv 2023. [Paper]
6 Sep 2023

GenSelfDiff-HIS: Generative Self-Supervision Using Diffusion for Histopathological Image Segmentation
Vishnuvardhan Purma, Suhas Srinath, Seshan Srirangarajan, Aanchal Kakkar, Prathosh A. P
arXiv 2023. [Paper] [Github]
4 Sep 2023

Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion
Ryota Yoshihashi, Yuya Otsuka, Kenji Doi, Tomohiro Tanaka
AAAI 2022. [Paper]
4 Sep 2023

ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models
Yuhao Du, Yuncheng Jiang, Shuangyi Tan, Xusheng Wu, Qi Dou, Zhen Li, Guanbin Li, Xiang Wan
arXiv 2023. [Paper]
3 Sep 2023

Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models
Minheng Ni, Yabo Zhang, Kailai Feng, Xiaoming Li, Yiwen Guo, Wangmeng Zuo
arXiv 2023. [Paper]
31 Aug 2023

Modality Cycles with Masked Conditional Diffusion for Unsupervised Anomaly Segmentation in MRI
Ziyun Liang, Harry Anthony, Felix Wagner, Konstantinos Kamnitsas
arXiv 2023. [Paper]
30 Aug 2023

A Recycling Training Strategy for Medical Image Segmentation with Diffusion Denoising Models
Yunguan Fu, Yiwen Li, Shaheer U Saeed, Matthew J Clarkson, Yipeng Hu
arXiv 2023. [Paper] [Github]
30 Aug 2023

Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
Junjiao Tian, Lavisha Aggarwal, Andrea Colaco, Zsolt Kira, Mar Gonzalez-Franco
arXiv 2023. [Paper]
23 Aug 2023

Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation
Duo Peng, Ping Hu, Qiuhong Ke, Jun Liu
arXiv 2023. [Paper]
23 Aug 2023

DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction
Xiaoxiao He, Chaowei Tan, Ligong Han, Bo Liu, Leon Axel, Kang Li, Dimitris N. Metaxas
MICCAI 2023. [Paper] [Github]
18 Aug 2023

Masked Diffusion as Self-supervised Representation Learner
Zixuan Pan, Jianxu Chen, Yiyu Shi
arXiv 2023. [Paper]
10 Aug 2023

DermoSegDiff: A Boundary-aware Segmentation Diffusion Model for Skin Lesion Delineation
Afshin Bozorgpour, Yousef Sadegheih, Amirhossein Kazerouni, Reza Azad, Dorit Merhof
MICCAI Workshop 2023. [Paper] [Github]
5 Aug 2023

DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation
Jingfan Chen, Yuxi Wang, Pengfei Wang, Xiao Chen, Zhaoxiang Zhang, Zhen Lei, Qing Li
arXiv 2023. [Paper]
2 Aug 2023

DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models
Chao Huang, Susan Liang, Yapeng Tian, Anurag Kumar, Chenliang Xu
arXiv 2023. [Paper]
31 Jul 2023

Pre-Training with Diffusion models for Dental Radiography segmentation
Jérémy Rousseau, Christian Alaka, Emma Covili, Hippolyte Mayard, Laura Misrachi, Willy Au
arXiv 2023. [Paper]
26 Jul 2023

FEDD – Fair, Efficient, and Diverse Diffusion-based Lesion Segmentation and Malignancy Classification
Héctor Carrión, Narges Norouzi
MICCAI 2023. [Paper] [Github]
21 Jul 2023

DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, Karsten Kreis, Antonio Torralba, Sanja Fidler
arXiv 2023. [Paper] [Project]
14 Jul 2023

Prompting Diffusion Representations for Cross-Domain Semantic Segmentation
Rui Gong, Martin Danelljan, Han Sun, Julio Delgado Mangas, Luc Van Gool
arXiv 2023. [Paper]
5 Jul 2023

DifFSS: Diffusion Model for Few-Shot Semantic Segmentation
Weimin Tan, Siyuan Chen, Bo Yan
arXiv 2023. [Paper]
3 Jul 2023

Towards Better Certified Segmentation via Diffusion Models
Othmane Laousy, Alexandre Araujo, Guillaume Chassagnon, Marie-Pierre Revel, Siddharth Garg, Farshad Khorrami, Maria Vakalopoulou
arXiv 2023. [Paper]
16 Jun 2023

Diffusion Models for Zero-Shot Open-Vocabulary Segmentation
Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht
arXiv 2023. [Paper]
15 Jun 2023

Annotator Consensus Prediction for Medical Image Segmentation with Diffusion Models
Tomer Amit, Shmuel Shichrur, Tal Shaharabany, Lior Wolf
arXiv 2023. [Paper]
15 Jun 2023

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery
Eleonora Grassucci, Sergio Barbarossa, Danilo Comminiello
arXiv 2023. [Paper] [Github]
7 Jun 2023

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation
Xinrong Hu, Yu-Jen Chen, Tsung-Yi Ho, Yiyu Shi
arXiv 2023. [Paper]
6 Jun 2023

DFormer: Diffusion-guided Transformer for Universal Image Segmentation
Hefeng Wang, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang
arXiv 2023. [Paper] [Github]
6 Jun 2023

Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Zeqiang Lai, Yuchen Duan, Jifeng Dai, Ziheng Li, Ying Fu, Hongsheng Li, Yu Qiao, Wenhai Wang
arXiv 2023. [Paper]
2 Jun 2023

Multi-Level Global Context Cross Consistency Model for Semi-Supervised Ultrasound Image Segmentation with Diffusion Model
Fenghe Tang, Jianrui Ding, Lingtao Wang, Min Xian, Chunping Ning
arXiv 2023. [Paper] [Github]
16 May 2023

Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation
David Stojanovski, Uxio Hermida, Pablo Lamata, Arian Beqiri, Alberto Gomez
arXiv 2023. [Paper]
9 May 2023

Personalize Segment Anything Model with One Shot
Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li
arXiv 2023. [Paper] [Github]
4 May 2023

Personalize Segment Anything Model with One Shot
Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li
arXiv 2023. [Paper] [Github]
4 May 2023

Unsupervised Discovery of 3D Hierarchical Structure with Generative Diffusion Features
Nurislam Tursynbek, Marc Niethammer
arXiv 2023. [Paper]
28 Apr 2023

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models
Shitong Shao, Xiaohan Yuan, Zhen Huang, Ziming Qiu, Shuai Wang, Kevin Zhou
arXiv 2023. [Paper] [Github]
26 Apr 2023

Realistic Data Enrichment for Robust Image Segmentation in Histopathology
Sarah Cechnicka, James Ball, Callum Arthurs, Candice Roufosse, Bernhard Kainz
arXiv 2023. [Paper]
19 Apr 2023

Denoising Diffusion Medical Models
Pham Ngoc Huy, Tran Minh Quan
IEEE ISBI 2023. [Paper]
19 Apr 2023

Ambiguous Medical Image Segmentation using Diffusion Models
Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel
CVPR 2023. [Paper] [Github]
10 Apr 2023

BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation
Tao Chen, Chenhui Wang, Hongming Shan
arXiv 2023. [Paper]
10 Apr 2023

Distribution Aligned Diffusion and Prototype-guided network for Unsupervised Domain Adaptive Segmentation
Haipeng Zhou, Lei Zhu, Yuyin Zhou
arXiv 2023. [Paper]
22 Mar 2023

Semantic Latent Space Regression of Diffusion Autoencoders for Vertebral Fracture Grading
Matthias Keicher, Matan Atad, David Schinz, Alexandra S. Gersing, Sarah C. Foreman, Sophia S. Goller, Juergen Weissinger, Jon Rischewski, Anna-Sophia Dietrich, Benedikt Wiestler, Jan S. Kirschke, Nassir Navab
arXiv 2023. [Paper]
21 Mar 2023

LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
Koutilya Pnvr, Bharat Singh, Pallabi Ghosh, Behjat Siddiquie, David Jacobs
arXiv 2023. [Paper]
22 Mar 2023

DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Weijia Wu, Yuzhong Zhao, Mike Zheng Shou, Hong Zhou, Chunhua Shen
arXiv 2023. [Paper] [Project]
21 Mar 2023

Object-Centric Slot Diffusion
Jindong Jiang, Fei Deng, Gautam Singh, Sungjin Ahn
arXiv 2023. [Paper]
20 Mar 2023

Diff-UNet: A Diffusion Embedded Network for Volumetric Segmentation
Zhaohu Xing, Liang Wan, Huazhu Fu, Guang Yang, Lei Zhu
arXiv 2023. [Paper] [Github]
18 Mar 2023

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery
Chaofan Ma, Yuhuan Yang, Chen Ju, Fei Zhang, Jinxiang Liu, Yu Wang, Ya Zhang, Yanfeng Wang
arXiv 2023. [Paper]
17 Mar 2023

Stochastic Segmentation with Conditional Categorical Diffusion Models
Lukas Zbinden, Lars Doorenbos, Theodoros Pissas, Raphael Sznitman, Pablo Márquez-Neila
ICCV 2023. [Paper] [Github]
15 Mar 2023

DiffBEV: Conditional Diffusion Model for Bird’s Eye View Perception
Jiayu Zou, Zheng Zhu, Yun Ye, Xingang Wang
arXiv 2023. [Paper]
15 Mar 2023

Importance of Aligning Training Strategy with Evaluation for Diffusion Models in 3D Multiclass Segmentation
Yunguan Fu, Yiwen Li, Shaheer U. Saeed, Matthew J. Clarkson, Yipeng Hu
arXiv 2023. [Paper] [Github]
10 Mar 2023

MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Minh-Quan Le, Tam V. Nguyen, Trung-Nghia Le, Thanh-Toan Do, Minh N. Do, Minh-Triet Tran
arXiv 2023. [Paper]
9 Mar 2023

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Jiarui Xu, Sifei Liu, Arash Vahdat, Wonmin Byeon, Xiaolong Wang, Shalini De Mello
arXiv 2023. [Paper] [Project]
8 Mar 2023

MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer
Junde Wu, Rao Fu, Huihui Fang, Yu Zhang, Yanwu Xu
arXiv 2023. [Paper]
19 Jan 2023

DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu, Haoxing Chen, Zhuoer Xu, Jun Lan, Changhua Meng, Weiqiang Wang
arXiv 2022. [Paper] [Github]
6 DEc 2022

Multi-Class Segmentation from Aerial Views using Recursive Noise Diffusion
Benedikt Kolbeinsson, Krystian Mikolajczyk
arXiv 2022. [Paper]
1 Dec 2022

Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Ryan Burgert, Kanchana Ranasinghe, Xiang Li, Michael S. Ryoo
arXiv 2022. [Paper]
23 Nov 2022

Improved HER2 Tumor Segmentation with Subtype Balancing using Deep Generative Networks
Mathias Öttl, Jana Mönius, Matthias Rübner, Carol I. Geppert, Jingna Qiu, Frauke Wilm, Arndt Hartmann, Matthias W. Beckmann, Peter A. Fasching, Andreas Maier, Ramona Erber, Katharina Breininger
arXiv 2022. [Paper]
11 Nov 2022

MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model
Junde Wu, Huihui Fang, Yu Zhang, Yehui Yang, Yanwu Xu
arXiv 2022. [Paper]
1 Nov 2022

Accelerating Diffusion Models via Pre-segmentation Diffusion Sampling for Medical Image Segmentation
Xutao Guo, Yanwu Yang, Chenfei Ye, Shang Lu, Yang Xiang, Ting Ma
arXiv 2022. [Paper]
27 Oct 2022

Anatomically constrained CT image translation for heterogeneous blood vessel segmentation
Giammarco La Barbera, Haithem Boussaid, Francesco Maso, Sabine Sarnacki, Laurence Rouet, Pietro Gori, Isabelle Bloch
BMVC 2022. [Paper]
4 Oct 2022

Diffusion Adversarial Representation Learning for Self-supervised Vessel Segmentation
Boah Kim, Yujin Oh, Jong Chul Ye
arXiv 2022. [Paper]
29 Sep 2022

Can segmentation models be trained with fully synthetically generated data?
Virginia Fernandez, Walter Hugo Lopez Pinaya, Pedro Borges, Petru-Daniel Tudosiu, Mark S Graham, Tom Vercauteren, M Jorge Cardoso
arXiv 2022. [Paper]
17 Sep 2022

Let us Build Bridges: Understanding and Extending Diffusion Generative Models
Xingchao Liu, Lemeng Wu, Mao Ye, Qiang Liu
arXiv 2022. [Paper]
31 Aug 2022

Semantic Image Synthesis via Diffusion Models
Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li
arXiv 2022. [Paper]
30 Jun 2022

Remote Sensing Change Detection (Segmentation) using Denoising Diffusion Probabilistic Models
Wele Gedara Chaminda Bandara, Nithin Gopalakrishnan Nair, Vishal M. Patel
arXiv 2022. [Paper] [Github]
23 Jun 2022

Diffusion models as plug-and-play priors
Alexandros Graikos, Nikolay Malkin, Nebojsa Jojic, Dimitris Samaras
arXiv 2022. [Paper]
17 Jun 2022

Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardos
MICCAI 2022. [Paper]
7 Jun 2022

Decoder Denoising Pretraining for Semantic Segmentation
Emmanuel Brempong Asiedu, Simon Kornblith, Ting Chen, Niki Parmar, Matthias Minderer, Mohammad Norouzi
arXiv 2022. [Paper]
23 May 2022

Diffusion Models for Implicit Image Segmentation Ensembles
Julia Wolleb, Robin Sandkühler, Florentin Bieder, Philippe Valmaggia, Philippe C. Cattin
MIDL 2021. [Paper]
6 Dec 2021

Label-Efficient Semantic Segmentation with Diffusion Models
Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko
ICLR 2021. [Paper] [Github]
6 Dec 2021

SegDiff: Image Segmentation with Diffusion Probabilistic Models
Tomer Amit, Eliya Nachmani, Tal Shaharbany, Lior Wolf
arXiv 2021. [Paper]
1 Dec 2021

Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions
Emiel Hoogeboom, Didrik Nielsen, Priyank Jaini, Patrick Forré, Max Welling
NeurIPS 2021. [Paper]
10 Feb 2021

Image Translation

Latent Diffusion Counterfactual Explanations
Karim Farid, Simon Schrodi, Max Argus, Thomas Brox
arXiv 2023. [Paper]
10 Oct 2023

Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption
Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma
ICCV 2023. [Paper]
7 Sep 2023

Latent Painter
Shih-Chieh Su
arXiv 2023. [Paper]
31 Aug 2023

Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models
Zhanbo Feng, Zenan Ling, Ci Gong, Feng Zhou, Jie Li, Robert C. Qiu
arXiv 2023. [Paper]
30 Aug 2023

DiffI2I: Efficient Diffusion Model for Image-to-Image Translation
Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, Radu Timotfe, Luc Van Gool
arXiv 2023. [Paper]
26 Aug 2023

SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Mengmeng Wang, Jingdong Wang
arXiv 2023. [Paper]
20 Aug 2023

MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu, Tzuhsuan Huang, Shuo-Yen Lin, Jun-Cheng Chen
arXiv 2023. [Paper] [Project]
19 Aug 2023

StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
Zhizhong Wang, Lei Zhao, Wei Xing
arXiv 2023. [Paper]
15 Aug 2023

Inversion-by-Inversion: Exemplar-based Sketch-to-Photo Synthesis via Stochastic Differential Equations without Training
Ximing Xing, Chuang Wang, Haitao Zhou, Zhihao Hu, Chongxuan Li, Dong Xu, Qian Yu
arXiv 2023. [Paper]
15 Aug 2023

Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow
Junhong Gou, Siyu Sun, Jianfu Zhang, Jianlou Si, Chen Qian, Liqing Zhang
ACM MM 2023. [Paper]
11 Aug 2023

Head Rotation in Denoising Diffusion Models
Andrea Asperti, Gabriele Colasuonno, Antonio Guerra
arXiv 2023. [Paper]
11 Aug 2023

Photorealistic and Identity-Preserving Image-Based Emotion Manipulation with Latent Diffusion Models
Ioannis Pikoulis, Panagiotis P. Filntisis, Petros Maragos
arXiv 2023. [Paper]
6 Aug 2023

SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation
Shikun Sun, Longhui Wei, Junliang Xing, Jia Jia, Qi Tian
ICML 2023. [Paper]
4 Aug 2023

Interpolating between Images with Diffusion Models
Clinton J. Wang, Polina Golland
ICML Workshop 2023. [Paper] [Project] [Github]
24 Jul 2023

TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition
Shilin Lu, Yanzhu Liu, Adams Wai-Kin Kong
ICCV 2023. [Paper] [Github]
24 Jul 2023

DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation
Yipeng Leng, Qiangjuan Huang, Zhiyuan Wang, Yangyang Liu, Haoyu Zhang
arXiv 2023. [Paper]
12 Jul 2023

DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer
Dan Ruta, Gemma Canet Tarrés, Andrew Gilbert, Eli Shechtman, Nicholas Kolkin, John Collomosse
arXiv 2023. [Paper]
9 Jul 2023

Applying a Color Palette with Local Control using Diffusion Models
Vaibhav Vavilala, David Forsyth
arXiv 2023. [Paper]
6 Jul 2023

DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang
arXiv 2023. [Paper] [Project]
5 Jul 2023

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
Yujun Shi, Chuhui Xue, Jiachun Pan, Wenqing Zhang, Vincent Y. F. Tan, Song Bai
arXiv 2023. [Paper]
26 Jun 2023

ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models
Dar-Yen Chen
arXiv 2023. [Paper] [Github]
15 Jun 2023

InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models
Yingheng Wang, Yair Schiff, Aaron Gokaslan, Weishen Pan, Fei Wang, Christopher De Sa, Volodymyr Kuleshov
ICML 2023. [Paper]
14 Jun 2023

TryOnDiffusion: A Tale of Two UNets
Luyang Zhu, Dawei Yang, Tyler Zhu, Fitsum Reda, William Chan, Chitwan Saharia, Mohammad Norouzi, Ira Kemelmacher-Shlizerman
CVPR 2023. [Paper]
14 Jun 2023

Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance
Gihyun Kwon, Jong Chul Ye
arXiv 2023. [Paper]
7 Jun 2023

DiffSketching: Sketch Control Image Synthesis with Diffusion Models
Qiang Wang, Di Kong, Fengyin Lin, Yonggang Qi
arXiv 2023. [Paper]
30 May 2023

Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang, Jinbo Xing, Eric Lo, Jiaya Jia
arXiv 2023. [Paper]
30 May 2023

Photoswap: Personalized Subject Swapping in Images
Jing Gu, Yilin Wang, Nanxuan Zhao, Tsu-Jui Fu, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang
arXiv 2023. [Paper] [Project]
29 May 2023

Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation
Lisa Dunlap, Alyssa Umino, Han Zhang, Jiezhi Yang, Joseph E. Gonzalez, Trevor Darrell
arXiv 2023. [Paper] [Github]
25 May 2023

Unpaired Image-to-Image Translation via Neural Schrödinger Bridge
Beomsu Kim, Gihyun Kwon, Kwanyoung Kim, Jong Chul Ye
arXiv 2023. [Paper] [Github]
24 May 2023

SAR-to-Optical Image Translation via Thermodynamics-inspired Network
Mingjin Zhang, Jiamin Xu, Chengyu He, Wenteng Shang, Yunsong Li, Xinbo Gao
arXiv 2023. [Paper]
23 May 2023

Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator
Jing Zhao, Heliang Zheng, Chaoyue Wang, Long Lan, Wanrong Huang, Wenjing Yang
arXiv 2023. [Paper] [Project] [Github]
11 May 2023

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation
Yupei Lin, Sen Zhang, Xiaojun Yang, Xiao Wang, Yukai Shi
arXiv 2023. [Paper] [Project]
8 May 2023

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Yu Qiao, Xihui Liu
arXiv 2023. [Paper]
24 Apr 2023

DiffusionRig: Learning Personalized Priors for Facial Appearance Editing
Zheng Ding, Xuaner Zhang, Zhihao Xia, Lars Jebe, Zhuowen Tu, Xiuming Zhang
CVPR 2023. [Paper] [Project] [Github]
13 Apr 2023

Face Animation with an Attribute-Guided Diffusion Model
Bohan Zeng, Xuhui Liu, Sicheng Gao, Boyu Liu, Hong Li, Jianzhuang Liu, Baochang Zhang
arXiv 2023. [Paper]
6 Apr 2023

Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Kangyeol Kim, Sunghyun Park, Junsoo Lee, Jaegul Choo
arXiv 2023. [Paper]
31 Mar 2023

Training-free Style Transfer Emerges from h-space in Diffusion models
Jaeseok Jeong, Mingi Kwon, Youngjung Uh
arXiv 2023. [Paper] [Project] [Github]
27 Mar 2023

Diffusion-based Target Sampler for Unsupervised Domain Adaptation
Yulong Zhang, Shuhao Chen, Yu Zhang, Jiangang Lu
arXiv 2023. [Paper]
17 Mar 2023

StyO: Stylize Your Face in Only One-Shot
Bonan Li, Zicheng Zhang, Xuecheng Nie, Congying Han, Yinhan Hu, Tiande Guo
arXiv 2023. [Paper]
6 Mar 2023

DiffFashion: Reference-based Fashion Design with Structure-aware Transfer by Diffusion Models
Shidong Cao, Wenhao Chai, Shengyu Hao, Yanting Zhang, Hangyue Chen, Gaoang Wang
arXiv 2023. [Paper]
14 Feb 2023

I2SB: Image-to-Image Schrödinger Bridge
Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos A. Theodorou, Weili Nie, Anima Anandkumar
arXiv 2023. [Paper] [Project]
12 Feb 2023

Zero-shot-Learning Cross-Modality Data Translation Through Mutual Information Guided Stochastic Diffusion
Zihao Wang, Yingyu Yang, Maxime Sermesant, Hervé Delingette, Ona Wu
arXiv 2023. [Paper]
31 Jan 2023

DiffFace: Diffusion-based Face Swapping with Facial Guidance
Kihong Kim, Yunho Kim, Seokju Cho, Junyoung Seo, Jisu Nam, Kychul Lee, Seungryong Kim, KwangHee Lee
arXiv 2022. [Paper] [Project]
27 Dec 2022

HS-Diffusion: Learning a Semantic-Guided Diffusion Model for Head Swapping
Qinghe Wang, Lijie Liu, Miao Hua, Qian He, Pengfei Zhu, Bing Cao, Qinghua Hu
arXiv 2022. [Paper]
13 Dec 2022

Inversion-Based Creativity Transfer with Diffusion Models
Yuxin Zhang, Nisha Huang, Fan Tang, Haibin Huang, Chongyang Ma, Weiming Dong, Changsheng Xu
CVPR 2023. [Paper] [Github]
23 Nov 2022

Person Image Synthesis via Denoising Diffusion Model
Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Mubarak Shah, Fahad Shahbaz Khan
arXiv 2022. [Paper]
22 Nov 2022

Unifying Diffusion Models’ Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu, Fernando De la Torre
arXiv 2022. [Paper] [Github-1] [Github-2]
11 Oct 2022

Anatomically constrained CT image translation for heterogeneous blood vessel segmentation
Giammarco La Barbera, Haithem Boussaid, Francesco Maso, Sabine Sarnacki, Laurence Rouet, Pietro Gori, Isabelle Bloch
BMVC 2022. [Paper]
4 Oct 2022

Diffusion-based Image Translation using Disentangled Style and Content Representation
Gihyun Kwon, Jong Chul Ye
arXiv 2022. [Paper]
30 Sep 2022

MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation
Junyoung Seo, Gyuseong Lee, Seokju Cho, Jiyoung Lee, Seungryong Kim
arXiv 2022. [Paper] [Project]
22 Sep 2022

Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models
Ozan Özdenizci, Robert Legenstein
arXiv 2022. [Paper]
29 Jul 2022

Non-Uniform Diffusion Models
Georgios Batzolis, Jan Stanczuk, Carola-Bibiane Schönlieb, Christian Etmann
arXiv 2022. [Paper]
20 Jul 2022

Unsupervised Medical Image Translation with Adversarial Diffusion Models
Muzaffer Özbey, Salman UH Dar, Hasan A Bedel, Onat Dalmaz, Şaban Özturk, Alper Güngör, Tolga Çukur
arXiv 2022. [Paper]
17 Jul 2022

EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
Min Zhao, Fan Bao, Chongxuan Li, Jun Zhu
arXiv 2022. [Paper]
14 Jul 2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation
Ye Zhu, Yu Wu, Kyle Olszewski, Jian Ren, Sergey Tulyakov, Yan Yan
arXiv 2022. [Paper] [Github]
15 Jun 2022

Pretraining is All You Need for Image-to-Image Translation
Tengfei Wang, Ting Zhang, Bo Zhang, Hao Ouyang, Dong Chen, Qifeng Chen, Fang Wen
arXiv 2022. [Paper] [Project] [Github]
25 May 2022

VQBB: Image-to-image Translation with Vector Quantized Brownian Bridge
Bo Li, Kaitao Xue, Bin Liu, Yu-Kun Lai
arXiv 2022. [Paper]
16 May 2022

The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models
Julia Wolleb, Robin Sandkühler, Florentin Bieder, Philippe C. Cattin
arXiv 2022. [Paper]
6 Apr 2022

Dual Diffusion Implicit Bridges for Image-to-Image Translation
Xuan Su, Jiaming Song, Chenlin Meng, Stefano Ermon
arXiv 2022. [Paper]
16 Mar 2022

Denoising Diffusion Restoration Models
Bahjat Kawar, Michael Elad, Stefano Ermon, Jiaming Song
NeurIPS 2022. [Paper]
27 Jan 2022

DiffuseMorph: Unsupervised Deformable Image Registration Along Continuous Trajectory Using Diffusion Models
Boah Kim, Inhwa Han, Jong Chul Ye
arXiv 2021. [Paper]
9 Dec 2021

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Konpat Preechakul, Nattanat Chatthee, Suttisak Wizadwongsa, Supasorn Suwajanakorn
arXiv 2021. [Paper] [Project]
30 Dec 2021

Conditional Image Generation with Score-Based Diffusion Models
Georgios Batzolis, Jan Stanczuk, Carola-Bibiane Schönlieb, Christian Etmann
arXiv 2021. [Paper]
26 Nov 2021

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon
ICCV 2021 (Oral). [Paper] [Github]
6 Aug 2021

UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models
Hiroshi Sasaki, Chris G. Willcocks, Toby P. Breckon
arXiv 2021. [Paper]
12 Apr 2021

Inverse Problems

EDiffSR: An Efficient Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution
Yi Xiao, Qiangqiang Yuan, Kui Jiang, Jiang He, Xianyu Jin, Liangpei Zhang
arXiv 2023. [Paper]
30 Oct 2023

Global Structure-Aware Diffusion Process for Low-Light Image Enhancement
Jinhui Hou, Zhiyu Zhu, Junhui Hou, Hui Liu, Huanqiang Zeng, Hui Yuan
arXiv 2023. [Paper]
26 Oct 2023

From Posterior Sampling to Meaningful Diversity in Image Restoration
Noa Cohen, Hila Manor, Yuval Bahat, Tomer Michaeli
arXiv 2023. [Paper]
24 Oct 2023

Diffusion-Model-Assisted Supervised Learning of Generative Models for Density Estimation
Yanfang Liu, Minglei Yang, Zezhong Zhang, Feng Bao, Yanzhao Cao, Guannan Zhang
arXiv 2023. [Paper]
22 Oct 2023

High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Zhiqian Lin, Jiangke Lin, Lincheng Li, Yi Yuan, Zhengxia Zou
arXiv 2023. [Paper]
22 Oct 2023

Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach
Feng Luo, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang
arXiv 2023. [Paper]
18 Oct 2023

Towards image compression with perfect realism at ultra-low bitrates
Marlène Careil, Matthew J. Muckley, Jakob Verbeek, Stéphane Lathuilière
arXiv 2023. [Paper]
16 Oct 2023

AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
Yitong Jiang, Zhaoyang Zhang, Tianfan Xue, Jinwei Gu
arXiv 2023. [Paper]
16 Oct 2023

Exploring the Design Space of Diffusion Autoencoders for Face Morphing
Zander Blasingame, Chen Liu
arXiv 2023. [Paper]
14 Oct 2023

Diffusion Prior Regularized Iterative Reconstruction for Low-dose CT
Wenjun Xia, Yongyi Shi, Chuang Niu, Wenxiang Cong, Ge Wang
arXiv 2023. [Paper]
10 Oct 2023

SMRD: SURE-based Robust MRI Reconstruction with Diffusion Models
Batu Ozturkler, Chao Liu, Benjamin Eckart, Morteza Mardani, Jiaming Song, Jan Kautz
MICCAI 2023. [Paper] [Github]
3 Oct 2023

Conditional Diffusion Distillation
Kangfu Mei, Mauricio Delbracio, Hossein Talebi, Zhengzhong Tu, Vishal M. Patel, Peyman Milanfar
arXiv 2023. [Paper]
2 Oct 2023

CommIN: Semantic Image Communications as an Inverse Problem with INN-Guided Diffusion Models
Jiakang Chen, Di You, Deniz Gündüz, Pier Luigi Dragotti
arXiv 2023. [Paper]
2 Oct 2023

Prompt-tuning latent diffusion models for inverse problems
Hyungjin Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio
arXiv 2023. [Paper]
2 Oct 2023

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks
ICCV 2023. [Paper]
30 Sep 2023

Generating Visual Scenes from Touch
Fengyu Yang, Jiacheng Zhang, Andrew Owens
ICCV 2023. [Paper] [Project]
26 Sep 2023

Bootstrap Diffusion Model Curve Estimation for High Resolution Low-Light Image Enhancement
Jiancheng Huang, Yifan Liu, Shifeng Chen
arXiv 2023. [Paper]
26 Sep 2023

Multiple Noises in Diffusion Model for Semi-Supervised Multi-Domain Translation
Tsiry Mayet, Simon Bernard, Clement Chatelain, Romain Herault
arXiv 2023. [Paper]
25 Sep 2023

Domain-Guided Conditional Diffusion Model for Unsupervised Domain Adaptation
Yulong Zhang, Shuhao Chen, Weisen Jiang, Yu Zhang, Jiangang Lu, James T. Kwok
arXiv 2023. [Paper]
23 Sep 2023

License Plate Super-Resolution Using Diffusion Models
Sawsan AlHalawani, Bilel Benjdira, Adel Ammar, Anis Koubaa, Anas M. Ali
arXiv 2023. [Paper]
21 Sep 2023

Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal
Xiao Feng Zhang, Tian Yi Song, Jia Wei Yao
arXiv 2023. [Paper]
21 Sep 2023

Face Aging via Diffusion-based Editing
Xiangyi Chen, Stéphane Lathuilière
arXiv 2023. [Paper]
20 Sep 2023

PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance
Peiqing Yang, Shangchen Zhou, Qingyi Tao, Chen Change Loy
NeurIPS 2023. [Paper] [Github]
19 Sep 2023

Reconstruct-and-Generate Diffusion Model for Detail-Preserving Image Denoising
Yujin Wang, Lingen Li, Tianfan Xue, Jinwei Gu
arXiv 2023. [Paper]
19 Sep 2023

Gradpaint: Gradient-Guided Inpainting with Diffusion Models
Asya Grechka, Guillaume Couairon, Matthieu Cord
arXiv 2023. [Paper]
18 Sep 2023

AdBooster: Personalized Ad Creative Generation using Stable Diffusion Outpainting
Veronika Shilova, Ludovic Dos Santos, Flavian Vasile, Gaëtan Racic, Ugo Tanielian
arXiv 2023. [Paper]
8 Sep 2023

Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Yi Tang, Takafumi Iwaguchi, Hiroshi Kawasaki
ACM MM 2023. [Paper] [Github]
7 Sep 2023

Efficient Bayesian Computational Imaging with a Surrogate Score-Based Prior
Berthy T. Feng, Katherine L. Bouman
arXiv 2023. [Paper]
5 Sep 2023

Diffusion Modeling with Domain-conditioned Prior Guidance for Accelerated MRI and qMRI Reconstruction
Wanyu Bian, Albert Jang, Fang Liu
arXiv 2023. [Paper]
2 Sep 2023

Correlated and Multi-frequency Diffusion Modeling for Highly Under-sampled MRI Reconstruction
Yu Guan, Chuanming Yu, Shiyu Lu, Zhuoxu Cui, Dong Liang, Qiegen Liu
arXiv 2023. [Paper] [Github]
2 Sep 2023

Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution
Charles Laroche, Andrés Almansa, Eva Coupete
arXiv 2023. [Paper] [Github]
1 Sep 2023

Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains
Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang
arXiv 2023. [Paper]
31 Aug 2023

Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction
Kai Xu, Shiyu Lu, Bin Huang, Weiwen Wu, Qiegen Liu
arXiv 2023. [Paper]
30 Aug 2023

Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation
Zhuo-Xu Cui, Congcong Liu, Xiaohong Fan, Chentao Cao, Jing Cheng, Qingyong Zhu, Yuanyuan Liu, Sen Jia, Yihang Zhou, Haifeng Wang, Yanjie Zhu, Jianping Zhang, Qiegen Liu, Dong Liang
arXiv 2023. [Paper]
30 Aug 2023

DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Xinqi Lin, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Ben Fei, Bo Dai, Wanli Ouyang, Yu Qiao, Chao Dong
arXiv 2023. [Paper] [Github]
29 Aug 2023

Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization
Tao Yang, Peiran Ren, Xuansong Xie, Lei Zhang
AAAI 2024. [Paper]
28 Aug 2023

Data-iterative Optimization Score Model for Stable Ultra-Sparse-View CT Reconstruction
Weiwen Wu, Yanyang Wang
arXiv 2023. [Paper]
28 Aug 2023

Residual Denoising Diffusion Models
Jiawei Liu, Qiang Wang, Huijie Fan, Yinong Wang, Yandong Tang, Liangqiong Qu
arXiv 2023. [Paper] [Github]
25 Aug 2023

Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model
Xunpeng Yi, Han Xu, Hao Zhang, Linfeng Tang, Jiayi Ma
ICCV 2023. [Paper]
25 Aug 2023

Full-dose PET Synthesis from Low-dose PET Using High-efficiency Diffusion Denoising Probabilistic Model
Shaoyan Pan, Elham Abouei, Junbo Peng, Joshua Qian, Jacob F Wynne, Tonghe Wang, Chih-Wei Chang, Justin Roper, Jonathon A Nye, Hui Mao, Xiaofeng Yang
arXiv 2023. [Paper]
24 Aug 2023

InverseSR: 3D Brain MRI Super-Resolution Using a Latent Diffusion Model
Jueqi Wang, Jacob Levman, Walter Hugo Lopez Pinaya, Petru-Daniel Tudosiu, M. Jorge Cardoso, Razvan Marinescu
MICCAI 2023. [Paper] [Github]
23 Aug 2023

High-quality Image Dehazing with Diffusion Model
Hu Yu, Jie Huang, Kaiwen Zheng, Man Zhou, Feng Zhao
arXiv 2023. [Paper]
23 Aug 2023

Frequency Compensated Diffusion Model for Real-scene Dehazing
Jing Wang, Songtao Wu, Kuanhong Xu, Zhiqiang Yuan
arXiv 2023. [Paper]
21 Aug 2023

Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction
Zeyu Han, Yuhan Wang, Luping Zhou, Peng Wang, Binyu Yan, Jiliu Zhou, Yan Wang, Dinggang Shen
MICCAI 2023. [Paper] [Github]
20 Aug 2023

DiffLLE: Diffusion-guided Domain Calibration for Unsupervised Low-light Image Enhancement
Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, Yuhan Wang, Jian Zhang
arXiv 2023. [Paper]
18 Aug 2023

Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration
Liyan Wang, Qinyu Yang, Cong Wang, Wei Wang, Jinshan Pan, Zhixun Su
arXiv 2023. [Paper]
17 Aug 2023

Monte Carlo guided Diffusion for Bayesian linear inverse problems
Gabriel Cardoso, Yazid Janati El Idrissi, Sylvain Le Corff, Eric Moulines
arXiv 2023. [Paper]
15 Aug 2023

Geometry of the Visual Cortex with Applications to Image Inpainting and Enhancement
Francesco Ballerin, Erlend Grong
arXiv 2023. [Paper] [Github]
15 Aug 2023

YODA: You Only Diffuse Areas. An Area-Masked Diffusion Approach For Image Super-Resolution
Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel
arXiv 2023. [Paper]
15 Aug 2023

TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
Baolin Liu, Zongyuan Yang, Pengfei Wang, Junjie Zhou, Ziqi Liu, Ziyi Song, Yan Liu, Yongping Xiong
AAAI 2024. [Paper]
13 Aug 2023

CLE Diffusion: Controllable Light Enhancement Diffusion Model
Yuyang Yin, Dejia Xu, Chuangchuang Tan, Ping Liu, Yao Zhao, Yunchao Wei
arXiv 2023. [Paper] [Project] [Github]
13 Aug 2023

Diffusion-Augmented Depth Prediction with Sparse Annotations
Jiaqi Li, Yiran Wang, Zihao Huang, Jinghong Zheng, Ke Xian, Zhiguo Cao, Jianming Zhang
arXiv 2023. [Paper]
4 Aug 2023

Painterly Image Harmonization using Diffusion Model
Lingxiao Lu, Jiangtong Li, Junyan Cao, Li Niu, Liqing Zhang
arXiv 2023. [Paper]
4 Aug 2023

Reference-Free Isotropic 3D EM Reconstruction using Diffusion Models
Kyungryun Lee, Won-Ki Jeong
arXiv 2023. [Paper]
3 Aug 2023

Learning Fourier-Constrained Diffusion Bridges for MRI Reconstruction
Muhammad U. Mirza, Onat Dalmaz, Hasan A. Bedel, Gokberk Elmas, Yilmaz Korkmaz, Alper Gungor, Salman UH Dar, Tolga Çukur
arXiv 2023. [Paper]
2 Aug 2023

Ultrasound Image Reconstruction with Denoising Diffusion Restoration Models
Yuxin Zhang, Clément Huneau, Jérôme Idier, Diana Mateus
MICCAI Workshop 2023. [Paper] [Github]
29 Jul 2023

LLDiffusion: Learning Degradation Representations in Diffusion Models for Low-Light Image Enhancement
Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo, Bjorn Stenger, Tae-Kyun Kim, Wei Liu, Hongdong Li
arXiv 2023. [Paper]
27 Jul 2023

Artifact Restoration in Histology Images with Diffusion Probabilistic Models
Zhenqi He, Junjun He, Jin Ye, Yiqing Shen
arXiv 2023. [Paper] [Github]
26 Jul 2023

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting
Zongsheng Yue, Jianyi Wang, Chen Change Loy
arXiv 2023. [Paper] [Github]
23 Jul 2023

Iterative Reconstruction Based on Latent Diffusion Model for Sparse Data Reconstruction
Linchao He, Hongyu Yan, Mengting Luo, Kunming Luo, Wang Wang, Wenchao Du, Hu Chen, Hongyu Yang, Yi Zhang
arXiv 2023. [Paper]
22 Jul 2023

PartDiff: Image Super-resolution with Partial Diffusion Models
Kai Zhao, Alex Ling Yu Hung, Kaifeng Pang, Haoxin Zheng, Kyunghyun Sung
arXiv 2023. [Paper]
21 Jul 2023

Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap
Dejia Xu, Xingqian Xu, Wenyan Cong, Humphrey Shi, Zhangyang Wang
arXiv 2023. [Paper] [Project]
20 Jul 2023

AnyDoor: Zero-shot Object-level Image Customization
Xi Chen, Lianghua Huang, Yu Liu, Yujun Shen, Deli Zhao, Hengshuang Zhao
arXiv 2023. [Paper] [Project]
18 Jul 2023

Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond
Yang Zhao, Tingbo Hou, Yu-Chuan Su, Xuhui Jia. Yandong Li, Matthias Grundmann
ICCV 2023. [Paper]
18 Jul 2023

Flow Matching in Latent Space
Quan Dao, Hao Phung, Binh Nguyen, Anh Tran
arXiv 2023. [Paper] [Project]
17 Jul 2023

Identity-Preserving Aging of Face Images via Latent Diffusion Models
Sudipta Banerjee, Govind Mittal, Ameya Joshi, Chinmay Hegde, Nasir Memon
IJCB 2023. [Paper]
17 Jul 2023

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency
Bowen Song, Soo Min Kwon, Zecheng Zhang, Xinyu Hu, Qing Qu, Liyue Shen
arXiv 2023. [Paper]
16 Jul 2023

ExposureDiffusion: Learning to Expose for Low-light Image Enhancement
Yufei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
arXiv 2023. [Paper]
15 Jul 2023

DDGM: Solving inverse problems by Diffusive Denoising of Gradient-based Minimization
Kyle Luther, H. Sebastian Seung
arXiv 2023. [Paper]
11 Jul 2023

Stimulating the Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling
Tong Li, Hansen Feng, Lizhi Wang, Zhiwei Xiong, Hua Huang
arXiv 2023. [Paper]
8 Jul 2023

IPO-LDM: Depth-aided 360-degree Indoor RGB Panorama Outpainting via Latent Diffusion Model
Tianhao Wu, Chuanxia Zheng, Tat-Jen Cham
arXiv 2023. [Paper] [Github]
6 Jul 2023

Single Image LDR to HDR Conversion using Conditional Diffusion
Dwip Dalal, Gautam Vashishtha, Prajwal Singh, Shanmuganathan Raman
arXiv 2023. [Paper]
6 Jul 2023

ACDMSR: Accelerated Conditional Diffusion Models for Single Image Super-Resolution
Axi Niu, Pham Xuan Trung, Kang Zhang, Jinqiu Sun, Yu Zhu, In So Kweon, Yanning Zhang
arXiv 2023. [Paper]
3 Jul 2023

LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Linoy Tsaban, Apolinário Passos
arXiv 2023. [Paper]
2 Jul 2023

Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models
Litu Rout, Negin Raoof, Giannis Daras, Constantine Caramanis, Alexandros G. Dimakis, Sanjay Shakkottai
arXiv 2023. [Paper] [Github]
2 Jul 2023

Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling
Li Sanqian, Higashita Risa, Fu Huazhu, Li Heng, Niu Jingxuan, Liu Jiang
arXiv 2023. [Paper]
30 Jun 2023

Self-Supervised MRI Reconstruction with Unrolled Diffusion Models
Yilmaz Korkmaz, Tolga Cukur, Vishal Patel
arXiv 2023. [Paper]
29 Jun 2023

SVNR: Spatially-variant Noise Removal with Denoising Diffusion
Naama Pearl, Yaron Brodsky, Dana Berman, Assaf Zomet, Alex Rav Acha, Daniel Cohen-Or, Dani Lischinski
arXiv 2023. [Paper]
28 Jun 2023

Easing Color Shifts in Score-Based Diffusion Models
Katherine Deck, Tobias Bischoff
arXiv 2023. [Paper]
27 Jun 2023

Diffusion Model Based Low-Light Image Enhancement for Space Satellite
Yiman Zhu, Lu Wang, Jingyi Yuan, Yu Guo
arXiv 2023. [Paper]
25 Jun 2023

DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology
Marco Aversa, Gabriel Nobis, Miriam Hägele, Kai Standvoss, Mihaela Chirica, Roderick Murray-Smith, Ahmed Alaa, Lukas Ruff, Daniela Ivanova, Wojciech Samek, Frederick Klauschen, Bruno Sanguinetti, Luis Oala
arXiv 2023. [Paper]
23 Jun 2023

Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model
Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann
arXiv 2023. [Paper]
22 Jun 2023

DiffuseIR:Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images
Mingjie Pan, Yulu Gan, Fangxu Zhou, Jiaming Liu, Aimin Wang, Shanghang Zhang, Dawei Li
arXiv 2023. [Paper]
21 Jun 2023

HSR-Diff:Hyperspectral Image Super-Resolution via Conditional Diffusion Models
Chanyue Wu, Dong Wang, Hanyu Mao, Ying Li
arXiv 2023. [Paper]
21 Jun 2023

Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision
Ayush Tewari, Tianwei Yin, George Cazenavette, Semon Rezchikov, Joshua B. Tenenbaum, Frédo Durand, William T. Freeman, Vincent Sitzmann
arXiv 2023. [Paper]
20 Jun 2023

Deep Ultrasound Denoising Using Diffusion Probabilistic Models
Hojat Asgariandehkordi, Sobhan Goudarzi, Adrian Basarab, Hassan Rivaz
arXiv 2023. [Paper]
12 Jun 2023

Towards Visual Foundational Models of Physical Scenes
Chethan Parameshwara, Alessandro Achille, Matthew Trager, Xiaolong Li, Jiawei Mo, Matthew Trager, Ashwin Swaminathan, CJ Taylor, Dheera Venkatraman, Xiaohan Fei, Stefano Soatto
arXiv 2023. [Paper]
6 Jun 2023

INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems
Di You, Andreas Floros, Pier Luigi Dragotti
arXiv 2023. [Paper]
5 Jun 2023

The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Saurabh Saxena, Charles Herrmann, Junhwa Hur, Abhishek Kar, Mohammad Norouzi, Deqing Sun, David J. Fleet
arXiv 2023. [Paper]
2 Jun 2023

Dissecting Arbitrary-scale Super-resolution Capability from Pre-trained Diffusion Generative Models
Ruibin Li, Qihua Zhou, Song Guo, Jie Zhang, Jingcai Guo, Xinyang Jiang, Yifei Shen, Zhenhua Han
arXiv 2023. [Paper]
1 Jun 2023

Low-Light Image Enhancement with Wavelet-based Diffusion Models
Hai Jiang, Ao Luo, Songchen Han, Haoqiang Fan, Shuaicheng Liu
arXiv 2023. [Paper]
1 Jun 2023

A Unified Conditional Framework for Diffusion-based Image Restoration
Yi Zhang, Xiaoyu Shi, Dasong Li, Xiaogang Wang, Jian Wang, Hongsheng Li
arXiv 2023. [Paper]
31 May 2023

Direct Diffusion Bridge using Data Consistency for Inverse Problems
Hyungjin Chung, Jeongsol Kim, Jong Chul Ye
arXiv 2023. [Paper]
31 May 2023

Accelerating Diffusion Models for Inverse Problems through Shortcut Sampling
Gongye Liu, Haoze Sun, Jiayi Li, Fei Yin, Yujiu Yang
arXiv 2023. [Paper]
26 May 2023

Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos
Matthew Chang, Aditya Prakash, Saurabh Gupta
arXiv 2023. [Paper] [Project]
25 May 2023

A Diffusion Probabilistic Prior for Low-Dose CT Image Denoising
Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang
arXiv 2023. [Paper]
25 May 2023

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
Yiyang Ma, Huan Yang, Wenhan Yang, Jianlong Fu, Jiaying Liu
arXiv 2023. [Paper]
24 May 2023

WaveDM: Wavelet-Based Diffusion Models for Image Restoration
Yi Huang, Jiancheng Huang, Jianzhuang Liu, Yu Dong, Jiaxi Lv, Shifeng Chen
arXiv 2023. [Paper]
23 May 2023

Dual-Diffusion: Dual Conditional Denoising Diffusion Probabilistic Models for Blind Super-Resolution Reconstruction in RSIs
Mengze Xu, Jie Ma, Yuanyuan Zhu
arXiv 2023. [Paper] [Github]
20 May 2023

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu
arXiv 2023. [Paper]
18 May 2023

Pyramid Diffusion Models For Low-light Image Enhancement
Dewei Zhou, Zongxin Yang, Yi Yang
arXiv 2023. [Paper]
17 May 2023

A Conditional Denoising Diffusion Probabilistic Model for Radio Interferometric Image Reconstruction
Ruoqi Wang, Zhuoyang Chen, Qiong Luo, Feng Wang
arXiv 2023. [Paper] 16 May 2023

Denoising Diffusion Models for Plug-and-Play Image Restoration
Yuanzhi Zhu, Kai Zhang, Jingyun Liang, Jiezhang Cao, Bihan Wen, Radu Timofte, Luc Van Gool
arXiv 2023. [Paper] [Github]
15 May 2023

Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang, Zongsheng Yue, Shangchen Zhou, Kelvin C.K. Chan, Chen Change Loy
arXiv 2023. [Paper] [Project] [Github]
11 May 2023

Atmospheric Turbulence Correction via Variational Deep Diffusion
Xijun Wang, Santiago López-Tapia, Aggelos K. Katsaggelos
arXiv 2023. [Paper]
8 May 2023

Controllable Light Diffusion for Portraits
David Futschik, Kelvin Ritland, James Vecore, Sean Fanello, Sergio Orts-Escolano, Brian Curless, Daniel Sýkora, Rohit Pandey
arXiv 2023. [Paper]
8 May 2023

DiffBFR: Bootstrapping Diffusion Model Towards Blind Face Restoration
Xinmin Qiu, Congying Han, ZiCheng Zhang, Bonan Li, Tiande Guo, Xuecheng Nie
arXiv 2023. [Paper]
8 May 2023

Real-World Denoising via Diffusion Model
Cheng Yang, Lijing Liang, Zhixun Su
arXiv 2023. [Paper]
8 May 2023

A Variational Perspective on Solving Inverse Problems with Diffusion Models
Morteza Mardani, Jiaming Song, Jan Kautz, Arash Vahdat
arXiv 2023. [Paper]
7 May 2023

Synthesizing PET images from High-field and Ultra-high-field MR images Using Joint Diffusion Attention Model
Taofeng Xie, Chentao Cao, Zhuoxu Cui, Yu Guo, Caiying Wu, Xuemei Wang, Qingneng Li, Zhanli Hu, Tao Sun, Ziru Sang, Yihang Zhou, Yanjie Zhu, Dong Liang, Qiyu Jin, Guoqing Chen, Haifeng Wang
arXiv 2023. [Paper]
6 May 2023

DocDiff: Document Enhancement via Residual Diffusion Models
Zongyuan Yang, Baolin Liu, Yongping Xiong, Lan Yi, Guibin Wu, Xiaojun Tang, Ziqi Liu, Junjie Zhou, Xing Zhang
arXiv 2023. [Paper] [Github]
6 May 2023

Solving Inverse Problems with Score-Based Generative Priors learned from Noisy Data
Asad Aali, Marius Arvinte, Sidharth Kumar, Jonathan I. Tamir
arXiv 2023. [Paper]
2 May 2023

Self-similarity-based super-resolution of photoacoustic angiography from hand-drawn doodles
Yuanzheng Ma, Wangting Zhou, Rui Ma, Sihua Yang, Yansong Tang, Xun Guan
arXiv 2023. [Paper]
2 May 2023

Score-Based Diffusion Models as Principled Priors for Inverse Imaging
Berthy T. Feng, Jamie Smith, Michael Rubinstein, Huiwen Chang, Katherine L. Bouman, William T. Freeman
arXiv 2023. [Paper]
23 Apr 2023

Improved Diffusion-based Image Colorization via Piggybacked Models
Hanyuan Liu, Jinbo Xing, Minshan Xie, Chengze Li, Tien-Tsin Wong
arXiv 2023. [Paper] [Project]
21 Apr 2023

DiFaReli: Diffusion Face Relighting
Puntawat Ponglertnapakorn, Nontawat Tritrong, Supasorn Suwajanakorn
arXiv 2023. [Paper] [Project]
19 Apr 2023

Inpaint Anything: Segment Anything Meets Image Inpainting
Tao Yu, Runseng Feng, Ruoyu Feng, Jinming Liu, Xin Jin, Wenjun Zeng, Zhibo Chen
arXiv 2023. [Paper] [Github]
13 Apr 2023

Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models
Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön
arXiv 2023. [Paper] [Github]
17 Apr 2023

SPIRiT-Diffusion: Self-Consistency Driven Diffusion Model for Accelerated MRI
Zhuo-Xu Cui, Chentao Cao, Jing Cheng, Sen Jia, Hairong Zheng, Dong Liang, Yanjie Zhu
arXiv 2023. [Paper]
11 Apr 2023

Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior
Kaiwen Xu, Aravind R. Krishnan, Thomas Z. Li, Yuankai Huo, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman
arXiv 2023. [Paper]
7 Apr 2023

SketchFFusion: Sketch-guided image editing with diffusion model
Weihang Mao, Bo Han, Zihao Wang
arXiv 2023. [Paper]
6 Apr 2023

Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Ahmet Burak Yildirim, Vedat Baday, Erkut Erdem, Aykut Erdem, Aysegul Dundar
arXiv 2023. [Paper] [Project]
6 Apr 2023

Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models
Guanhua Zhang, Jiabao Ji, Yang Zhang, Mo Yu, Tommi Jaakkola, Shiyu Chang
arXiv 2023. [Paper] [Github]
6 Apr 2023

Zero-shot Medical Image Translation via Frequency-Guided Diffusion Models
Yunxiang Li, Hua-Chieh Shao, Xiao Liang, Liyuan Chen, Ruiqi Li, Steve Jiang, Jing Wang, You Zhang
arXiv 2023. [Paper]
5 Apr 2023

Waving Goodbye to Low-Res: A Diffusion-Wavelet Approach for Image Super-Resolution
Brian Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel
arXiv 2023. [Paper]
4 Apr 2023

CoreDiff: Contextual Error-Modulated Generalized Diffusion Model for Low-Dose CT Denoising and Generalization
Qi Gao, Zilong Li, Junping Zhang, Yi Zhang, Hongming Shan
arXiv 2023. [Paper]
4 Apr 2023

Generative Diffusion Prior for Unified Image Restoration and Enhancement
Ben Fei, Zhaoyang Lyu, Liang Pan, Junzhe Zhang, Weidong Yang, Tianyue Luo, Bo Zhang, Bo Dai
CVPR 2023. [Paper]
3 Apr 2023

Implicit Diffusion Models for Continuous Super-Resolution
Sicheng Gao, Xuhui Liu, Bohan Zeng, Sheng Xu, Yanjing Li, Xiaoyan Luo, Jianzhuang Liu, Xiantong Zhen, Baochang Zhang
CVPR 2023. [Paper]
29 Mar 2023

DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency
Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi
arXiv 2023. [Paper]
25 Mar 2023

MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Yizhuo Lu, Changde Du, Dianpeng Wang, Huiguang He
arXiv 2023. [Paper]
24 Mar 2023

DisC-Diff: Disentangled Conditional Diffusion Model for Multi-Contrast MRI Super-Resolution
Ye Mao, Lan Jiang, Xi Chen, Chao Li
arXiv 2023. [Paper]
23 Mar 2023

Sub-volume-based Denoising Diffusion Probabilistic Model for Cone-beam CT Reconstruction from Incomplete Data
Wenjun Xia, Chuang Niu, Wenxiang Cong, Ge Wang
arXiv 2023. [Paper]
22 Mar 2023

A Perceptual Quality Assessment Exploration for AIGC Images
Zicheng Zhang, Chunyi Li, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai
arXiv 2023. [Paper]
22 Mar 2023

Inversion by Direct Iteration: An Alternative to Denoising Diffusion for Image Restoration
Mauricio Delbracio, Peyman Milanfar
arXiv 2023. [Paper]
20 Mar 2023

Efficient Neural Generation of 4K Masks for Homogeneous Diffusion Inpainting
Karl Schrader, Pascal Peter, Niklas Kämper, Joachim Weickert
arXiv 2023. [Paper]
17 Mar 2023

Denoising Diffusion Post-Processing for Low-Light Image Enhancement
Savvas Panagiotou, Anna S. Bosman
arXiv 2023. [Paper]
16 Mar 2023

SUD2: Supervision by Denoising Diffusion Models for Image Reconstruction
Matthew A. Chan, Sean I. Young, Christopher A. Metzler
arXiv 2023. [Paper]
16 Mar 2023

DiffIR: Efficient Diffusion Model for Image Restoration
Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, Luc Van Gool
arXiv 2023. [Paper]
16 Mar 2023

ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution
Shuyao Shang, Zhengyang Shan, Guangxing Liu, Jinglin Zhang
arXiv 2023. [Paper]
15 Mar 2023

Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels
Jan Oscar Cross-Zamirski, Praveen Anand, Guy Williams, Elizabeth Mouchet, Yinhai Wang, Carola-Bibiane Schönlieb
arXiv 2023. [Paper] [Github]
15 Mar 2023

Diffusion Models for Contrast Harmonization of Magnetic Resonance Images
Alicia Durrer, Julia Wolleb, Florentin Bieder, Tim Sinnecker, Matthias Weigel, Robin Sandkühler, Cristina Granziera, Özgür Yaldizli, Philippe C. Cattin
arXiv 2023. [Paper]
14 Mar 2023

Synthesizing Realistic Image Restoration Training Pairs: A Diffusion Approach
Tao Yang, Peiran Ren, Xuansong xie, Lei Zhang
arXiv 2023. [Paper]
13 Mar 2023

DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration
Zhixin Wang, Xiaoyun Zhang, Ziying Zhang, Huangjie Zheng, Mingyuan Zhou, Ya Zhang, Yanfeng Wang
CVPR 2023. [Paper]
13 Mar 2023

DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration
Yuchun Miao, Lefei Zhang, Liangpei Zhang, Dacheng Tao
arXiv 2023. [Paper]
12 Mar 2023

Fast Diffusion Sampler for Inverse Problems by Geometric Decomposition
Hyungjin Chung, Suhyeon Lee, Jong Chul Ye
arXiv 2023. [Paper]
10 Mar 2023

Generalized Diffusion MRI Denoising and Super-Resolution using Swin Transformers
Amir Sadikov, Jamie Wren-Jarvis, Xinlei Pan, Lanya T. Cai, Pratik Mukherjee
arXiv 2023. [Paper]
10 Mar 2023

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
Yiqun Duan, Zheng Zhu, Xianda Guo
arxiv 2023. [Paper] [Github]
9 Mar 2023

Learning Enhancement From Degradation: A Diffusion Model For Fundus Image Enhancement
Puijin Cheng, Li Lin, Yijin Huang, Huaqing He, Wenhan Luo, Xiaoying Tang
arXiv 2023. [Paper] [Github]
8 Mar 2023

Unlimited-Size Diffusion Restoration
Yinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang
arXiv 2023. [Paper]
1 Mar 2023

Unsupervised Out-of-Distribution Detection with Diffusion Inpainting
Zhenzhen Liu, Jin Peng Zhou, Yufan Wang, Kilian Q. Weinberger
arXiv 2023. [Paper]
20 Feb 2023

Restoration based Generative Models
Jaemoo Choi, Yesom Park, Myungjoo Kang
arXiv 2023. [Paper]
20 Feb 2023

Explicit Diffusion of Gaussian Mixture Model Based Image Priors
Martin Zach, Thomas Pock, Erich Kobler, Antonin Chambolle
arXiv 2023. [Paper]
16 Feb 2023

Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild
Hshmat Sahak, Daniel Watson, Chitwan Saharia, David Fleet
arXiv 2023. [Paper]
15 Feb 2023

CDPMSR: Conditional Diffusion Probabilistic Models for Single Image Super-Resolution
Axi Niu, Kang Zhang, Trung X. Pham, Jinqiu Sun, Yu Zhu, In So Kweon, Yanning Zhang
arXiv 2023. [Paper]
14 Feb 2023

How to Trust Your Diffusion Model: A Convex Optimization Approach to Conformal Risk Control
Jacopo Teneggi, Matt Tivnan, J Webster Stayman, Jeremias Sulam
arXiv 2023. [Paper]
7 Feb 2023

DDM2: Self-Supervised Diffusion MRI Denoising with Generative Diffusion Models
Tiange Xiang, Mahmut Yurt, Ali B Syed, Kawin Setsompop, Akshay Chaudhari
ICLR 2023. [Paper] [Github]
6 Feb 2023

Diffusion Model for Generative Image Denoising
Yutong Xie, Minne Yuan, Bin Dong, Quanzheng Li
arXiv 2023. [Paper]
5 Feb 2023

A Theoretical Justification for Image Inpainting using Denoising Diffusion Probabilistic Models
Litu Rout, Advait Parulekar, Constantine Caramanis, Sanjay Shakkottai
arXiv 2023. [Paper]
2 Feb 2023

GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon
arXiv 2023. [Paper]
30 Jan 2023

Accelerating Guided Diffusion Sampling with Splitting Numerical Methods
Suttisak Wizadwongsa, Supasorn Suwajanakorn
ICLR 2023. [Paper]
27 Jan 2023

Diffusion Denoising for Low-Dose-CT Model
Runyi Li
arXiv 2023. [Paper]
27 Jan 2023

Screen Space Indirect Lighting with Visibility Bitmask
Olivier Therrien, Yannick Levesque, Guillaume Gilet
Visual Computer 2023. [Paper]
26 Jan 2023

Dual Diffusion Architecture for Fisheye Image Rectification: Synthetic-to-Real Generalization
Shangrong Yang, Chunyu Lin, Kang Liao, Yao Zhao
arXiv 2023. [Paper]
26 Jan 2023

RainDiffusion:When Unsupervised Learning Meets Diffusion Models for Real-world Image Deraining
Mingqiang Wei, Yiyang Shen, Yongzhen Wang, Haoran Xie, Fu Lee Wang
arXiv 2023. [Paper]
23 Jan 2023

Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models
Mingqiang Wei, Yiyang Shen, Yongzhen Wang, Haoran Xie, Fu Lee Wang
arXiv 2023. [Paper]
23 Jan 2023

Removing Structured Noise with Diffusion Models
Tristan S.W. Stevens, Jean-Luc Robert, Faik C. Meral Jason Yu, Jun Seob Shin, Ruud J.G. van Sloun
arXiv 2023. [Paper]
20 Jan 2023

Image Restoration with Mean-Reverting Stochastic Differential Equations
Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön
arXiv 2023. [Paper] [Github]
20 Jan 2023

DiffusionCT: Latent Diffusion Model for CT Image Standardization
Md Selim, Jie Zhang, Michael A. Brooks, Ge Wang, Jin Chen
arXiv 2023. [Paper]
20 Jan 2023

Targeted Image Reconstruction by Sampling Pre-trained Diffusion Model
Jiageng Zheng
arXiv 2023. [Paper]
18 Jan 2023

Annealed Score-Based Diffusion Model for MR Motion Artifact Reduction
Gyutaek Oh, Jeong Eun Lee, Jong Chul Ye
arXiv 2023. [Paper]
8 Jan 2023

Exploring Vision Transformers as Diffusion Learners
He Cao, Jianan Wang, Tianhe Ren, Xianbiao Qi, Yihao Chen, Yuan Yao, Lei Zhang
arXiv 2022. [Paper]
28 Dec 2022

Towards Blind Watermarking: Combining Invertible and Non-invertible Mechanisms
Rui Ma, Mengxi Guo, Yi Hou, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie
arXiv 2022. [Paper] [Github]
24 Dec 2022

Bi-Noising Diffusion: Towards Conditional Diffusion Models with Generative Restoration Priors
Kangfu Mei, Nithin Gopalakrishnan Nair, Vishal M. Patel
arXiv 2022. [Paper] [Project]
14 Dec 2022

SPIRiT-Diffusion: SPIRiT-driven Score-Based Generative Modeling for Vessel Wall imaging
Chentao Cao, Zhuo-Xu Cui, Jing Cheng, Sen Jia, Hairong Zheng, Dong Liang, Yanjie Zhu
arXiv 2022. [Paper]
14 Dec 2022

Universal Generative Modeling in Dual-domain for Dynamic MR Imaging
Chuanming Yu, Yu Guan, Ziwen Ke, Dong Liang, Qiegen Liu
arXiv 2022. [Paper]
15 Dec 2022

DifFace: Blind Face Restoration with Diffused Error Contraction
Zongsheng Yue, Chen Change Loy
arXiv 2022. [Paper] [Github]
13 Dec 2022

ShadowDiffusion: When Degradation Prior Meets Diffusion Model for Shadow Removal
Lanqing Guo, Chong Wang, Wenhan Yang, Siyu Huang, Yufei Wang, Hanspeter Pfister, Bihan Wen
arXiv 2022. [Paper]
9 Dec 2022

One Sample Diffusion Model in Projection Domain for Low-Dose CT Imaging
Bin Huang, Liu Zhang, Shiyu Lu, Boyu Lin, Weiwen Wu, Qiegen Liu
arXiv 2022. [Paper]
7 Dec 2022

SDM: Spatial Diffusion Model for Large Hole Image Inpainting
Wenbo Li, Xin Yu, Kun Zhou, Yibing Song, Zhe Lin, Jiaya Jia
arXiv 2022. [Paper]
6 Dec 2022

ADIR: Adaptive Diffusion for Image Reconstruction
Shady Abu-Hussein, Tom Tirer, Raja Giryes
arXiv 2022. [Paper] [Project]
6 Dec 2022

Image Deblurring with Domain Generalizable Diffusion Models
Mengwei Ren, Mauricio Delbracio, Hossein Talebi, Guido Gerig, Peyman Milanfar
arXiv 2022. [Paper]
4 Dec 2022

Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
Yinhuai Wang, Jiwen Yu, Jian Zhang
arXiv 2022. [Paper] [Github]
1 Dec 2022

FREDSR: Fourier Residual Efficient Diffusive GAN for Single Image Super Resolution
Kyoungwan Woo, Achyuta Rajaram
arXiv 2022. [Paper]
30 Nov 2022

CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis
Shichong Peng, Alireza Moazeni, Ke Li
arXiv 2022. [Paper]
25 Nov 2022

DOLCE: A Model-Based Probabilistic Diffusion Framework for Limited-Angle CT Reconstruction
Jiaming Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Stewart He, K. Aditya Mohan, Ulugbek S. Kamilov, Hyojin Kim
arXiv 2022. [Paper]
22 Nov 2022

Diffusion Model Based Posterior Sampling for Noisy Linear Inverse Problems
Xiangming Meng, Yoshiyuki Kabashima
arXiv 2022. [Paper] [Github]
20 Nov 2022

Parallel Diffusion Models of Operator and Image for Blind Inverse Problems
Hyungjin Chung, Jeongsol Kim, Sehui Kim, Jong Chul Ye
arXiv 2022. [Paper]
19 Nov 2022

Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models
Hyungjin Chung, Dohoon Ryu, Michael T. McCann, Marc L. Klasky, Jong Chul Ye
arXiv 2022. [Paper]
19 Nov 2022

Patch-Based Denoising Diffusion Probabilistic Model for Sparse-View CT Reconstruction
Wenjun Xia, Wenxiang Cong, Ge Wang
arXiv 2022. [Paper]
18 Nov 2022

A Structure-Guided Diffusion Model for Large-Hole Diverse Image Completion
Daichi Horita, Jiaolong Yang, Dong Chen, Yuki Koyama, Kiyoharu Aizawa
BMVC 2023. [Paper]
18 Nov 2022

Conffusion: Confidence Intervals for Diffusion Models
Eliahu Horwitz, Yedid Hoshen
arXiv 2022. [Paper]
17 Nov 2022

Superresolution Reconstruction of Single Image for Latent features
Xin Wang, Jing-Ke Yan, Jing-Ye Cai, Jian-Hua Deng, Qin Qin, Qin Wang, Heng Xiao, Yao Cheng, Peng-Fei Ye
arXiv 2022. [Paper]
16 Nov 2022

Learning to Kindle the Starlight
Yu Yuan, Jiaqi Wu, Lindong Wang, Zhongliang Jing, Henry Leung, Shuyuan Zhu, Han Pan
arXiv 2022. [Paper]
16 Nov 2022

ShadowDiffusion: Diffusion-based Shadow Removal using Classifier-driven Attention and Structure Preservation
Yeying Jin, Wenhan Yang, Wei Ye, Yuan Yuan, Robby T. Tan
arXiv 2022. [Paper]
15 Nov 2022

DriftRec: Adapting diffusion models to blind image restoration tasks
Simon Welker, Henry N. Chapman, Timo Gerkmann
arXiv 2022. [Paper]
12 Nov 2022

From Denoising Diffusions to Denoising Markov Models
Joe Benton, Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet
arXiv 2022. [Paper] [Github]
7 Nov 2022

Quantized Compressed Sensing with Score-Based Generative Models
Xiangming Meng, Yoshiyuki Kabashima
arXiv 2022. [Paper] [Github]
2 Nov 2022

Intelligent Painter: Picture Composition With Resampling Diffusion Model
Wing-Fung Ku, Wan-Chi Siu, Xi Cheng, H. Anthony Chan
arXiv 2022. [Paper]
31 Oct 2022

Multitask Brain Tumor Inpainting with Diffusion Models: A Methodological Report
Pouria Rouzrokh, Bardia Khosravi, Shahriar Faghani, Mana Moassefi, Sanaz Vahdati, Bradley J. Erickson
arXiv 2022. [Paper] [Github]
21 Oct 2022

DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models
Yueqin Yin, Lianghua Huang, Yu Liu, Kaiqi Huang
arXiv 2022. [Paper]
16 Oct 2022

Low-Dose CT Using Denoising Diffusion Probabilistic Model for 20× Speedup
Wenjun Xia, Qing Lyu, Ge Wang
arXiv 2022. [Paper]
29 Sep 2022

Diffusion Posterior Sampling for General Noisy Inverse Problems
Hyungjin Chung, Jeongsol Kim, Michael T. Mccann, Marc L. Klasky, Jong Chul Ye
arXiv 2022. [Paper] [Github]
29 Sep 2022

Face Super-Resolution Using Stochastic Differential Equations
Marcelo dos Santos, Rayson Laroca, Rafael O. Ribeiro, João Neves, Hugo Proença, David Menotti
arXiv 2022. [Paper] [Github]
24 Sep 2022

JPEG Artifact Correction using Denoising Diffusion Restoration Models
Bahjat Kawar, Jiaming Song, Stefano Ermon, Michael Elad
arXiv 2022. [Paper]
23 Sep 2022

T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models
Nithin Gopalakrishnan Nair, Vishal M. Patel
arXiv 2022. [Paper]
19 Sep 2022

Delving Globally into Texture and Structure for Image Inpainting
Haipeng Liu, Yang Wang, Meng Wang, Yong Rui
ACM 2022. [Paper] [Github]
17 Sep 2022

PET image denoising based on denoising diffusion probabilistic models
Kuang Gong, Keith A. Johnson, Georges El Fakhri, Quanzheng Li, Tinsu Pan
arXiv 2022. [Paper]
13 Sep 2022

Self-Score: Self-Supervised Learning on Score-Based Models for MRI Reconstruction
Zhuo-Xu Cui, Chentao Cao, Shaonan Liu, Qingyong Zhu, Jing Cheng, Haifeng Wang, Yanjie Zhu, Dong Liang
IEEE TMI 2022. [Paper]
2 Sep 2022

AT-DDPM: Restoring Faces degraded by Atmospheric Turbulence using Denoising Diffusion Probabilistic Models
Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M Patel
arXiv 2022. [Paper]
24 Aug 2022

Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise
Arpit Bansal, Eitan Borgnia, Hong-Min Chu, Jie S. Li, Hamid Kazemi, Furong Huang, Micah Goldblum, Jonas Geiping, Tom Goldstein
arXiv 2022. [Paper] [Github]
19 Aug 2022

High-Frequency Space Diffusion Models for Accelerated MRI
Chentao Cao, Zhuo-Xu Cui, Shaonan Liu, Dong Liang, Yanjie Zhu
arXiv 2022. [Paper]
10 Aug 2022

Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models
Ozan Özdenizci, Robert Legenstein
arXiv 2022. [Paper] [Github]
29 Jul 2022

Non-Uniform Diffusion Models
Georgios Batzolis, Jan Stanczuk, Carola-Bibiane Schönlieb, Christian Etmann
arXiv 2022. [Paper]
20 Jul 2022

Unsupervised Medical Image Translation with Adversarial Diffusion Models
Muzaffer Özbey, Salman UH Dar, Hasan A Bedel, Onat Dalmaz, Şaban Özturk, Alper Güngör, Tolga Çukur
arXiv 2022. [Paper]
17 Jul 2022

Adaptive Diffusion Priors for Accelerated MRI Reconstruction
Salman UH Dar, Şaban Öztürk, Yilmaz Korkmaz, Gokberk Elmas, Muzaffer Özbey, Alper Güngör, Tolga Çukur
arXiv 2022. [Paper]
12 Jul 2022

A Novel Unified Conditional Score-based Generative Framework for Multi-modal Medical Image Completion
Xiangxi Meng, Yuning Gu, Yongsheng Pan, Nizhuan Wang, Peng Xue, Mengkang Lu, Xuming He, Yiqiang Zhan, Dinggang Shen
arXiv 2022. [Paper]
7 Jul 2022

SAR Despeckling using a Denoising Diffusion Probabilistic Model
Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel
arXiv 2022. [Paper]
9 Jun 2022

Improving Diffusion Models for Inverse Problems using Manifold Constraints
Hyungjin Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye
arXiv 2022. [Paper]
2 Jun 2022

The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models
Julia Wolleb, Robin Sandkühler, Florentin Bieder, Philippe C. Cattin
arXiv 2022. [Paper]
6 Apr 2022

MR Image Denoising and Super-Resolution Using Regularized Reverse Diffusion
Hyungjin Chung, Eun Sun Lee, Jong Chul Ye
arXiv 2022. [Paper]
23 Mar 2022

Towards performant and reliable undersampled MR reconstruction via diffusion model sampling
Cheng Peng, Pengfei Guo, S. Kevin Zhou, Vishal Patel, Rama Chellappa
arXiv 2022. [Paper] [Github]
8 Mar 2022

Measurement-conditioned Denoising Diffusion Probabilistic Model for Under-sampled Medical Image Reconstruction
Yutong Xie, Quanzheng Li
MICCAI 2022. [Paper] [Github]
5 Mar 2022

MRI Reconstruction via Data Driven Markov Chain with Joint Uncertainty Estimation
Guanxiong Luo, Martin Heide, Martin Uecker
arXiv 2022. [Paper] [Github]
3 Feb 2022

Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic Model
Dewei Hu, Yuankai K. Tao, Ipek Oguz
arXiv 2022. [Paper] [Github]
27 Jan 2022

Denoising Diffusion Restoration Models
Bahjat Kawar, Michael Elad, Stefano Ermon, Jiaming Song
ICLR 2022 Workshop (Oral). [Paper]
27 Jan 2022

RePaint: Inpainting using Denoising Diffusion Probabilistic Models
Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc Van Gool
CVPR 2022. [Paper] [Github]
24 Jan 2022

DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents
Kushagra Pandey, Avideep Mukherjee, Piyush Rai, Abhishek Kumar
arXiv 2022. [Paper] [Github]
2 Jan 2022

High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
CVPR 2022. [Paper] [Github]
20 Dec 2021

Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
Hyungjin Chung, Byeongsu Sim, Jong Chul Ye
CVPR 2022. [Paper]
9 Dec 2021

Deblurring via Stochastic Refinement
Jay Whang, Mauricio Delbracio, Hossein Talebi, Chitwan Saharia, Alexandros G. Dimakis, Peyman Milanfar
CVPR 2022. [Paper]
5 Dec 2021

Conditional Image Generation with Score-Based Diffusion Models
Georgios Batzolis, Jan Stanczuk, Carola-Bibiane Schönlieb, Christian Etmann
arXiv 2021. [Paper]
26 Nov 2021

Solving Inverse Problems in Medical Imaging with Score-Based Generative Models
Yang Song, Liyue Shen, Lei Xing, Stefano Ermon
NeurIPS Workshop 2021. [Paper] [Github]
15 Nov 2021

S3RP: Self-Supervised Super-Resolution and Prediction for Advection-Diffusion Process
Chulin Wang, Kyongmin Yeo, Xiao Jin, Andres Codas, Levente J. Klein, Bruce Elmegreen
NeurIPS 2022. [Paper]
8 Nov 2021

Score-based diffusion models for accelerated MRI
Hyungjin Chung, Jong chul Ye
MIA 2021. [Paper] [Github]
8 Oct 2021

Autoregressive Diffusion Models
Emiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben Poole, Rianne van den Berg, Tim Salimans
ICLR 2022. [Paper]
5 Oct 2021

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
Jooyoung Choi, Sungwon Kim, Yonghyun Jeong, Youngjune Gwon, Sungroh Yoon
ICCV 2021 (Oral). [Paper] [Github]
6 Aug 2021

Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho, Chitwan Saharia, William Chan, David J. Fleet, Mohammad Norouzi, Tim Salimans
arXiv 2021. [Paper] [Project]
30 May 2021

SRDiff: Single Image Super-Resolution with Diffusion Probabilistic Models
Haoying Li, Yifan Yang, Meng Chang, Huajun Feng, Zhihai Xu, Qi Li, Yueting Chen
ACM 2022. [Paper]
30 Apr 2021

Image Super-Resolution via Iterative Refinement
Chitwan Saharia, Jonathan Ho, William Chan, Tim Salimans, David J. Fleet, Mohammad Norouzi
arXiv 2021. [Paper] [Project] [Github]
15 Apr 2021

Medical Imaging

Diffusion-based Data Augmentation for Nuclei Image Segmentation
Xinyi Yu, Guanbin Li, Wei Lou, Siqi Liu, Xiang Wan, Yan Chen, Haofeng Li
arXiv 2023. [Paper]
22 Oct 2023

EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model
Zheyuan Zhang, Lanhong Yao, Bin Wang, Debesh Jha, Elif Keles, Alpay Medetalibeyoglu, Ulas Bagci
arXiv 2023. [Paper]
19 Oct 2023

Towards Generic Semi-Supervised Framework for Volumetric Medical Image Segmentation
Haonan Wang, Xiaomeng Li
NeurIPS 2023. [Paper] [Github]
17 Oct 2023

Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model
Junpeng Tan, Xin Zhang, Yao Lv, Xiangmin Xu, Gang Li
arXiv 2023. [Paper]
16 Oct 2023

JSMoCo: Joint Coil Sensitivity and Motion Correction in Parallel MRI with a Self-Calibrating Score-Based Diffusion Model
Lixuan Chen, Xuanyu Tian, Jiangjie Wu, Ruimin Feng, Guoyan Lao, Yuyao Zhang, Hongjiang Wei
arXiv 2023. [Paper]
14 Oct 2023

Histogram- and Diffusion-Based Medical Out-of-Distribution Detection
Evi M. C. Huijben, Sina Amirrajab, Josien P. W. Pluim
arXiv 2023. [Paper]
12 Oct 2023

Echocardiography video synthesis from end diastolic semantic map via diffusion model
Phi Nguyen Van, Duc Tran Minh, Hieu Pham Huy, Long Tran Quoc
arXiv 2023. [Paper]
11 Oct 2023

Diffusion Prior Regularized Iterative Reconstruction for Low-dose CT
Wenjun Xia, Yongyi Shi, Chuang Niu, Wenxiang Cong, Ge Wang
arXiv 2023. [Paper]
10 Oct 2023

Image Compression and Decompression Framework Based on Latent Diffusion Model for Breast Mammography
InChan Hwang, MinJae Woo
arXiv 2023. [Paper]
8 Oct 2023

Latent Diffusion Model for Medical Image Standardization and Enhancement
Md Selim, Jie Zhang, Faraneh Fathi, Michael A. Brooks, Ge Wang, Guoqiang Yu, Jin Chen
arXiv 2023. [Paper]
8 Oct 2023

Characterizing the Features of Mitotic Figures Using a Conditional Diffusion Probabilistic Model
Cagla Deniz Bahadir, Benjamin Liechty, David J. Pisapia, Mert R. Sabuncu
MICCAI Workshop 2023. [Paper]
5 Oct 2023

MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Yanwu Xu, Li Sun, Wei Peng, Shyam Visweswaran, Kayhan Batmanghelich
arXiv 2023. [Paper]
5 Oct 2023

Blind CT Image Quality Assessment Using DDPM-derived Content and Transformer-based Evaluator
Yongyi Shi, Wenjun Xia, Ge Wang, Xuanqin Mou
arXiv 2023. [Paper]
4 Oct 2023

SMRD: SURE-based Robust MRI Reconstruction with Diffusion Models
Batu Ozturkler, Chao Liu, Benjamin Eckart, Morteza Mardani, Jiaming Song, Jan Kautz
MICCAI 2023. [Paper] [Github]
3 Oct 2023

DiffGAN-F2S: Symmetric and Efficient Denoising Diffusion GANs for Structural Connectivity Prediction from Brain fMRI
Qiankun Zuo, Ruiheng Li, Yi Di, Hao Tian, Changhong Jing, Xuhang Chen, Shuqiang Wang
arXiv 2023. [Paper]
28 Sep 2023

Enhancing Knee Osteoarthritis severity level classification using diffusion augmented images
Paleti Nikhil Chowdary, Gorantla V N S L Vishnu Vardhan, Menta Sai Akshay, Menta Sai Aashish, Vadlapudi Sai Aravind, Garapati Venkata Krishna Rayalu, Aswathy P
arXiv 2023. [Paper]
17 Sep 2023

Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation
Zhiqing Zhang, Guojia Fan, Tianyong Liu, Nan Li, Yuyang Liu, Ziyu Liu, Canwei Dong, Shoujun Zhou
arXiv 2023. [Paper]
12 Sep 2023

Treatment-aware Diffusion Probabilistic Model for Longitudinal MRI Generation and Diffuse Glioma Growth Prediction
Qinghui Liu, Elies Fuster-Garcia, Ivar Thokle Hovden, Donatas Sederevicius, Karoline Skogen, Bradley J MacIntosh, Edvard Grødem, Till Schellhorn, Petter Brandal, Atle Bjørnerud, Kyrre Eeg Emblem
arXiv 2023. [Paper]
11 Sep 2023

Efficient Bayesian Computational Imaging with a Surrogate Score-Based Prior
Berthy T. Feng, Katherine L. Bouman
arXiv 2023. [Paper]
5 Sep 2023

Segmentation of 3D pore space from CT images using curvilinear skeleton: application to numerical simulation of microbial decomposition
Olivier Monga, Zakaria Belghali, Mouad Klai, Lucie Druoton, Dominique Michelucci, Valerie Pot
arXiv 2023. [Paper]
4 Sep 2023

GenSelfDiff-HIS: Generative Self-Supervision Using Diffusion for Histopathological Image Segmentation
Vishnuvardhan Purma, Suhas Srinath, Seshan Srirangarajan, Aanchal Kakkar, Prathosh A. P
arXiv 2023. [Paper] [Github]
4 Sep 2023

Correlated and Multi-frequency Diffusion Modeling for Highly Under-sampled MRI Reconstruction
Yu Guan, Chuanming Yu, Shiyu Lu, Zhuoxu Cui, Dong Liang, Qiegen Liu
arXiv 2023. [Paper] [Github]
2 Sep 2023

Diffusion Modeling with Domain-conditioned Prior Guidance for Accelerated MRI and qMRI Reconstruction
Wanyu Bian, Albert Jang, Fang Liu
arXiv 2023. [Paper]
2 Sep 2023

PathLDM: Text conditioned Latent Diffusion Model for Histopathology
Srikar Yellapragada, Alexandros Graikos, Prateek Prasanna, Tahsin Kurc, Joel Saltz, Dimitris Samaras
arXiv 2023. [Paper]
1 Sep 2023

Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains
Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang
arXiv 2023. [Paper]
31 Aug 2023

A Recycling Training Strategy for Medical Image Segmentation with Diffusion Denoising Models
Yunguan Fu, Yiwen Li, Shaheer U Saeed, Matthew J Clarkson, Yipeng Hu
arXiv 2023. [Paper] [Github]
30 Aug 2023

Physics-Informed DeepMRI: Bridging the Gap from Heat Diffusion to k-Space Interpolation
Zhuo-Xu Cui, Congcong Liu, Xiaohong Fan, Chentao Cao, Jing Cheng, Qingyong Zhu, Yuanyuan Liu, Sen Jia, Yihang Zhou, Haifeng Wang, Yanjie Zhu, Jianping Zhang, Qiegen Liu, Dong Liang
arXiv 2023. [Paper]
30 Aug 2023

Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction
Kai Xu, Shiyu Lu, Bin Huang, Weiwen Wu, Qiegen Liu
arXiv 2023. [Paper]
30 Aug 2023

Modality Cycles with Masked Conditional Diffusion for Unsupervised Anomaly Segmentation in MRI
Ziyun Liang, Harry Anthony, Felix Wagner, Konstantinos Kamnitsas
arXiv 2023. [Paper]
30 Aug 2023

Data-iterative Optimization Score Model for Stable Ultra-Sparse-View CT Reconstruction
Weiwen Wu, Yanyang Wang
arXiv 2023. [Paper]
28 Aug 2023

Full-dose PET Synthesis from Low-dose PET Using High-efficiency Diffusion Denoising Probabilistic Model
Shaoyan Pan, Elham Abouei, Junbo Peng, Joshua Qian, Jacob F Wynne, Tonghe Wang, Chih-Wei Chang, Justin Roper, Jonathon A Nye, Hui Mao, Xiaofeng Yang
arXiv 2023. [Paper]
24 Aug 2023

Augmenting medical image classifiers with synthetic data from latent diffusion models
Luke W. Sagers, James A. Diao, Luke Melas-Kyriazi, Matthew Groh, Pranav Rajpurkar, Adewole S. Adamson, Veronica Rotemberg, Roxana Daneshjou, Arjun K. Manrai
arXiv 2023. [Paper]
23 Aug 2023

InverseSR: 3D Brain MRI Super-Resolution Using a Latent Diffusion Model
Jueqi Wang, Jacob Levman, Walter Hugo Lopez Pinaya, Petru-Daniel Tudosiu, M. Jorge Cardoso, Razvan Marinescu
MICCAI 2023. [Paper] [Github]
23 Aug 2023

Texture Generation on 3D Meshes with Point-UV Diffusion
Xin Yu, Peng Dai, Wenbo Li, Lan Ma, Zhengzhe Liu, Xiaojuan Qi
ICCV 2023. [Paper]
21 Aug 2023

Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction
Zeyu Han, Yuhan Wang, Luping Zhou, Peng Wang, Binyu Yan, Jiliu Zhou, Yan Wang, Dinggang Shen
MICCAI 2023. [Paper] [Github]
20 Aug 2023

Denoising diffusion-based MR to CT image translation enables whole spine vertebral segmentation in 2D and 3D without manual annotations
Robert Graf, Joachim Schmitt, Sarah Schlaeger, Hendrik Kristian Möller, Vasiliki Sideri-Lampretsa, Anjany Sekuboyina, Sandro Manuel Krieg, Benedikt Wiestler, Bjoern Menze, Daniel Rueckert, Jan Stefan Kirschke
arXiv 2023. [Paper]
18 Aug 2023

DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction
Xiaoxiao He, Chaowei Tan, Ligong Han, Bo Liu, Leon Axel, Kang Li, Dimitris N. Metaxas
MICCAI 2023. [Paper] [Github]
18 Aug 2023

Denoising Diffusion Probabilistic Model for Retinal Image Generation and Segmentation
Alnur Alimanov, Md Baharul Islam
ICCP 2023. [Paper]
16 Aug 2023

Shape-guided Conditional Latent Diffusion Models for Synthesising Brain Vasculature
Yash Deo, Haoran Dou, Nishant Ravikumar, Alejandro F. Frangi, Toni Lassila
arXiv 2023. [Paper]
13 Aug 2023

Masked Diffusion as Self-supervised Representation Learner
Zixuan Pan, Jianxu Chen, Yiyu Shi
arXiv 2023. [Paper]
10 Aug 2023

Synthetic Augmentation with Large-scale Unconditional Pre-training
Jiarong Ye, Haomiao Ni, Peng Jin, Sharon X. Huang, Yuan Xue
MICCAI 2023. [Paper] [Github]
8 Aug 2023

Energy-Guided Diffusion Model for CBCT-to-CT Synthesis
Linjie Fu, Xia Li, Xiuding Cai, Dong Miao, Yu Yao, Yali Shen
arXiv 2023. [Paper]
7 Aug 2023

DermoSegDiff: A Boundary-aware Segmentation Diffusion Model for Skin Lesion Delineation
Afshin Bozorgpour, Yousef Sadegheih, Amirhossein Kazerouni, Reza Azad, Dorit Merhof
MICCAI Workshop 2023. [Paper] [Github]
5 Aug 2023

Synthesising Rare Cataract Surgery Samples with Guided Diffusion Models
Yannik Frisch, Moritz Fuchs, Antoine Sanner, Felix Anton Ucar, Marius Frenzel, Joana Wasielica-Poslednik, Adrian Gericke, Felix Mathias Wagner, Thomas Dratsch, Anirban Mukhopadhyay
arXiv 2023. [Paper]
3 Aug 2023

Diffusion Models for Counterfactual Generation and Anomaly Detection in Brain Images
Alessandro Fontanella, Grant Mair, Joanna Wardlaw, Emanuele Trucco, Amos Storkey
arXiv 2023. [Paper]
3 Aug 2023

Reference-Free Isotropic 3D EM Reconstruction using Diffusion Models
Kyungryun Lee, Won-Ki Jeong
arXiv 2023. [Paper]
3 Aug 2023

A vision transformer-based framework for knowledge transfer from multi-modal to mono-modal lymphoma subtyping models
Bilel Guetarni, Feryal Windal, Halim Benhabiles, Marianne Petit, Romain Dubois, Emmanuelle Leteurtre, Dominique Collard
arXiv 2023. [Paper]
2 Aug 2023

Learning Fourier-Constrained Diffusion Bridges for MRI Reconstruction
Muhammad U. Mirza, Onat Dalmaz, Hasan A. Bedel, Gokberk Elmas, Yilmaz Korkmaz, Alper Gungor, Salman UH Dar, Tolga Çukur
arXiv 2023. [Paper]
2 Aug 2023

C-DARL: Contrastive diffusion adversarial representation learning for label-free blood vessel segmentation
Boah Kim, Yujin Oh, Bradford J. Wood, Ronald M. Summers, Jong Chul Ye
arXiv 2023. [Paper]
31 Jul 2023

Ultrasound Image Reconstruction with Denoising Diffusion Restoration Models
Yuxin Zhang, Clément Huneau, Jérôme Idier, Diana Mateus
MICCAI Workshop 2023. [Paper] [Github]
29 Jul 2023

Pre-Training with Diffusion models for Dental Radiography segmentation
Jérémy Rousseau, Christian Alaka, Emma Covili, Hippolyte Mayard, Laura Misrachi, Willy Au
arXiv 2023. [Paper]
26 Jul 2023

Iterative Reconstruction Based on Latent Diffusion Model for Sparse Data Reconstruction
Linchao He, Hongyu Yan, Mengting Luo, Kunming Luo, Wang Wang, Wenchao Du, Hu Chen, Hongyu Yang, Yi Zhang
arXiv 2023. [Paper]
22 Jul 2023

FSDiffReg: Feature-wise and Score-wise Diffusion-guided Unsupervised Deformable Image Registration for Cardiac Images
Yi Qin, Xiaomeng Li
MICCAI 2023. [Paper] [Github]
22 Jul 2023

FEDD – Fair, Efficient, and Diverse Diffusion-based Lesion Segmentation and Malignancy Classification
Héctor Carrión, Narges Norouzi
MICCAI 2023. [Paper] [Github]
21 Jul 2023

PartDiff: Image Super-resolution with Partial Diffusion Models
Kai Zhao, Alex Ling Yu Hung, Kaifeng Pang, Haoxin Zheng, Kyunghyun Sung
arXiv 2023. [Paper]
21 Jul 2023

Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis
Lingting Zhu, Zeyue Xue, Zhenchao Jin, Xian Liu, Jingzhen He, Ziwei Liu, Lequan Yu
MICCAI 2023. [Paper]
19 Jul 2023

DiffDP: Radiotherapy Dose Prediction via a Diffusion Model
Zhenghao Feng, Lu Wen, Peng Wang, Binyu Yan, Xi Wu, Jiliu Zhou, Yan Wang
arXiv 2023. [Paper]
19 Jul 2023

DreaMR: Diffusion-driven Counterfactual Explanation for Functional MRI
Hasan Atakan Bedel, Tolga Çukur
arXiv 2023. [Paper]
18 Jul 2023

TractCloud: Registration-free tractography parcellation with a novel local-global streamline point cloud representation
Tengfei Xue, Yuqian Chen, Chaoyi Zhang, Alexandra J. Golby, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O’Donnell
arXiv 2023. [Paper] [Project] [Github]
18 Jul 2023

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency
Bowen Song, Soo Min Kwon, Zecheng Zhang, Xinyu Hu, Qing Qu, Liyue Shen
arXiv 2023. [Paper]
16 Jul 2023

Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation
Victor Gallego
EYSM 2023. [Paper]
15 Jul 2023

Improving Nonalcoholic Fatty Liver Disease Classification Performance With Latent Diffusion Models
Romain Hardy, Cornelia Ilin, Joe Klepich, Ryan Mitchell, Steve Hall, Jericho Villareal
arXiv 2023. [Paper]
13 Jul 2023

DDGM: Solving inverse problems by Diffusive Denoising of Gradient-based Minimization
Kyle Luther, H. Sebastian Seung
arXiv 2023. [Paper]
11 Jul 2023

LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion
Long Bai, Tong Chen, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren
arXiv 2023. [Paper] [Github]
5 Jul 2023

Synchronous Image-Label Diffusion Probability Model with Application to Stroke Lesion Segmentation on Non-contrast CT
Jianhai Zhang, Tonghua Wan, Ethan MacDonald, Bijoy Menon, Aravind Ganesh, Qiu Wu
arXiv 2023. [Paper]
4 Jul 2023

Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis
Salman Ul Hassan Dar, Arman Ghanaat, Jannik Kahmann, Isabelle Ayx, Theano Papavassiliu, Stefan O. Schoenberg, Sandy Engelhardt
arXiv 2023. [Paper]
3 Jul 2023

Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling
Li Sanqian, Higashita Risa, Fu Huazhu, Li Heng, Niu Jingxuan, Liu Jiang
arXiv 2023. [Paper]
30 Jun 2023

Self-Supervised MRI Reconstruction with Unrolled Diffusion Models
Yilmaz Korkmaz, Tolga Cukur, Vishal Patel
arXiv 2023. [Paper]
29 Jun 2023

DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy
Yiwen Zhang, Chuanpu Li, Liming Zhong, Zeli Chen, Wei Yang, Xuetao Wang
arXiv 2023. [Paper]
28 Jun 2023

DiffMix: Diffusion Model-based Data Synthesis for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets
Hyun-Jic Oh, Won-Ki Jeong
arXiv 2023. [Paper]
25 Jun 2023

DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology
Marco Aversa, Gabriel Nobis, Miriam Hägele, Kai Standvoss, Mihaela Chirica, Roderick Murray-Smith, Ahmed Alaa, Lukas Ruff, Daniela Ivanova, Wojciech Samek, Frederick Klauschen, Bruno Sanguinetti, Luis Oala
arXiv 2023. [Paper]
23 Jun 2023

DiffuseIR:Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images
Mingjie Pan, Yulu Gan, Fangxu Zhou, Jiaming Liu, Aimin Wang, Shanghang Zhang, Dawei Li
arXiv 2023. [Paper]
21 Jun 2023

TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models
Se-In Jang, Cristina Lois, Emma Thibault, J. Alex Becker, Yafei Dong, Marc D. Normandin, Julie C. Price, Keith A. Johnson, Georges El Fakhri, Kuang Gong
arXiv 2023. [Paper]
21 Jun 2023

SANO: Score-Based Diffusion Model for Anomaly Localization in Dermatology
Alvaro Gonzalez-Jimenez, Simone Lionetti, Marc Pouly, Alexander A. Navarini
CVPR Workshop 2023. [Paper]
18 Jun 2023

Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback
Shenghuan Sun, Gregory M. Goldgof, Atul Butte, Ahmed M. Alaa
arXiv 2023. [Paper]
16 Jun 2023

Annotator Consensus Prediction for Medical Image Segmentation with Diffusion Models
Tomer Amit, Shmuel Shichrur, Tal Shaharabany, Lior Wolf
arXiv 2023. [Paper]
15 Jun 2023

Deep Ultrasound Denoising Using Diffusion Probabilistic Models
Hojat Asgariandehkordi, Sobhan Goudarzi, Adrian Basarab, Hassan Rivaz
arXiv 2023. [Paper]
12 Jun 2023

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation
Xinrong Hu, Yu-Jen Chen, Tsung-Yi Ho, Yiyu Shi
arXiv 2023. [Paper]
6 Jun 2023

Interpretable Alzheimer’s Disease Classification Via a Contrastive Diffusion Autoencoder
Ayodeji Ijishakin, Ahmed Abdulaal, Adamos Hadjivasiliou, Sophie Martin, James Cole
arXiv 2023. [Paper]
5 Jun 2023

Optimizing Sampling Patterns for Compressed Sensing MRI with Diffusion Generative Models
Sriram Ravula, Brett Levac, Ajil Jalal, Jonathan I. Tamir, Alexandros G. Dimakis
arXiv 2023. [Paper]
5 Jun 2023

Brain tumor segmentation using synthetic MR images – A comparison of GANs and diffusion models
Muhammad Usman Akbar, Måns Larsson, Anders Eklund
arXiv 2023. [Paper]
5 Jun 2023

Unsupervised Anomaly Detection in Medical Images Using Masked Diffusion Model
Hasan Iqbal, Umar Khalid, Jing Hua, Chen Chen
arXiv 2023. [Paper]
31 May 2023

Mask, Stitch, and Re-Sample: Enhancing Robustness and Generalizability in Anomaly Detection through Automatic Diffusion Models
Cosmin I. Bercea, Michael Neumayr, Daniel Rueckert, Julia A. Schnabel
arXiv 2023. [Paper]
31 May 2023

Synthetic CT Generation from MRI using 3D Transformer-based Denoising Diffusion Model
Shaoyan Pan, Elham Abouei, Jacob Wynne, Tonghe Wang, Richard L. J. Qiu, Yuheng Li, Chih-Wei Chang, Junbo Peng, Justin Roper, Pretesh Patel, David S. Yu, Hui Mao, Xiaofeng Yang
arXiv 2023. [Paper]
31 May 2023

Conditional Diffusion Models for Semantic 3D Medical Image Synthesis
Zolnamar Dorjsembe, Hsing-Kuo Pao, Sodtavilan Odonchimed, Furen Xiao
arXiv 2023. [Paper]
29 May 2023

GenerateCT: Text-Guided 3D Chest CT Generation
Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, Alperen Tezcan, Ayse Gulnihan Simsek, Furkan Almas, Sevval Nil Esirgun, Hadrien Reynaud, Sarthak Pati, Christian Bluethgen, Bjoern Menze
arXiv 2023. [Paper] [Github]
25 May 2023

A Diffusion Probabilistic Prior for Low-Dose CT Image Denoising
Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang
arXiv 2023. [Paper]
25 May 2023

Multi-Level Global Context Cross Consistency Model for Semi-Supervised Ultrasound Image Segmentation with Diffusion Model
Fenghe Tang, Jianrui Ding, Lingtao Wang, Min Xian, Chunping Ning
arXiv 2023. [Paper] [Github]
16 May 2023

Beware of diffusion models for synthesizing medical images – A comparison with GANs in terms of memorizing brain tumor images
Muhammad Usman Akbar, Wuhao Wang, Anders Eklund
arXiv 2023. [Paper]
12 May 2023

Generation of Structurally Realistic Retinal Fundus Images with Diffusion Models
Sojung Go, Younghoon Ji, Sang Jun Park, Soochahn Lee
arXiv 2023. [Paper]
11 May 2023

Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation
David Stojanovski, Uxio Hermida, Pablo Lamata, Arian Beqiri, Alberto Gomez
arXiv 2023. [Paper]
9 May 2023

Synthesizing PET images from High-field and Ultra-high-field MR images Using Joint Diffusion Attention Model
Taofeng Xie, Chentao Cao, Zhuoxu Cui, Yu Guo, Caiying Wu, Xuemei Wang, Qingneng Li, Zhanli Hu, Tao Sun, Ziru Sang, Yihang Zhou, Yanjie Zhu, Dong Liang, Qiyu Jin, Guoqing Chen, Haifeng Wang
arXiv 2023. [Paper]
6 May 2023

Solving Inverse Problems with Score-Based Generative Priors learned from Noisy Data
Asad Aali, Marius Arvinte, Sidharth Kumar, Jonathan I. Tamir
arXiv 2023. [Paper]
2 May 2023

Self-similarity-based super-resolution of photoacoustic angiography from hand-drawn doodles
Yuanzheng Ma, Wangting Zhou, Rui Ma, Sihua Yang, Yansong Tang, Xun Guan
arXiv 2023. [Paper]
2 May 2023

High-Fidelity Image Synthesis from Pulmonary Nodule Lesion Maps using Semantic Diffusion Model
Xuan Zhao, Benjamin Hou
MIDL 2023. [Paper]
2 May 2023

Unsupervised Discovery of 3D Hierarchical Structure with Generative Diffusion Features
Nurislam Tursynbek, Marc Niethammer
arXiv 2023. [Paper]
28 Apr 2023

Cycle-guided Denoising Diffusion Probability Model for 3D Cross-modality MRI Synthesis
Shaoyan Pan, Chih-Wei Chang, Junbo Peng, Jiahan Zhang, Richard L.J. Qiu, Tonghe Wang, Justin Roper, Tian Liu, Hui Mao, Xiaofeng Yang
arXiv 2023. [Paper]
28 Apr 2023

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models
Shitong Shao, Xiaohan Yuan, Zhen Huang, Ziming Qiu, Shuai Wang, Kevin Zhou
arXiv 2023. [Paper] [Github]
26 Apr 2023

Realistic Data Enrichment for Robust Image Segmentation in Histopathology
Sarah Cechnicka, James Ball, Callum Arthurs, Candice Roufosse, Bernhard Kainz
arXiv 2023. [Paper]
19 Apr 2023

Denoising Diffusion Medical Models
Pham Ngoc Huy, Tran Minh Quan
IEEE ISBI 2023. [Paper]
19 Apr 2023

A Multi-Institutional Open-Source Benchmark Dataset for Breast Cancer Clinical Decision Support using Synthetic Correlated Diffusion Imaging Data
Chi-en Amy Tai, Hayden Gunraj, Alexander Wong
arXiv 2023. [Paper]
12 Apr 2023

Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep Radiomic Features from Synthetic Correlated Diffusion Imaging
Chi-en Amy Tai, Hayden Gunraj, Alexander Wong
arXiv 2023. [Paper]
12 Apr 2023

SPIRiT-Diffusion: Self-Consistency Driven Diffusion Model for Accelerated MRI
Zhuo-Xu Cui, Chentao Cao, Jing Cheng, Sen Jia, Hairong Zheng, Dong Liang, Yanjie Zhu
arXiv 2023. [Paper]
11 Apr 2023

Mask-conditioned latent diffusion for generating gastrointestinal polyp images
Roman Macháček, Leila Mozaffari, Zahra Sepasdar, Sravanthi Parasa, Pål Halvorsen, Michael A. Riegler, Vajira Thambawita
arXiv 2023. [Paper]
11 Apr 2023

BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation
Tao Chen, Chenhui Wang, Hongming Shan
arXiv 2023. [Paper]
10 Apr 2023

Ambiguous Medical Image Segmentation using Diffusion Models
Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel
CVPR 2023. [Paper] [Github]
10 Apr 2023

MedGen3D: A Deep Generative Framework for Paired 3D Image and Mask Generation
Kun Han, Yifeng Xiong, Chenyu You, Pooya Khosravi, Shanlin Sun, Xiangyi Yan, James Duncan, Xiaohui Xie
arxiv 2023. [Paper] [Project]
8 Apr 2023

Towards Realistic Ultrasound Fetal Brain Imaging Synthesis
Michelle Iskandar, Harvey Mannering, Zhanxiang Sun, Jacqueline Matthew, Hamideh Kerdegari, Laura Peralta, Miguel Xochicale
arXiv 2023. [Paper] [Gitub]
8 Apr 2023

Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior
Kaiwen Xu, Aravind R. Krishnan, Thomas Z. Li, Yuankai Huo, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman
arXiv 2023. [Paper]
7 Apr 2023

Zero-shot Medical Image Translation via Frequency-Guided Diffusion Models
Yunxiang Li, Hua-Chieh Shao, Xiao Liang, Liyuan Chen, Ruiqi Li, Steve Jiang, Jing Wang, You Zhang
arXiv 2023. [Paper]
5 Apr 2023

CoreDiff: Contextual Error-Modulated Generalized Diffusion Model for Low-Dose CT Denoising and Generalization
Qi Gao, Zilong Li, Junping Zhang, Yi Zhang, Hongming Shan
arXiv 2023. [Paper]
4 Apr 2023

ViT-DAE: Transformer-driven Diffusion Autoencoder for Histopathology Image Analysis
Xuan Xu, Saarthak Kapse, Rajarsi Gupta, Prateek Prasanna
MICCAI 2023. [Paper]
3 Apr 2023

Pay Attention: Accuracy Versus Interpretability Trade-off in Fine-tuned Diffusion Models
Mischa Dombrowski, Hadrien Reynaud, Johanna P. Müller, Matthew Baugh, Bernhard Kainz
arXiv 2023. [Paper]
31 Mar 2023

DDMM-Synth: A Denoising Diffusion Model for Cross-modal Medical Image Synthesis with Sparse-view Measurement Embedding
Xiaoyue Li, Kai Shang, Gaoang Wang, Mark D. Butala
arXiv 2023. [Paper]
28 Mar 2023

Diffusion Models for Memory-efficient Processing of 3D Medical Images
Florentin Bieder, Julia Wolleb, Alicia Durrer, Robin Sandkühler, Philippe C. Cattin
MIDL 2023. [Paper]
27 Mar 2023

Multi-task Learning of Histology and Molecular Markers for Classifying Diffuse Glioma
Xiaofei Wang, Stephen Price, Chao Li
arXiv 2023. [Paper]
26 Mar 2023

CoLa-Diff: Conditional Latent Diffusion Model for Multi-Modal MRI Synthesis
Lan Jiang, Ye Mao, Xi Chen, Xiangfeng Wang, Chao Li
arXiv 2023. [Paper]
24 Mar 2023

DisC-Diff: Disentangled Conditional Diffusion Model for Multi-Contrast MRI Super-Resolution
Ye Mao, Lan Jiang, Xi Chen, Chao Li
arXiv 2023. [Paper]
23 Mar 2023

Medical diffusion on a budget: textual inversion for medical image generation
Bram de Wilde, Anindo Saha, Richard P.G. ten Broek, Henkjan Huisman
arXiv 2023. [Paper]
23 Mar 2023

Sub-volume-based Denoising Diffusion Probabilistic Model for Cone-beam CT Reconstruction from Incomplete Data
Wenjun Xia, Chuang Niu, Wenxiang Cong, Ge Wang
arXiv 2023. [Paper]
22 Mar 2023

Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis
Hadrien Reynaud, Mengyun Qiao, Mischa Dombrowski, Thomas Day, Reza Razavi, Alberto Gomez, Paul Leeson, Bernhard Kainz
arXiv 2023. [Paper]
22 Mar 2023

Distribution Aligned Diffusion and Prototype-guided network for Unsupervised Domain Adaptive Segmentation
Haipeng Zhou, Lei Zhu, Yuyin Zhou
arXiv 2023. [Paper]
22 Mar 2023

Semantic Latent Space Regression of Diffusion Autoencoders for Vertebral Fracture Grading
Matthias Keicher, Matan Atad, David Schinz, Alexandra S. Gersing, Sarah C. Foreman, Sophia S. Goller, Juergen Weissinger, Jon Rischewski, Anna-Sophia Dietrich, Benedikt Wiestler, Jan S. Kirschke, Nassir Navab
arXiv 2023. [Paper]
21 Mar 2023

NASDM: Nuclei-Aware Semantic Histopathology Image Generation Using Diffusion Models
Aman Shrivastava, P. Thomas Fletcher
arXiv 2023. [Paper]
20 Mar 2023

Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis
Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer
arXiv 2023. [Paper]
20 Mar 2023

DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification
Yijun Yang, Huazhu Fu, Angelica Aviles-Rivero, Carola-Bibiane Schönlieb, Lei Zhu
arXiv 2023. [Paper]
19 Mar 2023

Diff-UNet: A Diffusion Embedded Network for Volumetric Segmentation
Zhaohu Xing, Liang Wan, Huazhu Fu, Guang Yang, Lei Zhu
arXiv 2023. [Paper] [Github]
18 Mar 2023

Reversing the Abnormal: Pseudo-Healthy Generative Networks for Anomaly Detection
Cosmin I Bercea, Benedikt Wiestler, Daniel Rueckert, Julia A Schnabel
arXiv 2023. [Paper]
15 Mar 2023

Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models
Suhyeon Lee, Hyungjin Chung, Minyoung Park, Jonghyuk Park, Wi-Sun Ryu, Jong Chul Ye
arXiv 2023. [Paper]
15 Mar 2023

Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels
Jan Oscar Cross-Zamirski, Praveen Anand, Guy Williams, Elizabeth Mouchet, Yinhai Wang, Carola-Bibiane Schönlieb
arXiv 2023. [Paper] [Github]
15 Mar 2023

Stochastic Segmentation with Conditional Categorical Diffusion Models
Lukas Zbinden, Lars Doorenbos, Theodoros Pissas, Raphael Sznitman, Pablo Márquez-Neila
ICCV 2023. [Paper] [Github]
15 Mar 2023

Diffusion Models for Contrast Harmonization of Magnetic Resonance Images
Alicia Durrer, Julia Wolleb, Florentin Bieder, Tim Sinnecker, Matthias Weigel, Robin Sandkühler, Cristina Granziera, Özgür Yaldizli, Philippe C. Cattin
arXiv 2023. [Paper]
14 Mar 2023

Efficiently Training Vision Transformers on Structural MRI Scans for Alzheimer’s Disease Detection
Nikhil J. Dhinagar, Sophia I. Thomopoulos, Emily Laltoo, Paul M. Thompson
arXiv 2023. [Paper]
14 Mar 2023

Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays
Ibrahim Ethem Hamamci, Sezgin Er, Enis Simsar, Anjany Sekuboyina, Mustafa Gundogar, Bernd Stadlinger, Albert Mehl, Bjoern Menze
arXiv 2023. [Paper]
11 Mar 2023

AugDiff: Diffusion based Feature Augmentation for Multiple Instance Learning in Whole Slide Image
Zhuchen Shao, Liuxi Dai, Yifeng Wang, Haoqian Wang, Yongbing Zhang
arXiv 2023. [Paper]
11 Mar 2023

Brain Diffuser: An End-to-End Brain Image to Brain Network Pipeline
Xuhang Chen, Baiying Lei, Chi-Man Pun, Shuqiang Wang
arXiv 2023. [Paper]
11 Mar 2023

Fast Diffusion Sampler for Inverse Problems by Geometric Decomposition
Hyungjin Chung, Suhyeon Lee, Jong Chul Ye
arXiv 2023. [Paper]
10 Mar 2023

Generalized Diffusion MRI Denoising and Super-Resolution using Swin Transformers
Amir Sadikov, Jamie Wren-Jarvis, Xinlei Pan, Lanya T. Cai, Pratik Mukherjee
arXiv 2023. [Paper]
10 Mar 2023

Importance of Aligning Training Strategy with Evaluation for Diffusion Models in 3D Multiclass Segmentation
Yunguan Fu, Yiwen Li, Shaheer U. Saeed, Matthew J. Clarkson, Yipeng Hu
arXiv 2023. [Paper] [Github]
10 Mar 2023

Patched Diffusion Models for Unsupervised Anomaly Detection in Brain MRI
Finn Behrendt, Debayan Bhattacharya, Julia Krüger, Roland Opfer, Alexander Schlaefer
MIDL 2023. [Paper]
7 Mar 2023

Bi-parametric prostate MR image synthesis using pathology and sequence-conditioned stable diffusion
Shaheer U. Saeed, Tom Syer, Wen Yan, Qianye Yang, Mark Emberton, Shonit Punwani, Matthew J. Clarkson, Dean C. Barratt, Yipeng Hu
arXiv 2023. [Paper]
3 Mar 2023

Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection
Jian Shi, Pengyi Zhang, Ni Zhang, Hakim Ghazzai, Yehia Massoud
arXiv 2023. [Paper]
28 Feb 2023

DDM2: Self-Supervised Diffusion MRI Denoising with Generative Diffusion Models
Tiange Xiang, Mahmut Yurt, Ali B Syed, Kawin Setsompop, Akshay Chaudhari
ICLR 2023. [Paper] [Github]
6 Feb 2023

Zero-shot-Learning Cross-Modality Data Translation Through Mutual Information Guided Stochastic Diffusion
Zihao Wang, Yingyu Yang, Maxime Sermesant, Hervé Delingette, Ona Wu
arXiv 2023. [Paper]
31 Jan 2023

Diffusion Denoising for Low-Dose-CT Model
Runyi Li
arXiv 2023. [Paper]
27 Jan 2023

DiffusionCT: Latent Diffusion Model for CT Image Standardization
Md Selim, Jie Zhang, Michael A. Brooks, Ge Wang, Jin Chen
arXiv 2023. [Paper]
20 Jan 2023

MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer
Junde Wu, Rao Fu, Huihui Fang, Yu Zhang, Yanwu Xu
arXiv 2023. [Paper]
19 Jan 2023

The role of noise in denoising models for anomaly detection in medical images
Antanas Kascenas, Pedro Sanchez, Patrick Schrempf, Chaoyang Wang, William Clackett, Shadia S. Mikhael, Jeremy P. Voisey, Keith Goatman, Alexander Weir, Nicolas Pugeault, Sotirios A. Tsaftaris, Alison Q. O’Neil
arXiv 2023. [Paper] [Github]
19 Jan 2023

Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images
Mohamed Akrout, Bálint Gyepesi, Péter Holló, Adrienn Poór, Blága Kincső, Stephen Solis, Katrina Cirone, Jeremy Kawahara, Dekker Slade, Latif Abid, Máté Kovács, István Fazekas
arXiv 2023. [Paper]
12 Jan 2023

Annealed Score-Based Diffusion Model for MR Motion Artifact Reduction
Gyutaek Oh, Jeong Eun Lee, Jong Chul Ye
arXiv 2023. [Paper]
8 Jan 2023

Denoising Diffusion Probabilistic Models for Generation of Realistic Fully-Annotated Microscopy Image Data Sets
Dennis Eschweiler, Johannes Stegmaier
arXiv 2023. [Paper]
2 Jan 2023

Diffusion Model based Semi-supervised Learning on Brain Hemorrhage Images for Efficient Midline Shift Quantification
Shizhan Gong, Cheng Chen, Yuqi Gong, Nga Yan Chan, Wenao Ma, Calvin Hoi-Kwan Mak, Jill Abrigo, Qi Dou
arXiv 2023. [Paper]
1 Jan 2023

SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation
Jee Seok Yoon, Chenghao Zhang, Heung-Il Suk, Jia Guo, Xiaoxiao Li
arXiv 2022. [Paper]
16 Dec 2022

Universal Generative Modeling in Dual-domain for Dynamic MR Imaging
Chuanming Yu, Yu Guan, Ziwen Ke, Dong Liang, Qiegen Liu
arXiv 2022. [Paper]
15 Dec 2022

Generating Realistic 3D Brain MRIs Using a Conditional Diffusion Probabilistic Model
Wei Peng, Ehsan Adeli, Qingyu Zhao, Kilian M. Pohl
arXiv 2022. [Paper] [Github]
15 Dec 2022

SPIRiT-Diffusion: SPIRiT-driven Score-Based Generative Modeling for Vessel Wall imaging
Chentao Cao, Zhuo-Xu Cui, Jing Cheng, Sen Jia, Hairong Zheng, Dong Liang, Yanjie Zhu
arXiv 2022. [Paper]
14 Dec 2022

Diffusion Probabilistic Models beat GANs on Medical Images
Gustav Müller-Franzes, Jan Moritz Niehues, Firas Khader, Soroosh Tayebi Arasteh, Christoph Haarburger, Christiane Kuhl, Tianci Wang, Tianyu Han, Sven Nebelung, Jakob Nikolas Kather, Daniel Truhn
arXiv 2022. [Paper]
14 Dec 2022

One Sample Diffusion Model in Projection Domain for Low-Dose CT Imaging
Bin Huang, Liu Zhang, Shiyu Lu, Boyu Lin, Weiwen Wu, Qiegen Liu
arXiv 2022. [Paper]
7 Dec 2022

Neural Cell Video Synthesis via Optical-Flow Diffusion
Manuel Serna-Aguilera, Khoa Luu, Nathaniel Harris, Min Zou
arXiv 2022. [Paper]
6 Dec 2022

Improving dermatology classifiers across populations using images generated by large diffusion models
Luke W. Sagers, James A. Diao, Matthew Groh, Pranav Rajpurkar, Adewole S. Adamson, Arjun K. Manrai
NeurIPS Workshop 2022. [Paper]
23 Nov 2022

RoentGen: Vision-Language Foundation Model for Chest X-ray Generation
Pierre Chambon, Christian Bluethgen, Jean-Benoit Delbrouck, Rogier Van der Sluijs, Małgorzata Połacin, Juan Manuel Zambrano Chaves, Tanishq Mathew Abraham, Shivanshu Purohit, Curtis P. Langlotz, Akshay Chaudhari
arXiv 2022. [Paper]
23 Nov 2022

DOLCE: A Model-Based Probabilistic Diffusion Framework for Limited-Angle CT Reconstruction
Jiaming Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Stewart He, K. Aditya Mohan, Ulugbek S. Kamilov, Hyojin Kim
arXiv 2022. [Paper]
22 Nov 2022

Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models
Hyungjin Chung, Dohoon Ryu, Michael T. McCann, Marc L. Klasky, Jong Chul Ye
arXiv 2022. [Paper]
19 Nov 2022

Patch-Based Denoising Diffusion Probabilistic Model for Sparse-View CT Reconstruction
Wenjun Xia, Wenxiang Cong, Ge Wang
arXiv 2022. [Paper]
18 Nov 2022

Brain PET Synthesis from MRI Using Joint Probability Distribution of Diffusion Model at Ultrahigh Fields
Xie Taofeng, Cao Chentao, Cui Zhuoxu, Li Fanshi, Wei Zidong, Zhu Yanjie, Li Ye, Liang Dong, Jin Qiyu, Chen Guoqing, Wang Haifeng
arXiv 2022. [Paper]
16 Nov 2022

Improved HER2 Tumor Segmentation with Subtype Balancing using Deep Generative Networks
Mathias Öttl, Jana Mönius, Matthias Rübner, Carol I. Geppert, Jingna Qiu, Frauke Wilm, Arndt Hartmann, Matthias W. Beckmann, Peter A. Fasching, Andreas Maier, Ramona Erber, Katharina Breininger
arXiv 2022. [Paper]
11 Nov 2022

An unobtrusive quality supervision approach for medical image annotation
Sonja Kunzmann, Mathias Öttl, Prathmesh Madhu, Felix Denzinger, Andreas Maier
arXiv 2022. [Paper]
11 Nov 2022

Medical Diffusion – Denoising Diffusion Probabilistic Models for 3D Medical Image Generation
Firas Khader, Gustav Mueller-Franzes, Soroosh Tayebi Arasteh, Tianyu Han, Christoph Haarburger, Maximilian Schulze-Hagen, Philipp Schad, Sandy Engelhardt, Bettina Baessler, Sebastian Foersch, Johannes Stegmaier, Christiane Kuhl, Sven Nebelung, Jakob Nikolas Kather, Daniel Truhn
arXiv 2022. [Paper]
7 Nov 2022

Generation of Anonymous Chest Radiographs Using Latent Diffusion Models for Training Thoracic Abnormality Classification Systems
Kai Packhäuser, Lukas Folle, Florian Thamm, Andreas Maier
arXiv 2022. [Paper]
2 Nov 2022

Spot the fake lungs: Generating Synthetic Medical Images using Neural Diffusion Models
Hazrat Ali, Shafaq Murad, Zubair Shah
arXiv 2022. [Paper] [Project]
2 Nov 2022

MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model
Junde Wu, Huihui Fang, Yu Zhang, Yehui Yang, Yanwu Xu
arXiv 2022. [Paper]
1 Nov 2022

Accelerating Diffusion Models via Pre-segmentation Diffusion Sampling for Medical Image Segmentation
Xutao Guo, Yanwu Yang, Chenfei Ye, Shang Lu, Yang Xiang, Ting Ma
arXiv 2022. [Paper]
27 Oct 2022

Multitask Brain Tumor Inpainting with Diffusion Models: A Methodological Report
Pouria Rouzrokh, Bardia Khosravi, Shahriar Faghani, Mana Moassefi, Sanaz Vahdati, Bradley J. Erickson
arXiv 2022. [Paper] [Github]
21 Oct 2022

Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre Chambon, Christian Bluethgen, Curtis P. Langlotz, Akshay Chaudhari
arXiv 2022. [Paper]
9 Oct 2022

Anatomically constrained CT image translation for heterogeneous blood vessel segmentation
Giammarco La Barbera, Haithem Boussaid, Francesco Maso, Sabine Sarnacki, Laurence Rouet, Pietro Gori, Isabelle Bloch
BMVC 2022. [Paper]
4 Oct 2022

Low-Dose CT Using Denoising Diffusion Probabilistic Model for 20× Speedup
Wenjun Xia, Qing Lyu, Ge Wang
arXiv 2022. [Paper]
29 Sep 2022

Diffusion Adversarial Representation Learning for Self-supervised Vessel Segmentation
Boah Kim, Yujin Oh, Jong Chul Ye
arXiv 2022. [Paper]
29 Sep 2022

Conversion Between CT and MRI Images Using Diffusion and Score-Matching Models
Qing Lyu, Ge Wang
arXiv 2022. [Paper]
24 Sep 2022

Brain Imaging Generation with Latent Diffusion Models
Walter H. L. Pinaya, Petru-Daniel Tudosiu, Jessica Dafflon, Pedro F da Costa, Virginia Fernandez, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso
arXiv 2022. [Paper]
15 Sep 2022

PET image denoising based on denoising diffusion probabilistic models
Kuang Gong, Keith A. Johnson, Georges El Fakhri, Quanzheng Li, Tinsu Pan
arXiv 2022. [Paper]
13 Sep 2022

Self-Score: Self-Supervised Learning on Score-Based Models for MRI Reconstruction
Zhuo-Xu Cui, Chentao Cao, Shaonan Liu, Qingyong Zhu, Jing Cheng, Haifeng Wang, Yanjie Zhu, Dong Liang
IEEE TMI 2022. [Paper]
2 Sep 2022

High-Frequency Space Diffusion Models for Accelerated MRI
Chentao Cao, Zhuo-Xu Cui, Shaonan Liu, Dong Liang, Yanjie Zhu
arXiv 2022. [Paper]
10 Aug 2022

What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez, Antanas Kascenas, Xiao Liu, Alison Q. O’Neil, Sotirios A. Tsaftaris
MICCAI 2022. [Paper] [Github]
25 Jul 2022

Unsupervised Medical Image Translation with Adversarial Diffusion Models
Muzaffer Özbey, Salman UH Dar, Hasan A Bedel, Onat Dalmaz, Şaban Özturk, Alper Güngör, Tolga Çukur
arXiv 2022. [Paper]
17 Jul 2022

Adaptive Diffusion Priors for Accelerated MRI Reconstruction
Salman UH Dar, Şaban Öztürk, Yilmaz Korkmaz, Gokberk Elmas, Muzaffer Özbey, Alper Güngör, Tolga Çukur
arXiv 2022. [Paper]
12 Jul 2022

A Novel Unified Conditional Score-based Generative Framework for Multi-modal Medical Image Completion
Xiangxi Meng, Yuning Gu, Yongsheng Pan, Nizhuan Wang, Peng Xue, Mengkang Lu, Xuming He, Yiqiang Zhan, Dinggang Shen
arXiv 2022. [Paper]
7 Jul 2022

Cross-Modal Transformer GAN: A Brain Structure-Function Deep Fusing Framework for Alzheimer’s Disease
Junren Pan, Shuqiang Wang
arXiv 2022. [Paper]
20 Jun 2022

Diffusion Deformable Model for 4D Temporal Medical Image Generation
Boah Kim, Jong Chul Ye
MICCAI 2022. [Paper] [Github]
27 Jun 2022

Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardos
MICCAI 2022. [Paper]
7 Jun 2022

Improving Diffusion Models for Inverse Problems using Manifold Constraints
Hyungjin Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye
arXiv 2022. [Paper]
2 Jun 2022

AnoDDPM: Anomaly Detection with Denoising Diffusion Probabilistic Models using Simplex Noise
Julian Wyatt, Adam Leach, Sebastian M. Schmon, Chris G. Willcocks
CVPR Workshop 2022. [Paper] [Github]
1 Jun 2022

The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models
Julia Wolleb, Robin Sandkühler, Florentin Bieder, Philippe C. Cattin
arXiv 2022. [Paper]
6 Apr 2022

MR Image Denoising and Super-Resolution Using Regularized Reverse Diffusion
Hyungjin Chung, Eun Sun Lee, Jong Chul Ye
arXiv 2022. [Paper]
23 Mar 2022

Diffusion Models for Medical Anomaly Detection
Julia Wolleb, Florentin Bieder, Robin Sandkühler, Philippe C. Cattin
MICCAI 2022. [Paper] [Github]
8 Mar 2022

Towards performant and reliable undersampled MR reconstruction via diffusion model sampling
Cheng Peng, Pengfei Guo, S. Kevin Zhou, Vishal Patel, Rama Chellappa
arXiv 2022. [Paper] [Github]
8 Mar 2022

Measurement-conditioned Denoising Diffusion Probabilistic Model for Under-sampled Medical Image Reconstruction
Yutong Xie, Quanzheng Li
MICCAI 2022. [Paper] [Github]
5 Mar 2022

MRI Reconstruction via Data Driven Markov Chain with Joint Uncertainty Estimation
Guanxiong Luo, Martin Heide, Martin Uecker
arXiv 2022. [Paper] [Github]
3 Feb 2022

Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic Model
Dewei Hu, Yuankai K. Tao, Ipek Oguz
arXiv 2022. [Paper] [Github]
27 Jan 2022

Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
Hyungjin Chung, Byeongsu Sim, Jong Chul Ye
CVPR 2021. [Paper]
9 Dec 2021

Solving Inverse Problems in Medical Imaging with Score-Based Generative Models
Yang Song, Liyue Shen, Lei Xing, Stefano Ermon
NeurIPS Workshop 2021. [Paper] [Github]
15 Nov 2021

Score-based diffusion models for accelerated MRI
Hyungjin Chung, Jong chul Ye
MIA 2021. [Paper] [Github]
8 Oct 2021

Multi-modal Learning

IterInv: Iterative Inversion for Pixel-Level T2I Models
Chuanming Tang, Kai Wang, Joost van de Weijer
arXiv 2023. [Paper]
30 Oct 2023

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen, Menghan Xia, Yingqing He, Yong Zhang, Xiaodong Cun, Shaoshu Yang, Jinbo Xing, Yaofang Liu, Qifeng Chen, Xintao Wang, Chao Weng, Ying Shan
arXiv 2023. [Paper]
30 Oct 2023

IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI
Bochuan Cao, Changjiang Li, Ting Wang, Jinyuan Jia, Bo Li, Jinghui Chen
NeurIPS 2023. [Paper]
30 Oct 2023

CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models
Ziyang Yuan, Mingdeng Cao, Xintao Wang, Zhongang Qi, Chun Yuan, Ying Shan
arXiv 2023. [Paper]
30 Oct 2023

Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model
Suyeon Lee, Chaeyoung Jung, Youngjoon Jang, Jaehun Kim, Joon Son Chung
arXiv 2023. [Paper]
30 Oct 2023

Text-to-3D with Classifier Score Distillation
Xin Yu, Yuan-Chen Guo, Yangguang Li, Ding Liang, Song-Hai Zhang, Xiaojuan Qi
arXiv 2023. [Paper]
30 Oct 2023

Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models
Hai Wang, Xiaoyu Xiang, Yuchen Fan, Jing-Hao Xue
arXiv 2023. [Paper]
28 Oct 2023

SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li, Jingyi Lu, Kai Han, Victor Prisacariu
arXiv 2023. [Paper]
26 Oct 2023

CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling
Seyedmorteza Sadat, Jakob Buhmann, Derek Bradely, Otmar Hilliges, Romann M. Weber
arXiv 2023. [Paper]
26 Oct 2023

Exploring Iterative Refinement with Diffusion Models for Video Grounding
Xiao Liang, Tao Shi, Yaoyuan Liang, Te Tao, Shao-Lun Huang
arXiv 2023. [Paper]
26 Oct 2023

A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Eyal Segalis, Dani Valevski, Danny Lumen, Yossi Matias, Yaniv Leviathan
arXiv 2023. [Paper]
25 Oct 2023

CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Aaron Gokaslan, A. Feder Cooper, Jasmine Collins, Landan Seguin, Austin Jacobson, Mihir Patel, Jonathan Frankle, Cory Stephenson, Volodymyr Kuleshov
arXiv 2023. [Paper]
25 Oct 2023

On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu, Ning Yu, Michael Backes, Yun Shen, Yang Zhang
arXiv 2023. [Paper]
25 Oct 2023

Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models
Tianyi Lu, Xing Zhang, Jiaxi Gu, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu
arXiv 2023. [Paper]
25 Oct 2023

Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models
Weijie Chen, Haoyu Wang, Shicai Yang, Lei Zhang, Wei Wei, Yanning Zhang, Luojun Lin, Di Xie, Yueting Zhuang
arXiv 2023. [Paper]
25 Oct 2023

Text Guided Video Editing Competition
Jay Zhangjie Wu, Xiuyu Li, Difei Gao, Zhen Dong, Jinbin Bai, Aishani Singh, Xiaoyu Xiang, Youzeng Li, Zuwei Huang, Yuanxi Sun, Rui He, Feng Hu, Junhua Hu, Hai Huang, Hanyu Zhu, Xu Cheng, Jie Tang, Mike Zheng Shou, Kurt Keutzer, Forrest Iandola
arXiv 2023. [Paper]
24 Oct 2023

Language-driven Scene Synthesis using Multi-conditional Diffusion Model
An Vuong, Minh Nhat Vu, Toan Tien Nguyen, Baoru Huang, Dzung Nguyen, Thieu Vo, Anh Nguyen
arXiv 2023. [Paper]
24 Oct 2023

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling
Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, Xintao Wang, Ying Shan, Ziwei Liu
arXiv 2023. [Paper] [Project]
23 Oct 2023

SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis
Marco Comunità, Riccardo F. Gramaccioni, Emilian Postolache, Emanuele Rodolà, Danilo Comminiello, Joshua D. Reiss
arXiv 2023. [Paper]
23 Oct 2023

Matryoshka Diffusion Models
Jiatao Gu, Shuangfei Zhai, Yizhe Zhang, Josh Susskind, Navdeep Jaitly
arXiv 2023. [Paper]
23 Oct 2023

Large Language Models can Share Images, Too!
Young-Jun Lee, Jonghwan Hyeon, Ho-Jin Choi
arXiv 2023. [Paper]
23 Oct 2023

Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Shawn Shan, Wenxin Ding, Josephine Passananti, Haitao Zheng, Ben Y. Zhao
arXiv 2023. [Paper]
20 Oct 2023

TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models
Tianshi Cao, Karsten Kreis, Sanja Fidler, Nicholas Sharp, Kangxue Yin
arXiv 2023. [Paper]
20 Oct 2023

DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics
Kaiwen Zheng, Cheng Lu, Jianfei Chen, Jun Zhu
NeurIPS 2023. [Paper] [Project]
20 Oct 2023

Localizing and Editing Knowledge in Text-to-Image Generative Models
Samyadeep Basu, Nanxuan Zhao, Vlad Morariu, Soheil Feizi, Varun Manjunatha
arXiv 2023. [Paper]
20 Oct 2023

TapMo: Shape-aware Motion Generation of Skeleton-free Characters
Jiaxu Zhang, Shaoli Huang, Zhigang Tu, Xin Chen, Xiaohang Zhan, Gang Yu, Ying Shan
arXiv 2023. [Paper]
19 Oct 2023

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation
Sihan Xu, Ziqiao Ma, Yidong Huang, Honglak Lee, Joyce Chai
arXiv 2023. [Paper]
19 Oct 2023

DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation
Bangbang Yang, Wenqi Dong, Lin Ma, Wenbo Hu, Xiao Liu, Zhaopeng Cui, Yuewen Ma
arXiv 2023. [Paper]
19 Oct 2023

EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model
Zheyuan Zhang, Lanhong Yao, Bin Wang, Debesh Jha, Elif Keles, Alpay Medetalibeyoglu, Ulas Bagci
arXiv 2023. [Paper]
19 Oct 2023

Diverse Diffusion: Enhancing Image Diversity in Text-to-Image Generation
Mariia Zameshina, Olivier Teytaud, Laurent Najman
arXiv 2023. [Paper]
19 Oct 2023

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing, Menghan Xia, Yong Zhang, Haoxin Chen, Xintao Wang, Tien-Tsin Wong, Ying Shan
arXiv 2023. [Paper]
18 Oct 2023

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan
arXiv 2023. [Paper]
18 Oct 2023

Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Qichao Wang, Tian Bian, Yian Yin, Tingyang Xu, Hong Cheng, Helen M. Meng, Zibin Zheng, Liang Chen, Bingzhe Wu
arXiv 2023. [Paper]
18 Oct 2023

Elucidating The Design Space of Classifier-Guided Diffusion Generation
Jiajun Ma, Tianyang Hu, Wenjia Wang, Jiacheng Sun
arXiv 2023. [Paper] [Github]
17 Oct 2023

BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
Siqi Kou, Lei Gan, Dequan Wang, Chongxuan Li, Zhijie Deng
arXiv 2023. [Paper]
17 Oct 2023

GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Dhruba Ghosh, Hanna Hajishirzi, Ludwig Schmidt
arXiv 2023. [Paper]
17 Oct 2023

Towards Training-free Open-world Segmentation via Image Prompting Foundation Models
Lv Tang, Peng-Tao Jiang, Hao-Ke Xiao, Bo Li
arXiv 2023. [Paper]
17 Oct 2023

LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu, Liangyu Chen, Tong Yang, Chunle Guo, Chongyi Li, Xiangyu Zhang
arXiv 2023. [Paper] [Project] [Github]
16 Oct 2023

Scene Graph Conditioning in Latent Diffusion
Frank Fundel
arXiv 2023. [Paper] [Github]
16 Oct 2023

Ring-A-Bell! How Reliable are Concept Removal Methods for Diffusion Models?
Yu-Lin Tsai, Chia-Yi Hsu, Chulin Xie, Chih-Hsun Lin, Jia-You Chen, Bo Li, Pin-Yu Chen, Chia-Mu Yu, Chun-Ying Huang
arXiv 2023. [Paper]
16 Oct 2023

Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models
Kevin Black, Mitsuhiko Nakamoto, Pranav Atreya, Homer Walke, Chelsea Finn, Aviral Kumar, Sergey Levine
arXiv 2023. [Paper]
16 Oct 2023

ViPE: Visualise Pretty-much Everything
Hassan Shahmohammadi, Adhiraj Ghosh, Hendrik P. A. Lensch
arXiv 2023. [Paper]
16 Oct 2023

TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi, Jianan Wang, He Cao, Boshi Tang, Xianbiao Qi, Tianyu Yang, Yukun Huang, Shilong Liu, Lei Zhang, Heung-Yeung Shum
arXiv 2023. [Paper]
16 Oct 2023

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
Hanan Gani, Shariq Farooq Bhat, Muzammal Naseer, Salman Khan, Peter Wonka
arXiv 2023. [Paper]
16 Oct 2023

LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
Zhenyi Liao, Zhijie Deng
arXiv 2023. [Paper]
15 Oct 2023

PaintHuman: Towards High-fidelity Text-to-3D Human Texturing via Denoised Score Distillation
Jianhui Yu, Hao Zhu, Liming Jiang, Chen Change Loy, Weidong Cai, Wayne Wu
arXiv 2023. [Paper]
14 Oct 2023

Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Maya Okawa, Ekdeep Singh Lubana, Robert P. Dick, Hidenori Tanaka
ICML Workshop 2023. [Paper]
13 Oct 2023

Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy
Anton Baryshnikov, Max Ryabinin
arXiv 2023. [Paper]
13 Oct 2023

Making Multimodal Generation Easier: When Diffusion Models Meet LLMs
Xiangyu Zhao, Bo Liu, Qijiong Liu, Guangyuan Shi, Xiao-Ming Wu
arXiv 2023. [Paper]
13 Oct 2023

R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation
Jiayu Xiao, Liang Li, Henglei Lv, Shuhui Wang, Qingming Huang
arXiv 2023. [Paper]
13 Oct 2023

DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu, Kang Zhao, Bo Peng, Yue Jiang, Yingya Zhang, Jing Dong
arXiv 2023. [Paper]
12 Oct 2023

OmniControl: Control Any Joint at Any Time for Human Motion Generation
Yiming Xie, Varun Jampani, Lei Zhong, Deqing Sun, Huaizu Jiang
arXiv 2023. [Paper] [Project]
12 Oct 2023

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu, Jian Ren, Aliaksandr Siarohin, Ivan Skorokhodov, Yanyu Li, Dahua Lin, Xihui Liu, Ziwei Liu, Sergey Tulyakov
arXiv 2023. [Paper] [Project] [Github]
12 Oct 2023

GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Taoran Yi, Jiemin Fang, Guanjun Wu, Lingxi Xie, Xiaopeng Zhang, Wenyu Liu, Qi Tian, Xinggang Wang
arXiv 2023. [Paper]
12 Oct 2023

MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao, Yuchao Gu, Jay Zhangjie Wu, David Junhao Zhang, Jiawei Liu, Weijia Wu, Jussi Keppo, Mike Zheng Shou
arXiv 2023. [Paper]
12 Oct 2023

Interpretable Diffusion via Information Decomposition
Xianghao Kong, Ollie Liu, Han Li, Dani Yogatama, Greg Ver Steeg
arXiv 2023. [Paper]
12 Oct 2023

DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li, Yifu Zhang, Xiaoqing Ye
arXiv 2023. [Paper] [Project] [Github]
11 Oct 2023

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
Yingqing He, Shaoshu Yang, Haoxin Chen, Xiaodong Cun, Menghan Xia, Yong Zhang, Xintao Wang, Ran He, Qifeng Chen, Ying Shan
arXiv 2023. [Paper] [Project] [Github]
11 Oct 2023

ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Bo Peng, Xinyuan Chen, Yaohui Wang, Chaochao Lu, Yu Qiao
arXiv 2023. [Paper]
11 Oct 2023

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Zeqiang Lai, Xizhou Zhu, Jifeng Dai, Yu Qiao, Wenhai Wang
arXiv 2023. [Paper]
11 Oct 2023

Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else
Hazarapet Tunanyan, Dejia Xu, Shant Navasardyan, Zhangyang Wang, Humphrey Shi
arXiv 2023. [Paper]
11 Oct 2023

Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model
Shiyuan Yang, Xiaodong Chen, Jing Liao
arXiv 2023. [Paper]
11 Oct 2023

ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning
Alec Helbling, Evan Montoya, Duen Horng Chau
arXiv 2023. [Paper]
10 Oct 2023

JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling
Jingyang Zhang, Shiwei Li, Yuanxun Lu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Yao Yao
arXiv 2023. [Paper]
10 Oct 2023

Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris Metaxas
arXiv 2023. [Paper]
10 Oct 2023

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Zhili Liu, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok
arXiv 2023 [Paper]
9 Oct 2023

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong, Mengmeng Xu, Christian Simon, Shoufa Chen, Jiawei Ren, Yanping Xie, Juan-Manuel Perez-Rua, Bodo Rosenhahn, Tao Xiang, Sen He
arXiv 2023. [Paper]
9 Oct 2023

Language Model Beats Diffusion – Tokenizer is Key to Visual Generation
Lijun Yu, José Lezama, Nitesh B. Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang
arXiv 2023. [Paper] [Github]
9 Oct 2023

IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts
Bohan Zeng, Shanglin Li, Yutang Feng, Hong Li, Sicheng Gao, Jiaming Liu, Huaxia Li, Xu Tang, Jianzhuang Liu, Baochang Zhang
arXiv 2023. [Paper]
9 Oct 2023

Diffusion Models as Masked Audio-Video Learners
Elvis Nunez, Yanzi Jin, Mohammad Rastegari, Sachin Mehta, Maxwell Horton
arXiv 2023. [Paper]
5 Oct 2023

Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Mihir Prabhudesai, Anirudh Goyal, Deepak Pathak, Katerina Fragkiadaki
arXiv 2023. [Paper]
5 Oct 2023

Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
Chuan Fang, Xiaotao Hu, Kunming Luo, Ping Tan
arXiv 2023. [Paper]
5 Oct 2023

MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Yanwu Xu, Li Sun, Wei Peng, Shyam Visweswaran, Kayhan Batmanghelich
arXiv 2023. [Paper]
5 Oct 2023

Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
Anton Razzhigaev, Arseniy Shakhmatov, Anastasia Maltseva, Vladimir Arkhipkin, Igor Pavlov, Ilya Ryabov, Angelina Kuts, Alexander Panchenko, Andrey Kuznetsov, Denis Dimitrov
arXiv 2023. [Paper]
5 Oct 2023

Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior
Jinting Wang, Li Liu, Jun Wang, Hei Victor Cheng
arXiv 2023. [Paper]
5 Oct 2023

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
Yuze He, Yushi Bai, Matthieu Lin, Wang Zhao, Yubin Hu, Jenny Sheng, Ran Yi, Juanzi Li, Yong-Jin Liu
arXiv 2023. [Paper] [Project] [Github]
4 Oct 2023

Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts
Shiyi Du, Xiaosong Wang, Yongyi Lu, Yuyin Zhou, Shaoting Zhang, Alan Yuille, Kang Li, Zongwei Zhou
arXiv 2023. [Paper]
4 Oct 2023

Magicremover: Tuning-free Text-guided Image inpainting with Diffusion Models
Siyuan Yang, Lu Zhang, Liqian Ma, Yu Liu, JingJing Fu, You He
arXiv 2023. [Paper]
4 Oct 2023

ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF
Jangho Park, Gihyun Kwon, Jong Chul Ye
arXiv 2023. [Paper]
4 Oct 2023

SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D
Weiyu Li, Rui Chen, Xuelin Chen, Ping Tan
arXiv 2023. [Paper] [Project]
4 Oct 2023

EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods
Samyadeep Basu, Mehrdad Saberi, Shweta Bhardwaj, Atoosa Malemir Chegini, Daniela Massiceti, Maziar Sanjabi, Shell Xu Hu, Soheil Feizi
arXiv 2023. [Paper] [Project] [Github]
3 Oct 2023

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Yingqian Cui, Jie Ren, Yuping Lin, Han Xu, Pengfei He, Yue Xing, Wenqi Fan, Hui Liu, Jiliang Tang
arXiv 2023. [Paper]
3 Oct 2023

Amazing Combinatorial Creation: Acceptable Swap-Sampling for Text-to-Image Generation
Jun Li, Zedong Zhang, Jian Yang
arXiv 2023. [Paper] [Project]
3 Oct 2023

Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation
Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha
arXiv 2023. [Paper]
2 Oct 2023

Conditional Diffusion Distillation
Kangfu Mei, Mauricio Delbracio, Hossein Talebi, Zhengzhong Tu, Vishal M. Patel, Peyman Milanfar
arXiv 2023. [Paper]
2 Oct 2023

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Xuan Ju, Ailing Zeng, Yuxuan Bian, Shaoteng Liu, Qiang Xu
arXiv 2023. [Paper]
2 Oct 2023

Prompt-tuning latent diffusion models for inverse problems
Hyungjin Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio
arXiv 2023. [Paper]
2 Oct 2023

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models
Yongchan Kwon, Eric Wu, Kevin Wu, James Zou
arXiv 2023. [Paper]
2 Oct 2023

Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Hyeonho Jeong, Jong Chul Ye
arXiv 2023. [Paper] [Github]
2 Oct 2023

Music- and Lyrics-driven Dance Synthesis
Wenjie Yin, Qingyuan Yao, Yi Yu, Hang Yin, Danica Kragic, Mårten Björkman
arXiv 2023. [Paper]
30 Sep 2023

DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
Zhiyao Sun, Tian Lv, Sheng Ye, Matthieu Gaetan Lin, Jenny Sheng, Yu-Hui Wen, Minjing Yu, Yong-jin Liu
arXiv 2023. [Paper] [Project]
30 Sep 2023

PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Yue Wu, Zhongdao Wang, James Kwok, Ping Luo, Huchuan Lu, Zhenguo Li
arXiv 2023. [Paper] [Project] [Github]
30 Sep 2023

InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists
Yulu Gan, Sungwoo Park, Alexander Schubert, Anthony Philippakis, Ahmed M. Alaa
arXiv 2023. [Paper]
30 Sep 2023

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks
ICCV 2023. [Paper]
30 Sep 2023

Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Kevin Clark, Paul Vicol, Kevin Swersky, David J Fleet
arXiv 2023. [Paper]
29 Sep 2023

Text-image Alignment for Diffusion-based Perception
Neehar Kondapaneni, Markus Marks, Manuel Knott, Rogério Guimarães, Pietro Perona
arXiv 2023. [Paper]
29 Sep 2023

LLM-grounded Video Diffusion Models
Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li
arXiv 2023. [Paper] [Project] [Github]
29 Sep 2023

KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiancheng Huang, Yifan Liu, Jin Qin, Shifeng Chen
arXiv 2023. [Paper]
28 Sep 2023

CCEdit: Creative and Controllable Video Editing via Diffusion Models
Ruoyu Feng, Wenming Weng, Yanhui Wang, Yuhui Yuan, Jianmin Bao, Chong Luo, Zhibo Chen, Baining Guo
arXiv 2023. [Paper]
28 Sep 2023

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
David Junhao Zhang, Jay Zhangjie Wu, Jia-Wei Liu, Rui Zhao, Lingmin Ran, Yuchao Gu, Difei Gao, Mike Zheng Shou
arXiv 2023. [Paper]
27 Sep 2023

Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing
Kai Wang, Fei Yang, Shiqi Yang, Muhammad Atif Butt, Joost van de Weijer
arXiv 2023. [Paper]
27 Sep 2023

DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
Lingxiao Lu, Bo Zhang, Li Niu
arXiv 2023. [Paper]
27 Sep 2023

Learning Using Generated Privileged Information by Text-to-Image Diffusion Models
Rafael-Edy Menadil, Mariana-Iuliana Georgescu, Radu Tudor Ionescu
arXiv 2023. [Paper]
26 Sep 2023

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu
arXiv 2023. [Paper] [Project]
26 Sep 2023

Learning Using Generated Privileged Information by Text-to-Image Diffusion Models
Rafael-Edy Menadil, Mariana-Iuliana Georgescu, Radu Tudor Ionescu
arXiv 2023. [Paper]
26 Sep 2023

FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing
Songyan Chen, Jiancheng Huang
arXiv 2023. [Paper]
26 Sep 2023

Navigating Text-To-Image Customization:From LyCORIS Fine-Tuning to Model Evaluation
Shin-Ying Yeh, Yu-Guan Hsieh, Zhidong Gao, Bernard B W Yang, Giyeong Oh, Yanmin Gong
arXiv 2023. [Paper]
26 Sep 2023

Text-image guided Diffusion Model for generating Deepfake celebrity interactions
Yunzhuo Chen, Nur Al Hasan Haldar, Naveed Akhtar, Ajmal Mian
arXiv 2023. [Paper]
26 Sep 2023

Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Hanzhuo Huang, Yufan Feng, Cheng Shi, Lan Xu, Jingyi Yu, Sibei Yang
arXiv 2023. [Paper]
25 Sep 2023

COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs
Tiep Le, Vasudev Lal, Phillip Howard
arXiv 2023. [Paper]
23 Sep 2023

Zero-Shot Object Counting with Language-Vision Models
Jingyi Xu, Hieu Le, Dimitris Samaras
CVPR 2023. [Paper] [Github]
22 Sep 2023

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy
arXiv 2023. [Paper] [Github]
22 Sep 2023

DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis
Yu Gu, Yianrao Bian, Guangzhi Lei, Chao Weng, Dan Su
arXiv 2023. [Paper]
22 Sep 2023

FreeU: Free Lunch in Diffusion U-Net
Chenyang Si, Ziqi Huang, Yuming Jiang, Ziwei Liu
arXiv 2023. [Paper]
20 Sep 2023

Investigating Personalization Methods in Text to Music Generation
Manos Plitsis, Theodoros Kouzelis, Georgios Paraskevopoulos, Vassilis Katsouros, Yannis Panagakis
arXiv 2023. [Paper] [Project]
20 Sep 2023

Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Yatong Bai, Trung Dang, Dung Tran, Kazuhito Koishida, Somayeh Sojoudi
arXiv 2023. [Paper]
19 Sep 2023

Forgedit: Text Guided Image Editing via Learning and Forgetting
Shiwen Zhang, Shuai Xiao, Weilin Huang
arXiv 2023. [Paper] [Github]
19 Sep 2023

What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews
Zoe De Simone, Angie Boggust, Arvind Satyanarayan, Ashia Wilson
arXiv 2023. [Paper]
18 Sep 2023

Causal-Story: Local Causal Attention Utilizing Parameter-Efficient Tuning For Visual Story Synthesis
Tianyi Song, Jiuxin Cao, Kun Wang, Bo Liu, Xiaofeng Zhang
arXiv 2023. [Paper]
18 Sep 2023

Progressive Text-to-Image Diffusion with Soft Latent Direction
YuTeng Ye, Jiale Cai, Hang Zhou, Guanwen Li, Youjia Zhang, Zikai Song, Chenxing Gao, Junqing Yu, Wei Yang
arXiv 2023. [Paper]
18 Sep 2023

LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
Yihao Zhi, Xiaodong Cun, Xuelin Chen, Xi Shen, Wen Guo, Shaoli Huang, Shenghua Gao
arXiv 2023. [Paper]
17 Sep 2023

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana
arXiv 2023. [Paper]
15 Sep 2023

AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Ju-Chieh Chou, Chung-Ming Chien, Karen Livescu
arXiv 2023. [Paper]
14 Sep 2023

Viewpoint Textual Inversion: Unleashing Novel View Synthesis with Pretrained 2D Diffusion Models
James Burgess, Kuan-Chieh Wang, Serena Yeung
arXiv 2023. [Paper] [Github]
14 Sep 2023

Text-to-Image Models for Counterfactual Explanations: a Black-Box Approach
Guillaume Jeanneret, Loïc Simon, Frédéric Jurie
arXiv 2023. [Paper]
14 Sep 2023

Large-Vocabulary 3D Diffusion Model with Transformer
Ziang Cao, Fangzhou Hong, Tong Wu, Liang Pan, Ziwei Liu
arXiv 2023. [Paper] [Project] [Github]
14 Sep 2023

DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang
arXiv 2023. [Paper]
14 Sep 2023

Diffusion models for audio semantic communication
Eleonora Grassucci, Christian Marinoni, Andrea Rodriguez, Danilo Comminiello
arXiv 2023. [Paper]
13 Sep 2023

DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models
Namhyuk Ahn, Junsoo Lee, Chunggi Lee, Kunhee Kim, Daesik Kim, Seung-Hun Nam, Kibeom Hong
arXiv 2023. [Paper]
13 Sep 2023

DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation
Zhichao Wu, Qiulin Li, Sixing Liu, Qun Yang
arXiv 2023. [Paper]
13 Sep 2023

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Xingchao Liu, Xiwen Zhang, Jianzhu Ma, Jian Peng, Qiang Liu
arXiv 2023. [Paper] [Github]
12 Sep 2023

Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model
Yin Wang, Zhiying Leng, Frederick W. B. Li, Shun-Cheng Wu, Xiaohui Liang
ICCV 2023. [Paper]
12 Sep 2023

Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
Zhi-Yi Chin, Chieh-Ming Jiang, Ching-Chun Huang, Pin-Yu Chen, Wei-Chen Chiu
arXiv 2023. [Paper]
12 Sep 2023

PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Li Chen, Mengyi Zhao, Yiheng Liu, Mingxu Ding, Yangyang Song, Shizun Wang, Xu Wang, Hao Yang, Jing Liu, Kang Du, Min Zheng
arXiv 2023. [Paper] [Project]
11 Sep 2023

PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud
Chengyu Wang, Zhongjie Duan, Bingyan Liu, Xinyi Zou, Cen Chen, Kui Jia, Jun Huang
arXiv 2023. [Paper]
11 Sep 2023

Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation
Anna Deichler, Shivam Mehta, Simon Alexanderson, Jonas Beskow
ICMI 2023. [Paper]
11 Sep 2023

Effective Real Image Editing with Accelerated Iterative Diffusion Inversion
Zhihong Pan, Riccardo Gherardi, Xiufeng Xie, Stephen Huang
ICCV 2023. [Paper]
10 Sep 2023

Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning
Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu, Xiangyang Luo, Yanqing Guo
arXiv 2023. [Paper]
10 Sep 2023

Text-driven Editing of 3D Scenes without Retraining
Shuangkang Fang, Yufeng Wang, Yi Yang, Yi-Hsuan Tsai, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang
arXiv 2023. [Paper]
10 Sep 2023

The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion
Yujin Jeong, Wonjeong Ryoo, Seunghyun Lee, Dabin Seo, Wonmin Byeon, Sangpil Kim, Jinkyu Kim
arXiv 2023. [Paper]
8 Sep 2023

Create Your World: Lifelong Text-to-Image Diffusion
Gan Sun, Wenqi Liang, Jiahua Dong, Jun Li, Zhengming Ding, Yang Cong
arXiv 2023. [Paper]
8 Sep 2023

MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
Yupeng Zhou, Daquan Zhou, Zuo-Liang Zhu, Yaxing Wang, Qibin Hou, Jiashi Feng
arXiv 2023. [Paper]
8 Sep 2023

MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers
Sijia Li, Chen Chen, Haonan Lu
arXiv 2023. [Paper] [Project]
8 Sep 2023

From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao, Qi Yang, Feng Zhou, Changshui Zhang
arXiv 2023. [Paper]
8 Sep 2023

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Han Hu, Dong Chen, Baining Guo
arXiv 2023. [Paper] [Project] [Github]
7 Sep 2023

Text-to-feature diffusion for audio-visual few-shot learning
Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata
arXiv 2023. [Paper]
7 Sep 2023

Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Sungwon Hwang, Junha Hyung, Jaegul Choo
arXiv 2023. [Paper] [Project]
7 Sep 2023

Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu
arXiv 2023. [Paper]
7 Sep 2023

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang
arXiv 2023. [Paper] [Project] [Github]
7 Sep 2023

MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Zeyu Ling, Bo Han, Yongkang Wong, Mohan Kangkanhalli, Weidong Geng
arXiv 2023. [Paper]
6 Sep 2023

Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
Jinglong Wang, Xiawei Li, Jing Zhang, Qingyuan Xu, Qin Zhou, Qian Yu, Lu Sheng, Dong Xu
arXiv 2023. [Paper]
6 Sep 2023

Generating Realistic Images from In-the-wild Sounds
Taegyeong Lee, Jeonghun Kang, Hyeonyu Kim, Taehwan Kim
ICCV 2023. [Paper]
5 Sep 2023

Generative-based Fusion Mechanism for Multi-Modal Tracking
Zhangyong Tang, Tianyang Xu, Xuefeng Zhu, Xiao-Jun Wu, Josef Kittler
arXiv 2023. [Paper]
4 Sep 2023

VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
Xuyang Liu, Siteng Huang, Yachen Kang, Honggang Chen, Donglin Wang
arXiv 2023. [Paper]
3 Sep 2023

Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities
Shanyuan Liu, Dawei Leng, Yuhui Yin
arXiv 2023. [Paper]
2 Sep 2023

MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation
Hanshu Yan, Jun Hao Liew, Long Mai, Shanchuan Lin, Jiashi Feng
arXiv 2023. [Paper]
2 Sep 2023

Iterative Multi-granular Image Editing using Diffusion Models
K J Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan
arXiv 2023. [Paper]
1 Sep 2023

DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion Models
Michael Shenoda, Edward Kim
arXiv 2023. [Paper]
1 Sep 2023

PathLDM: Text conditioned Latent Diffusion Model for Histopathology
Srikar Yellapragada, Alexandros Graikos, Prateek Prasanna, Tahsin Kurc, Joel Saltz, Dimitris Samaras
arXiv 2023. [Paper]
1 Sep 2023

VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang
arXiv 2023. [Paper]
1 Sep 2023

Detecting Out-of-Context Image-Caption Pairs in News: A Counter-Intuitive Method
Eivind Moholdt, Sohail Ahmed Khan, Duc-Tien Dang-Nguyen
CBMI 2023. [Paper]
31 Aug 2023

Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Qingping Zheng, Yuanfan Guo, Jiankang Deng, Jianhua Han, Ying Li, Songcen Xu, Hang Xu
arXiv 2023. [Paper]
31 Aug 2023

MVDream: Multi-view Diffusion for 3D Generation
Yichun Shi, Peng Wang, Jianglong Ye, Mai Long, Kejie Li, Xiao Yang
arXiv 2023. [Paper]
31 Aug 2023

Intriguing Properties of Diffusion Models: A Large-Scale Dataset for Evaluating Natural Attack Capability in Text-to-Image Generative Models
Takami Sato, Justin Yue, Nanze Chen, Ningfei Wang, Qi Alfred Chen
arXiv 2023. [Paper]
30 Aug 2023

DiffusionVMR: Diffusion Model for Video Moment Retrieval
Henghao Zhao, Kevin Qinghong Lin, Rui Yan, Zechao Li
ACM MM 2023. [Paper]
29 Aug 2023

C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model
Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin
arXiv 2023. [Paper]
29 Aug 2023

360-Degree Panorama Generation from Few Unregistered NFoV Images
Jionghao Wang, Ziyu Chen, Jun Ling, Rong Xie, Li Song
ACM MM 2023. [Paper] [Github]
28 Aug 2023

Priority-Centric Human Motion Generation in Discrete Latent Space
Hanyang Kong, Kehong Gong, Dongze Lian, Michael Bi Mi, Xinchao Wang
arXiv 2023. [Paper]
28 Aug 2023

SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation
Zhiyu Qu, Tao Xiang, Yi-Zhe Song
BMVC 2023. [Paper] [Github]
27 Aug 2023

Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models
Hao Fei, Shengqiong Wu, Wei Ji, Hanwang Zhang, Tat-Seng Chua
arXiv 2023. [Paper] [Project]
26 Aug 2023

ORES: Open-vocabulary Responsible Visual Synthesis
Minheng Ni, Chenfei Wu, Xiaodong Wang, Shengming Yin, Lijuan Wang, Zicheng Liu, Nan Duan
arXiv 2023. [Paper]
26 Aug 2023

The DiffuseStyleGesture+ entry to the GENEA Challenge 2023
Sicheng Yang, Haiwei Xue, Zhensong Zhang, Minglei Li, Zhiyong Wu, Xiaofei Wu, Songcen Xu, Zonghong Dai
ICMI 2023. [Paper] [Github]
26 Aug 2023

EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior
Minda Zhao, Chaoyi Zhao, Xinyue Liang, Lincheng Li, Zeng Zhao, Zhipeng Hu, Changjie Fan, Xin Yu
arXiv 2023. [Paper]
25 Aug 2023

Unified Concept Editing in Diffusion Models
Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzyńska, David Bau
arXiv 2023. [Paper] [Project] [Github]
25 Aug 2023

Dense Text-to-Image Generation with Attention Modulation
Yunji Kim, Jiyoung Lee, Jin-Hwa Kim, Jung-Woo Ha, Jun-Yan Zhu
ICCV 2023. [Paper] [Github]
24 Aug 2023

APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency
Yupu Yao, Shangqi Deng, Zihan Cao, Harry Zhang, Liang-Jian Deng
arXiv 2023. [Paper]
24 Aug 2023

Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers, Julia Peters, Martin Potthast
arXiv 2023. [Paper]
23 Aug 2023

DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
Se Jin Park, Joanna Hong, Minsu Kim, Yong Man Ro
arXiv 2023. [Paper]
23 Aug 2023

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Yiwen Chen, Chi Zhang, Xiaofeng Yang, Zhongang Cai, Gang Yu, Lei Yang, Guosheng Lin
arXiv 2023. [Paper] [Github]
22 Aug 2023

DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
Xujie Zhang, Binbin Yang, Michael C. Kampffmeyer, Wenqing Zhang, Shiyue Zhang, Guansong Lu, Liang Lin, Hang Xu, Xiaodan Liang
arXiv 2023. [Paper]
22 Aug 2023

MusicJam: Visualizing Music Insights via Generated Narrative Illustrations
Chuer Chen, Nan Cao, Jiani Hou, Yi Guo, Yulei Zhang, Yang Shi
arXiv 2023. [Paper]
22 Aug 2023

TADA! Text to Animatable Digital Avatars
Tingting Liao, Hongwei Yi, Yuliang Xiu, Jiaxaing Tang, Yangyi Huang, Justus Thies, Michael J. Black
arXiv 2023. [Paper]
21 Aug 2023

EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints
Yutao Chen, Xingning Dong, Tian Gan, Chunluan Zhou, Ming Yang, Qingpei Guo
arXiv 2023. [Paper]
21 Aug 2023

Backdooring Textual Inversion for Concept Censorship
Yutong Wu, Jie Zhang, Florian Kerschbaum, Tianwei Zhang
arXiv 2023. [Paper] [Project] [Github]
21 Aug 2023

AltDiffusion: A Multilingual Text-to-Image Diffusion Model
Fulong Ye, Guang Liu, Xinya Wu, Ledell Wu
AAAI 2024. [Paper] [Github]
19 Aug 2023

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu
ICCV 2023. [Paper]
18 Aug 2023

MATLABER: Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR
Xudong Xu, Zhaoyang Lyu, Xingang Pan, Bo Dai
arXiv 2023. [Paper] [Project]
18 Aug 2023

Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Soumik Mukhopadhyay, Saksham Suri, Ravi Teja Gadde, Abhinav Shrivastava
arXiv 2023. [Paper] [Project] [Github]
18 Aug 2023

Guide3D: Create 3D Avatars from Text and Image Guidance
Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, Kwan-Yee K. Wong
arXiv 2023. [Paper]
18 Aug 2023

Language-Guided Diffusion Model for Visual Grounding
Sijia Chen, Baochun Li
arXiv 2023. [Paper]
18 Aug 2023

SimDA: Simple Diffusion Adapter for Efficient Video Generation
Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang
arXiv 2023. [Paper] [Project]
18 Aug 2023

StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Wenhao Chai, Xun Guo, Gaoang Wang, Yan Lu
ICCV 2023. [Paper] [Github]
18 Aug 2023

Edit Temporal-Consistent Videos with Image Diffusion Model
Yuanzhi Wang, Yong Li, Xin Liu, Anbo Dai, Antoni Chan, Zhen Cui
arXiv 2023. [Paper]
17 Aug 2023

Watch Your Steps: Local Image and Scene Editing by Text Instructions
Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski
arXiv 2023. [Paper] [Project]
17 Aug 2023

Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis
Minho Park, Jooyeol Yun, Seunghwan Choi, Jaegul Choo
ICCV 2023. [Paper] [Project] [Github]
16 Aug 2023

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory
Shengming Yin, Chenfei Wu, Jian Liang, Jie Shi, Houqiang Li, Gong Ming, Nan Duan
arXiv 2023. [Paper] [Project]
16 Aug 2023

Dual-Stream Diffusion Net for Text-to-Video Generation
Binhui Liu, Xin Liu, Anbo Dai, Zhiyong Zeng, Zhen Cui, Jian Yang
arXiv 2023. [Paper]
16 Aug 2023

DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding
Jeongsoo Choi, Joanna Hong, Yong Man Ro
arXiv 2023. [Paper]
15 Aug 2023

SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
Zhengwentai Sun, Yanghong Zhou, Honghong He, P. Y. Mok
ACM MM 2023. [Paper]
15 Aug 2023

Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Bosheng Qin, Wentao Ye, Qifan Yu, Siliang Tang, Yueting Zhuang
arXiv 2023. [Paper]
15 Aug 2023

Diffusion Based Augmentation for Captioning and Retrieval in Cultural Heritage
Dario Cioni, Lorenzo Berlincioni, Federico Becattini, Alberto del Bimbo
ICCV Workshop 2023. [Paper]
14 Aug 2023

Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation
Alexander Martin, Haitian Zheng, Jie An, Jiebo Luo
ACM MM 2023. [Paper]
14 Aug 2023

UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion Model from Human Brain Activity
Weijian Mai, Zhijun Zhang
arXiv 2023. [Paper]
14 Aug 2023

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
David Junhao Zhang, Mutian Xu, Chuhui Xue, Wenqing Zhang, Xiaoguang Han, Song Bai, Mike Zheng Shou
arXiv 2023. [Paper]
13 Aug 2023

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Hu Ye, Jun Zhang, Sibo Liu, Xiao Han, Wei Yang
arXiv 2023. [Paper] [Project] [Github]
13 Aug 2023

LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang, Yi Luo, Ziliang Chen, Guangrun Wang, Xiaodan Liang, Liang Lin
arXiv 2023. [Paper]
13 Aug 2023

ModelScope Text-to-Video Technical Report
Jiuniu Wang, Hangjie Yuan, Dayou Chen, Yingya Zhang, Xiang Wang, Shiwei Zhang
arXiv 2023. [Paper]
12 Aug 2023

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Weijia Wu, Yuzhong Zhao, Hao Chen, Yuchao Gu, Rui Zhao, Yefei He, Hong Zhou, Mike Zheng Shou, Chunhua Shen
arXiv 2023. [Paper] [Project] [Github]
11 Aug 2023

Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Chun-Mei Feng, Kai Yu, Yong Liu, Salman Khan, Wangmeng Zuo
ICCV 2023. [Paper] [Github]
11 Aug 2023

Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
arXiv 2023. [Paper]
11 Aug 2023

Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Fan Zhang, Naye Ji, Fuxing Gao, Siyuan Zhao, Zhaohan Wang, Shunman Li
arXiv 2023. [Paper]
11 Aug 2023

Zero-shot Text-driven Physically Interpretable Face Editing
Yapeng Meng, Songru Yang, Xu Hu, Rui Zhao, Lincheng Li, Zhenwei Shi, Zhengxia Zou
arXiv 2023. [Paper]
11 Aug 2023

PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
John Joon Young Chung, Eytan Adar
UIST 2023. [Paper]
9 Aug 2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie, Tat-Seng Chua
arXiv 2023. [Paper] [Project]
9 Aug 2023

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On
Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, Qixing Huang
arXiv 2023. [Paper]
8 Aug 2023

MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Yizhuo Lu, Changde Du, Qiongyi zhou, Dianpeng Wang, Huiguang He
arXiv 2023. [Paper]
8 Aug 2023

FLIRT: Feedback Loop In-context Red Teaming
Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta
arXiv 2023. [Paper]
8 Aug 2023

DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis
Zhongjie Duan, Lizhou You, Chengyu Wang, Cen Chen, Ziheng Wu, Weining Qian, Jun Huang
arXiv 2023. [Paper] [Project] [Github]
7 Aug 2023

AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose
Huichao Zhang, Bowen Chen, Hao Yang, Liao Qu, Xu Wang, Li Chen, Chao Long, Feida Zhu, Kang Du, Min Zheng
arXiv 2023. [Paper] [Project]
7 Aug 2023

Towards Scene-Text to Scene-Text Translation
Onkar Susladkar, Prajwal Gatti, Anand Mishra
arXiv 2023. [Paper]
6 Aug 2023

Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation
Zijie Wu, Yaonan Wang, Mingtao Feng, He Xie, Ajmal Mian
arXiv 2023. [Paper]
5 Aug 2023

ConceptLab: Creative Generation using Diffusion Prior Constraints
Elad Richardson, Kfir Goldberg, Yuval Alaluf, Daniel Cohen-Or
arXiv 2023. [Paper] [Project] [Github]
3 Aug 2023

DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models
Jianxin Lin, Peng Xiao, Yijun Wang, Rongju Zhang, Xiangxiang Zeng
arXiv 2023. [Paper]
3 Aug 2023

Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling
Zhao Yang, Bing Su, Ji-Rong Wen
ACM MM 2023. [Paper] [Github]
3 Aug 2023

Reverse Stable Diffusion: What prompt was used to generate this image?
Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah
arXiv 2023. [Paper]
2 Aug 2023

Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion
Zixuan Ni, Longhui Wei, Jiacheng Li, Siliang Tang, Yueting Zhuang, Qi Tian
arXiv 2023. [Paper]
2 Aug 2023

ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun, Yifan Yang, Houwen Peng, Yifei Shen, Yuqing Yang, Han Hu, Lili Qiu, Hideki Koike
arXiv 2023. [Paper]
2 Aug 2023

The Bias Amplification Paradox in Text-to-Image Generation
Preethi Seshadri, Sameer Singh, Yanai Elazar
arXiv 2023. [Paper]
1 Aug 2023

BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models
Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian
arXiv 2023. [Paper] [Github] [Dataset]
31 Jul 2023

MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
Junchen Zhu, Huan Yang, Wenjing Wang, Huiguo He, Zixi Tuo, Yongsheng Yu, Wen-Huang Cheng, Lianli Gao, Jingkuan Song, Jianlong Fu, Jiebo Luo
arXiv 2023. [Paper]
31 Jul 2023

DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models
Chao Huang, Susan Liang, Yapeng Tian, Anurag Kumar, Chenliang Xu
arXiv 2023. [Paper]
31 Jul 2023

Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Yuxin Mao, Jing Zhang, Mochu Xiang, Yunqiu Lv, Yiran Zhong, Yuchao Dai
arXiv 2023. [Paper]
31 Jul 2023

HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation
Jinbo Wu, Xiaobo Gao, Xing Liu, Zhengyang Shen, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding
arXiv 2023. [Paper]
30 Jul 2023

Seeing through the Brain: Image Reconstruction of Visual Perception from Human Brain Signals
Yu-Ting Lan, Kan Ren, Yansen Wang, Wei-Long Zheng, Dongsheng Li, Bao-Liang Lu, Lili Qiu
arXiv 2023. [Paper]
27 Jul 2023

VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet
Zhihao Hu, Dong Xu
arXiv 2023. [Paper] [Project]
26 Jul 2023

Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation
Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang
arXiv 2023. [Paper]
26 Jul 2023

Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen, Yuheng Li, Utkarsh Ojha, Yong Jae Lee
arXiv 2023. [Paper] [Project] [Github]
26 Jul 2023

Composite Diffusion | whole >= \Sigma parts
Vikram Jamwal, Ramaneswaran S
arXiv 2023. [Paper]
25 Jul 2023

Fashion Matrix: Editing Photos by Just Talking
Zheng Chong, Xujie Zhang, Fuwei Zhao, Zhenyu Xie, Xiaodan Liang
arXiv 2023. [Paper] [Project] [Github]
25 Jul 2023

Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Yong-Hyun Park, Mingi Kwon, Jaewoong Choi, Junghyo Jo, Youngjung Uh
arXiv 2023. [Paper]
24 Jul 2023

InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing
Anant Khandelwal
ICCV Workshop 2023. [Paper]
22 Jul 2023

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Jian Ma, Junhao Liang, Chen Chen, Haonan Lu
arXiv 2023. [Paper] [Project] [Github]
21 Jul 2023

Divide & Bind Your Attention for Improved Generative Semantic Nursing
Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva
arXiv 2023. [Paper] [Project]
20 Jul 2023

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models
Jiachun Pan, Jun Hao Liew, Vincent Y. F. Tan, Jiashi Feng, Hanshu Yan
arXiv 2023. [Paper]
20 Jul 2023

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Jinheng Xie, Yuexiang Li, Yawen Huang, Haozhe Liu, Wentian Zhang, Yefeng Zheng, Mike Zheng Shou
arXiv 2023. [Paper] [Github]
20 Jul 2023

Text2Layer: Layered Image Generation using Latent Diffusion Model
Xinyang Zhang, Wentian Zhao, Xin Lu, Jeff Chien
arXiv 2023. [Paper]
19 Jul 2023

FABRIC: Personalizing Diffusion Models with Iterative Feedback
Dimitri von Rütte, Elisabetta Fedele, Jonathan Thomm, Lukas Wolf
arXiv 2023. [Paper]
19 Jul 2023

TokenFlow: Consistent Diffusion Features for Consistent Video Editing
Michal Geyer, Omer Bar-Tal, Shai Bagon, Tali Dekel
arXiv 2023. [Paper] [Project] [Github]
19 Jul 2023

Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions
Yui Iioka, Yu Yoshida, Yuiga Wada, Shumpei Hatanaka, Komei Sugiura
arXiv 2023. [Paper]
17 Jul 2023

Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation
Luozhou Wang, Shuai Yang, Shu Liu, Ying-cong Chen
ICCV 2023. [Paper] [Github]
17 Jul 2023

Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection
Alessandro Flaborea, Luca Collorone, Guido D’Amely, Stefano D’Arrigo, Bardh Prenkaj, Fabio Galasso
arXiv 2023. [Paper]
14 Jul 2023

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Wei Wei, Tingbo Hou, Yael Pritch, Neal Wadhwa, Michael Rubinstein, Kfir Aberman
arXiv 2023. [Paper] [Project] [Github]
13 Jul 2023

Exact Diffusion Inversion via Bi-directional Integration Approximation
Guoqiang Zhang, J. P. Lewis, W. Bastiaan Kleijn
arXiv 2023. [Paper]
10 Jul 2023

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo, Ceyuan Yang, Anyi Rao, Yaohui Wang, Yu Qiao, Dahua Lin, Bo Dai
arXiv 2023. [Paper] [Project] [Github]
10 Jul 2023

Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Jaskirat Singh, Liang Zheng
arXiv 2023. [Paper] [Project] [Github]
10 Jul 2023

Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion
Jie S. Li, Yow-Ting Shiue, Yong-Siang Shih, Jonas Geiping
arXiv 2023. [Paper]
9 Jul 2023

Measuring the Success of Diffusion Models at Imitating Human Artists
Stephen Casper, Zifan Guo, Shreya Mogulothu, Zachary Marinov, Chinmay Deshpande, Rui-Jie Yew, Zheng Dai, Dylan Hadfield-Menell
ICML Workshop 2023. [Paper]
8 Jul 2023

How to Detect Unauthorized Data Usages in Text-to-image Diffusion Models
Zhenting Wang, Chen Chen, Yuchen Liu, Lingjuan Lyu, Dimitris Metaxas, Shiqing Ma
arXiv 2023. [Paper]
6 Jul 2023

Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim, Kyungmin Lee, June Suk Choi, Jongheon Jeong, Kihyuk Sohn, Jinwoo Shin
arXiv 2023. [Paper] [Project] [Github]
4 Jul 2023

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas Müller, Joe Penna, Robin Rombach
arXiv 2023. [Paper] [Github]
4 Jul 2023

MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion
Shitao Tang, Fuyang Zhang, Jiacheng Chen, Peng Wang, Yasutaka Furukawa
arXiv 2023. [Paper] [Project]
3 Jul 2023

Counting Guidance for High Fidelity Text-to-Image Synthesis
Wonjun Kang, Kevin Galim, Hyung Il Koo
arXiv 2023. [Paper]
30 Jun 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao
arXiv 2023. [Paper]
29 Jun 2023

Generate Anything Anywhere in Any Scene
Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee
arXiv 2023. [Paper] [Project]
29 Jun 2023

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Simian Luo, Chuanhao Yan, Chenxu Hu, Hang Zhao
arXiv 2023. [Paper] [Github]
29 Jun 2023

PFB-Diff: Progressive Feature Blending Diffusion for Text-driven Image Editing
Wenjing Huang, Shikui Tu, Lei Xu
arXiv 2023. [Paper]
28 Jun 2023

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models
Ximing Xing, Chuang Wang, Haitao Zhou, Jing Zhang, Qian Yu, Dong Xu
arXiv 2023. [Paper]
26 Jun 2023

A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal, Srikrishna Karanam, K J Joseph, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan
arXiv 2023. [Paper]
26 Jun 2023

Decompose and Realign: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang, Guibao Shen, Yijun Li, Ying-cong Chen
arXiv 2023. [Paper]
26 Jun 2023

Zero-shot spatial layout conditioning for text-to-image diffusion models
Guillaume Couairon, Marlène Careil, Matthieu Cord, Stéphane Lathuilière, Jakob Verbeek
arXiv 2023. [Paper]
23 Jun 2023

DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation
Yukun Huang, Jianan Wang, Yukai Shi, Xianbiao Qi, Zheng-Jun Zha, Lei Zhang
arXiv 2023. [Paper]
21 Jun 2023

Align, Adapt and Inject: Sound-guided Unified Image Generation
Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo
arXiv 2023. [Paper]
20 Jun 2023

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Lianying Yin, Yijun Wang, Tianyu He, Jinming Liu, Wei Zhao, Bohan Li, Xin Jin, Jianxin Lin
arXiv 2023. [Paper]
20 Jun 2023

RS5M: A Large Scale Vision-Language Dataset for Remote Sensing Vision-Language Foundation Model
Zilun Zhang, Tiancheng Zhao, Yulong Guo, Jianwei Yin
arXiv 2023. [Paper]
20 Jun 2023

Instruct-NeuralTalker: Editing Audio-Driven Talking Radiance Fields with Instructions
Yuqi Sun, Reian He, Weimin Tan, Bo Yan
arXiv 2023. [Paper]
19 Jun 2023

Conditional Text Image Generation with Diffusion Models
Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao
arXiv 2023. [Paper]
19 Jun 2023

Point-Cloud Completion with Pretrained Text-to-image Diffusion Models
Yoni Kasten, Ohad Rahamim, Gal Chechik
arXiv 2023. [Paper]
18 Jun 2023

Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models
Geon Yeong Park, Jeongsol Kim, Beomsu Kim, Sang Wan Lee, Jong Chul Ye
arXiv 2023. [Paper]
16 Jun 2023

Evaluating the Robustness of Text-to-image Diffusion Models against Real-world Attacks
Hongcheng Gao, Hao Zhang, Yinpeng Dong, Zhijie Deng
arXiv 2023. [Paper]
16 Jun 2023

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian McAuley
arXiv 2023. [Paper]
16 Jun 2023

Taming Diffusion Models for Music-driven Conducting Motion Generation
Zhuoran Zhao, Jinbin Bai, Delong Chen, Debang Wang, Yubo Pan
arXiv 2023. [Paper]
15 Jun 2023

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Shivam Mehta, Siyang Wang, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter
arXiv 2023. [Paper]
15 Jun 2023

Diffusion Models for Zero-Shot Open-Vocabulary Segmentation
Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht
arXiv 2023. [Paper]
15 Jun 2023

Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin, Eran Hirsch, Daniel Glickman, Shauli Ravfogel, Yoav Goldberg, Gal Chechik
arXiv 2023. [Paper]
15 Jun 2023

Training Multimedia Event Extraction With Generated Images and Captions
Zilin Du, Yunxin Li, Xu Guo, Yidan Sun, Boyang Li
arXiv 2023. [Paper]
15 Jun 2023

VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing
Paul Couairon, Clément Rambour, Jean-Emmanuel Haugeard, Nicolas Thome
arXiv 2023. [Paper]
14 Jun 2023

Norm-guided latent space exploration for text-to-image generation
Dvir Samuel, Rami Ben-Ari, Nir Darshan, Haggai Maron, Gal Chechik
arXiv 2023. [Paper]
14 Jun 2023

Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis
Zhiyu Jin, Xuli Shen, Bin Li, Xiangyang Xue
arXiv 2023. [Paper]
14 Jun 2023

GBSD: Generative Bokeh with Stage Diffusion
Jieren Deng, Xin Zhou, Hao Tian, Zhihong Pan, Derek Aguiar
arXiv 2023. [Paper]
14 Jun 2023

Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation
Yongqi Yang, Ruoyu Wang, Zhihao Qian, Ye Zhu, Yu Wu
arXiv 2023. [Paper]
14 Jun 2023

Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Shuai Yang, Yifan Zhou, Ziwei Liu, Chen Change Loy
arXiv 2023. [Paper]
13 Jun 2023

Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model
Xin Zhang, Jiaxian Guo, Paul Yoo, Yutaka Matsuo, Yusuke Iwasawa
arXiv 2023. [Paper]
13 Jun 2023

Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Zeju Qiu, Weiyang Liu, Haiwen Feng, Yuxuan Xue, Yao Feng, Zhen Liu, Dan Zhang, Adrian Weller, Bernhard Schölkopf
arXiv 2023. [Paper]
12 Jun 2023

MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
Junchen Zhu, Huan Yang, Huiguo He, Wenjing Wang, Zixi Tuo, Wen-Huang Cheng, Lianli Gao, Jingkuan Song, Jianlong Fu
arXiv 2023. [Paper]
12 Jun 2023

InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
Jiale Xu, Xintao Wang, Yan-Pei Cao, Weihao Cheng, Ying Shan, Shenghua Gao
arXiv 2023. [Paper]
12 Jun 2023

Language-Guided Traffic Simulation via Scene-Level Diffusion
Ziyuan Zhong, Davis Rempe, Yuxiao Chen, Boris Ivanovic, Yulong Cao, Danfei Xu, Marco Pavone, Baishakhi Ray
arXiv 2023. [Paper]
10 Jun 2023

BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Jiatao Gu, Shuangfei Zhai, Yizhe Zhang, Lingjie Liu, Josh Susskind
arXiv 2023. [Paper]
8 Jun 2023

Grounded Text-to-Image Synthesis with Attention Refocusing
Quynh Phung, Songwei Ge, Jia-Bin Huang
arXiv 2023. [Paper]
8 Jun 2023

SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
Yuseung Lee, Kunho Kim, Hyunjin Kim, Minhyuk Sung
arXiv 2023. [Paper] [Project] [Github]
8 Jun 2023

Improving Tuning-Free Real Image Editing with Proximal Guidance
Ligong Han, Song Wen, Qi Chen, Zhixing Zhang, Kunpeng Song, Mengwei Ren, Ruijiang Gao, Yuxiao Chen, Di Liu, Qilong Zhangli, Anastasis Stathopoulos, Jindong Jiang, Zhaoyang Xia, Akash Srivastava, Dimitris Metaxas
arXiv 2023. [Paper]
8 Jun 2023

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Changhoon Kim, Kyle Min, Maitreya Patel, Sheng Cheng, Yezhou Yang
arXiv 2023. [Paper]
7 Jun 2023

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models
Maitreya Patel, Tejas Gokhale, Chitta Baral, Yezhou Yang
arXiv 2023. [Paper]
7 Jun 2023

Designing a Better Asymmetric VQGAN for StableDiffusion
Zixin Zhu, Xuelu Feng, Dongdong Chen, Jianmin Bao, Le Wang, Yinpeng Chen, Lu Yuan, Gang Hua
arXiv 2023. [Paper] [Github]
7 Jun 2023

Multi-modal Latent Diffusion
Mustapha Bounoua, Giulio Franzese, Pietro Michiardi
arXiv 2023. [Paper]
7 Jun 2023

Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt
Kai Chen, Enze Xie, Zhe Chen, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung
arXiv 2023. [Paper]
7 Jun 2023

Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance
Gihyun Kwon, Jong Chul Ye
arXiv 2023. [Paper]
7 Jun 2023

Stable Diffusion is Unstable
Chengbin Du, Yanxi Li, Zhongwei Qiu, Chang Xu
arXiv 2023. [Paper]
5 Jun 2023

LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Yochai Yemini, Aviv Shamsian, Lior Bracha, Sharon Gannot, Ethan Fetaya
arXiv 2023. [Paper] [Project]
5 Jun 2023

HeadSculpt: Crafting 3D Head Avatars with Text
Xiao Han, Yukang Cao, Kai Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang, Kwan-Yee K. Wong
arXiv 2023. [Paper] [Project]
5 Jun 2023

Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Shaoxu Li
arXiv 2023. [Paper]
5 Jun 2023

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Shuyu Yang, Yinan Zhou, Yaxiong Wang, Yujiao Wu, Li Zhu, Zhedong Zheng
arXiv 2023. [Paper]
5 Jun 2023

User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques
Sunwoo Kim, Wooseok Jang, Hyunsu Kim, Junho Kim, Yunjey Choi, Seungryong Kim, Gayeong Lee
arXiv 2023. [Paper]
5 Jun 2023

Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu, Zijian Zhang, Yi Ren, Rongjie Huang, Xiang Yin, Zhou Zhao
arXiv 2023. [Paper]
4 Jun 2023

VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang, Hangjie Yuan, Shiwei Zhang, Dayou Chen, Jiuniu Wang, Yingya Zhang, Yujun Shen, Deli Zhao, Jingren Zhou
NeruIPS 2023. [Paper] [Project] [Github]
3 Jun 2023

Word-Level Explanations for Analyzing Bias in Text-to-Image Models
Alexander Lin, Lucas Monteiro Paes, Sree Harsha Tanneru, Suraj Srinivas, Himabindu Lakkaraju
arXiv 2023. [Paper]
3 Jun 2023

Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Yiji Cheng, Fei Yin, Xiaoke Huang, Xintong Yu, Jiaxiang Liu, Shikun Feng, Yujiu Yang, Yansong Tang
arXiv 2023. [Paper]
3 Jun 2023

Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel
arXiv 2023. [Paper] [Project]
2 Jun 2023

Video Colorization with Pre-trained Text-to-Image Diffusion Models
Hanyuan Liu, Minshan Xie, Jinbo Xing, Chengze Li, Tien-Tsin Wong
arXiv 2023. [Paper]
2 Jun 2023

Audio-Visual Speech Enhancement with Score-Based Generative Models
Julius Richter, Simone Frintrop, Timo Gerkmann
arXiv 2023. [Paper]
2 Jun 2023

Privacy Distillation: Reducing Re-identification Risk of Multimodal Diffusion Models
Virginia Fernandez, Pedro Sanchez, Walter Hugo Lopez Pinaya, Grzegorz Jacenków, Sotirios A. Tsaftaris, Jorge Cardoso
arXiv 2023. [Paper]
2 Jun 2023

StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Yonglong Tian, Lijie Fan, Phillip Isola, Huiwen Chang, Dilip Krishnan
arXiv 2023. [Paper]
1 Jun 2023

Diffusion Self-Guidance for Controllable Image Generation
Dave Epstein, Allan Jabri, Ben Poole, Alexei A. Efros, Aleksander Holynski
arXiv 2023. [Paper] [Project]
1 Jun 2023

StyleDrop: Text-to-Image Generation in Any Style
Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan
arXiv 2023. [Paper] [Project]
1 Jun 2023

Intriguing Properties of Text-guided Diffusion Models
Qihao Liu, Adam Kortylewski, Yutong Bai, Song Bai, Alan Yuille
arXiv 2023. [Paper]
1 Jun 2023

Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models
Chang Liu, Haoning Wu, Yujie Zhong, Xiaoyun Zhang, Weidi Xie
arXiv 2023. [Paper] [Project]
1 Jun 2023

ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Shaozhe Hao, Kai Han, Shihao Zhao, Kwan-Yee K. Wong
arXiv 2023. [Paper] [Github]
1 Jun 2023

The Hidden Language of Diffusion Models
Hila Chefer, Oran Lang, Mor Geva, Volodymyr Polosukhin, Assaf Shocher, Michal Irani, Inbar Mosseri, Lior Wolf
arXiv 2023. [Paper] [Project]
1 Jun 2023

Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Minghui Hu, Jianbin Zheng, Daqing Liu, Chuanxia Zheng, Chaoyue Wang, Dacheng Tao, Tat-Jen Cham
arXiv 2023. [Paper] [Project] [Github]
1 Jun 2023

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing, Menghan Xia, Yuxin Liu, Yuechen Zhang, Yong Zhang, Yingqing He, Hanyuan Liu, Haoxin Chen, Xiaodong Cun, Xintao Wang, Ying Shan, Tien-Tsin Wong
arXiv 2023. [Paper] [Project]
1 Jun 2023

Inserting Anybody in Diffusion Models via Celeb Basis
Ge Yuan, Xiaodong Cun, Yong Zhang, Maomao Li, Chenyang Qi, Xintao Wang, Ying Shan, Huicheng Zheng
arXiv 2023. [Paper] [Project]
1 Jun 2023

Wuerstchen: Efficient Pretraining of Text-to-Image Models
Pablo Pernias, Dominic Rampas, Marc Aubreville
arXiv 2023. [Paper]
1 Jun 2023

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Xiao Dong, Runhui Huang, Xiaoyong Wei, Zequn Jie, Jianxing Yu, Jian Yin, Xiaodan Liang
arXiv 2023. [Paper]
1 Jun 2023

FigGen: Text to Scientific Figure Generation
Juan A. Rodriguez, David Vazquez, Issam Laradji, Marco Pedersoli, Pau Rodriguez
ICLR 2023. [Paper]
1 Jun 2023

Diffusion Brush: A Latent Diffusion Model-based Editing Tool for AI-generated Images
Peyman Gholami, Robert Xiao
arXiv 2023. [Paper]
31 May 2023

Understanding and Mitigating Copying in Diffusion Models
Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, Tom Goldstein
CVPR 2023. [Paper] [Github]
31 May 2023

Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor
Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou, Hongwen Zhang, Yebin Liu
arXiv 2023. [Paper] [Project]
31 May 2023

Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards
Guian Fang, Zutao Jiang, Jianhua Han, Guansong Lu, Hang Xu, Xiaodan Liang
arXiv 2023. [Paper] [Github]
31 May 2023

Perturbation-Assisted Sample Synthesis: A Novel Approach for Uncertainty Quantification
Yifei Liu, Rex Shen, Xiaotong Shen
arXiv 2023. [Paper]
30 May 2023

PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li, Mohit Bansal
arXiv 2023. [Paper] [Project] [Github]
30 May 2023

Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models
Ernie Chu, Shuo-Yen Lin, Jun-Cheng Chen
arXiv 2023. [Paper]
30 May 2023

Nested Diffusion Processes for Anytime Image Generation
Noam Elata, Bahjat Kawar, Tomer Michaeli, Michael Elad
arXiv 2023. [Paper]
30 May 2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Chi Zhang, Yiwen Chen, Yijun Fu, Zhenglin Zhou, Gang YU, Billzb Wang, Bin Fu, Tao Chen, Guosheng Lin, Chunhua Shen
arXiv 2023. [Paper]
30 May 2023

HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance
Junzhe Zhu, Peiye Zhuang
arXiv 2023. [Paper]
30 May 2023

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
Pengzhi Li, QInxuan Huang, Yikang Ding, Zhiheng Li
arXiv 2023. [Paper]
30 May 2023

Controllable Text-to-Image Generation with GPT-4
Tianjun Zhang, Yi Zhang, Vibhav Vineet, Neel Joshi, Xin Wang
arXiv 2023. [Paper]
29 May 2023

Cognitively Inspired Cross-Modal Data Generation Using Diffusion Models
Zizhao Hu, Mohammad Rostami
NeurIPS 2023. [Paper]
28 May 2023

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Zeyue Xue, Guanglu Song, Qiushan Guo, Boxiao Liu, Zhuofan Zong, Yu Liu, Ping Luo
arXiv 2023. [Paper]
29 May 2023

Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Yuchao Gu, Xintao Wang, Jay Zhangjie Wu, Yujun Shi, Yunpeng Chen, Zihan Fan, Wuyou Xiao, Rui Zhao, Shuning Chang, Weijia Wu, Yixiao Ge, Ying Shan, Mike Zheng Shou
arXiv 2023. [Paper] [Project]
29 May 2023

Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu-Yun Wang, Wenshuo Chen, Guanglu Song, Han-Jia Ye, Yu Liu, Hongsheng Li
arXiv 2023. [Paper] [Github]
29 May 2023

Text-Only Image Captioning with Multi-Context Data Generation
Feipeng Ma, Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun
arXiv 2023. [Paper]
29 May 2023

InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang, Biao Zhang, Michael Birsak, Peter Wonka
arXiv 2023. [Paper]
29 May 2023

Conditional Score Guidance for Text-Driven Image-to-Image Translation
Hyunsoo Lee, Minsoo Kang, Bohyung Han
arXiv 2023. [Paper]
29 May 2023

Text-to-image Editing by Image Information Removal
Zhongping Zhang, Jian Zheng, Jacob Zhiyuan Fang, Bryan A. Plummer
arXiv 2023. [Paper]
27 May 2023

Towards Consistent Video Editing with Text-to-Image Diffusion Models
Zicheng Zhang, Bonan Li, Xuecheng Nie, Congying Han, Tiande Guo, Luoqi Liu
arXiv 2023. [Paper]
27 May 2023

FISEdit: Accelerating Text-to-image Editing via Cache-enabled Sparse Diffusion Inference
Zihao Yu, Haoyang Li, Fangcheng Fu, Xupeng Miao, Bin Cui
arXiv 2023. [Paper]
27 May 2023

ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing
Min Zhao, Rongzhen Wang, Fan Bao, Chongxuan Li, Jun Zhu
arXiv 2023. [Paper] [Project]
26 May 2023

Improved Visual Story Generation with Adaptive Context Modeling
Zhangyin Feng, Yuchen Ren, Xinmiao Yu, Xiaocheng Feng, Duyu Tang, Shuming Shi, Bing Qin
arXiv 2023. [Paper]
26 May 2023

Negative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Models
Daiki Miyake, Akihiro Iohara, Yu Saito, Toshiyuki Tanaka
arXiv 2023. [Paper]
26 May 2023

Are Diffusion Models Vision-And-Language Reasoners?
Benno Krojer, Elinor Poole-Dayan, Vikram Voleti, Christopher Pal, Siva Reddy
arXiv 2023. [Paper] [Github]
25 May 2023

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee
arXiv 2023. [Paper]
25 May 2023

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao, Dongdong Chen, Yen-Chun Chen, Jianmin Bao, Shaozhe Hao, Lu Yuan, Kwan-Yee K. Wong
arXiv 2023. [Paper] [Project] [Github]
25 May 2023

Parallel Sampling of Diffusion Models
Andy Shih, Suneel Belkhale, Stefano Ermon, Dorsa Sadigh, Nima Anari
arXiv 2023. [Paper] [Github]
25 May 2023

Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami, Kfir Aberman, Ohad Fried, Daniel Cohen-Or, Dani Lischinski
SIGGRAPH Asia 2023. [Paper] [Project] [Github]
25 May 2023

Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation
Lisa Dunlap, Alyssa Umino, Han Zhang, Jiezhi Yang, Joseph E. Gonzalez, Trevor Darrell
arXiv 2023. [Paper] [Github]
25 May 2023

Prompt-Free Diffusion: Taking “Text” out of Text-to-Image Diffusion Models
Xingqian Xu, Jiayi Guo, Zhangyang Wang, Gao Huang, Irfan Essa, Humphrey Shi
arXiv 2023. [Paper] [Github]
25 May 2023

ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation
Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, Chongyang Ma, Tong-Yee Lee, Oliver Deussen, Changsheng Xu
arXiv 2023. [Paper]
25 May 2023

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
Zhengyi Wang, Cheng Lu, Yikai Wang, Fan Bao, Chongxuan Li, Hang Su, Jun Zhu
arXiv 2023. [Paper] [Project]
25 May 2023

On Architectural Compression of Text-to-Image Diffusion Models
Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells, Shinkook Choi
arXiv 2023. [Paper]
25 May 2023

Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models
Jooyoung Choi, Yunjey Choi, Yunji Kim, Junho Kim, Sungroh Yoon
arXiv 2023. [Paper]
25 May 2023

MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Marco Bellagente, Manuel Brack, Hannah Teufel, Felix Friedrich, Björn Deiseroth, Constantin Eichenberg, Andrew Dai, Robert Baldock, Souradeep Nanda, Koen Oostermeijer, Andres Felipe Cruz-Salinas, Patrick Schramowski, Kristian Kersting, Samuel Weinbach
arXiv 2023. [Paper]
24 May 2023

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation
Dongxu Yue, Qin Guo, Munan Ning, Jiaxi Cui, Yuesheng Zhu, Li Yuan
arXiv 2023. [Paper]
24 May 2023

DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim, Junsoo Lee, Kibeom Hong, Daesik Kim, Namhyuk Ahn
arXiv 2023. [Paper] [Github]
24 May 2023

I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors
Tuhin Chakrabarty, Arkadiy Saakyan, Olivia Winn, Artemis Panagopoulou, Yue Yang, Marianna Apidianaki, Smaranda Muresan
arXiv 2023. [Paper]
24 May 2023

BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
Dongxu Li, Junnan Li, Steven C. H. Hoi
arXiv 2023. [Paper]
24 May 2023

Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models
Alicia Parrish, Hannah Rose Kirk, Jessica Quaye, Charvi Rastogi, Max Bartolo, Oana Inel, Juan Ciro, Rafael Mosquera, Addison Howard, Will Cukierski, D. Sculley, Vijay Janapa Reddi, Lora Aroyo
arXiv 2023. [Paper]
22 May 2023

Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang, Zekang Chen, Chen Chen, Jian Ma, Haonan Lu, Xiaodong Lin
arXiv 2023. [Paper]
23 May 2023

Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Yiting Qu, Xinyue Shen, Xinlei He, Michael Backes, Savvas Zannettou, Yang Zhang
arXiv 2023. [Paper]
23 May 2023

Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen, Jie Wu, Pan Xie, Hefeng Wu, Jiashi Li, Xin Xia, Xuefeng Xiao, Liang Lin
arXiv 2023. [Paper]
23 May 2023

Understanding Text-driven Motion Synthesis with Keyframe Collaboration via Diffusion Models
Dong Wei, Xiaoning Sun, Huaijiang Sun, Bin Li, Shengxiang Hu, Weiqing Li, Jianfeng Lu
arXiv 2023. [Paper]
23 May 2023

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian, Boyi Li, Adam Yala, Trevor Darrell
arXiv 2023. [Paper]
23 May 2023

LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
Davide Morelli, Alberto Baldrati, Giuseppe Cartella, Marcella Cornia, Marco Bertini, Rita Cucchiara
arXiv 2023. [Paper]
22 May 2023

FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering
Megha Chakraborty, Khusbu Pahwa, Anku Rani, Adarsh Mahor, Aditya Pakala, Arghya Sarkar, Harshit Dave, Ishan Paul, Janvita Reddy, Preethi Gurumurthy, Ritvik G, Samahriti Mukherjee, Shreyas Chatterjee, Kinjal Sensharma, Dwip Dalal, Suryavardan S, Shreyash Mishra, Parth Patwa, Aman Chadha, Amit Sheth, Amitava Das
arXiv 2023. [Paper]
22 May 2023

Training Diffusion Models with Reinforcement Learning
Kevin Black, Michael Janner, Yilun Du, Ilya Kostrikov, Sergey Levine
arXiv 2023. [Paper]
22 May 2023

If at First You Don’t Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Shyamgopal Karthik, Karsten Roth, Massimiliano Mancini, Zeynep Akata
arXiv 2023. [Paper] [Project]
22 May 2023

ControlVideo: Training-free Controllable Text-to-Video Generation
Yabo Zhang, Yuxiang Wei, Dongsheng Jiang, Xiaopeng Zhang, Wangmeng Zuo, Qi Tian
arXiv 2023. [Paper] [Github]
22 May 2023

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz
arXiv 2023. [Paper]
22 May 2023

The CLIP Model is Secretly an Image-to-Prompt Converter
Yuxuan Ding, Chunna Tian, Haoxuan Ding, Lingqiao Liu
arXiv 2023. [Paper]
22 May 2023

InstructVid2Vid: Controllable Video Editing with Natural Language Instructions
Bosheng Qin, Juncheng Li, Siliang Tang, Tat-Seng Chua, Yueting Zhuang
arXiv 2023. [Paper]
21 May 2023

SneakyPrompt: Evaluating Robustness of Text-to-image Generative Models’ Safety Filters
Yuchen Yang, Bo Hui, Haolin Yuan, Neil Gong, Yinzhi Cao
arXiv 2023. [Paper]
20 May 2023

Late-Constraint Diffusion Guidance for Controllable Image Synthesis
Chang Liu, Dong Liu
arXiv 2023. [Paper] [Project] [Github]
19 May 2023

Any-to-Any Generation via Composable Diffusion
Zineng Tang, Ziyi Yang, Chenguang Zhu, Michael Zeng, Mohit Bansal
arXiv 2023. [Paper] [Project] [Github]
19 May 2023

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
Jingbo Zhang, Xiaoyu Li, Ziyu Wan, Can Wang, Jing Liao
arXiv 2023. [Paper]
19 May 2023

Brain Captioning: Decoding human brain activity into images and text
Matteo Ferrante, Furkan Ozcelik, Tommaso Boccato, Rufin VanRullen, Nicola Toschi
arXiv 2023. [Paper]
19 May 2023

Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots
Jinyi Hu, Xu Han, Xiaoyuan Yi, Yutong Chen, Wenhao Li, Zhiyuan Liu, Maosong Sun
arXiv 2023. [Paper]
19 May 2023

Discriminative Diffusion Models as Few-shot Vision and Language Learners
Xuehai He, Weixi Feng, Tsu-Jui Fu, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang
arXiv 2023. [Paper]
18 May 2023

Zero-Day Backdoor Attack against Text-to-Image Diffusion Models via Personalization
Yihao Huang, Qing Guo, Felix Juefei-Xu
arXiv 2023. [Paper]
18 May 2023

AIwriting: Relations Between Image Generation and Digital Writing
Scott Rettberg, Talan Memmott, Jill Walker Rettberg, Jason Nelson, Patrick Lichty
ISEA 2023. [Paper]
18 May 2023

TextDiffuser: Diffusion Models as Text Painters
Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei
arXiv 2023. [Paper]
18 May 2023

VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang, Huan Yang, Zixi Tuo, Huiguo He, Junchen Zhu, Jianlong Fu, Jiaying Liu
arXiv 2023. [Paper]
18 May 2023

LDM3D: Latent Diffusion Model for 3D
Gabriela Ben Melech Stan, Diana Wofk, Scottie Fox, Alex Redden, Will Saxton, Jean Yu, Estelle Aflalo, Shao-Yen Tseng, Fabio Nonato, Matthias Muller, Vasudev Lal
arXiv 2023. [Paper]
18 May 2023

X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models
Yixiong Chen
arXiv 2023. [Paper] [Github]
18 May 2023

Inspecting the Geographical Representativeness of Images from Text-to-Image Models
Abhipsa Basu, R. Venkatesh Babu, Danish Pruthi
arXiv 2023. [Paper]
18 May 2023

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji
arXiv 2023. [Paper] [Project]
17 May 2023

AMD: Autoregressive Motion Diffusion
Bo Han, Hao Peng, Minjing Dong, Chang Xu, Yi Ren, Yixuan Shen, Yuheng Li
arXiv 2023. [Paper]
16 May 2023

Generating coherent comic with rich story using ChatGPT and Stable Diffusion
Ze Jin, Zorina Song
arXiv 2023. [Paper]
16 May 2023

Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation
Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta
arXiv 2023. [Paper] [Project]
16 May 2023

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee
arXiv 2023. [Paper] [Project] [Github]
15 May 2023

Common Diffusion Noise Schedules and Sample Steps are Flawed
Shanchuan Lin, Bingchen Liu, Jiashi Li, Xiao Yang
arXiv 2023. [Paper]
15 May 2023

Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models
Krishna Sri Ipsit Mantri, Nevasini Sasikumar
arXiv 2023. [Paper]
15 May 2023

Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator
Jing Zhao, Heliang Zheng, Chaoyue Wang, Long Lan, Wanrong Huang, Wenjing Yang
arXiv 2023. [Paper] [Project] [Github]
11 May 2023

iEdit: Localised Text-guided Image Editing with Weak Supervision
Rumeysa Bodur, Erhan Gundogdu, Binod Bhattarai, Tae-Kyun Kim, Michael Donoser, Loris Bazzani
arXiv 2023. [Paper]
10 May 2023

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models
Shanshan Zhong, Zhongzhan Huang, Wushao Wen, Jinghui Qin, Liang Lin
arXiv 2023. [Paper] [Github]
9 May 2023

Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer
Nisha Huang, Yuxin Zhang, Weiming Dong
arXiv 2023. [Paper]
9 May 2023

DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models
Sicheng Yang, Zhiyong Wu, Minglei Li, Zhensong Zhang, Lei Hao, Weihong Bao, Ming Cheng, Long Xiao
IJCAI 2023. [Paper] [Github]
8 May 2023

IIITD-20K: Dense captioning for Text-Image ReID
A V Subramanyam, Niranjan Sundararajan, Vibhu Dubey, Brejesh Lall
arXiv 2023. [Paper]
8 May 2023

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation
Yupei Lin, Sen Zhang, Xiaojun Yang, Xiao Wang, Yukai Shi
arXiv 2023. [Paper] [Project]
8 May 2023

Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models
Wenkai Dong, Song Xue, Xiaoyue Duan, Shumin Han
arXiv 2023. [Paper]
8 May 2023

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning
Shengfang Zhai, Yinpeng Dong, Qingni Shen, Shi Pu, Yuejian Fang, Hang Su
arXiv 2023. [Paper]
7 May 2023

AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion
Seungwoo Lee, Chaerin Kong, Donghyeon Jeon, Nojun Kwak
arXiv 2023. [Paper]
6 May 2023

Data Curation for Image Captioning with Text-to-Image Generative Models
Wenyan Li, Jonas F. Lotz, Chen Qiu, Desmond Elliott
arXiv 2023. [Paper]
5 May 2023

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
Hong Chen, Yipeng Zhang, Xin Wang, Xuguang Duan, Yuwei Zhou, Wenwu Zhu
arXiv 2023. [Paper] [Project]
5 May 2023

Guided Image Synthesis via Initial Image Editing in Diffusion Model
Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa
arXiv 2023. [Paper]
5 May 2023

Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion
Seongmin Lee, Benjamin Hoover, Hendrik Strobelt, Zijie J. Wang, ShengYun Peng, Austin Wright, Kevin Li, Haekyu Park, Haoyang Yang, Duen Horng Chau
arXiv 2023. [Paper] [Project]
4 May 2023

Multimodal-driven Talking Face Generation, Face Swapping, Diffusion Model
Chao Xu, Shaoting Zhu, Junwei Zhu, Tianxin Huang, Jiangning Zhang, Ying Tai, Yong Liu
arXiv 2023. [Paper]
4 May 2023

Multimodal Data Augmentation for Image Captioning using Diffusion Models
Changrong Xiao, Sean Xin Xu, Kunpeng Zhang
arXiv 2023. [Paper]
3 May 2023

In-Context Learning Unlocked for Diffusion Models
Zhendong Wang, Yifan Jiang, Yadong Lu, Yelong Shen, Pengcheng He, Weizhu Chen, Zhangyang Wang, Mingyuan Zhou
arXiv 2023. [Paper] [Project] [Github]
1 May 2023

SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
Azade Farshad, Yousef Yeganeh, Yu Chi, Chengzhi Shen, Björn Ommer, Nassir Navab
arXiv 2023. [Paper]
28 Apr 2023

It is all about where you start: Text-to-image generation with seed selection
Dvir Samuel, Rami Ben-Ari, Simon Raviv, Nir Darshan, Gal Chechik
arXiv 2023. [Paper]
27 Apr 2023

Edit Everything: A Text-Guided Generative System for Images Editing
Defeng Xie, Ruichen Wang, Jian Ma, Chen Chen, Haonan Lu, Dong Yang, Fobo Shi, Xiaodong Lin
arXiv 2023. [Paper] [Github]
27 Apr 2023

Training-Free Location-Aware Text-to-Image Synthesis
Jiafeng Mao, Xueting Wang
arXiv 2023. [Paper]
26 Apr 2023

TextMesh: Generation of Realistic 3D Meshes From Text Prompts
Christina Tsalicoglou, Fabian Manhardt, Alessio Tonioni, Michael Niemeyer, Federico Tombari
arXiv 2023. [Paper]
24 Apr 2023

Using Text-to-Image Generation for Architectural Design Ideation
Ville Paananen, Jonas Oppenlaender, Aku Visuri
arXiv 2023. [Paper]
20 Apr 2023

Anything-3D: Towards Single-view Anything Reconstruction in the Wild
Qiuhong Shen, Xingyi Yang, Xinchao Wang
arXiv 2023. [Paper] [Github]
19 Apr 2023

UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer
Soon Yau Cheong, Armin Mustafa, Andrew Gilbert
ICCV Workshop 2023. [Paper] [Github]
18 Apr 2023

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models
Yuwei Yin, Jean Kaddour, Xiang Zhang, Yixin Nie, Zhenguang Liu, Lingpeng Kong, Qi Liu
arXiv 2023. [Paper]
18 Apr 2023

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler, Karsten Kreis
CVPR 2023. [Paper] [Project]
18 Apr 2023

Text2Performer: Text-Driven Human Video Generation
Yuming Jiang, Shuai Yang, Tong Liang Koh, Wayne Wu, Chen Change Loy, Ziwei Liu
arXiv 2023. [Paper] [Project]
17 Apr 2023

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation
Jie An, Songyang Zhang, Harry Yang, Sonal Gupta, Jia-Bin Huang, Jiebo Luo, Xi Yin
arXiv 2023. [Paper] [Project]
17 Apr 2023

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Mingdeng Cao, Xintao Wang, Zhongang Qi, Ying Shan, Xiaohu Qie, Yinqiang Zheng
arXiv 2023. [Paper] [Github]
17 Apr 2023

Text-Conditional Contextualized Avatars For Zero-Shot Personalization
Samaneh Azadi, Thomas Hayes, Akbar Shah, Guan Pang, Devi Parikh, Sonal Gupta
arXiv 2023. [Paper]
14 Apr 2023

Delta Denoising Score
Amir Hertz, Kfir Aberman, Daniel Cohen-Or
arXiv 2023. [Paper] [Project]
14 Apr 2023

Expressive Text-to-Image Generation with Rich Text
Songwei Ge, Taesung Park, Jun-Yan Zhu, Jia-Bin Huang
arXiv 2023. [Paper] [Project] [Github]
13 Apr 2023

Soundini: Sound-Guided Diffusion for Natural Video Editing
Seung Hyun Lee, Sieun Kim, Innfarn Yoo, Feng Yang, Donghyeon Cho, Youngseo Kim, Huiwen Chang, Jinkyu Kim, Sangpil Kim
arXiv 2023. [Paper] [Project]
13 Apr 2023

Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji, Guanhua Zhang, Zhaowen Wang, Bairu Hou, Zhifei Zhang, Brian Price, Shiyu Chang
arXiv 2023. [Paper] [Github]
12 Apr 2023

An Edit Friendly DDPM Noise Space: Inversion and Manipulations
Inbar Huberman-Spiegelglas, Vladimir Kulikov, Tomer Michaeli
arXiv 2023. [Paper]
12 Apr 2023

Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA
James Seale Smith, Yen-Chang Hsu, Lingyu Zhang, Ting Hua, Zsolt Kira, Yilin Shen, Hongxia Jin
arXiv 2023. [Paper] [Project]
12 Apr 2023

HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr, Pengzhan Sun, Xiaoqian Shen, Faizan Farooq Khan, Li Erran Li, Mohamed Elhoseiny
arXiv 2023. [Paper] [Project]
11 Apr 2023

Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Mohammadreza Armandpour, Huangjie Zheng, Ali Sadeghian, Amir Sadeghian, Mingyuan Zhou
arXiv 2023. [Paper]
11 Apr 2023

Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Nikita Starodubcev, Dmitry Baranchuk, Valentin Khrulkov, Artem Babenko
arXiv 2023. [Paper]
10 Apr 2023

HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
Xuan Ju, Ailing Zeng, Chenchen Zhao, Jianan Wang, Lei Zhang, Qiang Xu
arXiv 2023. [Paper] [Github]
9 Apr 2023

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu, Yujian Liu, Handong Zhao, Trung Bui, Zhe Lin, Yang Zhang, Shiyu Chang
arXiv 2023. [Paper] [Github]
7 Apr 2023

Zero-shot Generative Model Adaptation via Image-specific Prompt Learning
Jiayi Guo, Chaofei Wang, You Wu, Eric Zhang, Kai Wang, Xingqian Xu, Shiji Song, Humphrey Shi, Gao Huang
CVPR 2023. [Paper] [Github]
6 Apr 2023

Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen, Iro Laina, Andrea Vedaldi
arXiv 2023. [Paper] [Project] [Github]
6 Apr 2023

Benchmarking Robustness to Text-Guided Corruptions
Mohammadreza Mofayezi, Yasamin Medghalchi
arXiv 2023. [Paper]
6 Apr 2023

DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model
Hoigi Seo, Hayeon Kim, Gwanghyun Kim, Se Young Chun
arXiv 2023. [Paper] [Project]
6 Apr 2023

Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models
Xuhui Jia, Yang Zhao, Kelvin C.K. Chan, Yandong Li, Han Zhang, Boqing Gong, Tingbo Hou, Huisheng Wang, Yu-Chuan Su
arXiv 2023. [Paper]
5 Apr 2023

A Diffusion-based Method for Multi-turn Compositional Image Generation
Chao Wang, Xiaoyu Yang, Jinmiao Huang, Kevin Ferreira
arXiv 2023. [Paper]
5 Apr 2023

viz2viz: Prompt-driven stylized visualization generation using a diffusion model
Jiaqi Wu, John Joon Young Chung, Eytan Adar
arXiv 2023. [Paper]
4 Apr 2023

Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
Alberto Baldrati, Davide Morelli, Giuseppe Cartella, Marcella Cornia, Marco Bertini, Rita Cucchiara
arXiv 2023. [Paper]
4 Apr 2023

PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion
Gwanghyun Kim, Ji Ha Jang, Se Young Chun
arXiv 2023. [Paper] [Project]
4 Apr 2023

Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee, Sangwon Jang, Jaehyeong Jo, Jaehong Yoon, Yunji Kim, Jin-Hwa Kim, Jung-Woo Ha, Sung Ju Hwang
arXiv 2023. [Paper]
4 Apr 2023

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
Mingyuan Zhang, Xinying Guo, Liang Pan, Zhongang Cai, Fangzhou Hong, Huirong Li, Lei Yang, Ziwei Liu
arXiv 2023. [Paper] [Project] [Github]
3 Apr 2023

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models
Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, Kwan-Yee K. Wong
arXiv 2023. [Paper]
3 Apr 2023

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance
Longwen Zhang, Qiwei Qiu, Hongyang Lin, Qixuan Zhang, Cheng Shi, Wei Yang, Ye Shi, Sibei Yang, Lan Xu, Jingyi Yu
arXiv 2023. [Paper] [Project]
1 Apr 2023

GlyphDraw: Learning to Draw Chinese Characters in Image Synthesis Models Coherently
Jian Ma, Mingjun Zhao, Chen Chen, Ruichen Wang, Di Niu, Haonan Lu, Xiaodong Lin
arXiv 2023. [Paper] [Project]
31 Mar 2023

AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control
Ruixiang Jiang, Can Wang, Jingbo Zhang, Menglei Chai, Mingming He, Dongdong Chen, Jing Liao
arXiv 2023. [Paper] [Project] [Github]
30 Mar 2023

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models
Vidit Goel, Elia Peruzzo, Yifan Jiang, Dejia Xu, Nicu Sebe, Trevor Darrell, Zhangyang Wang, Humphrey Shi
arXiv 2023. [Paper] [Github]
30 Mar 2023

Social Biases through the Text-to-Image Generation Lens
Ranjita Naik, Besmira Nushi
arXiv 2023. [Paper]
30 Mar 2023

Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Eric Zhang, Kai Wang, Xingqian Xu, Zhangyang Wang, Humphrey Shi
arXiv 2023. [Paper] [Github]
30 Mar 2023

DiffCollage: Parallel Generation of Large Content with Diffusion Models
Qinsheng Zhang, Jiaming Song, Xun Huang, Yongxin Chen, Ming-Yu Liu
CVPR 2023. [Paper] [Project]
30 Mar 2023

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
Wen Wang, Kangyang Xie, Zide Liu, Hao Chen, Yue Cao, Xinlong Wang, Chunhua Shen
arXiv 2023. [Paper]
30 Mar 2023

Discriminative Class Tokens for Text-to-Image Diffusion Models
Idan Schwartz, Vésteinn Snæbjarnarson, Sagie Benaim, Hila Chefer, Ryan Cotterell, Lior Wolf, Serge Belongie
arXiv 2023. [Paper]
30 Mar 2023

DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
Chenpng Du, Qi Chen, Tianyu He, Xu Tan, Xie Chen, Kai Yu, Sheng Zhao, Jiang Bian
arXiv 2023. [Paper]
30 Mar 2023

LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
Guangcong Zheng, Xianpan Zhou, Xuewei Li, Zhongang Qi, Ying Shan, Xi Li
CVPR 2023. [Paper] [Github]
30 Mar 2023

4D Facial Expression Diffusion Model
Kaifeng Zou, Sylvain Faisan, Boyang Yu, Sébastien Valette, Hyewon Seo
arXiv 2023. [Paper] [Github]
29 Mar 2023

MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path
Qian Wang, Biao Zhang, Michael Birsak, Peter Wonka
arXiv 2023. [Paper] [Github]
29 Mar 2023

Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion
Hiromichi Kamata, Yuiko Sakuma, Akio Hayakawa, Masato Ishii, Takuya Narihira
arXiv 2023. [Paper] [Github]
28 Mar 2023

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang
arXiv 2023. [Paper]
28 Mar 2023

Seer: Language Instructed Video Prediction with Latent Diffusion Models
Xianfan Gu, Chuan Wen, Jiaming Song, Yang Gao
CVPR Workshop 2023. [Paper]
27 Mar 2023

Debiasing Scores and Prompts of 2D Diffusion for Robust Text-to-3D Generation
Susung Hong, Donghoon Ahn, Seungryong Kim
arXiv 2023. [Paper]
27 Mar 2023

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis
Thanh Van Le, Hao Phung, Thuan Hoang Nguyen, Quan Dao, Ngoc Tran, Anh Tran
SIGGRAPH 2023. [Paper] [Github]
27 Mar 2023

GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
Tenglong Ao, Zeyi Zhang, Libin Liu
arXiv 2023. [Paper]
26 Mar 2023

Better Aligning Text-to-Image Models with Human Preference
Xiaoshi Wu, Keqiang Sun, Feng Zhu, Rui Zhao, Hongsheng Li
arXiv 2023. [Paper] [Github]
25 Mar 2023

ISS++: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu, Peng Dai, Ruihui Li, Xiaojuan Qi, Chi-Wing Fu
ICLR 2023. [Paper]
24 Mar 2023

DiffuScene: Scene Graph Denoising Diffusion Probabilistic Model for Generative Indoor Scene Synthesis
Jiapeng Tang, Yinyu Nie, Lev Markhasin, Angela Dai, Justus Thies, Matthias Nießner
arXiv 2023. [Paper] [Project]
24 Mar 2023

CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout
Yiqi Lin, Haotian Bai, Sijia Li, Haonan Lu, Xiaodong Lin, Hui Xiong, Lin Wang
arXiv 2023. [Paper] [Project]
24 Mar 2023

Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Rui Chen, Yongwei Chen, Ningxin Jiao, Kui Jia
arXiv 2023. [Paper]
24 Mar 2023

ReVersion: Diffusion-Based Relation Inversion from Images
Ziqi Huang, Tianxing Wu, Yuming Jiang, Kelvin C.K. Chan, Ziwei Liu
arXiv 2023. [Paper] [Project] [Github] 23 Mar 2023

Ablating Concepts in Text-to-Image Diffusion Models
Nupur Kumari, Bingliang Zhang, Sheng-Yu Wang, Eli Shechtman, Richard Zhang, Jun-Yan Zhu
arXiv 2023. [Paper] [Project] [Github]
23 Mar 2023

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan, Andranik Movsisyan, Vahram Tadevosyan, Roberto Henschel, Zhangyang Wang, Shant Navasardyan, Humphrey Shi
arXiv 2023. [Paper] [Github]
23 Mar 2023

MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
Jing Zhao, Heliang Zheng, Chaoyue Wang, Long Lan, Wenjing Yang
arXiv 2023. [Paper] [Project] [Github]
23 Mar 2023

Pix2Video: Video Editing using Image Diffusion
Duygu Ceylan, Chun-Hao Paul Huang, Niloy J. Mitra
arXiv 2023. [Paper] [Project]
22 Mar 2023

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions
Ayaan Haque, Matthew Tancik, Alexei A. Efros, Aleksander Holynski, Angjoo Kanazawa
arXiv 2023. [Paper] [Project]
22 Mar 2023

SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation
Juil Koo, Seungwoo Yoo, Minh Hieu Nguyen, Minhyuk Sung
arXiv 2023. [Paper] [Project]
21 Mar 2023

Vox-E: Text-guided Voxel Editing of 3D Objects
Etai Sella, Gal Fiebelman, Peter Hedman, Hadar Averbuch-Elor
arXiv 2023. [Paper] [Project]
21 Mar 2023

CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu, Sanghyuk Chun, Wonjae Kim, HeeJae Jun, Yoohoon Kang, Sangdoo Yun
arXiv 2023. [Paper]
21 Mar 2023

3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion
Yu-Jhe Li, Kris Kitani
arXiv 2023. [Paper]
21 Mar 2023

Text2Tex: Text-driven Texture Synthesis via Diffusion Models
Dave Zhenyu Chen, Yawar Siddiqui, Hsin-Ying Lee, Sergey Tulyakov, Matthias Nießner
arXiv 2023. [Paper] [Project]
20 Mar 2023

Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik, Daniel Garibi, Idan Azuri, Hadar Averbuch-Elor, Daniel Cohen-Or
arXiv 2023. [Paper] [Project]
20 Mar 2023

SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han, Yinxiao Li, Han Zhang, Peyman Milanfar, Dimitris Metaxas, Feng Yang
arXiv 2023. [Paper]
20 Mar 2023

Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models
René Haas, Inbar Huberman-Spiegelglas, Rotem Mulayoff, Tomer Michaeli
arXiv 2023. [Paper]
20 Mar 2023

SKED: Sketch-guided Text-based 3D Editing
Aryan Mikaeili, Or Perel, Daniel Cohen-Or, Ali Mahdavi-Amiri
arxiv 2023. [Paper]
19 Mar 2023

DialogPaint: A Dialog-based Image Editing Model
Jingxuan Wei, Shiyu Wu, Xin Jiang, Yequan Wang
arXiv 2023. [Paper]
17 Mar 2023

GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin, Ning Yu, Chen Xing, Shu Zhang, Zeyuan Chen, Stefano Ermon, Yun Fu, Caiming Xiong, Ran Xu
arXiv 2023. [Paper]
17 Mar 2023

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen
arXiv 2023. [Paper]
17 Mar 2023

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
Jiwen Yu, Yinhuai Wang, Chen Zhao, Bernard Ghanem, Jian Zhang
arXiv 2023. [Paper] [Github]
17 Mar 2023

Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
Yiyang Ma, Huan Yang, Wenjing Wang, Jianlong Fu, Jiaying Liu
arXiv 2023. [Paper]
16 Mar 2023

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
Chenyang Qi, Xiaodong Cun, Yong Zhang, Chenyang Lei, Xintao Wang, Ying Shan, Qifeng Chen
arXiv 2023. [Paper] [Project] [Github]
16 Mar 2023

HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu
arXiv 2023. [Paper]
16 Mar 2023

P+: Extended Textual Conditioning in Text-to-Image Generation
Andrey Voynov, Qinghao Chu, Daniel Cohen-Or, Kfir Aberman
arXiv 2023. [Paper] [Project]
16 Mar 2023

Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Inhwa Han, Serin Yang, Taesung Kwon, Jong Chul Ye
arXiv 2023. [Paper]
15 Mar 2023

Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models
Divya Kothandaraman, Tianyi Zhou, Ming Lin, Dinesh Manocha
arXiv 2023. [Paper] [Github]
15 Mar 2023

Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer
Serin Yang, Hyunmin Hwang, Jong Chul Ye
arXiv 2023. [Paper]
15 Mar 2023

Edit-A-Video: Single Video Editing with Object-Aware Consistency
Chaehun Shin, Heeseung Kim, Che Hyun Lee, Sang-gil Lee, Sungroh Yoon
arXiv 2023. [Paper] [Project]
14 Mar 2023

Editing Implicit Assumptions in Text-to-Image Diffusion Models
Hadas Orgad, Bahjat Kawar, Yonatan Belinkov
arXiv 2023. [Paper] [Project] [Github]
14 Mar 2023

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation
Junyoung Seo, Wooseok Jang, Min-Seop Kwak, Jaehoon Ko, Hyeonsu Kim, Junho Kim, Jin-Hwa Kim, Jiyoung Lee, Seungryong Kim
arXiv 2023. [Paper]
14 Mar 2023

Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
arXiv 2023. [Paper] [Github]
8 Mar 2023

Video-P2P: Video Editing with Cross-attention Control
Shaoteng Liu, Yuechen Zhang, Wenbo Li, Zhe Lin, Jiaya Jia
arXiv 2023. [Paper] [Project]
8 Mar 2023

Erasing Concepts from Diffusion Models
Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, David Bau
arXiv 2023. [Paper] [Project] [Github]
13 Mar 2023

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shi Pu, Yaole Wang, Gang Yue, Yue Cao, Hang Su, Jun Zhu
arXiv 2023. [Paper] [Github]
12 Mar 2023

Cones: Concept Neurons in Diffusion Models for Customized Generation
Zhiheng Liu, Ruili Feng, Kai Zhu, Yifei Zhang, Kecheng Zheng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao
arXiv 2023. [Paper]
9 Mar 2023

A Prompt Log Analysis of Text-to-Image Generation Systems
Yutong Xie, Zhaoying Pan, Jinge Ma, Jie Luo, Qiaozhu Mei
arXiv 2023. [Paper]
8 Mar 2023

Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles
Zhiwei Tang, Dmitry Rybin, Tsung-Hui Chang
arXiv 2023. [Paper] [Github]
7 Mar 2023

Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao, Yongming Rao, Zuyan Liu, Benlin Liu, Jie Zhou, Jiwen Lu
arXiv 2023. [Paper] [Github]
3 Mar 2023

Collage Diffusion
Vishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon Fatahalian
arXiv 2023. [Paper]
1 Mar 2023

Towards Enhanced Controllability of Diffusion Models
Wonwoong Cho, Hareesh Ravi, Midhun Harikumar, Vinh Khuc, Krishna Kumar Singh, Jingwan Lu, David I. Inouye, Ajinkya Kale
arXiv 2023. [Paper]
28 Feb 2023

Directed Diffusion: Direct Control of Object Placement through Attention Guidance
Wan-Duo Kurt Ma, J.P. Lewis, W. Bastiaan Kleijn, Thomas Leung
arXiv 2023. [Paper]
25 Feb 2023

Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham, James Hays, Jingwan Lu, Krishna Kumar Singh, Zhifei Zhang, Tobias Hinz
arXiv 2023. [Paper]
24 Feb 2023

Region-Aware Diffusion for Zero-shot Text-driven Image Editing
Nisha Huang, Fan Tang, Weiming Dong, Tong-Yee Lee, Changsheng Xu
arXiv 2023. [Paper] [Github]
23 Feb 2023

Controlled and Conditional Text to Image Generation with Diffusion Prior
Pranav Aggarwal, Hareesh Ravi, Naveen Marri, Sachin Kelkar, Fengbin Chen, Vinh Khuc, Midhun Harikumar, Ritiz Tambi, Sudharshan Reddy Kakumanu, Purvak Lapsiya, Alvin Ghouas, Sarah Saber, Malavika Ramprasad, Baldo Faieta, Ajinkya Kale
arXiv 2023. [Paper]
23 Feb 2023

Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC
Yilun Du, Conor Durkan, Robin Strudel, Joshua B. Tenenbaum, Sander Dieleman, Rob Fergus, Jascha Sohl-Dickstein, Arnaud Doucet, Will Grathwohl
arXiv 2023. [Paper] [Project]
22 Feb 2023

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan
arXiv 2023. [Paper]
21 Feb 2023

Exploring the Representation Manifolds of Stable Diffusion Through the Lens of Intrinsic Dimension
Henry Kvinge, Davis Brown, Charles Godfrey
arXiv 2023. [Paper]
16 Feb 2023

Text-driven Visual Synthesis with Latent Diffusion Prior
Ting-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar, Jia-Bin Huang
arXiv 2023. [Paper] [Project]
16 Feb 2023

T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou, Xintao Wang, Liangbin Xie, Jian Zhang, Zhongang Qi, Ying Shan, Xiaohu Qie
arXiv 2023. [Paper] [Github]
16 Feb 2023

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
Omer Bar-Tal, Lior Yariv, Yaron Lipman, Tali Dekel
arXiv 2023. [Paper] Project] [Github]
16 Feb 2023

Boundary Guided Mixing Trajectory for Semantic Control with Diffusion Models
Ye Zhu, Yu Wu, Zhiwei Deng, Olga Russakovsky, Yan Yan
arXiv 2023. [Paper]
16 Feb 2023

Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation
Joshua Vendrow, Saachi Jain, Logan Engstrom, Aleksander Madry
arXiv 2023. [Paper] [Github]
15 Feb 2023

PRedItOR: Text Guided Image Editing with Diffusion Prior
Hareesh Ravi, Sachin Kelkar, Midhun Harikumar, Ajinkya Kale
arXiv 2023. [Paper]
15 Feb 2023

Text-Guided Scene Sketch-to-Photo Synthesis
AprilPyone MaungMaung, Makoto Shing, Kentaro Mitsui, Kei Sawada, Fumio Okura
arXiv 2023. [Paper]
14 Feb 2023

Universal Guidance for Diffusion Models
Arpit Bansal, Hong-Min Chu, Avi Schwarzschild, Soumyadip Sengupta, Micah Goldblum, Jonas Geiping, Tom Goldstein
arXiv 2023. [Paper] [Github]
14 Feb 2023

Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang, Maneesh Agrawala
arXiv 2023. [Paper] [Github]
10 Feb 2023

Analyzing Multimodal Objectives Through the Lens of Generative Diffusion Guidance
Chaerin Kong, Nojun Kwak
arXiv 2023. [Paper]
10 Feb 2023

Is This Loss Informative? Speeding Up Textual Inversion with Deterministic Objective Evaluation
Anton Voronov, Mikhail Khoroshikh, Artem Babenko, Max Ryabinin
arXiv 2023. [Paper]
9 Feb 2023

Q-Diffusion: Quantizing Diffusion Models
Xiuyu Li, Long Lian, Yijiang Liu, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt Keutzer
arXiv 2023. [Paper] [Github]
8 Feb 2023

GLAZE: Protecting Artists from Style Mimicry by Text-to-Image Models
Shawn Shan, Jenna Cryan, Emily Wenger, Haitao Zheng, Rana Hanocka, Ben Y. Zhao
arXiv 2023. [Paper]
8 Feb 2023

Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models
Hyeonho Jeong, Gihyun Kwon, Jong Chul Ye
arXiv 2023. [Paper]
8 Feb 2023

Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness
Felix Friedrich, Patrick Schramowski, Manuel Brack, Lukas Struppek, Dominik Hintersdorf, Sasha Luccioni, Kristian Kersting
arXiv 2023. [Paper]
7 Feb 2023

Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery
Yuxin Wen, Neel Jain, John Kirchenbauer, Micah Goldblum, Jonas Geiping, Tom Goldstein
arXiv 2023. [Paper] [Github]
7 Feb 2023

Zero-shot Image-to-Image Translation
Gaurav Parmar, Krishna Kumar Singh, Richard Zhang, Yijun Li, Jingwan Lu, Jun-Yan Zhu
arXiv 2023. [Paper]
6 Feb 2023

Structure and Content-Guided Video Synthesis with Diffusion Models
Patrick Esser, Johnathan Chiu, Parmida Atighehchian, Jonathan Granskog, Anastasis Germanidis
arXiv 2023. [Paper] [Project]
6 Feb 2023

Mixture of Diffusers for scene composition and high resolution image generation
Álvaro Barbero Jiménez
arXiv 2023. [Paper] [Github]
5 Feb 2023

ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval
Kexun Zhang, Xianjun Yang, William Yang Wang, Lei Li
arXiv 2023. [Paper]
5 Feb 2023

Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Zuopeng Yang, Tianshu Chu, Xin Lin, Erdun Gao, Daqing Liu, Jie Yang, Chaoyue Wang
arXiv 2023. [Paper]
5 Feb 2023

Semantic-Guided Image Augmentation with Pre-trained Models
Bohan Li, Xinghao Wang, Xiao Xu, Yutai Hou, Yunlong Feng, Feng Wang, Wanxiang Che
SIGGRAPH 2023. [Paper] [Project]
4 Feb 2023

TEXTure: Text-Guided Texturing of 3D Shapes
Elad Richardson, Gal Metzer, Yuval Alaluf, Raja Giryes, Daniel Cohen-Or
arXiv 2023. [Paper] [Project] [Github]
3 Feb 2023

Dreamix: Video Diffusion Models are General Video Editors
Eyal Molad, Eliahu Horwitz, Dani Valevski, Alex Rav Acha, Yossi Matias, Yael Pritch, Yaniv Leviathan, Yedid Hoshen
arXiv 2023. [Paper] [Project]
2 Feb 2023

Trash to Treasure: Using text-to-image models to inform the design of physical artefacts
Amy Smith, Hope Schroeder, Ziv Epstein, Michael Cook, Simon Colton, Andrew Lippman
AAAI 2023. [Paper]
1 Feb 2023

Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Hila Chefer, Yuval Alaluf, Yael Vinker, Lior Wolf, Daniel Cohen-Or
SIGGRAPH 2023. [Paper] [Project] [Github]
31 Jan 2023

Zero3D: Semantic-Driven Multi-Category 3D Shape Generation
Bo Han, Yitong Liu, Yixuan Shen
arXiv 2023. [Paper]
31 Jan 2023

Shape-aware Text-driven Layered Video Editing
Yao-Chih Lee, Ji-Ze Genevieve Jang, Yi-Ting Chen, Elizabeth Qiu, Jia-Bin Huang
arXiv 2023. [Paper] [Project]
30 Jan 2023

PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks
Arian Bakhtiarnia, Qi Zhang, Alexandros Iosifidis
arXiv 2023. [Paper] [Github]
30 Jan 2023

GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao, Bing-Kun Bao, Hao Tang, Changsheng Xu
CVPR 2023. [Paper] [Github]
30 Jan 2023

SEGA: Instructing Diffusion using Semantic Dimensions
Manuel Brack, Felix Friedrich, Dominik Hintersdorf, Lukas Struppek, Patrick Schramowski, Kristian Kersting
arXiv 2023. [Paper]
28 Jan 2023

Towards Equitable Representation in Text-to-Image Synthesis Models with the Cross-Cultural Understanding Benchmark (CCUB) Dataset
Zhixuan Liu, Youeun Shin, Beverley-Claire Okogwu, Youngsik Yun, Lia Coleman, Peter Schaldenbrand, Jihie Kim, Jean Oh
arXiv 2023. [Paper]
28 Jan 2023

Text-To-4D Dynamic Scene Generation
Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman
arXiv 2023. [Paper]
26 Jan 2023

Guiding Text-to-Image Diffusion Model Towards Grounded Generation
Ziyi Li, Qinye Zhou, Xiaoyun Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie
arXiv 2023. [Paper] [Project]
12 Jan 2023

Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi, Shubhajit Basak, Hugh Jordan, Rachel McDonnell, Peter Corcoran
arXiv 2023. [Paper] [Project] [Github]
10 Jan 2023

Visual Story Generation Based on Emotion and Keywords
Yuetian Chen, Ruohua Li, Bowen Shi, Peiru Liu, Mei Si
AIIDE INT 2022. [Paper]
7 Jan 2023

DiffTalk: Crafting Diffusion Models for Generalized Talking Head Synthesis
Shuai Shen, Wenliang Zhao, Zibin Meng, Wanhua Li, Zheng Zhu, Jie Zhou, Jiwen Lu
arXiv 2023. [Paper]
10 Jan 2023

Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi, Shubhajit Basak, Hugh Jordan, Rachel McDonnell, Peter Corcoran
arXiv 2023. [Paper]
10 Jan 2023

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Michał Stypułkowski, Konstantinos Vougioukas, Sen He, Maciej Zięba, Stavros Petridis, Maja Pantic
arXiv 2023. [Paper] [Project]
6 Jan 2023

Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang, Han Zhang, Jarred Barber, AJ Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan
arXiv 2023. [Paper] [Project]
2 Jan 2023

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
Jiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Shenghua Gao
CVPR 2023. [Paper] [Project]
28 Dec 2022

Exploring Vision Transformers as Diffusion Learners
He Cao, Jianan Wang, Tianhe Ren, Xianbiao Qi, Yihao Chen, Yuan Yao, Lei Zhang
arXiv 2022. [Paper]
28 Dec 2022

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou
arXiv 2022. [Paper] [Project]
22 Dec 2022

Contrastive Language-Vision AI Models Pretrained on Web-Scraped Multimodal Data Exhibit Sexual Objectification Bias
Robert Wolfe, Yiwei Yang, Bill Howe, Aylin Caliskan
arXiv 2022. [Paper]
21 Dec 2022

Optimizing Prompts for Text-to-Image Generation
Yaru Hao, Zewen Chi, Li Dong, Furu Wei
arXiv 2022. [Paper] [Project] [Github]
19 Dec 2022

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
Qiucheng Wu, Yujian Liu, Handong Zhao, Ajinkya Kale, Trung Bui, Tong Yu, Zhe Lin, Yang Zhang, Shiyu Chang
arXiv 2022. [Paper] [Github]
16 Dec 2022

TeTIm-Eval: a novel curated evaluation data set for comparing text-to-image models
Federico A. Galatolo, Mario G. C. A. Cimino, Edoardo Cogotti
arXiv 2022. [Paper]
15 Dec 2022

The Infinite Index: Information Retrieval on Generative Text-To-Image Models
Niklas Deckers, Maik Fröbe, Johannes Kiesel, Gianluca Pandolfo, Christopher Schröder, Benno Stein, Martin Potthast
CHIIR 2023. [Paper]
14 Dec 2022

LidarCLIP or: How I Learned to Talk to Point Clouds
Georg Hess, Adam Tonderski, Christoffer Petersson, Lennart Svensson, Kalle Åström
arXiv 2022. [Paper] [Github]
13 Dec 2022

Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Su Wang, Chitwan Saharia, Ceslee Montgomery, Jordi Pont-Tuset, Shai Noy, Stefano Pellegrini, Yasumasa Onoe, Sarah Laszlo, David J. Fleet, Radu Soricut, Jason Baldridge, Mohammad Norouzi, Peter Anderson, William Chan
CVPR 2023. [Paper]
13 Dec 2022

The Stable Artist: Steering Semantics in Diffusion Latent Space
Manuel Brack, Patrick Schramowski, Felix Friedrich, Dominik Hintersdorf, Kristian Kersting
arXiv 2022. [Paper]
12 Dec 2022

SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model
Shaoan Xie, Zhifei Zhang, Zhe Lin, Tobias Hinz, Kun Zhang
arXiv 2022. [Paper]
9 Dec 2022

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng, Xuehai He, Tsu-Jui Fu, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, Xin Eric Wang, William Yang Wang
ICLR 2023. [Paper] [Github]
9 Dec 2022

MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Rishabh Dabral, Muhammad Hamza Mughal, Vladislav Golyanik, Christian Theobalt
arXiv 2022. [Paper] [Project]
8 Dec 2022

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation
Yen-Chi Cheng, Hsin-Ying Lee, Sergey Tulyakov, Alexander Schwing, Liangyan Gui
arXiv 2022. [Paper] [Project]
8 Dec 2022

SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Zhixing Zhang, Ligong Han, Arnab Ghosh, Dimitris Metaxas, Jian Ren
arXiv 2022. [Paper] [Project] [Github]
8 Dec 2022

Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, Jun-Yan Zhu
arXiv 2022. [Paper] [Project]
8 Dec 2022

Diffusion Guided Domain Adaptation of Image Generators
Kunpeng Song, Ligong Han, Bingchen Liu, Dimitris Metaxas, Ahmed Elgammal
arXiv 2022. [Paper] [Project]
8 Dec 2022

Executing your Commands via Motion Diffusion in Latent Space
Xin Chen, Biao Jiang, Wen Liu, Zilong Huang, Bin Fu, Tao Chen, Jingyi Yu, Gang Yu
arXiv 2022. [Paper] [Project]
8 Dec 2022

Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors
Zhentao Yu, Zixin Yin, Deyu Zhou, Duomin Wang, Finn Wong, Baoyuan Wang
arXiv 2022. [Paper] [Project]
7 Dec 2022

Magic: Multi Art Genre Intelligent Choreography Dataset and Network for 3D Dance Generation
Ronghui Li, Junfan Zhao, Yachao Zhang, Mingyang Su, Zeping Ren, Han Zhang, Xiu Li
arXiv 2022. [Paper]
7 Dec 2022

Judge, Localize, and Edit: Ensuring Visual Commonsense Morality for Text-to-Image Generation
Seongbeom Park, Suhong Moon, Jinkyu Kim
arXiv 2022. [Paper]
7 Dec 2022

NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Congyue Deng, Chiyu “Max’’ Jiang, Charles R. Qi, Xinchen Yan, Yin Zhou, Leonidas Guibas, Dragomir Anguelov
arXiv 2022. [Paper]
6 Dec 2022

Semantic-Conditional Diffusion Networks for Image Captioning
Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei
CVPR 2023. [Paper] [Github]
6 Dec 2022

Diffusion-SDF: Text-to-Shape via Voxelized Diffusion
Muheng Li, Yueqi Duan, Jie Zhou, Jiwen Lu
CVPR 2023. [Paper] [Project] [Github]
6 Dec 2022

ADIR: Adaptive Diffusion for Image Reconstruction
Shady Abu-Hussein, Tom Tirer, Raja Giryes
arXiv 2022. [Paper] [Project]
6 Dec 2022

M-VADER: A Model for Diffusion with Multimodal Context
Samuel Weinbach, Marco Bellagente, Constantin Eichenberg, Andrew Dai, Robert Baldock, Souradeep Nanda, Björn Deiseroth, Koen Oostermeijer, Hannah Teufel, Andres Felipe Cruz-Salinas
arXiv 2022. [Paper]
6 Dec 2022

Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding
Gyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho Yang
CVPR 2023. [Paper] [Project] [Github]
6 Dec 2022

Unite and Conquer: Cross Dataset Multimodal Synthesis using Diffusion Models
Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel
arXiv 2022. [Paper] [Project]
1 Dec 2022

Shape-Guided Diffusion with Inside-Outside Attention
Dong Huk Park, Grace Luo, Clayton Toste, Samaneh Azadi, Xihui Liu, Maka Karalashvili, Anna Rohrbach, Trevor Darrell
arXiv 2022. [Paper] [Project]
1 Dec 2022

SinDDM: A Single Image Denoising Diffusion Model
Vladimir Kulikov, Shahar Yadin, Matan Kleiner, Tomer Michaeli
arXiv 2022. [Paper] [Project]
29 Nov 2022

DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model
Gwanghyun Kim, Se Young Chun
CVPR 2023. [Paper] [Github]
29 Nov 2022

Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning
Xian Zhong, Zipeng Li, Shuqin Chen, Kui Jiang, Chen Chen, Mang Ye
arXiv 2022. [Paper] [Github]
28 Nov 2022

Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui Hu, Chuanxia Zheng, Heliang Zheng, Tat-Jen Cham, Chaoyue Wang, Zuopeng Yang, Dacheng Tao, Ponnuthurai N. Suganthan
arXiv 2022. [Paper]
27 Nov 2022

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models
Gang Li, Heliang Zheng, Chaoyue Wang, Chang Li, Changwen Zheng, Dacheng Tao
arXiv 2022. [Paper]
25 Nov 2022

SpaText: Spatio-Textual Representation for Controllable Image Generation
Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin
CVPR 2023. [Paper] [Project]
25 Nov 2022

Sketch-Guided Text-to-Image Diffusion Models
Andrey Voynov, Kfir Aberman, Daniel Cohen-Or
arXiv 2022. [Paper] [Project]
24 Nov 2022

Shifted Diffusion for Text-to-image Generation
Yufan Zhou, Bingchen Liu, Yizhe Zhu, Xiao Yang, Changyou Chen, Jinhui Xu
CVPR 2023. [Paper]
24 Nov 2022

Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Tanzila Rahman, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Shweta Mahajan, Leonid Sigal
CVPR 2023. [Paper]
23 Nov 2022

Schrödinger’s Bat: Diffusion Models Sometimes Generate Polysemous Words in Superposition
Jennifer C. White, Ryan Cotterell
arXiv 2022. [Paper]
23 Nov 2022

EDICT: Exact Diffusion Inversion via Coupled Transformations
Bram Wallace, Akash Gokul, Nikhil Naik
arXiv 2022. [Paper] [Github]
22 Nov 2022

Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation
Narek Tumanyan, Michal Geyer, Shai Bagon, Tali Dekel
CVPR 2023. [Paper] [Github]
22 Nov 2022

Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark
Vitali Petsiuk, Alexander E. Siemenn, Saisamrit Surbehera, Zad Chin, Keith Tyser, Gregory Hunter, Arvind Raghavan, Yann Hicke, Bryan A. Plummer, Ori Kerret, Tonio Buonassisi, Kate Saenko, Armando Solar-Lezama, Iddo Drori
NeurIPS Workshop 2022. [Paper]
22 Nov 2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image
Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li
arXiv 2022. [Paper] [Github]
22 Nov 2022

SinFusion: Training Diffusion Models on a Single Image or Video
Yaniv Nikankin, Niv Haim, Michal Irani
arXiv 2022. [Paper] [Github]
21 Nov 2022

Exploring Discrete Diffusion Models for Image Captioning
Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu
arXiv 2022. [Paper] [Github]
21 Nov 2022

Investigating Prompt Engineering in Diffusion Models
Sam Witteveen, Martin Andrews
NeurIPS Workshop 2022. [Paper]
21 Nov 2022

VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models
Ajay Jain, Amber Xie, Pieter Abbeel
arXiv 2022. [Paper] [Project]
21 Nov 2022

Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Xichen Pan, Pengda Qin, Yuhong Li, Hui Xue, Wenhu Chen
arXiv 2022. [Paper] [Github]
20 Nov 2022

DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization
Nisha Huang, Yuxin Zhang, Fan Tang, Chongyang Ma, Haibin Huang, Yong Zhang, Weiming Dong, Changsheng Xu
arXiv 2022. [Paper]
19 Nov 2022

Magic3D: High-Resolution Text-to-3D Content Creation
Chen-Hsuan Lin, Jun Gao, Luming Tang, Towaki Takikawa, Xiaohui Zeng, Xun Huang, Karsten Kreis, Sanja Fidler, Ming-Yu Liu, Tsung-Yi Lin
CVPR 2023. [Paper] [Project]
18 Nov 2022

Invariant Learning via Diffusion Dreamed Distribution Shifts
Priyatham Kattakinda, Alexander Levine, Soheil Feizi
arXiv 2022. [Paper]
18 Nov 2022

Null-text Inversion for Editing Real Images using Guided Diffusion Models
Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or
arXiv 2022. [Paper]
17 Nov 2022

InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks, Aleksander Holynski, Alexei A. Efros
CVPR 2023. [Paper] [Project] [Github]
17 Nov 2022

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Xingqian Xu, Zhangyang Wang, Eric Zhang, Kai Wang, Humphrey Shi
arXiv 2022. [Paper] [Github]
15 Nov 2022

Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models
Adham Elarabawy, Harish Kamath, Samuel Denton
arXiv 2022. [Paper]
15 Nov 2022

Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan, Xin Zhou, Hao Tian
WACV 2023. [Paper]
14 Nov 2022

Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
Patrick Schramowski, Manuel Brack, Björn Deiseroth, Kristian Kersting
CVPR 2023. [Paper] [Github]
9 Nov 2022

Rickrolling the Artist: Injecting Invisible Backdoors into Text-Guided Image Generation Models
Lukas Struppek, Dominik Hintersdorf, Kristian Kersting
arXiv 2022. [Paper] [Github]
4 Nov 2022

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji, Seungjun Nah, Xun Huang, Arash Vahdat, Jiaming Song, Karsten Kreis, Miika Aittala, Timo Aila, Samuli Laine, Bryan Catanzaro, Tero Karras, Ming-Yu Liu
arXiv 2022. [Paper] [Github]
2 Nov 2022

MagicMix: Semantic Mixing with Diffusion Models
Jun Hao Liew, Hanshu Yan, Daquan Zhou, Jiashi Feng
arXiv 2022. [Paper] [Project]
28 Oct 2022

UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance
Wei Li, Xue Xu, Xinyan Xiao, Jiachen Liu, Hu Yang, Guohao Li, Zhanpeng Wang, Zhifan Feng, Qiaoqiao She, Yajuan Lyu, Hua Wu
arXiv 2022. [Paper]
28 Oct 2022

How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?
Hritik Bansal, Da Yin, Masoud Monajatipoor, Kai-Wei Chang
EMNLP 2022. [Paper] [Github]
27 Oct 2022

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Zhida Feng, Zhenyu Zhang, Xintong Yu, Yewei Fang, Lanxin Li, Xuyi Chen, Yuxiang Lu, Jiaxiang Liu, Weichong Yin, Shikun Feng, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang
CVPR 2023. [Paper]
27 Oct 2022

DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Zijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover, Duen Horng Chau
arXiv 2022. [Paper] [Project] [Github]
26 Oct 2022

Lafite2: Few-shot Text-to-Image Generation
Yufan Zhou, Chunyuan Li, Changyou Chen, Jianfeng Gao, Jinhui Xu
arXiv 2022. [Paper]
25 Oct 2022

High-Resolution Image Editing via Multi-Stage Blended Diffusion
Johannes Ackermann, Minjun Li
NeurIPS Workshop 2022. [Paper] [Github]
24 Oct 2022

Conditional Diffusion with Less Explicit Guidance via Model Predictive Control
Max W. Shen, Ehsan Hajiramezanali, Gabriele Scalia, Alex Tseng, Nathaniel Diamant, Tommaso Biancalani, Andreas Loukas
arXiv 2022. [Paper]
21 Oct 2022

A Visual Tour Of Current Challenges In Multimodal Language Models
Shashank Sonkar, Naiming Liu, Richard G. Baraniuk
arXiv 2022. [Paper]
22 Oct 2022

DiffEdit: Diffusion-based semantic image editing with mask guidance
Guillaume Couairon, Jakob Verbeek, Holger Schwenk, Matthieu Cord
ICLR 2023. [Paper]
20 Oct 2022

Diffusion Models already have a Semantic Latent Space
Mingi Kwon, Jaeseok Jeong, Youngjung Uh
ICLR 2023. [Paper] [Project]
20 Oct 2022

UniTune: Text-Driven Image Editing by Fine Tuning an Image Generation Model on a Single Image
Dani Valevski, Matan Kalman, Yossi Matias, Yaniv Leviathan
arXiv 2022. [Paper]
18 Oct 2022

Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Ruijun Li, Weihua Li, Yi Yang, Hanyu Wei, Jianhua Jiang, Quan Bai
arXiv 2022. [Paper]
18 Oct 2022

Imagic: Text-Based Real Image Editing with Diffusion Models
Bahjat Kawar, Shiran Zada, Oran Lang, Omer Tov, Huiwen Chang, Tali Dekel, Inbar Mosseri, Michal Irani
CVPR 2023. [Paper] [Project]
17 Oct 2022

Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation
Chaerin Kong, DongHyeon Jeon, Ohjoon Kwon, Nojun Kwak
WACV 2022. [Paper]
12 Oct 2022

Unifying Diffusion Models’ Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu, Fernando De la Torre
arXiv 2022. [Paper] [Github-1] [Github-2]
11 Oct 2022

Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho, William Chan, Chitwan Saharia, Jay Whang, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim Salimans
arXiv 2022. [Paper]
5 Oct 2022

DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
Ivan Kapelyukh, Vitalis Vosylius, Edward Johns
IEEE RA-L 2022. [Paper]
5 Oct 2022

LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models
Paramanand Chandramouli, Kanchana Vaishnavi Gandikota
BMVC 2022. [Paper]
5 Oct 2022

clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIP
Justin N. M. Pinkney, Chuan Li
BMVC 2022. [Paper] [Github]
5 Oct 2022

Membership Inference Attacks Against Text-to-image Generation Models
Yixin Wu, Ning Yu, Zheng Li, Michael Backes, Yang Zhang
arXiv 2022. [Paper]
3 Oct 2022

Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman
arXiv 2022. [Paper]
29 Sep 2022

DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole, Ajay Jain, Jonathan T. Barron, Ben Mildenhall
arXiv 2022. [Paper] [Github]
29 Sep 2022

Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen, Hexiang Hu, Chitwan Saharia, William W. Cohen
arXiv 2022. [Paper]
29 Sep 2022

Creative Painting with Latent Diffusion Models
Xianchao Wu
arXiv 2022. [Paper]
29 Sep 2022

Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion
Nisha Huang, Fan Tang, Weiming Dong, Changsheng Xu
ACM MM 2022. [Paper] [Github]
27 Sep 2022

Personalizing Text-to-Image Generation via Aesthetic Gradients
Victor Gallego
NeurIPS Workshop 2022. [Paper] [Github]
25 Sep 2022

Best Prompts for Text-to-Image Models and How to Find Them
Nikita Pavlichenko, Dmitry Ustalov
NeurIPS Workshop 2022. [Paper]
23 Sep 2022

The Biased Artist: Exploiting Cultural Biases via Homoglyphs in Text-Guided Image Generation Models
Lukas Struppek, Dominik Hintersdorf, Kristian Kersting
arXiv 2022. [Paper] [Github]
19 Sep 2022

Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models
Chen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando De la Torre
NeurIPS 2022. [Paper] [Github]
14 Sep 2022

ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu, Peng Dai, Ruihui Li, Xiaojuan Qi, Chi-Wing Fu
ICLR 2023. [Paper] [Github]
9 Sep 2022

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman
CVPR 2023. [Paper] [Project] [Github]
25 Aug 2022

Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach, Andreas Blattmann, Björn Ommer
arXiv 2022. [Paper] [Github]
26 Jul 2022

Discrete Contrastive Diffusion for Cross-Modal and Conditional Generation
Ye Zhu, Yu Wu, Kyle Olszewski, Jian Ren, Sergey Tulyakov, Yan Yan
ICLR 2023. [Paper] [Github]
15 Jun 2022

Blended Latent Diffusion
Omri Avrahami, Ohad Fried, Dani Lischinski
ACM 2022. [Paper] [Project] [Github]
6 Jun 2022

Compositional Visual Generation with Composable Diffusion Models
Nan Liu, Shuang Li, Yilun Du, Antonio Torralba, Joshua B. Tenenbaum
ECCV 2022. [Paper] [Project] [Github]
3 Jun 2022

DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi, Chenfei Wu, Jian Liang, Xiang Liu, Nan Duan
arXiv 2022. [Paper]
1 Jun 2022

Improved Vector Quantized Diffusion Models
Zhicong Tang, Shuyang Gu, Jianmin Bao, Dong Chen, Fang Wen
arXiv 2022. [Paper] [Github]
31 May 2022

Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang, Shuai Yang, Haonan Qiu, Wayne Wu, Chen Change Loy, Ziwei Liu
ACM 2022. [Paper] [Github]
31 May 2022

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi
NeurIPS 2022. [Paper] [Github]
23 May 2022

Retrieval-Augmented Diffusion Models
Andreas Blattmann, Robin Rombach, Kaan Oktay, Björn Ommer
NeurIPS 2022. [Paper] [Github]
25 Apr 2022

Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, Mark Chen
arXiv 2022. [Paper] [Github]
13 Apr 2022

KNN-Diffusion: Image Generation via Large-Scale Retrieval
Oron Ashual, Shelly Sheynin, Adam Polyak, Uriel Singer, Oran Gafni, Eliya Nachmani, Yaniv Taigman
ICLR 2023. [Paper]
6 Apr 2022

High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
CVPR 2022. [Paper] [Github]
20 Dec 2021

More Control for Free! Image Synthesis with Semantic Diffusion Guidance
Xihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell
WACV 2021. [Paper] [Project]
10 Dec 2021

Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining Guo
CVPR 2022. [Paper] [Github]
29 Nov 2021

Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami, Dani Lischinski, Ohad Fried
CVPR 2022. [Paper] [Project] [Github]
29 Nov 2021

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Zhisheng Xiao, Karsten Kreis, Arash Vahdat
ICLR 2022 (Spotlight). [Paper] [Project]
15 Dec 2021

DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models
Gwanghyun Kim, Jong Chul Ye
CVPR 2022. [Paper] [Github]
6 Oct 2021

3D Vision

Text-to-3D with Classifier Score Distillation
Xin Yu, Yuan-Chen Guo, Yangguang Li, Ding Liang, Song-Hai Zhang, Xiaojuan Qi
arXiv 2023. [Paper]
30 Oct 2023

Controllable Group Choreography using Contrastive Diffusion
Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen
ACM ToG 2023. [Paper]
29 Oct 2023

SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation
Haobo Jiang, Mathieu Salzmann, Zheng Dang, Jin Xie, Jian Yang
arXiv 2023. [Paper]
26 Oct 2023

6-DoF Stability Field via Diffusion Models
Takuma Yoneda, Tianchong Jiang, Gregory Shakhnarovich, Matthew R. Walter
arXiv 2023. [Paper]
26 Oct 2023

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Jingxiang Sun, Bo Zhang, Ruizhi Shao, Lizhen Wang, Wen Liu, Zhenda Xie, Yebin Liu
arXiv 2023. [Paper]
25 Oct 2023

DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection
Se-Ho Kim, Inyong Koo, Inyoung Lee, Byeongjun Park, Changick Kim
arXiv 2023. [Paper]
25 Oct 2023

iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis
Yash Kant, Aliaksandr Siarohin, Michael Vasilkovsky, Riza Alp Guler, Jian Ren, Sergey Tulyakov, Igor Gilitschenski
SIGGRAPH ASIA 2023. [Paper] [Project]
24 Oct 2023

Wonder3D: Single Image to 3D using Cross-Domain Diffusion
Xiaoxiao Long, Yuan-Chen Guo, Cheng Lin, Yuan Liu, Zhiyang Dou, Lingjie Liu, Yuexin Ma, Song-Hai Zhang, Marc Habermann, Christian Theobalt, Wenping Wang
arXiv 2023. [Paper]
23 Oct 2023

MAS: Multi-view Ancestral Sampling for 3D motion generation using 2D diffusion
Roy Kapon, Guy Tevet, Daniel Cohen-Or, Amit H. Bermano
arXiv 2023. [Paper]
23 Oct 2023

High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Zhiqian Lin, Jiangke Lin, Lincheng Li, Yi Yuan, Zhengxia Zou
arXiv 2023. [Paper]
22 Oct 2023

TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models
Tianshi Cao, Karsten Kreis, Sanja Fidler, Nicholas Sharp, Kangxue Yin
arXiv 2023. [Paper]
20 Oct 2023

Conditional Generative Modeling for Images, 3D Animations, and Video
Vikram Voleti
arXiv 2023. [Paper]
19 Oct 2023

TapMo: Shape-aware Motion Generation of Skeleton-free Characters
Jiaxu Zhang, Shaoli Huang, Zhigang Tu, Xin Chen, Xiaohang Zhan, Gang Yu, Ying Shan
arXiv 2023. [Paper]
19 Oct 2023

Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Zijie Pan, Jiachen Lu, Xiatian Zhu, Li Zhang
arXiv 2023. [Paper]
19 Oct 2023

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan
arXiv 2023. [Paper]
18 Oct 2023

3D Structure-guided Network for Tooth Alignment in 2D Photograph
Yulong Dou, Lanzhuju Mei, Dinggang Shen, Zhiming Cui
arXiv 2023. [Paper]
17 Oct 2023

DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
Jia-Wei Liu, Yan-Pei Cao, Jay Zhangjie Wu, Weijia Mao, Yuchao Gu, Rui Zhao, Jussi Keppo, Ying Shan, Mike Zheng Shou
arXiv 2023. [Paper]
16 Oct 2023

ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
Jiayu Yang, Ziang Cheng, Yunfei Duan, Pan Ji, Hongdong Li
arXiv 2023. [Paper]
16 Oct 2023

PaintHuman: Towards High-fidelity Text-to-3D Human Texturing via Denoised Score Distillation
Jianhui Yu, Hao Zhu, Liming Jiang, Chen Change Loy, Weidong Cai, Wayne Wu
arXiv 2023. [Paper]
14 Oct 2023

OmniControl: Control Any Joint at Any Time for Human Motion Generation
Yiming Xie, Varun Jampani, Lei Zhong, Deqing Sun, Huaizu Jiang
arXiv 2023. [Paper] [Project]
12 Oct 2023

Consistent123: Improve Consistency for One Image to 3D Object Synthesis
Haohan Weng, Tianyu Yang, Jianan Wang, Yu Li, Tong Zhang, C. L. Philip Chen, Lei Zhang
arXiv 2023. [Paper] [Project]
12 Oct 2023

What Does Stable Diffusion Know about the 3D Scene?
Guanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman
arXiv 2023. [Paper]
10 Oct 2023

HiFi-123: Towards High-fidelity One Image to 3D Content Generation
Wangbo Yu, Li Yuan, Yan-Pei Cao, Xiangjun Gao, Xiaoyu Li, Long Quan, Ying Shan, Yonghong Tian
arXiv 2023. [Paper]
10 Oct 2023

IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts
Bohan Zeng, Shanglin Li, Yutang Feng, Hong Li, Sicheng Gao, Jiaming Liu, Huaxia Li, Xu Tang, Jianzhuang Liu, Baochang Zhang
arXiv 2023. [Paper]
9 Oct 2023

DragD3D: Vertex-based Editing for Realistic Mesh Deformations using 2D Diffusion Priors
Tianhao Xie, Eugene Belilovsky, Sudhir Mudur, Tiberiu Popa
arXiv 2023. [Paper]
6 Oct 2023

Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
Chuan Fang, Xiaotao Hu, Kunming Luo, Ping Tan
arXiv 2023. [Paper]
5 Oct 2023

FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
Haiping Wang, Yuan Liu, Bing Wang, Yujing Sun, Zhen Dong, Wenping Wang, Bisheng Yang
arXiv 2023. [Paper]
5 Oct 2023

Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Jianglong Ye, Peng Wang, Kejie Li, Yichun Shi, Heng Wang
arXiv 2023. [Paper] [Project]
4 Oct 2023

Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Yifan Jiang, Hao Tang, Jen-Hao Rick Chang, Liangchen Song, Zhangyang Wang, Liangliang Cao
arXiv 2023. [Paper]
4 Oct 2023

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
Yuze He, Yushi Bai, Matthieu Lin, Wang Zhao, Yubin Hu, Jenny Sheng, Ran Yi, Juanzi Li, Yong-Jin Liu
arXiv 2023. [Paper] [Project] [Github]
4 Oct 2023

ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF
Jangho Park, Gihyun Kwon, Jong Chul Ye
arXiv 2023. [Paper]
4 Oct 2023

MagicDrive: Street View Generation with Diverse 3D Geometry Control
Ruiyuan Gao, Kai Chen, Enze Xie, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung, Qiang Xu
arXiv 2023. [Paper] [Project]
4 Oct 2023

SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D
Weiyu Li, Rui Chen, Xuelin Chen, Ping Tan
arXiv 2023. [Paper] [Project]
4 Oct 2023

Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
Huaijin Pi, Sida Peng, Minghui Yang, Xiaowei Zhou, Hujun Bao
arXiv 2023. [Paper] [Project] [Github]
3 Oct 2023

HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation
Xin Huang, Ruizhi Shao, Qi Zhang, Hongwen Zhang, Ying Feng, Yebin Liu, Qing Wang
arXiv 2023. [Paper] [Project]
2 Oct 2023

Diffusion Posterior Illumination for Ambiguity-aware Inverse Rendering
Linjie Lyu, Ayush Tewari, Marc Habermann, Shunsuke Saito, Michael Zollhöfer, Thomas Leimkühler, Christian Theobalt
arXiv 2023. [Paper]
30 Sep 2023

EPiC-ly Fast Particle Cloud Generation with Flow-Matching and Diffusion
Erik Buhmann, Cedric Ewen, Darius A. Faroughy, Tobias Golling, Gregor Kasieczka, Matthew Leigh, Guillaume Quétant, John Andrew Raine, Debajyoti Sengupta, David Shih
arXiv 2023. [Paper]
29 Sep 2023

Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors
Yukang Lin, Haonan Han, Chaoqun Gong, Zunnan Xu, Yachao Zhang, Xiu Li
arXiv 2023. [Paper]
29 Sep 2023

Object Motion Guided Human Motion Synthesis
Jiaman Li, Jiajun Wu, C. Karen Liu
arXiv 2023. [Paper]
28 Sep 2023

ITEM3D: Illumination-Aware Directional Texture Editing for 3D Models
Shengqi Liu, Zhuo Chen, Jingnan Gao, Yichao Yan, Wenhan Zhu, Xiaobo Li, Ke Gao, Jiangjing Lyu, Xiaokang Yang
arXiv 2023. [Paper]
26 Sep 2023

Light Field Diffusion for Single-View Novel View Synthesis
Yifeng Xiong, Haoyu Ma, Shanlin Sun, Kun Han, Xiaohui Xie
arXiv 2023. [Paper]
20 Sep 2023

Latent Diffusion Models for Structural Component Design
Ethan Herron, Jaydeep Rade, Anushrut Jignasu, Baskar Ganapathysubramanian, Aditya Balu, Soumik Sarkar, Adarsh Krishnamurthy
arXiv 2023. [Paper]
20 Sep 2023

FaceDiffuser: Speech-Driven 3D Facial Animation Synthesis Using Diffusion
Stefan Stan, Kazi Injamamul Haque, Zerrin Yumak
arXiv 2023. [Paper]
20 Sep 2023

TwinTex: Geometry-aware Texture Generation for Abstracted 3D Architectural Models
Weidan Xiong, Hongqian Zhang, Botao Peng, Ziyu Hu, Yongli Wu, Jianwei Guo, Hui Huang
SIGGRAPH ASIA 2023. [Paper]
20 Sep 2023

Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Toan Nguyen, Minh Nhat Vu, Baoru Huang, Tuan Van Vo, Vy Truong, Ngan Le, Thieu Vo, Bac Le, Anh Nguyen
arXiv 2023. [Paper]
19 Sep 2023

Large Intestine 3D Shape Refinement Using Point Diffusion Models for Digital Phantom Generation
Kaouther Mouheb, Mobina Ghojogh Nejad, Lavsen Dahal, Ehsan Samei, W. Paul Segars, Joseph Y. Lo
arXiv 2023. [Paper]
15 Sep 2023

Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models
Ruian He, Zhen Xing, Weimin Tan, Bo Yan
arXiv 2023. [Paper]
15 Sep 2023

M3Dsynth: A dataset of medical 3D images with AI-generated local manipulations
Giada Zingarini, Davide Cozzolino, Riccardo Corvi, Giovanni Poggi, Luisa Verdoliva
arXiv 2023. [Paper]
14 Sep 2023

Large-Vocabulary 3D Diffusion Model with Transformer
Ziang Cao, Fangzhou Hong, Tong Wu, Liang Pan, Ziwei Liu
arXiv 2023. [Paper] [Project] [Github]
14 Sep 2023

UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons
Sicheng Yang, Zilin Wang, Zhiyong Wu, Minglei Li, Zhensong Zhang, Qiaochu Huang, Lei Hao, Songcen Xu, Xiaofei Wu, changpeng yang, Zonghong Dai
ACM MM 2023. [Paper]
13 Sep 2023

Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model
Yin Wang, Zhiying Leng, Frederick W. B. Li, Shun-Cheng Wu, Xiaohui Liang
ICCV 2023. [Paper]
12 Sep 2023

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang
arXiv 2023. [Paper] [Project] [Github]
7 Sep 2023

SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction
Nivetha Jayakumar, Tonmoy Hossain, Miaomiao Zhang
arXiv 2023. [Paper]
6 Sep 2023

MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Zeyu Ling, Bo Han, Yongkang Wong, Mohan Kangkanhalli, Weidong Geng
arXiv 2023. [Paper]
6 Sep 2023

DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion
Yunhong Lou, Linchao Zhu, Yaxiong Wang, Xiaohan Wang, Yi Yang
AAAI 2024. [Paper]
4 Sep 2023

BuilDiff: 3D Building Shape Generation using Single-Image Conditional Point Cloud Diffusion Models
Yao Wei, George Vosselman, Michael Ying Yang
ICCV Workshop 2023. [Paper]
31 Aug 2023

MVDream: Multi-view Diffusion for 3D Generation
Yichun Shi, Peng Wang, Jianglong Ye, Mai Long, Kejie Li, Xiao Yang
arXiv 2023. [Paper]
31 Aug 2023

Diffusion Inertial Poser: Human Motion Reconstruction from Arbitrary Sparse IMU Configurations
Tom Van Wouwe, Seunghwan Lee, Antoine Falisse, Scott Delp, C. Karen Liu
arXiv 2023. [Paper]
31 Aug 2023

InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion
Sirui Xu, Zhengyuan Li, Yu-Xiong Wang, Liang-Yan Gui
ICCV 2023. [Paper] [Project] [Github]
31 Aug 2023

Priority-Centric Human Motion Generation in Discrete Latent Space
Hanyang Kong, Kehong Gong, Dongze Lian, Michael Bi Mi, Xinchao Wang
arXiv 2023. [Paper]
28 Aug 2023

HoloFusion: Towards Photo-realistic 3D Generative Modeling
Animesh Karnewar, Niloy J. Mitra, Andrea Vedaldi, David Novotny
ICCV 2023. [Paper] [Project]
28 Aug 2023

Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers
Abril Corona-Figueroa, Sam Bond-Taylor, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon, Hubert P. H. Shum, Chris G. Willcocks
ICCV 2023. [Paper]
27 Aug 2023

Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views
Zi-Xin Zou, Weihao Cheng, Yan-Pei Cao, Shi-Sheng Huang, Ying Shan, Song-Hai Zhang
arXiv 2023. [Paper]
27 Aug 2023

Multi-plane denoising diffusion-based dimensionality expansion for 2D-to-3D reconstruction of microstructures with harmonized sampling
Kang-Hyun Lee, Gun Jin Yun
arXiv 2023. [Paper]
27 Aug 2023

The DiffuseStyleGesture+ entry to the GENEA Challenge 2023
Sicheng Yang, Haiwei Xue, Zhensong Zhang, Minglei Li, Zhiyong Wu, Xiaofei Wu, Songcen Xu, Zonghong Dai
ICMI 2023. [Paper] [Github]
26 Aug 2023

Distribution-Aligned Diffusion for Human Mesh Recovery
Lin Geng Foo, Jia Gong, Hossein Rahmani, Jun Liu
ICCV 2023. [Paper] [Project]
25 Aug 2023

EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior
Minda Zhao, Chaoyi Zhao, Xinyue Liang, Lincheng Li, Zeng Zhao, Zhipeng Hu, Changjie Fan, Xin Yu
arXiv 2023. [Paper]
25 Aug 2023

DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
Se Jin Park, Joanna Hong, Minsu Kim, Yong Man Ro
arXiv 2023. [Paper]
23 Aug 2023

LongDanceDiff: Long-term Dance Generation with Conditional Diffusion Model
Siqi Yang, Zejun Yang, Zhisheng Wang
arXiv 2023. [Paper]
23 Aug 2023

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Yiwen Chen, Chi Zhang, Xiaofeng Yang, Zhongang Cai, Gang Yu, Lei Yang, Guosheng Lin
arXiv 2023. [Paper] [Github]
22 Aug 2023

Texture Generation on 3D Meshes with Point-UV Diffusion
Xin Yu, Peng Dai, Wenbo Li, Lan Ma, Zhengzhe Liu, Xiaojuan Qi
ICCV 2023. [Paper]
21 Aug 2023

Physics-Guided Human Motion Capture with Pose Probability Modeling
Jingyi Ju, Buzhen Huang, Chen Zhu, Zhihao Li, Yangang Wang
IJCAI 2023. [Paper] [Github]
19 Aug 2023

Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling
Haorui Ji, Hui Deng, Yuchao Dai, Hongdong Li
arXiv 2023. [Paper]
18 Aug 2023

MATLABER: Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR
Xudong Xu, Zhaoyang Lyu, Xingang Pan, Bo Dai
arXiv 2023. [Paper] [Project]
18 Aug 2023

O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model
Yubin Hu, Sheng Ye, Wang Zhao, Matthieu Lin, Yuze He, Yu-Hui Wen, Ying He, Yong-Jin Liu
arXiv 2023. [Paper]
18 Aug 2023

Denoising Diffusion for 3D Hand Pose Estimation from Images
Maksym Ivashechkin, Oscar Mendez, Richard Bowden
arXiv 2023. [Paper]
18 Aug 2023

PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation
Hanbing Liu, Jun-Yan He, Zhi-Qi Cheng, Wangmeng Xiang, Qize Yang, Wenhao Chai, Gaoang Wang, Xu Bao, Bin Luo, Yifeng Geng, Xuansong Xie
ACM MM 2023. [Paper] [Github]
18 Aug 2023

Guide3D: Create 3D Avatars from Text and Image Guidance
Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, Kwan-Yee K. Wong
arXiv 2023. [Paper]
18 Aug 2023

DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction
Xiaoxiao He, Chaowei Tan, Ligong Han, Bo Liu, Leon Axel, Kang Li, Dimitris N. Metaxas
MICCAI 2023. [Paper] [Github]
18 Aug 2023

HumanLiff: Layer-wise 3D Human Generation with Diffusion Model
Shoukang Hu, Fangzhou Hong, Tao Hu, Liang Pan, Haiyi Mei, Weiye Xiao, Lei Yang, Ziwei Liu
arXiv 2023. [Paper] [Project]
18 Aug 2023

Watch Your Steps: Local Image and Scene Editing by Text Instructions
Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski
arXiv 2023. [Paper] [Project]
17 Aug 2023

TeCH: Text-guided Reconstruction of Lifelike Clothed Humans
Yangyi Huang, Hongwei Yi, Yuliang Xiu, Tingting Liao, Jiaxiang Tang, Deng Cai, Justus Thies
arXiv 2023. [Paper] [Project] [Github]
16 Aug 2023

CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction
Yan Di, Chenyangguang Zhang, Pengyuan Wang, Guangyao Zhai, Ruida Zhang, Fabian Manhardt, Benjamin Busam, Xiangyang Ji, Federico Tombari
arXiv 2023. [Paper]
15 Aug 2023

Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Bosheng Qin, Wentao Ye, Qifan Yu, Siliang Tang, Yueting Zhuang
arXiv 2023. [Paper]
15 Aug 2023

3D Scene Diffusion Guidance using Scene Graphs
Mohammad Naanaa, Katharina Schmid, Yinyu Nie
arXiv 2023. [Paper]
8 Aug 2023

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On
Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, Qixing Huang
arXiv 2023. [Paper]
8 Aug 2023

AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose
Huichao Zhang, Bowen Chen, Hao Yang, Liao Qu, Xu Wang, Li Chen, Chao Long, Feida Zhu, Kang Du, Min Zheng
arXiv 2023. [Paper] [Project]
7 Aug 2023

Generative Approach for Probabilistic Human Mesh Recovery using Diffusion Models
Hanbyel Cho, Junmo Kim
ICCV Workshop 2023. [Paper] [Github]
5 Aug 2023

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation
Qiaosong Qi, Le Zhuo, Aixi Zhang, Yue Liao, Fei Fang, Si Liu, Shuicheng Yan
ACM MM 2023. [Paper]
5 Aug 2023

Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation
Zijie Wu, Yaonan Wang, Mingtao Feng, He Xie, Ajmal Mian
arXiv 2023. [Paper]
5 Aug 2023

On the Transition from Neural Representation to Symbolic Knowledge
Junyan Cheng, Peter Chin
arXiv 2023. [Paper]
3 Aug 2023

Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling
Zhao Yang, Bing Su, Ji-Rong Wen
ACM MM 2023. [Paper] [Github]
3 Aug 2023

HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation
Jinbo Wu, Xiaobo Gao, Xing Liu, Zhengyang Shen, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding
arXiv 2023. [Paper]
30 Jul 2023

TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction
Sibo Tian, Minghui Zheng, Xiao Liang
arXiv 2023. [Paper]
30 Jul 2023

TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis
Zihan Zhang, Richard Liu, Kfir Aberman, Rana Hanocka
arXiv 2023. [Paper]
27 Jul 2023

Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation
Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang
arXiv 2023. [Paper]
26 Jul 2023

Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Shape Estimation
Will Rowan, Patrik Huber, Nick Pears, Andrew Keeling
arXiv 2023. [Paper]
25 Jul 2023

NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis
Nilesh Kulkarni, Davis Rempe, Kyle Genova, Abhijit Kundu, Justin Johnson, David Fouhey, Leonidas Guibas
arXiv 2023. [Paper] [Project]
14 Jul 2023

AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion
Shuo Huang, Zongxin Yang, Liangting Li, Yi Yang, Jia Jia
arXiv 2023. [Paper]
13 Jul 2023

Articulated 3D Head Avatar Generation using Text-to-Image Diffusion Models
Alexander W. Bergman, Wang Yifan, Gordon Wetzstein
arXiv 2023. [Paper]
10 Jul 2023

Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation
Zhongyu Jiang, Zhuoran Zhou, Lei Li, Wenhao Chai, Cheng-Yen Yang, Jenq-Neng Hwang
arXiv 2023. [Paper]
7 Jul 2023

AutoDecoding Latent 3D Diffusion Models
Evangelos Ntavelis, Aliaksandr Siarohin, Kyle Olszewski, Chaoyang Wang, Luc Van Gool, Sergey Tulyakov
arXiv 2023. [Paper]
7 Jul 2023

SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Yuguang Shi
arXiv 2023. [Paper]
5 Jul 2023

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
Shentong Mo, Enze Xie, Ruihang Chu, Lewei Yao, Lanqing Hong, Matthias Nießner, Zhenguo Li
arXiv 2023. [Paper]
4 Jul 2023

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
Guocheng Qian, Jinjie Mai, Abdullah Hamdi, Jian Ren, Aliaksandr Siarohin, Bing Li, Hsin-Ying Lee, Ivan Skorokhodov, Peter Wonka, Sergey Tulyakov, Bernard Ghanem
arXiv 2023. [Paper] [Project]
30 Jun 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao
arXiv 2023. [Paper]
29 Jun 2023

DiffComplete: Diffusion-based Generative 3D Shape Completion
Ruihang Chu, Enze Xie, Shentong Mo, Zhenguo Li, Matthias Nießner, Chi-Wing Fu, Jiaya Jia
arXiv 2023. [Paper]
28 Jun 2023

DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation
Yukun Huang, Jianan Wang, Yukai Shi, Xianbiao Qi, Zheng-Jun Zha, Lei Zhang
arXiv 2023. [Paper]
21 Jun 2023

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Lianying Yin, Yijun Wang, Tianyu He, Jinming Liu, Wei Zhao, Bohan Li, Xin Jin, Jianxin Lin
arXiv 2023. [Paper]
20 Jun 2023

Point-Cloud Completion with Pretrained Text-to-image Diffusion Models
Yoni Kasten, Ohad Rahamim, Gal Chechik
arXiv 2023. [Paper]
18 Jun 2023

AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation
Yifei Zeng, Yuanxun Lu, Xinya Ji, Yao Yao, Hao Zhu, Xun Cao
arXiv 2023. [Paper]
16 Jun 2023

Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model
Lu Yu, Wei Xiang, Kang Han
arXiv 2023. [Paper]
15 Jun 2023

Adding 3D Geometry Control to Diffusion Models
Wufei Ma, Qihao Liu, Jiahao Wang, Angtian Wang, Yaoyao Liu, Adam Kortylewski, Alan Yuille
arXiv 2023. [Paper]
13 Jun 2023

Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data
Stanislaw Szymanowicz, Christian Rupprecht, Andrea Vedaldi
arXiv 2023. [Paper]
13 Jun 2023

3D molecule generation by denoising voxel grids
Pedro O. Pinheiro, Joshua Rackers, Joseph Kleinhenz, Michael Maser, Omar Mahmood, Andrew Martin Watkins, Stephen Ra, Vishnu Sresht, Saeed Saremi
arXiv 2023. [Paper]
13 Jun 2023

InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
Jiale Xu, Xintao Wang, Yan-Pei Cao, Weihao Cheng, Ying Shan, Shenghua Gao
arXiv 2023. [Paper]
12 Jun 2023

RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models
Xingchen Zhou, Ying He, F. Richard Yu, Jianqiang Li, You Li
arXiv 2023. [Paper]
9 Jun 2023

Stochastic Multi-Person 3D Motion Forecasting
Sirui Xu, Yu-Xiong Wang, Liang-Yan Gui
arXiv 2023. [Paper]
8 Jun 2023

ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections
Chun-Han Yao, Amit Raj, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani
arXiv 2023. [Paper]
7 Jun 2023

Synthesizing realistic sand assemblies with denoising diffusion in latent space
Nikolaos N. Vlassis, WaiChing Sun, Khalid A. Alshibli, Richard A. Regueiro
arXiv 2023. [Paper]
7 Jun 2023

AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars
Mohit Mendiratta, Xingang Pan, Mohamed Elgharib, Kartik Teotia, Mallikarjun B R, Ayush Tewari, Vladislav Golyanik, Adam Kortylewski, Christian Theobalt
arXiv 2023. [Paper]
1 Jun 2023

DiffRoom: Diffusion-based High-Quality 3D Room Reconstruction and Generation
Xiaoliang Ju, Zhaoyang Huang, Yijin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li
arXiv 2023. [Paper]
1 Jun 2023

Controllable Motion Diffusion Model
Yi Shi, Jingbo Wang, Xuekun Jiang, Bo Dai
arXiv 2023. [Paper] Project]
1 Jun 2023

FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models
Hao Zhang, Yanbo Xu, Tianyuan Dai, Yu-Wing, Tai Chi-Keung Tang
arXiv 2023. [Paper]
1 Jun 2023

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images
Junxing Hu, Hongwen Zhang, Zerui Chen, Mengcheng Li, Yunlong Wang, Yebin Liu, Zhenan Sun
arXiv 2023. [Paper] [Project]
31 May 2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Chi Zhang, Yiwen Chen, Yijun Fu, Zhenglin Zhou, Gang YU, Billzb Wang, Bin Fu, Tao Chen, Guosheng Lin, Chunhua Shen
arXiv 2023. [Paper]
30 May 2023

HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance
Junzhe Zhu, Peiye Zhuang
arXiv 2023. [Paper]
30 May 2023

Conditional Diffusion Models for Semantic 3D Medical Image Synthesis
Zolnamar Dorjsembe, Hsing-Kuo Pao, Sodtavilan Odonchimed, Furen Xiao
arXiv 2023. [Paper]
29 May 2023

ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image
Zhenzhen Weng, Zeyu Wang, Serena Yeung
arXiv 2023. [Paper]
25 May 2023

NAP: Neural 3D Articulation Prior
Jiahui Lei, Congyue Deng, Bokui Shen, Leonidas Guibas, Kostas Daniilidis
arXiv 2023. [Paper] [Project]
25 May 2023

CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graphs
Guangyao Zhai, Evin Pınar Örnek, Shun-Cheng Wu, Yan Di, Federico Tombari, Nassir Navab, Benjamin Busam
arXiv 2023. [Paper]
25 May 2023

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
Zhengyi Wang, Cheng Lu, Yikai Wang, Fan Bao, Chongxuan Li, Hang Su, Jun Zhu
arXiv 2023. [Paper] [Project]
25 May 2023

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Sitian Shen, Zilin Zhu, Linqian Fan, Harry Zhang, Xinxiao Wu
arXiv 2023. [Paper]
25 May 2023

Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
Tsu-Ching Hsiao, Hao-Wei Chen, Hsuan-Kung Yang, Chun-Yi Lee
arXiv 2023. [Paper]
25 May 2023

Deceptive-NeRF: Enhancing NeRF Reconstruction using Pseudo-Observations from Diffusion Models
Xinhang Liu, Shiu-hong Kao, Jiaben Chen, Yu-Wing Tai, Chi-Keung Tang
arXiv 2023. [Paper]
24 May 2023

Manifold Diffusion Fields
Ahmed A. Elhag, Joshua M. Susskind, Miguel Angel Bautista
arXiv 2023. [Paper]
24 May 2023

Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape
Rundi Wu, Ruoshi Liu, Carl Vondrick, Changxi Zheng
arXiv 2023. [Paper] [Project] [Github]
24 May 2023

Understanding Text-driven Motion Synthesis with Keyframe Collaboration via Diffusion Models
Dong Wei, Xiaoning Sun, Huaijiang Sun, Bin Li, Shengxiang Hu, Weiqing Li, Jianfeng Lu
arXiv 2023. [Paper]
23 May 2023

DiffHand: End-to-End Hand Mesh Reconstruction via Diffusion Models
Lijun Li, Li’an Zhuo, Bang Zhang, Liefeng Bo, Chen Chen
arXiv 2023. [Paper]
23 May 2023

GMD: Controllable Human Motion Synthesis via Guided Diffusion Models
Korrawe Karunratanakul, Konpat Preechakul, Supasorn Suwajanakorn, Siyu Tang
arXiv 2023. [Paper] [Project]
21 May 2023

Towards Globally Consistent Stochastic Human Motion Prediction via Motion Diffusion
Jiarui Sun, Girish Chowdhary
arXiv 2023. [Paper]
21 May 2023

Few-shot 3D Shape Generation
Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
arXiv 2023. [Paper]
19 May 2023

Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models
Byungjun Kim, Patrick Kwon, Kwangho Lee, Myunggi Lee, Sookwan Han, Daesik Kim, Hanbyul Joo
arXiv 2023. [Paper] [Project]
19 May 2023

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
Jingbo Zhang, Xiaoyu Li, Ziyu Wan, Can Wang, Jing Liao
arXiv 2023. [Paper]
19 May 2023

RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
Liangchen Song, Liangliang Cao, Hongyu Xu, Kai Kang, Feng Tang, Junsong Yuan, Yang Zhao
arXiv 2023. [Paper]
18 May 2023

LDM3D: Latent Diffusion Model for 3D
Gabriela Ben Melech Stan, Diana Wofk, Scottie Fox, Alex Redden, Will Saxton, Jean Yu, Estelle Aflalo, Shao-Yen Tseng, Fabio Nonato, Matthias Muller, Vasudev Lal
arXiv 2023. [Paper]
18 May 2023

Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation
Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta
arXiv 2023. [Paper] [Project]
16 May 2023

FitMe: Deep Photorealistic 3D Morphable Model Avatars
Alexandros Lattas, Stylianos Moschoglou, Stylianos Ploumpis, Baris Gecer, Jiankang Deng, Stefanos Zafeiriou
CVPR 2023. [Paper] [Project]
16 May 2023

AMD: Autoregressive Motion Diffusion
Bo Han, Hao Peng, Minjing Dong, Chang Xu, Yi Ren, Yixuan Shen, Yuheng Li
arXiv 2023. [Paper]
16 May 2023

Text-guided High-definition Consistency Texture Model
Zhibin Tang, Tiantong He
arXiv 2023. [Paper]
10 May 2023

Relightify: Relightable 3D Faces from a Single Image via Diffusion Models
Foivos Paraperas Papantoniou, Alexandros Lattas, Stylianos Moschoglou, Stefanos Zafeiriou
arXiv 2023. [Paper] [Project]
10 May 2023

CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter Simulation
Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, William Korcari, Katja Krüger, Peter McKeown
arXiv 2023. [Paper]
8 May 2023

Locally Attentional SDF Diffusion for Controllable 3D Shape Generation
Xin-Yang Zheng, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, Heung-Yeung Shum
SIGGRAPH 2023. [Paper]
8 May 2023

DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion
Kiyohiro Nakayama, Mikaela Angelina Uy, Jiahui Huang, Shi-Min Hu, Ke Li, Leonidas J Guibas
arXiv 2023. [Paper] [Github]
4 May 2023

Shap-E: Generating Conditional 3D Implicit Functions
Heewoo Jun, Alex Nichol
arXiv 2023. [Paper] [Github] 3 May 2023

ContactArt: Learning 3D Interaction Priors for Category-level Articulated Object and Hand Poses Estimation
Zehao Zhu, Jiashun Wang, Yuzhe Qin, Deqing Sun, Varun Jampani, Xiaolong Wang
arXiv 2023. [Paper] [Project]
2 May 2023

DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling
Mehmet Saygin Seyfioglu, Karim Bouyarmane, Suren Kumar, Amir Tavanaei, Ismail B. Tutar
arXiv 2023. [Paper]
2 May 2023

Learning a Diffusion Prior for NeRFs
Guandao Yang, Abhijit Kundu, Leonidas J. Guibas, Jonathan T. Barron, Ben Poole
ICLR Workshop 2023. [Paper]
27 Apr 2023

TextMesh: Generation of Realistic 3D Meshes From Text Prompts
Christina Tsalicoglou, Fabian Manhardt, Alessio Tonioni, Michael Niemeyer, Federico Tombari
arXiv 2023. [Paper]
24 Apr 2023

Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs
Frederik Warburg, Ethan Weber, Matthew Tancik, Aleksander Holynski, Angjoo Kanazawa
arXiv 2023. [Paper] [Project] [Github]
20 Apr 2023

Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion
Tomas Jakab, Ruining Li, Shangzhe Wu, Christian Rupprecht, Andrea Vedaldi
arXiv 2023. [Paper] [Project]
20 Apr 2023

Anything-3D: Towards Single-view Anything Reconstruction in the Wild
Qiuhong Shen, Xingyi Yang, Xinchao Wang
arXiv 2023. [Paper]
19 Apr 2023

Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model
Yuming Du, Robin Kips, Albert Pumarola, Sebastian Starke, Ali Thabet, Artsiom Sanakoyeu
CVPR 2023. [Paper] [Project] [Github]
17 Apr 2023

Towards Controllable Diffusion Models via Reward-Guided Exploration
Hengtong Zhang, Tingyang Xu
arXiv 2023. [Paper]
14 Apr 2023

Learning Controllable 3D Diffusion Models from Single-view Images
Jiatao Gu, Qingzhe Gao, Shuangfei Zhai, Baoquan Chen, Lingjie Liu, Josh Susskind
arXiv 2023. [Paper] [Project]
13 Apr 2023

Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction
Hansheng Chen, Jiatao Gu, Anpei Chen, Wei Tian, Zhuowen Tu, Lingjie Liu, Hao Su
arXiv 2023. [Paper] [Project]
13 Apr 2023

Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views
Siwei Zhang, Qianli Ma, Yan Zhang, Sadegh Aliakbarian, Darren Cosker, Siyu Tang
arXiv 2023. [Paper] [Project]
12 Apr 2023

InterGen: Diffusion-based Multi-human Motion Generation under Complex Interactions
Han Liang, Wenqian Zhang, Wenxuan Li, Jingyi Yu, Lan Xu
arXiv 2023. [Paper] [Github]
12 Apr 2023

Probabilistic Human Mesh Recovery in 3D Scenes from Egocentric Views
Siwei Zhang, Qianli Ma, Yan Zhang, Sadegh Aliakbarian, Darren Cosker, Siyu Tang
arXiv 2023. [Paper] [Project]
12 Apr 2023

Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Mohammadreza Armandpour, Huangjie Zheng, Ali Sadeghian, Amir Sadeghian, Mingyuan Zhou
arXiv 2023. [Paper] [Project]
11 Apr 2023

NeRF applied to satellite imagery for surface reconstruction
Federico Semeraro, Yi Zhang, Wenying Wu, Patrick Carroll
arXiv 2023. [Paper] [Github]
9 Apr 2023

DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model
Hoigi Seo, Hayeon Kim, Gwanghyun Kim, Se Young Chun
arXiv 2023. [Paper] [Project]
6 Apr 2023

Generative Novel View Synthesis with 3D-Aware Diffusion Models
Eric R. Chan, Koki Nagano, Matthew A. Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, Gordon Wetzstein
arXiv 2023. [Paper] [Project]
5 Apr 2023

Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion
Davis Rempe, Zhengyi Luo, Xue Bin Peng, Ye Yuan, Kris Kitani, Karsten Kreis, Sanja Fidler, Or Litany
CVPR 2023. [Paper] [Github]
4 Apr 2023

PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion
Gwanghyun Kim, Ji Ha Jang, Se Young Chun
arXiv 2023. [Paper] [Project]
4 Apr 2023

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
Mingyuan Zhang, Xinying Guo, Liang Pan, Zhongang Cai, Fangzhou Hong, Huirong Li, Lei Yang, Ziwei Liu
arXiv 2023. [Paper] [Project] [Github]
3 Apr 2023

Controllable Motion Synthesis and Reconstruction with Autoregressive Diffusion Models
Wenjie Yin, Ruibo Tu, Hang Yin, Danica Kragic, Hedvig Kjellström, Mårten Björkman
arXiv 2023. [Paper]
3 Apr 2023

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models
Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, Kwan-Yee K. Wong
arXiv 2023. [Paper]
3 Apr 2023

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance
Longwen Zhang, Qiwei Qiu, Hongyang Lin, Qixuan Zhang, Cheng Shi, Wei Yang, Ye Shi, Sibei Yang, Lan Xu, Jingyi Yu
arXiv 2023. [Paper] [Project]
1 Apr 2023

AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control
Ruixiang Jiang, Can Wang, Jingbo Zhang, Menglei Chai, Mingming He, Dongdong Chen, Jing Liao
arXiv 2023. [Paper] [Project] [Github]
30 Mar 2023

HOLODIFFUSION: Training a 3D Diffusion Model using 2D Images
Animesh Karnewar, Andrea Vedaldi, David Novotny, Niloy Mitra
CVPR 2023. [Paper] [Project]
29 Mar 2023

4D Facial Expression Diffusion Model
Kaifeng Zou, Sylvain Faisan, Boyang Yu, Sébastien Valette, Hyewon Seo
arXiv 2023. [Paper] [Github]
29 Mar 2023

Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion
Hiromichi Kamata, Yuiko Sakuma, Akio Hayakawa, Masato Ishii, Takuya Narihira
arXiv 2023. [Paper] [Project] [Github]
28 Mar 2023

Novel View Synthesis of Humans using Differentiable Rendering
Guillaume Rochette, Chris Russell, Richard Bowden
IEEE T-BIOM 2023. [Paper] [Github]
28 Mar 2023

Debiasing Scores and Prompts of 2D Diffusion for Robust Text-to-3D Generation
Susung Hong, Donghoon Ahn, Seungryong Kim
CVPR Workshop 2023. [Paper]
27 Mar 2023

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Junshu Tang, Tengfei Wang, Bo Zhang, Ting Zhang, Ran Yi, Lizhuang Ma, Dong Chen
arXiv 2023. [Paper] [Project] [Github]
24 Mar 2023

ISS++: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu, Peng Dai, Ruihui Li, Xiaojuan Qi, Chi-Wing Fu
ICLR 2023. [Paper]
24 Mar 2023

CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout
Yiqi Lin, Haotian Bai, Sijia Li, Haonan Lu, Xiaodong Lin, Hui Xiong, Lin Wang
arXiv 2023. [Paper] [Project]
24 Mar 2023

Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Rui Chen, Yongwei Chen, Ningxin Jiao, Kui Jia
arXiv 2023. [Paper] [Project] [Github]
24 Mar 2023

DDT: A Diffusion-Driven Transformer-based Framework for Human Mesh Recovery from a Video
Ce Zheng, Guo-Jun Qi, Chen Chen
arXiv 2023. [Paper]
23 Mar 2023

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions
Ayaan Haque, Matthew Tancik, Alexei A. Efros, Aleksander Holynski, Angjoo Kanazawa
arXiv 2023. [Paper] [Project]
22 Mar 2023

FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models
Jianglong Ye, Naiyan Wang, Xiaolong Wang
arXiv 2023. [Paper] [Project]
22 Mar 2023

Vox-E: Text-guided Voxel Editing of 3D Objects
Etai Sella, Gal Fiebelman, Peter Hedman, Hadar Averbuch-Elor
arXiv 2023. [Paper] [Project]
21 Mar 2023

Compositional 3D Scene Generation using Locally Conditioned Diffusion
Ryan Po, Gordon Wetzstein
arXiv 2023. [Paper] [Github]
21 Mar 2023

Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation
Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Zhao Wang, Kai Han, Shanshe Wang, Siwei Ma, Wen Gao
arXiv 2023. [Paper] [Github]
21 Mar 2023

3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion
Yu-Jhe Li, Kris Kitani
arXiv 2023. [Paper]
21 Mar 2023

Affordance Diffusion: Synthesizing Hand-Object Interactions
Yufei Ye, Xueting Li, Abhinav Gupta, Shalini De Mello, Stan Birchfield, Jiaming Song, Shubham Tulsiani, Sifei Liu
CVPR 2023. [Paper] [Project]
21 Mar 2023

SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation
Juil Koo, Seungwoo Yoo, Minh Hieu Nguyen, Minhyuk Sung
arXiv 2023. [Paper] [Project]
21 Mar 2023

Learning a 3D Morphable Face Reflectance Model from Low-cost Data
Yuxuan Han, Zhibo Wang, Feng Xu
CVPR 2023. [Paper] [Project]
21 Mar 2023

Text2Tex: Text-driven Texture Synthesis via Diffusion Models
Dave Zhenyu Chen, Yawar Siddiqui, Hsin-Ying Lee, Sergey Tulyakov, Matthias Nießner
arXiv 2023. [Paper] [Project]
20 Mar 2023

Zero-1-to-3: Zero-shot One Image to 3D Object
Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov, Carl Vondrick
arXiv 2023. [Paper] [Project] [Github]
20 Mar 2023

SKED: Sketch-guided Text-based 3D Editing
Aryan Mikaeili, Or Perel, Daniel Cohen-Or, Ali Mahdavi-Amiri
arxiv 2023. [Paper]
19 Mar 2023

3DQD: Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process
Yuhan Li, Yishun Dou, Xuanhong Chen, Bingbing Ni, Yilin Sun, Yutian Liu, Fuzhen Wang
CVPR 2023. [Paper] [Github]
18 Mar 2023

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Lingting Zhu, Xian Liu, Xuanyu Liu, Rui Qian, Ziwei Liu, Lequan Yu
CVPR 2023. [Paper] [Github]
16 Mar 2023

Diffusion-HPC: Generating Synthetic Images with Realistic Humans
Zhenzhen Weng, Laura Bravo-Sánchez, Serena Yeung
arXiv 2023. [Paper] [Github]
16 Mar 2023

DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars
David Svitov, Dmitrii Gudkov, Renat Bashirov, Victor Lempitsky
arXiv 2023. [Paper]
16 Mar 2023

Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models
Suhyeon Lee, Hyungjin Chung, Minyoung Park, Jonghyuk Park, Wi-Sun Ryu, Jong Chul Ye
arXiv 2023. [Paper]
15 Mar 2023

Controllable Mesh Generation Through Sparse Latent Point Diffusion Models
Zhaoyang Lyu, Jinyi Wang, Yuwei An, Ya Zhang, Dahua Lin, Bo Dai
CVPR 2023. [Paper] [Project]
14 Mar 2023

MeshDiffusion: Score-based Generative 3D Mesh Modeling
Zhen Liu, Yao Feng, Michael J. Black, Derek Nowrouzezahrai, Liam Paull, Weiyang Liu
ICLR 2023. [Paper] [Project] [Github]
14 Mar 2023

Point Cloud Diffusion Models for Automatic Implant Generation
Paul Friedrich, Julia Wolleb, Florentin Bieder, Florian M. Thieringer, Philippe C. Cattin
arXiv 2023. [Paper]
14 Mar 2023

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation
Junyoung Seo, Wooseok Jang, Min-Seop Kwak, Jaehoon Ko, Hyeonsu Kim, Junho Kim, Jin-Hwa Kim, Jiyoung Lee, Seungryong Kim
arXiv 2023. [Paper] [Github]
14 Mar 2023

GECCO: Geometrically-Conditioned Point Diffusion Models
Michał J. Tyszkiewicz, Pascal Fua, Eduard Trulls
arXiv 2023. [Paper]
10 Mar 2023

3DGen: Triplane Latent Diffusion for Textured Mesh Generation
Anchit Gupta, Wenhan Xiong, Yixin Nie, Ian Jones, Barlas Oğuz
arXiv 2023. [Paper]
9 Mar 2023

Human Motion Diffusion as a Generative Prior
Yonatan Shafir, Guy Tevet, Roy Kapon, Amit H. Bermano
arXiv 2023. [Paper]
2 Mar 2023

Can We Use Diffusion Probabilistic Models for 3D Motion Prediction?
Hyemin Ahn, Esteve Valls Mascaro, Dongheui Lee
ICRA 2023. [Paper] [Project] [Github]
28 Feb 2023

DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models
Jamie Wynn, Daniyar Turmukhambetov
CVPR 2023. [Paper] [Github] [Github]
23 Feb 2023

PC2: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction
Luke Melas-Kyriazi, Christian Rupprecht, Andrea Vedaldi
arXiv 2023. [Paper] Project]
23 Feb 2023

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion
Jiatao Gu, Alex Trevithick, Kai-En Lin, Josh Susskind, Christian Theobalt, Lingjie Liu, Ravi Ramamoorthi
ICML 2023. [Paper] [Github]
20 Feb 2023

SinMDM: Single Motion Diffusion
Sigal Raab, Inbal Leibovitch, Guy Tevet, Moab Arar, Amit H. Bermano, Daniel Cohen-Or
arXiv 2023. [Paper] [Project] [Github]
12 Feb 2023

3D Colored Shape Reconstruction from a Single RGB Image through Diffusion
Bo Li, Xiaolin Wei, Fengwei Chen, Bin Liu
arXiv 2023. [Paper]
11 Feb 2023

HumanMAC: Masked Motion Completion for Human Motion Prediction
Ling-Hao Chen, Jiawei Zhang, Yewen Li, Yiren Pang, Xiaobo Xia, Tongliang Liu
arXiv 2023. [Paper] [Project] [Github]
7 Feb 2023

TEXTure: Text-Guided Texturing of 3D Shapes
Elad Richardson, Gal Metzer, Yuval Alaluf, Raja Giryes, Daniel Cohen-Or
arXiv 2023. [Paper] [Project] [Github]
3 Feb 2023

Zero3D: Semantic-Driven Multi-Category 3D Shape Generation
Bo Han, Yitong Liu, Yixuan Shen
arXiv 2023. [Paper]
31 Jan 2023

Neural Wavelet-domain Diffusion for 3D Shape Generation, Inversion, and Manipulation
Jingyu Hu, Ka-Hei Hui, Zhengzhe Liu, Ruihui Li, Chi-Wing Fu
SIGGRAPH ASIA 2023. [Paper] [Github]
1 Feb 2023

3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion Models
Biao Zhang, Jiapeng Tang, Matthias Niessner, Peter Wonka
SIGGRAPH 2023. [Paper] [Github] [Github]
26 Jan 2023

DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion Model
Fan Zhang, Naye Ji, Fuxing Gao, Yongping Li
arXiv 2023. [Paper]
24 Jan 2023

Bipartite Graph Diffusion Model for Human Interaction Generation
Baptiste Chopin, Hao Tang, Mohamed Daoudi
arXiv 2023. [Paper]
24 Jan 2023

Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Siyuan Huang, Zan Wang, Puhao Li, Baoxiong Jia, Tengyu Liu, Yixin Zhu, Wei Liang, Song-Chun Zhu
arXiv 2023. [Paper] [Project] [Github]
15 Jan 2023

Modiff: Action-Conditioned 3D Motion Generation with Denoising Diffusion Probabilistic Models
Mengyi Zhao, Mengyuan Liu, Bin Ren, Shuling Dai, Nicu Sebe
arXiv 2023. [Paper]
10 Jan 2023

Diffusion Probabilistic Models for Scene-Scale 3D Categorical Data
Jumin Lee, Woobin Im, Sebin Lee, Sung-Eui Yoon
arXiv 2023. [Paper] [Github]
2 Jan 2023

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
Jiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Shenghua Gao
CVPR 2023. [Paper] [Project]
28 Dec 2022

Point-E: A System for Generating 3D Point Clouds from Complex Prompts
Alex Nichol, Heewoo Jun, Prafulla Dhariwal, Pamela Mishkin, Mark Chen
arXiv 2022. [Paper] [Github]
16 Dec 2022

Real-Time Rendering of Arbitrary Surface Geometries using Learnt Transfer
Sirikonda Dhawal, Aakash KT, P.J. Narayanan
ICVGIP 2022. [Paper]
19 Dec 2022

Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models
Ziyi Chang, Edmund J. C. Findlay, Haozheng Zhang, Hubert P. H. Shum
arXiv 2022. [Paper]
16 Dec 2022

Rodin: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
Tengfei Wang, Bo Zhang, Ting Zhang, Shuyang Gu, Jianmin Bao, Tadas Baltrusaitis, Jingjing Shen, Dong Chen, Fang Wen, Qifeng Chen, Baining Guo
arXiv 2022. [Paper] [Project]
12 Dec 2022

Generative Scene Synthesis via Incremental View Inpainting using RGBD Diffusion Models
Jiabao Lei, Jiapeng Tang, Kui Jia
CVPR 2023. [Paper] [Project] [Github]
12 Dec 2022

Ego-Body Pose Estimation via Ego-Head Pose Estimation
Jiaman Li, C. Karen Liu, Jiajun Wu
CVPR 2023. [Paper]
9 Dec 2022

MoFusion: A Framework for Denoising-Diffusion-based Motion Synthesis
Rishabh Dabral, Muhammad Hamza Mughal, Vladislav Golyanik, Christian Theobalt
CVPR 2023. [Paper] [Project]
8 Dec 2022

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation
Yen-Chi Cheng, Hsin-Ying Lee, Sergey Tulyakov, Alexander Schwing, Liangyan Gui
CVPR 2023. [Paper] [Project]
8 Dec 2022

Executing your Commands via Motion Diffusion in Latent Space
Xin Chen, Biao Jiang, Wen Liu, Zilong Huang, Bin Fu, Tao Chen, Jingyi Yu, Gang Yu
CVPR 2023. [Paper] [Project] [Github]
8 Dec 2022

Magic: Multi Art Genre Intelligent Choreography Dataset and Network for 3D Dance Generation
Ronghui Li, Junfan Zhao, Yachao Zhang, Mingyang Su, Zeping Ren, Han Zhang, Xiu Li
arXiv 2022. [Paper]
7 Dec 2022

NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Congyue Deng, Chiyu “Max’’ Jiang, Charles R. Qi, Xinchen Yan, Yin Zhou, Leonidas Guibas, Dragomir Anguelov
arXiv 2022. [Paper]
6 Dec 2022

Diffusion-SDF: Text-to-Shape via Voxelized Diffusion
Muheng Li, Yueqi Duan, Jie Zhou, Jiwen Lu
CVPR 2023. [Paper] [Github]
6 Dec 2022

Pretrained Diffusion Models for Unified Human Motion Synthesis
Jianxin Ma, Shuai Bai, Chang Zhou
arXiv 2022. [Paper] [Project]
6 Dec 2022

DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model
Jeongjun Choi, Dongseok Shim, H. Jin Kim
arXiv 2022. [Paper]
6 Dec 2022

PhysDiff: Physics-Guided Human Motion Diffusion Model
Ye Yuan, Jiaming Song, Umar Iqbal, Arash Vahdat, Jan Kautz
arXiv 2022. [Paper] [Project]
5 Dec 2022

Fast Point Cloud Generation with Straight Flows
Lemeng Wu, Dilin Wang, Chengyue Gong, Xingchao Liu, Yunyang Xiong, Rakesh Ranjan, Raghuraman Krishnamoorthi, Vikas Chandra, Qiang Liu
arXiv 2022. [Paper]
4 Dec 2022

DiffRF: Rendering-Guided 3D Radiance Field Diffusion
Norman Müller, Yawar Siddiqui, Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder, Matthias Nießner
CVPR 2023. [Paper] [Project]
2 Dec 2022

3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models
Gimin Nam, Mariem Khlifi, Andrew Rodriguez, Alberto Tono, Linqi Zhou, Paul Guerrero
arXiv 2022. [Paper]
1 Dec 2022

Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation
Haochen Wang, Xiaodan Du, Jiahao Li, Raymond A. Yeh, Greg Shakhnarovich
CVPR 2023. [Paper] [Project]
1 Dec 2022

SparseFusion: Distilling View-conditioned Diffusion for 3D Reconstruction
Zhizhuo Zhou, Shubham Tulsiani
CVPR 2023. [Paper] [Project] [Github]
1 Dec 2022

3D Neural Field Generation using Triplane Diffusion
J. Ryan Shue, Eric Ryan Chan, Ryan Po, Zachary Ankner, Jiajun Wu, Gordon Wetzstein
arXiv 2022. [Paper] [Project]
30 Nov 2022

DiffPose: Toward More Reliable 3D Pose Estimation
Jia Gong, Lin Geng Foo, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu
CVPR 2023. [Paper] [Github]
30 Nov 2022

DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion models
Karl Holmquist, Bastian Wandt
arXiv 2022. [Paper] [Github]
29 Nov 2022

DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model
Gwanghyun Kim, Se Young Chun
CVPR 2023. [Paper] [Github]
29 Nov 2022

NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views
Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Yi Wang, Zhangyang Wang
arXiv 2022. [Paper] [Project] [Github]
29 Nov 2022

Ada3Diff: Defending against 3D Adversarial Point Clouds via Adaptive Diffusion
Kui Zhang, Hang Zhou, Jie Zhang, Qidong Huang, Weiming Zhang, Nenghai Yu
arXiv 2022. [Paper]
29 Nov 2022

UDE: A Unified Driving Engine for Human Motion Generation
Zixiang Zhou, Baoyuan Wang
arXiv 2022. [Paper] [Project] [Github]
29 Nov 2022

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models
Gang Li, Heliang Zheng, Chaoyue Wang, Chang Li, Changwen Zheng, Dacheng Tao
arXiv 2022. [Paper]
25 Nov 2022

DiffusionSDF: Conditional Generative Modeling of Signed Distance Functions
Gene Chou, Yuval Bahat, Felix Heide
arXiv 2022. [Paper] [Github]
24 Nov 2022

Tetrahedral Diffusion Models for 3D Shape Generation
Nikolai Kalischek, Torben Peters, Jan D. Wegner, Konrad Schindler
arXiv 2022. [Paper]
23 Nov 2022

IC3D: Image-Conditioned 3D Diffusion for Shape Generation
Cristian Sbrolli, Paolo Cudrano, Matteo Frosi, Matteo Matteucci
arXiv 2022. [Paper]
20 Nov 2022

Listen, denoise, action! Audio-driven motion synthesis with diffusion models
Simon Alexanderson, Rajmund Nagy, Jonas Beskow, Gustav Eje Henter
arXiv 2022. [Paper]
17 Nov 2022

RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation
Titas Anciukevičius, Zexiang Xu, Matthew Fisher, Paul Henderson, Hakan Bilen, Niloy J. Mitra, Paul Guerrero
CVPR 2023. [Paper] [Github]
17 Nov 2022

Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
Gal Metzer, Elad Richardson, Or Patashnik, Raja Giryes, Daniel Cohen-Or
arXiv 2022. [Paper] [Github]
14 Nov 2022

ReFu: Refine and Fuse the Unobserved View for Detail-Preserving Single-Image 3D Human Reconstruction
Gyumin Shim, Minsoo Lee, Jaegul Choo
ACM 2022. [Paper]
9 Nov 2022

StructDiffusion: Object-Centric Diffusion for Semantic Rearrangement of Novel Objects
Weiyu Liu, Tucker Hermans, Sonia Chernova, Chris Paxton
RSS 2023. [Paper]
8 Nov 2022

Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion Model
Zhiyuan Ren, Zhihong Pan, Xin Zhou, Le Kang
ICASSP 2023. [Paper]
22 Oct 2022

LION: Latent Point Diffusion Models for 3D Shape Generation
Xiaohui Zeng, Arash Vahdat, Francis Williams, Zan Gojcic, Or Litany, Sanja Fidler, Karsten Kreis
NeurIPS 2022. [Paper] [Project]
12 Oct 2022

Human Joint Kinematics Diffusion-Refinement for Stochastic Motion Prediction
Dong Wei, Huaijiang Sun, Bin Li, Jianfeng Lu, Weiqing Li, Xiaoning Sun, Shengxiang Hu
AAAI 2023. [Paper]
12 Oct 2022

A generic diffusion-based approach for 3D human pose prediction in the wild
Saeed Saadatnejad, Ali Rasekh, Mohammadreza Mofayezi, Yasamin Medghalchi, Sara Rajabzadeh, Taylor Mordan, Alexandre Alahi
ICRA 2023. [Paper]
11 Oct 2022

Novel View Synthesis with Diffusion Models
Daniel Watson, William Chan, Ricardo Martin-Brualla, Jonathan Ho, Andrea Tagliasacchi, Mohammad Norouzi
ICLR 2023. [Paper]
6 Oct 2022

Neural Volumetric Mesh Generator
Yan Zheng, Lemeng Wu, Xingchao Liu, Zhen Chen, Qiang Liu, Qixing Huang
NeurIPS Workshop 2022. [Paper]
6 Oct 2022

Denoising Diffusion Probabilistic Models for Styled Walking Synthesis
Edmund J. C. Findlay, Haozheng Zhang, Ziyi Chang, Hubert P. H. Shum
ICLR 2023. [Paper]
29 Sep 2022

Human Motion Diffusion Model
Guy Tevet, Sigal Raab, Brian Gordon, Yonatan Shafir, Amit H. Bermano, Daniel Cohen-Or
arXiv 2022. [Paper] [Project]
29 Sep 2022

ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu, Peng Dai, Ruihui Li, Xiaojuan Qi, Chi-Wing Fu
ICLR 2023. [Paper] [Github]
9 Sep 2022

SE(3)-DiffusionFields: Learning cost functions for joint grasp and motion optimization through diffusion
Julen Urain, Niklas Funk, Georgia Chalvatzaki, Jan Peters
arXiv 2022. [Paper] [Github]
8 Sep 2022

First Hitting Diffusion Models for Generating Manifold, Graph and Categorical Data
Mao Ye, Lemeng Wu, Qiang Liu
NeruIPS 2022. [Paper]
2 Sep 2022

FLAME: Free-form Language-based Motion Synthesis & Editing
Jihoon Kim, Jiseob Kim, Sungjoon Choi
AAAI 2023. [Paper]
1 Sep 2022

Let us Build Bridges: Understanding and Extending Diffusion Generative Models
Xingchao Liu, Lemeng Wu, Mao Ye, Qiang Liu
NeurIPS Workshop 2022. [Paper]
31 Aug 2022

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model
Mingyuan Zhang, Zhongang Cai, Liang Pan, Fangzhou Hong, Xinying Guo, Lei Yang, Ziwei Liu
arXiv 2022. [Paper] [Project]
31 Aug 2022

A Diffusion Model Predicts 3D Shapes from 2D Microscopy Images
Dominik J. E. Waibel, Ernst Röell, Bastian Rieck, Raja Giryes, Carsten Marr
arXiv 2022. [Paper]
30 Aug 2022

PointDP: Diffusion-driven Purification against Adversarial Attacks on 3D Point Cloud Recognition
Jiachen Sun, Weili Nie, Zhiding Yu, Z. Morley Mao, Chaowei Xiao
arXiv 2022. [Paper]
21 Aug 2022

A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion
Zhaoyang Lyu, Zhifeng Kong, Xudong Xu, Liang Pan, Dahua Lin
ICLR 2022. [Paper] [Github]
7 Dec 2021

Score-Based Point Cloud Denoising
Shitong Luo, Wei Hu
ICCV 2021. [Paper] [Github]
23 Jul 2021

DiffuStereo: High Quality Human Reconstruction via Diffusion-based Stereo Using Sparse Cameras
Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu
ECCV 2022. [Paper] [Project] [Github]
16 Jul 2022

3D Shape Generation and Completion through Point-Voxel Diffusion
Linqi Zhou, Yilun Du, Jiajun Wu
ICCV 2021. [Paper] [Project]
8 Apr 2021

Diffusion Probabilistic Models for 3D Point Cloud Generation
Shitong Luo, Wei Hu
CVPR 2021. [Paper] [Github]
2 Mar 2021

Adversarial Attack

Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models
Minxing Zhang, Ning Yu, Rui Wen, Michael Backes, Yang Zhang
arXiv 2023. [Paper]
30 Oct 2023

Adversarial Examples Are Not Real Features
Ang Li, Yifei Wang, Yiwen Guo, Yisen Wang
NeurIPS 2023. [Paper]
29 Oct 2023

Purify++: Improving Diffusion-Purification with Advanced Diffusion Models and Control of Randomness
Boya Zhang, Weijian Luo, Zhihua Zhang
arXiv 2023. [Paper]
28 Oct 2023

Energy-Based Models for Anomaly Detection: A Manifold Diffusion Recovery Approach
Sangwoong Yoon, Young-Uk Jin, Yung-Kyun Noh, Frank C. Park
arXiv 2023. [Paper]
28 Oct 2023

Model Selection of Anomaly Detectors in the Absence of Labeled Validation Data
Clement Fung, Chen Qiu, Aodong Li, Maja Rudolph
arXiv 2023. [Paper]
16 Oct 2023

Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models
Renyang Liu, Wei Zhou, Tianwei Zhang, Kangjie Chen, Jun Zhao, Kwok-Yan Lam
arXiv 2023. [Paper]
11 Oct 2023

Investigating the Adversarial Robustness of Density Estimation Using the Probability Flow ODE
Marius Arvinte, Cory Cornelius, Jason Martin, Nageen Himayat
arXiv 2023. [Paper]
10 Oct 2023

Understanding and Improving Adversarial Attacks on Latent Diffusion Model
Boyang Zheng, Chumeng Liang, Xiaoyu Wu, Yan Liu
arXiv 2023. [Paper]
7 Oct 2023

Semantic Adversarial Attacks via Diffusion Models
Chenan Wang, Jinhao Duan, Chaowei Xiao, Edward Kim, Matthew Stamm, Kaidi Xu
arXiv 2023. [Paper]
14 Sep 2023

Catch You Everything Everywhere: Guarding Textual Inversion via Concept Watermarking
Weitao Feng, Jiyan He, Jie Zhang, Tianwei Zhang, Wenbo Zhou, Weiming Zhang, Nenghai Yu
arXiv 2023. [Paper]
12 Sep 2023

Diff-Privacy: Diffusion-based Face Privacy Protection
Xiao He, Mingrui Zhu, Dongxin Chen, Nannan Wang, Xinbo Gao
arXiv 2023. [Paper]
11 Sep 2023

DiffDefense: Defending against Adversarial Attacks via Diffusion Models
Hondamunige Prasanna Silva, Lorenzo Seidenari, Alberto Del Bimbo
arXiv 2023. [Paper] [Github]
7 Sep 2023

My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes, Ram Bhagat, Umur Aybars Ciftci, Ilke Demir
arXiv 2023. [Paper]
6 Sep 2023

Improving Visual Quality and Transferability of Adversarial Attacks on Face Recognition Simultaneously with Adversarial Restoration
Fengfan Zhou
arXiv 2023. [Paper]
4 Sep 2023

Intriguing Properties of Diffusion Models: A Large-Scale Dataset for Evaluating Natural Attack Capability in Text-to-Image Generative Models
Takami Sato, Justin Yue, Nanze Chen, Ningfei Wang, Qi Alfred Chen
arXiv 2023. [Paper]
30 Aug 2023

DiffSmooth: Certifiably Robust Learning via Diffusion Models and Local Smoothing
Jiawei Zhang, Zhongzhu Chen, Huan Zhang, Chaowei Xiao, Bo Li
USENIX Security 2023. [Paper]
28 Aug 2023

A Probabilistic Fluctuation based Membership Inference Attack for Diffusion Models
Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang
arXiv 2023. [Paper]
23 Aug 2023

White-box Membership Inference Attacks against Diffusion Models
Yan Pang, Tianhao Wang, Xuhui Kang, Mengdi Huai, Yang Zhang
arXiv 2023. [Paper]
11 Aug 2023

BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models
Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian
arXiv 2023. [Paper] [Github] [Dataset]
31 Jul 2023

Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models
Weikang Yu, Yonghao Xu, Pedram Ghamisi
arXiv 2023. [Paper]
31 Jul 2023

AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models
Xuelong Dai, Kaisheng Liang, Bin Xiao
arXiv 2023. [Paper]
24 Jul 2023

Enhancing Adversarial Robustness via Score-Based Optimization
Boya Zhang, Weijian Luo, Zhihua Zhang
arXiv 2023. [Paper]
10 Jul 2023

DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks in the Physical World
Caixin Kang, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Hang Su, Xingxing Wei
arXiv 2023. [Paper]
15 Jun 2023

An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization
Fei Kong, Jinhao Duan, RuiPeng Ma, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu
arXiv 2023. [Paper]
26 May 2023

Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability
Haotian Xue, Alexandre Araujo, Bin Hu, Yongxin Chen
arXiv 2023. [Paper] [Github]
25 May 2023

Differentially Private Latent Diffusion Models
Saiyue Lyu, Margarita Vinaroz, Michael F. Liu, Mijung Park
arXiv 2023. [Paper]
25 May 2023

Latent Magic: An Investigation into Adversarial Examples Crafted in the Semantic Latent Space
BoYang Zheng
arXiv 2023. [Paper]
22 May 2023

Mist: Towards Improved Adversarial Examples for Diffusion Models
Chumeng Liang, Xiaoyu Wu
arXiv 2023. [Paper]
22 May 2023

Content-based Unrestricted Adversarial Attack
Zhaoyu Chen, Bo Li, Shuang Wu, Kaixun Jiang, Shouhong Ding, Wenqiang Zhang
arXiv 2023. [Paper]
18 May 2023

Zero-Day Backdoor Attack against Text-to-Image Diffusion Models via Personalization
Yihao Huang, Qing Guo, Felix Juefei-Xu
arXiv 2023. [Paper]
18 May 2023

Raising the Bar for Certified Adversarial Robustness with Diffusion Models
Thomas Altstidl, David Dobre, Björn Eskofier, Gauthier Gidel, Leo Schwinn
arXiv 2023. [Paper]
17 May 2023

Diffusion Models for Imperceptible and Transferable Adversarial Attack
Jianqi Chen, Hao Chen, Keyan Chen, Yilan Zhang, Zhengxia Zou, Zhenwei Shi
arXiv 2023. [Paper] [Github]
14 May 2023

On enhancing the robustness of Vision Transformers: Defensive Diffusion
Raza Imam, Muhammad Huzaifa, Mohammed El-Amine Azz
arXiv 2023. [Paper] [Github]
14 May 2023

Generative Steganography Diffusion
Ping Wei, Qing Zhou, Zichi Wang, Zhenxing Qian, Xinpeng Zhang, Sheng Li
arXiv 2023. [Paper]
5 May 2023

A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion
Haomin Zhuang, Yihua Zhang, Sijia Liu
CVPR Workshop 2023. [Paper]
3 Apr 2023

Black-box Backdoor Defense via Zero-shot Image Purification
Yucheng Shi, Mengnan Du, Xuansheng Wu, Zihan Guan, Ninghao Liu
arXiv 2023. [Paper]
21 Mar 2023

Adversarial Counterfactual Visual Explanations
Guillaume Jeanneret, Loïc Simon, Frédéric Jurie
CVPR 2023. [Paper] [Github]
17 Mar 2023

Robust Evaluation of Diffusion-Based Adversarial Purification
Minjong Lee, Dongwoo Kim
ICLR 2023. [Paper]
16 Mar 2023

The Devil’s Advocate: Shattering the Illusion of Unexploitable Data using Diffusion Models
Hadi M. Dolatabadi, Sarah Erfani, Christopher Leckie
arXiv 2023. [Paper]
15 Mar 2023

TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
Weixin Chen, Dawn Song, Bo Li
CVPR 2023. [Paper] [Github]
10 Mar 2023

Generative Model-Based Attack on Learnable Image Encryption for Privacy-Preserving Deep Learning
AprilPyone MaungMaung, Hitoshi Kiya
arXiv 2023. [Paper]
9 Mar 2023

Differentially Private Diffusion Models Generate Useful Synthetic Images
Sahra Ghalebikesabi, Leonard Berrada, Sven Gowal, Ira Ktena, Robert Stanforth, Jamie Hayes, Soham De, Samuel L. Smith, Olivia Wiles, Borja Balle
arXiv 2023. [Paper]
27 Feb 2023

Data Forensics in Diffusion Models: A Systematic Analysis of Membership Privacy
Derui Zhu, Dingfan Chen, Jens Grossklags, Mario Fritz
arXiv 2023. [Paper]
15 Feb 2023

Raising the Cost of Malicious AI-Powered Image Editing
Hadi Salman, Alaa Khaddaj, Guillaume Leclerc, Andrew Ilyas, Aleksander Madry
arXiv 2023. [Paper] [Github]
13 Feb 2023

Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples
Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui Xue, Ruhui Ma, Haibing Guan
arXiv 2023. [Paper]
9 Feb 2023

Better Diffusion Models Further Improve Adversarial Training
Zekai Wang, Tianyu Pang, Chao Du, Min Lin, Weiwei Liu, Shuicheng Yan
arXiv 2023. [Paper] [Github]
9 Feb 2023

Membership Inference Attacks against Diffusion Models
Tomoya Matsumoto, Takayuki Miura, Naoto Yanai
arXiv 2023. [Paper]
7 Feb 2023

MorDIFF: Recognition Vulnerability and Attack Detectability of Face Morphing Attacks Created by Diffusion Autoencoders
Naser Damer, Meiling Fang, Patrick Siebke, Jan Niklas Kolf, Marco Huber, Fadi Boutros
IWBF 2023. [Paper] [Github]
3 Feb 2023

Extracting Training Data from Diffusion Models
Nicholas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramèr, Borja Balle, Daphne Ippolito, Eric Wallace
arXiv 2023. [Paper]
2 Feb 2023

Are Diffusion Models Vulnerable to Membership Inference Attacks?
Jinhao Duan, Fei Kong, Shiqi Wang, Xiaoshuang Shi, Kaidi Xu
arXiv 2023. [Paper]
2 Feb 2023

Salient Conditional Diffusion for Defending Against Backdoor Attacks
Brandon B. May, N. Joseph Tatro, Piyush Kumar, Nathan Shnidman
ICLR Workshop 2023. [Paper]
31 Jan 2023

Extracting Training Data from Diffusion Models
Nicholas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramèr, Borja Balle, Daphne Ippolito, Eric Wallace
arXiv 2023. [Paper]
30 Jan 2023

Membership Inference of Diffusion Models
Hailong Hu, Jun Pang
arXiv 2023. [Paper]
24 Jan 2023

Denoising Diffusion Probabilistic Models as a Defense against Adversarial Attacks
Lars Lien Ankile, Anna Midgley, Sebastian Weisshaar
arXiv 2023. [Paper] [Github]
17 Jan 2023

Fight Fire With Fire: Reversing Skin Adversarial Examples by Multiscale Diffusive and Denoising Aggregation Mechanism
Yongwei Wang, Yuan Li, Zhiqi Shen
arXiv 2022. [Paper]
22 Aug 2022

DensePure: Understanding Diffusion Models towards Adversarial Robustness
Chaowei Xiao, Zhongzhu Chen, Kun Jin, Jiongxiao Wang, Weili Nie, Mingyan Liu, Anima Anandkumar, Bo Li, Dawn Song
NeurIPS 2022. [Paper]
1 Nov 2022

Improving Adversarial Robustness by Contrastive Guided Diffusion Process
Yidong Ouyang, Liyan Xie, Guang Cheng
arXiv 2022. [Paper]
18 Oct 2022

Differentially Private Diffusion Models
Tim Dockhorn, Tianshi Cao, Arash Vahdat, Karsten Kreis
arXiv 2022. [Paper] [Project]
18 Oct 2022

PointDP: Diffusion-driven Purification against Adversarial Attacks on 3D Point Cloud Recognition
Jiachen Sun, Weili Nie, Zhiding Yu, Z. Morley Mao, Chaowei Xiao
arXiv 2022. [Paper]
21 Aug 2022

Threat Model-Agnostic Adversarial Defense using Diffusion Models
Tsachi Blau, Roy Ganz, Bahjat Kawar, Alex Bronstein, Michael Elad
arXiv 2022. [Paper] [Github]
17 Jul 2022

Back to the Source: Diffusion-Driven Test-Time Adaptation
Jin Gao, Jialing Zhang, Xihui Liu, Trevor Darrell, Evan Shelhamer, Dequan Wang
arXiv 2022. [Paper] [Github]
7 Jul 2022

Guided Diffusion Model for Adversarial Purification from Random Noise
Quanlin Wu, Hang Ye, Yuntian Gu
arXiv 2022. [Paper]
17 Jun 2022

(Certified!!) Adversarial Robustness for Free!
Nicholas Carlini, Florian Tramer, Krishnamurthy (Dj)Dvijotham, J. Zico Kolter
ICLR 2023. [Paper]
21 Jun 2022

Guided Diffusion Model for Adversarial Purification
Jinyi Wang, Zhaoyang Lyu, Dahua Lin, Bo Dai, Hongfei Fu
ICML 2022. [Paper] [Github]
30 May 2022

Diffusion Models for Adversarial Purification
Weili Nie, Brandon Guo, Yujia Huang, Chaowei Xiao, Arash Vahdat, Anima Anandkumar
ICML 2022. [Paper] [Project] [Github]
16 May 2022

TFDPM: Attack detection for cyber-physical systems with diffusion probabilistic models
Tijin Yan, Tong Zhou, Yufeng Zhan, Yuanqing Xia
Elsveier Knowledge-Based Systems 2021. [Paper]
20 Dec 2021

Adversarial purification with Score-based generative models
Jongmin Yoon, Sung Ju Hwang, Juho Lee
ICML 2021. [Paper] [Github]
11 Jun 2021

Miscellany

SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li, Jingyi Lu, Kai Han, Victor Prisacariu
arXiv 2023. [Paper]
26 Oct 2023

Dolfin: Diffusion Layout Transformers without Autoencoder
Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu
arXiv 2023. [Paper]
25 Oct 2023

Removing Dust from CMB Observations with Diffusion Models
David Heurtel-Depeiges, Blakesley Burkhart, Ruben Ohana, Bruno Régaldo-Saint Blancard
arXiv 2023. [Paper]
25 Oct 2023

On the Inherent Privacy Properties of Discrete Denoising Diffusion Models
Rongzhe Wei, Eleonora Kreačić, Haoyu Wang, Haoteng Yin, Eli Chien, Vamsi K. Potluru, Pan Li
arXiv 2023. [Paper]
24 Oct 2023

Unified High-binding Watermark for Unconditional Image Generation Models
Ruinan Ma, Yu-an Tan, Shangbo Wu, Tian Chen, Yajie Wang, Yuanzhang Li
arXiv 2023. [Paper]
14 Oct 2023

Monsters in the Dark: Sanitizing Hidden Threats with Diffusion Models
Preston K. Robinette, Daniel Moyer, Taylor T. Johnson
arXiv 2023. [Paper]
10 Oct 2023

EasyPhoto: Your Smart AI Photo Generator
Ziheng Wu, Jiaqi Xu, Xinyi Zou, Kunzhe Huang, Xing Shi, Jun Huang
arXiv 2023. [Paper]
7 Oct 2023

SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection
Pengfei Zhou, Weiqing Min, Yang Zhang, Jiajun Song, Ying Jin, Shuqiang Jiang
arXiv 2023. [Paper]
7 Oct 2023

VI-Diff: Unpaired Visible-Infrared Translation Diffusion Model for Single Modality Labeled Visible-Infrared Person Re-identification
Han Huang, Yan Huang, Liang Wang
arXiv 2023. [Paper]
6 Oct 2023

Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks
Luca Scimeca, Alexander Rubinstein, Armand Mihai Nicolicioiu, Damien Teney, Yoshua Bengio
arXiv 2023. [Paper]
3 Oct 2023

Mirror Diffusion Models for Constrained and Watermarked Generation
Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Molei Tao
arXiv 2023. [Paper]
2 Oct 2023

SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models
Orkhan Baghirli, Hamid Askarov, Imran Ibrahimli, Ismat Bakhishov, Nabi Nabiyev
arXiv 2023. [Paper]
28 Sep 2023

Compositional Sculpting of Iterative Generative Processes
Timur Garipov, Sebastiaan De Peuter, Ge Yang, Vikas Garg, Samuel Kaski, Tommi Jaakkola
arXiv 2023. [Paper]
28 Sep 2023

Exploiting the Signal-Leak Bias in Diffusion Models
Martin Nicolas Everaert, Athanasios Fitsios, Marco Bocchio, Sami Arpa, Sabine Süsstrunk, Radhakrishna Achanta
arXiv 2023. [Paper]
27 Sep 2023

Diffusion-based Holistic Texture Rectification and Synthesis
Guoqing Hao, Satoshi Iizuka, Kensho Hara, Edgar Simo-Serra, Hirokatsu Kataoka, Kazuhiro Fukui
arXiv 2023. [Paper]
26 Sep 2023

On quantifying and improving realism of images generated with diffusion
Yunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Ajmal Mian
arXiv 2023. [Paper]
26 Sep 2023

Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context
Rucha Deshpande, Muzaffer Özbey, Hua Li, Mark A. Anastasio, Frank J. Brooks
arXiv 2023. [Paper]
19 Sep 2023

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration
Lijiang Li, Huixia Li, Xiawu Zheng, Jie Wu, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan, Fei Chao, Rongrong Ji
arXiv 2023. [Paper]
19 Sep 2023

LiDAR Data Synthesis with Denoising Diffusion Probabilistic Models
Kazuto Nakashima, Ryo Kurazume
arXiv 2023. [Paper] [Github]
17 Sep 2023

Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions
Tianxu Wu, Shuo Ye, Shuhuang Chen, Qinmu Peng, Xinge You
arXiv 2023. [Paper]
15 Sep 2023

Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation From Scratch
Zelin Zang, Hao Luo, Kai Wang, Panpan Zhang, Fan Wang, Stan. Z Li, Yang You
arXiv 2023. [Paper]
10 Sep 2023

Unbiased Face Synthesis With Diffusion Models: Are We There Yet?
Harrison Rosenberg, Shimaa Ahmed, Guruprasad V Ramesh, Ramya Korlakai Vinayak, Kassem Fawaz
arXiv 2023. [Paper]
13 Sep 2023

Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement
Chenghao Li, Dake Chen, Yuke Zhang, Peter A. Beerel
arXiv 2023. [Paper]
13 Sep 2023

Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips
Yufei Ye, Poorvi Hebbar, Abhinav Gupta, Shubham Tulsiani
arXiv 2023. [Paper]
11 Sep 2023

Decoding visual brain representations from electroencephalography through Knowledge Distillation and latent diffusion models
Matteo Ferrante, Tommaso Boccato, Stefano Bargione, Nicola Toschi
arXiv 2023. [Paper]
8 Sep 2023

Diffusion on the Probability Simplex
Griffin Floto, Thorsteinn Jonsson, Mihai Nica, Scott Sanner, Eric Zhengyu Zhu
ICML Workshop 2023. [Paper]
5 Sep 2023

Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models
Haixu Song, Shiyu Huang, Yinpeng Dong, Wei-Wei Tu
arXiv 2023. [Paper] [Github]
5 Sep 2023

ControlMat: A Controlled Generative Approach to Material Capture
Giuseppe Vecchio, Rosalie Martin, Arthur Roullier, Adrien Kaiser, Romain Rouffet, Valentin Deschaintre, Tamy Boubekeur
arXiv 2023. [Paper]
4 Sep 2023

Softmax Bias Correction for Quantized Generative Models
Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel
arXiv 2023. [Paper]
4 Sep 2023

DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion
Cédric Rommel, Eduardo Valle, Mickaël Chen, Souhaiel Khalfaoui, Renaud Marlet, Matthieu Cord, Patrick Pérez
arXiv 2023. [Paper]
4 Sep 2023

RSDiff: Remote Sensing Image Generation from Text Using Diffusion Model
Ahmad Sebaq, Mohamed ElHelw
arXiv 2023. [Paper]
3 Sep 2023

Diffusion Model with Clustering-based Conditioning for Food Image Generation
Yue Han, Jiangpeng He, Mridul Gupta, Edward J. Delp, Fengqing Zhu
MADiMa 2023. [Paper]
1 Sep 2023

Generate Your Own Scotland: Satellite Image Generation Conditioned on Maps
Miguel Espinosa, Elliot J. Crowley
arXiv 2023. [Paper] [Github]
31 Aug 2023

Diffusion Models for Interferometric Satellite Aperture Radar
Alexandre Tuel, Thomas Kerdreux, Claudia Hulbert, Bertrand Rouet-Leduc
arXiv 2023. [Paper]
31 Aug 2023

MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model
Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han
ACM MM 2023. [Paper]
31 Aug 2023

SignDiff: Learning Diffusion Models for American Sign Language Production
Sen Fang, Chunyu Sui, Xuedong Zhang, Yapeng Tian
arXiv 2023. [Paper]
30 Aug 2023

DiffuVolume: Diffusion Model for Volume based Stereo Matching
Dian Zheng, Xiao-Ming Wu, Zuhao Liu, Jingke Meng, Wei-shi Zheng
arXiv 2023. [Paper]
30 Aug 2023

Feature Attention Network (FA-Net): A Deep-Learning Based Approach for Underwater Single Image Enhancement
Muhammad Hamza, Ammar Hawbani, Sami Ul Rehman, Xingfu Wang, Liang Zhao
arXiv 2023. [Paper]
30 Aug 2023

Total Selfie: Generating Full-Body Selfies
Bowei Chen, Brian Curless, Ira Kemelmacher-Shlizerman, Steve Seitz
arXiv 2023. [Paper] [Project]
28 Aug 2023

Unsupervised Domain Adaptation via Domain-Adaptive Diffusion
Duo Peng, Qiuhong Ke, Yinjie Lei, Jun Liu
arXiv 2023. [Paper]
26 Aug 2023

SDeMorph: Towards Better Facial De-morphing from Single Morph
Nitish Shukla
arXiv 2023. [Paper]
22 Aug 2023

Hey That’s Mine Imperceptible Watermarks are Preserved in Diffusion Generated Outputs
Luke Ditria, Tom Drummond
arXiv 2023. [Paper]
22 Aug 2023

MatFuse: Controllable Material Generation with Diffusion Models
Giuseppe Vecchio, Renato Sortino, Simone Palazzo, Concetto Spampinato
arXiv 2023. [Paper]
22 Aug 2023

ControlCom: Controllable Image Composition using Diffusion Model
Bo Zhang, Yuxuan Duan, Jun Lan, Yan Hong, Huijia Zhu, Weiqiang Wang, Li Niu
arXiv 2023. [Paper]
19 Aug 2023

DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion Customization
Xiaoyu Ye, Hao Huang, Jiaqi An, Yongtao Wang
arXiv 2023. [Paper]
19 Aug 2023

Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model
Ran Jiang, Sanfeng Zhang, Linfeng Liu, Yanbing Peng
arXiv 2023. [Paper]
16 Aug 2023

U-Turn Diffusion
Hamidreza Behjoo, Michael Chertkov
arXiv 2023. [Paper]
14 Aug 2023

Diffusion-based Visual Counterfactual Explanations – Towards Systematic Quantitative Evaluation
Philipp Vaeth, Alexander M. Fruehwald, Benjamin Paassen, Magda Gregorova
arXiv 2023. [Paper]
11 Aug 2023

DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images
Xuechao Zou, Kai Li, Junliang Xing, Yu Zhang, Shiying Wang, Lei Jin, Pin Tao
arXiv 2023. [Paper]
8 Aug 2023

Towards Personalized Prompt-Model Retrieval for Generative Recommendation
Yuanhe Guo, Haoming Liu, Hongyi Wen
arXiv 2023. [Paper] [Github]
4 Aug 2023

Training Data Protection with Compositional Diffusion Models
Aditya Golatkar, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto
arXiv 2023. [Paper]
2 Aug 2023

Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation
Guojin Zhong, Jin Yuan, Pan Wang, Kailun Yang, Weili Guan, Zhiyong Li
ACM MM 2023. [Paper]
2 Aug 2023

RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects
Sascha Kirch, Valeria Olyunina, Jan Ondřej, Rafael Pagés, Sergio Martin, Clara Pérez-Molina
arXiv 2023. [Paper]
29 Jul 2023

Not with my name! Inferring artists’ names of input strings employed by Diffusion Models
Roberto Leotta, Oliver Giudice, Luca Guarnera, Sebastiano Battiato
arXiv 2023. [Paper] [Github]
25 Jul 2023

Data-free Black-box Attack based on Diffusion Model
Mingwen Shao, Lingzhuang Meng, Yuanjian Qiao, Lixu Zhang, Wangmeng Zuo
arXiv 2023. [Paper]
24 Jul 2023

Diffusion Models for Probabilistic Deconvolution of Galaxy Images
Zhiwei Xue, Yuhang Li, Yash Patel, Jeffrey Regier
arXiv 2023. [Paper] [Github]
20 Jul 2023

BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection
Jitao Ma, Weiying Xie, Yunsong Li, Leyuan Fang
arXiv 2023. [Paper]
19 Jul 2023

Unstoppable Attack: Label-Only Model Inversion via Conditional Diffusion Model
Rongke Liu
arXiv 2023. [Paper]
17 Jul 2023

LafitE: Latent Diffusion Model with Feature Editing for Unsupervised Multi-class Anomaly Detection
Haonan Yin, Guanlong Jiao, Qianhui Wu, Borje F. Karlsson, Biqing Huang, Chin Yew Lin
arXiv 2023. [Paper]
16 Jul 2023

Improved Flood Insights: Diffusion-Based SAR to EO Image Translation
Minseok Seo, Youngtack Oh, Doyi Kim, Dongmin Kang, Yeji Choi
arXiv 2023. [Paper]
14 Jul 2023

Exposing the Fake: Effective Diffusion-Generated Images Detection
Ruipeng Ma, Jinhao Duan, Fei Kong, Xiaoshuang Shi, Kaidi Xu
ICML 2023. [Paper]
12 Jul 2023

On the Vulnerability of DeepFake Detectors to Attacks Generated by Denoising Diffusion Models
Marija Ivanovska, Vitomir Štruc
arXiv 2023. [Paper]
11 Jul 2023

Unsupervised 3D out-of-distribution detection with latent diffusion models
Mark S. Graham, Walter Hugo Lopez Pinaya, Paul Wright, Petru-Daniel Tudosiu, Yee H. Mah, James T. Teo, H. Rolf Jäger, David Werring, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso
arXiv 2023. [Paper] [Github]
7 Jul 2023

Hyperspectral and Multispectral Image Fusion Using the Conditional Denoising Diffusion Probabilistic Model
Shuaikai Shi, Lijun Zhang, Jie Chen
arXiv 2023. [Paper]
7 Jul 2023

Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback
TaeHo Yoon, Kibeom Myoung, Keon Lee, Jaewoong Cho, Albert No, Ernest K. Ryu
arXiv 2023. [Paper] [Github]
6 Jul 2023

Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality
Peter Lorenz, Ricard Durall, Janis Keuper
arXiv 2023. [Paper]
5 Jul 2023

Diffusion Models for Computational Design at the Example of Floor Plans
Joern Ploennigs, Markus Berger
arXiv 2023. [Paper]
5 Jul 2023

RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation
Renato Sortino, Thomas Cecconello, Andrea DeMarco, Giuseppe Fiameni, Andrea Pilzer, Andrew M. Hopkins, Daniel Magro, Simone Riggi, Eva Sciacca, Adriano Ingallinera, Cristobal Bordiu, Filomena Bufano, Concetto Spampinato
arXiv 2023. [Paper]
5 Jul 2023

TomatoDIFF: On-plant Tomato Segmentation with Denoising Diffusion Models
Marija Ivanovska, Vitomir Struc, Janez Pers
arXiv 2023. [Paper]
3 Jul 2023

Squeezing Large-Scale Diffusion Models for Mobile
Jiwoong Choi, Minkyu Kim, Daehyun Ahn, Taesu Kim, Yulhwa Kim, Dongwon Jo, Hyesung Jeon, Jae-Joon Kim, Hyungjun Kim
arXiv 2023. [Paper]
3 Jul 2023

Class-Incremental Learning using Diffusion Model for Distillation and Replay
Quentin Jodelet, Xin Liu, Yin Jun Phua, Tsuyoshi Murata
arXiv 2023. [Paper]
30 Jun 2023

ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models
Weihao Cheng, Yan-Pei Cao, Ying Shan
arXiv 2023. [Paper]
29 Jun 2023

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation
Zhongwei Qiu, Qiansheng Yang, Jian Wang, Xiyu Wang, Chang Xu, Dongmei Fu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang
arXiv 2023. [Paper]
29 Jun 2023

DiffusionSTR: Diffusion Model for Scene Text Recognition
Masato Fujitake
arXiv 2023. [Paper]
29 Jun 2023

Filtered-Guided Diffusion: Fast Filter Guidance for Black-Box Diffusion Models
Zeqi Gu, Abe Davis
arXiv 2023. [Paper] [Github]
29 Jun 2023

ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models
Weihao Cheng, Yan-Pei Cao, Ying Shan
arXiv 2023. [Paper]
29 Jun 2023

Face Morphing Attack Detection with Denoising Diffusion Probabilistic Models
Marija Ivanovska, Vitomir Štruc
arXiv 2023. [Paper]
27 Jun 2023

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment
Jianyuan Wang, Christian Rupprecht, David Novotny
arXiv 2023. [Paper]
27 Jun 2023

Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction
Majed El Helou
arXiv 2023. [Paper]
26 Jun 2023

Towards More Realistic Membership Inference Attacks on Large Diffusion Models
Jan Dubiński, Antoni Kowalczuk, Stanisław Pawlak, Przemysław Rokita, Tomasz Trzciński, Paweł Morawiecki
arXiv 2023. [Paper]
22 Jun 2023

DiffWA: Diffusion Models for Watermark Attack
Xinyu Li
arXiv 2023. [Paper]
22 Jun 2023

Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs
Yu Takagi, Shinji Nishimoto
arXiv 2023. [Paper]
20 Jun 2023

Diffusion model based data generation for partial differential equations
Rucha Apte, Sheel Nidhan, Rishikesh Ranade, Jay Pathak
arXiv 2023. [Paper]
19 Jun 2023

GenPose: Generative Category-level Object Pose Estimation via Diffusion Models
Jiyao Zhang, Mingdong Wu, Hao Dong
arXiv 2023. [Paper]
18 Jun 2023

Drag-guided diffusion models for vehicle image generation
Nikos Arechiga, Frank Permenter, Binyang Song, Chenyang Yuan
arXiv 2023. [Paper]
16 Jun 2023

R2-Diff: Denoising by diffusion as a refinement of retrieved motion for image-based motion prediction
Takeru Oba, Norimichi Ukita
arXiv 2023. [Paper]
15 Jun 2023

On the Robustness of Latent Diffusion Models
Jianping Zhang, Zhuoer Xu, Shiwen Cui, Changhua Meng, Weibin Wu, Michael R. Lyu
arXiv 2023. [Paper]
14 Jun 2023

VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models
Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho
arXiv 2023. [Paper]
12 Jun 2023

Boosting GUI Prototyping with Diffusion Models
Jialiang Wei, Anne-Lise Courbis, Thomas Lambolais, Binbin Xu, Pierre Louis Bernard, Gérard Dray
arXiv 2023. [Paper]
9 Jun 2023

Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion Model
Yule Wang, Zijing Wu, Chengrui Li, Anqi Wu
arXiv 2023. [Paper] [Github]
9 Jun 2023

Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model
Yida Chen, Fernanda Viégas, Martin Wattenberg
arXiv 2023. [Paper]
9 Jun 2023

PriSampler: Mitigating Property Inference of Diffusion Models
Hailong Hu, Jun Pang
arXiv 2023. [Paper]
8 Jun 2023

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
George Stein, Jesse C. Cresswell, Rasa Hosseinzadeh, Yi Sui, Brendan Leigh Ross, Valentin Villecroze, Zhaoyan Liu, Anthony L. Caterini, J. Eric T. Taylor, Gabriel Loaiza-Ganem
arXiv 2023. [Paper] [Github]
7 Jun 2023

Phoenix: A Federated Generative Diffusion Model
Fiona Victoria Stanley Jothiraj, Afra Mashhadi
arXiv 2023. [Paper]
7 Jun 2023

High-dimensional and Permutation Invariant Anomaly Detection
Vinicius Mikuni, Benjamin Nachman
arXiv 2023. [Paper]
6 Jun 2023

Emergent Correspondence from Image Diffusion
Luming Tang, Menglin Jia, Qianqian Wang, Cheng Perng Phoo, Bharath Hariharan
arXiv 2023. [Paper]
6 Jun 2023

Phoenix: A Federated Generative Diffusion Model
Fiona Victoria Stanley Jothiraj, Afra Mashhadi
arXiv 2023. [Paper]
7 Jun 2023

Change Diffusion: Change Detection Map Generation Based on Difference-Feature Guided DDPM
Yihan Wen, Jialu Sui, Xianping Ma, Wendi Liang, Xiaokang Zhang, Man-On Pun
arXiv 2023. [Paper]
6 Jun 2023

Towards Visual Foundational Models of Physical Scenes
Chethan Parameshwara, Alessandro Achille, Matthew Trager, Xiaolong Li, Jiawei Mo, Matthew Trager, Ashwin Swaminathan, CJ Taylor, Dheera Venkatraman, Xiaohan Fei, Stefano Soatto
arXiv 2023. [Paper]
6 Jun 2023

Protecting the Intellectual Property of Diffusion Models by the Watermark Diffusion Process
Sen Peng, Yufei Chen, Cong Wang, Xiaohua Jia
arXiv 2023. [Paper]
6 Jun 2023

Emergent Correspondence from Image Diffusion
Luming Tang, Menglin Jia, Qianqian Wang, Cheng Perng Phoo, Bharath Hariharan
arXiv 2023. [Paper] [Project]
6 Jun 2023

Enhance Diffusion to Improve Robust Generalization
Jianhui Sun, Sanchit Sinha, Aidong Zhang
arXiv 2023. [Paper]
5 Jun 2023

Training Data Attribution for Diffusion Models
Zheng Dai, David K Gifford
arXiv 2023. [Paper]
3 Jun 2023

Deep Classifier Mimicry without Data Access
Steven Braun, Martin Mundt, Kristian Kersting
arXiv 2023. [Paper]
3 Jun 2023

Quantifying Sample Anonymity in Score-Based Generative Models with Adversarial Fingerprinting
Mischa Dombrowski, Bernhard Kainz
arXiv 2023. [Paper]
2 Jun 2023

Generative Autoencoders as Watermark Attackers: Analyses of Vulnerabilities and Threats
Xuandong Zhao, Kexun Zhang, Yu-Xiang Wang, Lei Li
arXiv 2023. [Paper]
2 Jun 2023

PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models
Jiacheng Chen, Ruizhi Deng, Yasutaka Furukawa
arXiv 2023. [Paper]
2 Jun 2023

Unlearnable Examples for Diffusion Models: Protect Data from Unauthorized Exploitation
Zhengyue Zhao, Jinhao Duan, Xing Hu, Kaidi Xu, Chenan Wang, Rui Zhang, Zidong Du, Qi Guo, Yunji Chen
arXiv 2023. [Paper]
2 Jun 2023

Robust Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers
Ruotong Wang, Hongrui Chen, Zihao Zhu, Li Liu, Yong Zhang, Yanbo Fan, Baoyuan Wu
arXiv 2023. [Paper]
1 Jun 2023

Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust
Yuxin Wen, John Kirchenbauer, Jonas Geiping, Tom Goldstein
arXiv 2023. [Paper] [Github]
31 May 2023

GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations
Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Ruben Vera-Rodriguez, Dominik Lawatsch, Florian Domin, Maxim Schaubert
arXiv 2023. [Paper]
31 May 2023

Improving Handwritten OCR with Training Samples Generated by Glyph Conditional Denoising Diffusion Probabilistic Model
Haisong Ding, Bozhi Luan, Dongnan Gui, Kai Chen, Qiang Huo
arXiv 2023. [Paper]
31 May 2023

Label-Retrieval-Augmented Diffusion Models for Learning from Noisy Labels
Jian Chen, Ruiyi Zhang, Tong Yu, Rohan Sharma, Zhiqiang Xu, Tong Sun, Changyou Chen
arXiv 2023. [Paper] [Github]
31 May 2023

DiffMatch: Diffusion Model for Dense Matching
Jisu Nam, Gyuseong Lee, Sunwoo Kim, Hyeonsu Kim, Hyoungwon Cho, Seyeon Kim, Seungryong Kim
arXiv 2023. [Paper] [Project]
30 May 2023

Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling
Qisheng Liao, Gus Xia, Zhinuo Wang
arXiv 2023. [Paper]
30 May 2023

Diffusion-Stego: Training-free Diffusion Generative Steganography via Message Projection
Daegyu Kim, Chaehun Shin, Jooyoung Choi, Dahuin Jung, Sungroh Yoon
arXiv 2023. [Paper]
30 May 2023

On Diffusion Modeling for Anomaly Detection
Victor Livernoche, Vineet Jain, Yashar Hezaveh, Siamak Ravanbakhsh
arXiv 2023. [Paper]
29 May 2023

Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation
Giorgio Giannone, Akash Srivastava, Ole Winther, Faez Ahmed
arXiv 2023. [Paper]
29 May 2023

Generating Driving Scenes with Diffusion
Ethan Pronovost, Kai Wang, Nick Roy
arXiv 2023. [Paper]
29 May 2023

Generative Diffusion for 3D Turbulent Flows
Marten Lienen, Jan Hansen-Palmus, David Lüdke, Stephan Günnemann
arXiv 2023. [Paper]
29 May 2023

GlyphControl: Glyph Conditional Control for Visual Text Generation
Yukang Yang, Dongnan Gui, Yuhui Yuan, Haisong Ding, Han Hu, Kai Chen
arXiv 2023. [Paper] [Github]
29 May 2023

High-Fidelity Image Compression with Score-based Generative Models
Emiel Hoogeboom, Eirikur Agustsson, Fabian Mentzer, Luca Versari, George Toderici, Lucas Theis
arXiv 2023. [Paper]
26 May 2023

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models
Zhongxi Chen, Ke Sun, Xianming Lin, Rongrong Ji
arXiv 2023. [Paper]
29 May 2023

DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models
Sohyun An, Hayeon Lee, Jaehyeong Jo, Seanie Lee, Sung Ju Hwang
arXiv 2023. [Paper]
26 May 2023

CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography
Jiwen Yu, Xuanyu Zhang, Youmin Xu, Jian Zhang
arXiv 2023. [Paper] [Github]
26 May 2023

DiffusionShield: A Watermark for Copyright Protection against Generative Diffusion Models
Yingqian Cui, Jie Ren, Han Xu, Pengfei He, Hui Liu, Lichao Sun, Jiliang Tang
arXiv 2023. [Paper]
25 May 2023

Realistic Noise Synthesis with Diffusion Models
Qi Wu, Mingyan Han, Ting Jiang, Haoqiang Fan, Bing Zeng, Shuaicheng Liu
arXiv 2023. [Paper]
23 May 2023

Anomaly Detection with Conditioned Denoising Diffusion Models
Arian Mousakhan, Thomas Brox, Jawad Tayyub
arXiv 2023. [Paper]
25 May 2023

Anomaly Detection in Satellite Videos using Diffusion Models
Akash Awasthi, Son Ly, Jaer Nizam, Samira Zare, Videet Mehta, Safwan Ahmed, Keshav Shah, Ramakrishna Nemani, Saurabh Prasad, Hien Van Nguyen
arXiv 2023. [Paper]
25 May 2023

Knowledge Diffusion for Distillation
Tao Huang, Yuan Zhang, Mingkai Zheng, Shan You, Fei Wang, Chen Qian, Chang Xu
arXiv 2023. [Paper] [Github]
25 May 2023

Zero-shot Generation of Training Data with Denoising Diffusion Probabilistic Model for Handwritten Chinese Character Recognition
Dongnan Gui, Kai Chen, Haisong Ding, Qiang Huo
arXiv 2023. [Paper]
25 May 2023

Unsupervised Semantic Correspondence Using Stable Diffusion
Eric Hedlin, Gopal Sharma, Shweta Mahajan, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, Kwang Moo Yi
arXiv 2023. [Paper]
24 May 2023

A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence
Junyi Zhang, Charles Herrmann, Junhwa Hur, Luisa Polania Cabrera, Varun Jampani, Deqing Sun, Ming-Hsuan Yang
arXiv 2023. [Paper] [Project]
24 May 2023

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence
Grace Luo, Lisa Dunlap, Dong Huk Park, Aleksander Holynski, Trevor Darrell
arXiv 2023. [Paper] [Project]
23 May 2023

DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection
Jiang Liu, Chun Pong Lau, Rama Chellappa
arXiv 2023. [Paper]
23 May 2023

GSURE-Based Diffusion Model Training with Corrupted Data
Bahjat Kawar, Noam Elata, Tomer Michaeli, Michael Elad
arXiv 2023. [Paper] [Github]
22 May 2023

Watermarking Diffusion Model
Yugeng Liu, Zheng Li, Michael Backes, Yun Shen, Yang Zhang
arXiv 2023. [Paper]
21 May 2023

DiffUCD:Unsupervised Hyperspectral Image Change Detection with Semantic Correlation Diffusion Model
Xiangrong Zhang, Shunli Tian, Guanchun Wang, Huiyu Zhou, Licheng Jiao
arXiv 2023. [Paper]
21 May 2023

Incomplete Multi-view Clustering via Diffusion Completion
Sifan Fang
arXiv 2023. [Paper]
19 May 2023

SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu, Jingyu Hu, Wuyue Lu, Igor Gilitschenski, Animesh Garg
arXiv 2023. [Paper]
18 May 2023

Selective Guidance: Are All the Denoising Steps of Guided Diffusion Important?
Pareesa Ameneh Golnari, Zhewei Yao, Yuxiong He
arXiv 2023. [Paper]
16 May 2023

A Method for Training-free Person Image Picture Generation
Tianyu Chen
ICOAI 2023. [Paper]
16 May 2023

Constructing a personalized AI assistant for shear wall layout using Stable Diffusion
Lufeng Wang, Jiepeng Liu, Guozhong Cheng, En Liu, Wei Chen
arXiv 2023. [Paper]
18 May 2023

DiffUTE: Universal Text Editing Diffusion Model
Chen, Haoxing, Xu, Zhuoer, Gu, Zhangxuan, Lan, Jun, Zheng, Xing, Li, Yaohui, Meng, Changhua, Zhu, Huijia, Wang, Weiqiang
arXiv 2023. [Paper] [Github]
18 May 2023

CDDM: Channel Denoising Diffusion Models for Wireless Communications
Tong Wu, Zhiyong Chen, Dazhi He, Liang Qian, Yin Xu, Meixia Tao, Wenjun Zhang
arXiv 2023. [Paper]
16 May 2023

Unlearnable Examples Give a False Sense of Security: Piercing through Unexploitable Data with Learnable Examples
Wan Jiang, Yunfeng Diao, He Wang, Jianxin Sun, Meng Wang, Richang Hong
arXiv 2023. [Paper]
16 May 2023

Diffusion Dataset Generation: Towards Closing the Sim2Real Gap for Pedestrian Detection
Andrew Farley, Mohsen Zand, Michael Greenspan
CRV 2023. [Paper]
16 May 2023

A Reproducible Extraction of Training Images from Diffusion Models
Ryan Webster
arXiv 2023. [Paper] [Github]
15 May 2023

Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Antoni Bigata Casademunt, Rodrigo Mira, Nikita Drobyshev, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic
arXiv 2023. [Paper]
15 May 2023

Manipulating Visually-aware Federated Recommender Systems and Its Countermeasures
Wei Yuan, Shilong Yuan, Chaoqun Yang, Quoc Viet Hung Nguyen, Hongzhi Yin
arXiv 2023. [Paper]
14 May 2023

Undercover Deepfakes: Detecting Fake Segments in Videos
Sanjay Saha, Rashindrie Perera, Sachith Seneviratne, Tamasha Malepathirana, Sanka Rasnayaka, Deshani Geethika, Terence Sim, Saman Halgamuge
arXiv 2023. [Paper] [Github]
11 May 2023

Comprehensive Dataset of Synthetic and Manipulated Overhead Imagery for Development and Evaluation of Forensic Tools
Brandon B. May, Kirill Trapeznikov, Shengbang Fang, Matthew C. Stamm
arXiv 2023. [Paper]
9 May 2023

DifFIQA: Face Image Quality Assessment Using Denoising Diffusion Probabilistic Models
Žiga Babnik, Peter Peer, Vitomir Štruc
arXiv 2023. [Paper]
9 May 2023

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning
Shengfang Zhai, Yinpeng Dong, Qingni Shen, Shi Pu, Yuejian Fang, Hang Su
arXiv 2023. [Paper]
7 May 2023

Exploring One-shot Semi-supervised Federated Learning with A Pre-trained Diffusion Model
Mingzhao Yang, Shangchao Su, Bin Li, Xiangyang Xue
arXiv 2023. [Paper]
6 May 2023

Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling Augmentation Framework
Ruijia Wu, Yuhang Wang, Huafeng Shi, Zhipeng Yu, Yichao Wu, Ding Liang
arXiv 2023. [Paper] 6 May 2023

Conditional Diffusion Feature Refinement for Continuous Sign Language Recognition
Leming Guo, Wanli Xue, Qing Guo, Yuxi Zhou, Tiantian Yuan, Shengyong Chen
arXiv 2023. [Paper]
5 May 2023

LayoutDM: Transformer-based Diffusion Model for Layout Generation
Shang Chai, Liansheng Zhuang, Fengying Yan
CVPR 2023. [Paper]
4 May 2023

Long-Term Rhythmic Video Soundtracker
Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao
ICML 2023. [Paper] [Github]
2 May 2023

Putting People in Their Place: Affordance-Aware Human Insertion into Scenes
Sumith Kulal, Tim Brooks, Alex Aiken, Jiajun Wu, Jimei Yang, Jingwan Lu, Alexei A. Efros, Krishna Kumar Singh
CVPR 2023. [Paper] [Project] [Github]
27 Apr 2023

Single-View Height Estimation with Conditional Diffusion Probabilistic Models
Isaac Corley, Peyman Najafirad
arXiv 2023. [Paper]
26 Apr 2023

Diffusion Probabilistic Model Based Accurate and High-Degree-of-Freedom Metasurface Inverse Design
Zezhou Zhang, Chuanchuan Yang, Yifeng Qin, Hao Feng, Jiqiang Feng, Hongbin Li
arXiv 2023. [Paper]
25 Apr 2023

Improving Synthetically Generated Image Detection in Cross-Concept Settings
Pantelis Dogoulis, Giorgos Kordopatis-Zilos, Ioannis Kompatsiaris, Symeon Papadopoulos
arXiv 2023. [Paper]
24 Apr 2023

Fast Diffusion Probabilistic Model Sampling through the lens of Backward Error Analysis
Yansong Gao, Zhihong Pan, Xin Zhou, Le Kang, Pratik Chaudhari
arXiv 2023. [Paper]
22 Apr 2023

Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Yu-Hui Chen, Raman Sarokin, Juhyun Lee, Jiuqiang Tang, Chuo-Ling Chang, Andrei Kulik, Matthias Grundmann
CVPR 2023 Workshop. [Paper]
21 Apr 2023

A data augmentation perspective on diffusion models and retrieval
Max F. Burg, Florian Wenzel, Dominik Zietlow, Max Horn, Osama Makansi, Francesco Locatello, Chris Russell
arXiv 2023. [Paper]
20 Apr 2023

Diffusion models with location-scale noise
Alexia Jolicoeur-Martineau, Kilian Fatras, Ke Li, Tal Kachman
arXiv 2023. [Paper]
12 Apr 2023

Exploring Diffusion Models for Unsupervised Video Anomaly Detection
Anil Osman Tur, Nicola Dall’Asen, Cigdem Beyan, Elisa Ricci
arXiv 2023. [Paper]
12 Apr 2023

CamDiff: Camouflage Image Augmentation via Diffusion Model
Xue-Jing Luo, Shuo Wang, Zongwei Wu, Christos Sakaridis, Yun Cheng, Deng-Ping Fan, Luc Van Gool
arXiv 2023. [Paper] [Github]
11 Apr 2023

DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion
ZiHan Cao, ShiQi Cao, Xiao Wu, JunMing Hou, Ran Ran, Liang-Jian Deng
arXiv 2023. [Paper]
10 Apr 2023

CCLAP: Controllable Chinese Landscape Painting Generation via Latent Diffusion Model
Zhongqi Wang, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan
arXiv 2023. [Paper]
9 Apr 2023

ChiroDiff: Modelling chirographic data with Diffusion Models
Ayan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song
ICLR 2023. [Paper] [Project]
7 Apr 2023

RoSteALS: Robust Steganography using Autoencoder Latent Space
Tu Bui, Shruti Agarwal, Ning Yu, John Collomosse
CVPR Workshop 2023. [Paper] [Github]
6 Apr 2023

JPEG Compressed Images Can Bypass Protections Against AI Editing
Pedro Sandoval-Segura, Jonas Geiping, Tom Goldstein
arXiv 2023. [Paper]
5 Apr 2023

Learning to Read Braille: Bridging the Tactile Reality Gap with Diffusion Models
Carolina Higuera, Byron Boots, Mustafa Mukadam
arXiv 2023. [Paper]
3 Apr 2023

Textile Pattern Generation Using Diffusion Models
Halil Faruk Karagoz, Gulcin Baykal, Irem Arikan Eksi, Gozde Unal
ITFC 2023. [Paper]
2 Apr 2023

Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images
Roberto Amoroso, Davide Morelli, Marcella Cornia, Lorenzo Baraldi, Alberto Del Bimbo, Rita Cucchiara
ACM 2023. [Paper]
2 Apr 2023

NeuroDAVIS: A neural network model for data visualization
Chayan Maitra, Dibyendu B. Seal, Rajat K. De
arXiv 2023. [Paper]
1 Apr 2023

Diffusion Action Segmentation
Daochang Liu, Qiyue Li, AnhDung Dinh, Tingting Jiang, Mubarak Shah, Chang Xu
arXiv 2023. [Paper]
31 Mar 2023

One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models
Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière
arXiv 2023. [Paper]
31 Mar 2023

DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo
arXiv 2023. [Paper]
30 Mar 2023

WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models
Konstantina Nikolaidou, George Retsinas, Vincent Christlein, Mathias Seuret, Giorgos Sfikas, Elisa Barney Smith, Hamam Mokayed, Marcus Liwicki
arXiv 2023. [Paper]
29 Mar 2023

Visual Chain-of-Thought Diffusion Models
William Harvey, Frank Wood
arXiv 2023. [Paper]
28 Mar 2023

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Sauradip Nag, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang
arXiv 2023. [Paper]
27 Mar 2023

The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez, Guillaume Couairon, Hervé Jégou, Matthijs Douze, Teddy Furon
arXiv 2023. [Paper] [Project]
27 Mar 2023

Exploring Continual Learning of Diffusion Models
Michał Zając, Kamil Deja, Anna Kuzina, Jakub M. Tomczak, Tomasz Trzciński, Florian Shkurti, Piotr Miłoś
arXiv 2023. [Paper]
27 Mar 2023

Freestyle Layout-to-Image Synthesis
Han Xue, Zhiwu Huang, Qianru Sun, Li Song, Wenjun Zhang
CVPR 2023. [Paper]
25 Mar 2023

Controllable Inversion of Black-Box Face-Recognition Models via Diffusion
Manuel Kansy, Anton Raël, Graziana Mignone, Jacek Naruniec, Christopher Schroers, Markus Gross, Romann M. Weber
arXiv 2023. [Paper]
23 Mar 2023

End-to-End Diffusion Latent Optimization Improves Classifier Guidance
Bram Wallace, Akash Gokul, Stefano Ermon, Nikhil Naik
arXiv 2023. [Paper]
23 Mar 2023

DiffPattern: Layout Pattern Generation via Discrete Diffusion
Zixiao Wang, Yunheng Shen, Wenqian Zhao, Yang Bai, Guojin Chen, Farzan Farnia, Bei Yu
DAC 2023. [Paper]
23 Mar 2023

Diffuse-Denoise-Count: Accurate Crowd-Counting with Diffusion Models
Yasiru Ranasinghe, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel
arXiv 2023. [Paper]
22 Mar 2023

LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models
Junyi Zhang, Jiaqi Guo, Shizhao Sun, Jian-Guang Lou, Dongmei Zhang
arXiv 2023. [Paper]
21 Mar 2023

Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models
Francesco Giuliari, Gianluca Scarpellini, Stuart James, Yiming Wang, Alessio Del Bue
arXiv 2023. [Paper] [Project]
20 Mar 2023

Leapfrog Diffusion Model for Stochastic Trajectory Prediction
Weibo Mao, Chenxin Xu, Qi Zhu, Siheng Chen, Yanfeng Wang
CVPR 2023. [Paper] [Github]
20 Mar 2023

Pluralistic Aging Diffusion Autoencoder
Peipei Li, Rui Wang, Huaibo Huang, Ran He, Zhaofeng He
arXiv 2023. [Paper]
20 Mar 2023

AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion Models
Yu Cao, Xiangqiao Meng, P.Y. Mok, Xueting Liu, Tong-Yee Lee, Ping Li
arxiv 2023. [Paper]
20 Mar 2023

Diffusion-based Document Layout Generation
Liu He, Yijuan Lu, John Corring, Dinei Florencio, Cha Zhang
arXiv 2023. [Paper]
19 Mar 2023

On the De-duplication of LAION-2B
Ryan Webster, Julien Rabin, Loic Simon, Frederic Jurie
arXiv 2023. [Paper]
17 Mar 2023

A Recipe for Watermarking Diffusion Models
Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Ngai-Man Cheung, Min Lin
arXiv 2023. [Paper] [Github]
17 Mar 2023

DIRE for Diffusion-Generated Image Detection
Zhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, Hezhen Hu, Hong Chen, Houqiang Li
arXiv 2023. [Paper] [Github]
16 Mar 2023

DS-Fusion: Artistic Typography via Discriminated and Stylized Diffusion
Maham Tanveer, Yizhi Wang, Ali Mahdavi-Amiri, Hao Zhang
arXiv 2023. [Paper] [Project]
16 Mar 2023

DiffusionAD: Denoising Diffusion for Anomaly Detection
Hui Zhang, Zheng Wang, Zuxuan Wu, Yu-Gang Jiang
arXiv 2023. [Paper]
15 Mar 2023

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
CVPR 2023. [Paper] [Project] [Github]
14 Mar 2023

Parallel Vertex Diffusion for Unified Visual Grounding
Zesen Cheng, Kehan Li, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen
arXiv 2023. [Paper]
13 Mar 2023

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion
Zixiang Zhao, Haowen Bai, Yuanzhi Zhu, Jiangshe Zhang, Shuang Xu, Yulun Zhang, Kai Zhang, Deyu Meng, Radu Timofte, Luc Van Gool
arXiv 2023. [Paper]
13 Mar 2023

Detecting Images Generated by Diffusers
Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato
arXiv 2023. [Paper]
9 Mar 2023

Unifying Layout Generation with a Decoupled Diffusion Model
Mude Hui, Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yuwang Wang, Yan Lu
CVPR 2023. [Paper]
9 Mar 2023

DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer
Elad Levi, Eli Brosh, Mykola Mykhailych, Meir Perez
ICCV 2023. [Paper]
7 Mar 2023

Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition
Cindy M. Nguyen, Eric R. Chan, Alexander W. Bergman, Gordon Wetzstein
arXiv 2023. [Paper] [Project]
7 Mar 2023

Word-As-Image for Semantic Typography
Shir Iluz, Yael Vinker, Amir Hertz, Daniel Berio, Daniel Cohen-Or, Ariel Shamir
SIGGRAPH 2023. [Paper] [Project]
3 Mar 2023

Makeup Extraction of 3D Representation via Illumination-Aware Image Decomposition
Xingchao Yang, Takafumi Taketomi, Yoshihiro Kanamori
Eurographics 2023. [Paper]
26 Feb 2023

Monocular Depth Estimation using Diffusion Models
Saurabh Saxena, Abhishek Kar, Mohammad Norouzi, David J. Fleet
arXiv 2023. [Paper] [Github]
28 Feb 2023

Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang, Han Chen, Hanseok Ko
arXiv 2023. [Paper]
26 Feb 2023

LDFA: Latent Diffusion Face Anonymization for Self-driving Applications
Marvin Klemp, Kevin Rösch, Royden Wagner, Jannik Quehl, Martin Lauer
arXiv 2023. [Paper]
17 Feb 2023

Road Redesign Technique Achieving Enhanced Road Safety by Inpainting with a Diffusion Model
Sumit Mishra, Medhavi Mishra, Taeyoung Kim, Dongsoo Har
arXiv 2023. [Paper]
15 Feb 2023

Effective Data Augmentation With Diffusion Models
Brandon Trabucco, Kyle Doherty, Max Gurinas, Ruslan Salakhutdinov
arXiv 2023. [Paper] [Project]
7 Feb 2023

Learning End-to-End Channel Coding with Diffusion Models
Muah Kim, Rick Fritschek, Rafael F. Schaefer
WSA/SCC 2023. [Paper]
3 Feb 2023

Extracting Training Data from Diffusion Models
Nicholas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramèr, Borja Balle, Daphne Ippolito, Eric Wallace
arXiv 2023. [Paper]
2 Feb 2023

Diffusion Models for High-Resolution Solar Forecasts
Yusuke Hatanaka, Yannik Glaser, Geoff Galgon, Giuseppe Torri, Peter Sadowski
arxiv 2023. [Paper]
1 Feb 2023

A Denoising Diffusion Model for Fluid Field Prediction
Gefan Yang, Stefan Sommer
arXiv 2023. [Paper]
27 Jan 2023

Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?
Victor Boutin, Thomas Fel, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre
arXiv 2023. [Paper]
27 Jan 2023

PLay: Parametrically Conditioned Layout Generation using Latent Diffusion
Chin-Yi Cheng, Forrest Huang, Gang Li, Yang Li
arXiv 2023. [Paper]
27 Jan 2023

LEGO-Net: Learning Regular Rearrangements of Objects in Rooms
Qiuhong Anna Wei, Sijie Ding, Jeong Joon Park, Rahul Sajnani, Adrien Poulenard, Srinath Sridhar, Leonidas Guibas
CVPR 2023. [Paper] [Project]
23 Jan 2023

Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models
Jun Yue, Leyuan Fang, Shaobo Xia, Yue Deng, Jiayi Ma
arXiv 2023. [Paper]
19 Jan 2023

Neural Image Compression with a Diffusion-Based Decoder
Noor Fathima Goose, Jens Petersen, Auke Wiggers, Tianlin Xu, Guillaume Sautière
arXiv 2023. [Paper]
13 Jan 2023

Diffusion Models For Stronger Face Morphing Attacks
Zander Blasingame, Chen Liu
arXiv 2023. [Paper]
10 Jan 2023

AI Art in Architecture
Joern Ploennigs, Markus Berger
arXiv 2022. [Paper]
19 Dec 2022

Diffusing Surrogate Dreams of Video Scenes to Predict Video Memorability
Lorin Sweeney, Graham Healy, Alan F. Smeaton
MediaEval Workshop 2022. [Paper]
19 Dec 2022

Diff-Font: Diffusion Model for Robust One-Shot Font Generation
Haibin He, Xinyuan Chen, Chaoyue Wang, Juhua Liu, Bo Du, Dacheng Tao, Yu Qiao
arXiv 2022. [Paper]
12 Dec 2022

How to Backdoor Diffusion Models?
Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho
CVPR 2023. [Paper]
11 Dec 2022

Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models
Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, Tom Goldstein
CVPR 2023. [Paper] [Github]
7 Dec 2022

ObjectStitch: Generative Object Compositing
Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga
arXiv 2022. [Paper]
2 Dec 2022

Post-training Quantization on Diffusion Models
Yuzhang Shang, Zhihang Yuan, Bin Xie, Bingzhe Wu, Yan Yan
arXiv 2022. [Paper] [Github]
28 Nov 2022

Diffusion Probabilistic Model Made Slim
Xingyi Yang, Daquan Zhou, Jiashi Feng, Xinchao Wang
CVPR 2023. [Paper]
27 Nov 2022

BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction
German Barquero, Sergio Escalera, Cristina Palmero
arXiv 2022. [Paper] [Project] [Github]
25 Nov 2022

JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models
Sepidehsadat Hosseini, Mohammad Amin Shabani, Saghar Irandoust, Yasutaka Furukawa
arXiv 2022. [Paper] [Project]
24 Nov 2022

HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising
Mohammad Amin Shabani, Sepidehsadat Hosseini, Yasutaka Furukawa
arXiv 2022. [Paper] [Project]
23 Nov 2022

Can denoising diffusion probabilistic models generate realistic astrophysical fields?
Nayantara Mudur, Douglas P. Finkbeiner
NeurIPS Workshop 2022. [Paper]
22 Nov 2022

DiffDreamer: Consistent Single-view Perpetual View Generation with Conditional Diffusion Models
Shengqu Cai, Eric Ryan Chan, Songyou Peng, Mohamad Shahbazi, Anton Obukhov, Luc Van Gool, Gordon Wetzstein
arXiv 2022. [Paper] [Project]
22 Nov 2022

CaDM: Codec-aware Diffusion Modeling for Neural-enhanced Video Streaming
Qihua Zhou, Ruibin Li, Song Guo, Yi Liu, Jingcai Guo, Zhenda Xu
arXiv 2022. [Paper]
15 Nov 2022

Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Zhihong Pan, Xin Zhou, Hao Tian
arXiv 2022. [Paper]
14 Nov 2022

Evaluating a Synthetic Image Dataset Generated with Stable Diffusion
Andreas Stöckl
arXiv 2022. [Paper]
3 Nov 2022

On the detection of synthetic images generated by diffusion models
Riccardo Corvi, Davide Cozzolino, Giada Zingarini, Giovanni Poggi, Koki Nagano, Luisa Verdoliva
arXiv 2022. [Paper] [Github]
1 Nov 2022

DOLPH: Diffusion Models for Phase Retrieval
Shirin Shoushtari, Jiaming Liu, Ulugbek S. Kamilov
arXiv 2022. [Paper]
1 Nov 2022

Towards the Detection of Diffusion Model Deepfakes
Jonas Ricker, Simon Damm, Thorsten Holz, Asja Fischer
arXiv 2022. [Paper]
26 Oct 2022

Deep Data Augmentation for Weed Recognition Enhancement: A Diffusion Probabilistic Model and Transfer Learning Based Approach
Dong Chen, Xinda Qi, Yu Zheng, Yuzhen Lu, Zhaojian Li
arXiv 2022. [Paper] [Github]
18 Oct 2022

DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Diffusion Models
Zeyang Sha, Zheng Li, Ning Yu, Yang Zhang
arXiv 2022. [Paper]
13 Oct 2022

Markup-to-Image Diffusion Models with Scheduled Sampling
Yuntian Deng, Noriyuki Kojima, Alexander M. Rush
ICLR 2023. [Paper]
11 Oct 2022

What the DAAM: Interpreting Stable Diffusion Using Cross Attention
Raphael Tang, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Jimmy Lin, Ferhan Ture
arXiv 2022. [Paper] [Github]
10 Oct 2022

CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
Shitong Xu
arXiv 2022. [Paper] [Github]
10 Oct 2022

Diffusion Models Beat GANs on Topology Optimization
François Mazé, Faez Ahmed
AAAI 2022. [Paper] [Project] [Github]
20 Aug 2022

Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation
Pan Xie, Qipeng Zhang, Zexian Li, Hao Tang, Yao Du, Xiaohui Hu
arXiv 2022. [Paper]
19 Aug 2022

Deep Diffusion Models for Seismic Processing
Ricard Durall, Ammar Ghanim, Mario Fernandez, Norman Ettrich, Janis Keuper
arXiv 2022. [Paper]
21 Jul 2022

Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
Tianpei Gu, Guangyi Chen, Junlong Li, Chunze Lin, Yongming Rao, Jie Zhou, Jiwen Lu
CVPR 2022. [Paper] [Github]
25 Mar 2022

Audio

Generation

JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation
Yao Yao, Peike Li, Boyu Chen, Alex Wang
arXiv 2023. [Paper]
29 Oct 2023

Energy-Based Models For Speech Synthesis
Wanli Sun, Zehai Tu, Anton Ragni
arXiv 2023. [Paper]
19 Oct 2023

Generation or Replication: Auscultating Audio Latent Diffusion Models
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux
arXiv 2023. [Paper]
16 Oct 2023

U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning
Tao Li, Zhichao Wang, Xinfa Zhu, Jian Cong, Qiao Tian, Yuping Wang, Lei Xie
arXiv 2023. [Paper]
6 Oct 2023

DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation
Roi Benita, Michael Elad, Joseph Keshet
arXiv 2023. [Paper]
2 Oct 2023

High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Chunyu Qiang, Hao Li, Yixin Tian, Yi Zhao, Ying Zhang, Longbiao Wang, Jianwu Dang
arXiv 2023. [Paper]
27 Sep 2023

Invisible Watermarking for Audio Generation Diffusion Models
Xirong Cao, Xiang Li, Divyesh Jadav, Yanzhao Wu, Zhehui Chen, Chen Zeng, Wenqi Wei
arXiv 2023. [Paper]
22 Sep 2023

Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis
Ben Maman, Johannes Zeitler, Meinard Müller, Amit H. Bermano
arXiv 2023. [Paper]
21 Sep 2023

Speeding Up Speech Synthesis In Diffusion Models By Reducing Data Distribution Recovery Steps Via Content Transfer
Peter Ochieng
arXiv 2023. [Paper]
18 Sep 2023

Audio Generation with Multiple Conditional Diffusion Model
Zhifang Guo, Jianguo Mao, Rui Tao, Long Yan, Kazushige Ouchi, Hong Liu, Xiangdong Wang
AAAI 2024. [Paper] [Project]
23 Aug 2023

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley
arXiv 2023. [Paper]
10 Aug 2023

DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
Hyung-Seok Oh, Sang-Hoon Lee, Seong-Whan Lee
arXiv 2023. [Paper]
31 Jul 2023

Progressive distillation diffusion for raw music generation
Svetlana Pavlova
arXiv 2023. [Paper]
20 Jul 2023

UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data
Heeseung Kim, Sungwon Kim, Jiheum Yeom, Sungroh Yoon
arXiv 2023. [Paper]
28 Jun 2023

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Shivam Mehta, Siyang Wang, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter
arXiv 2023. [Paper]
15 Jun 2023

HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio Codec and Latent Diffusion Models
Ji-Sang Hwang, Sang-Hoon Lee, Seong-Whan Lee
arXiv 2023. [Paper]
12 Jun 2023

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion
Haogeng Liu, Tao Wang, Jie Cao, Ran He, Jianhua Tao
arXiv 2023. [Paper]
9 Jun 2023

EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis
Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao
InterSpeech 2023. [Paper]
1 Jun 2023

Efficient Neural Music Generation
Max W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yuping Wang, Yuxuan Wang
arXiv 2023. [Paper] [Github]
25 May 2023

Generating symbolic music using diffusion models
Lilac Atassi
arXiv 2023. [Paper]
15 Mar 2023

DiffuseRoll: Multi-track multi-category music generation based on diffusion model
Hongfei Wang
arXiv 2023. [Paper]
14 Mar 2023

Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani, Irene Tallini, Emilian Postolache, Michele Mancusi, Luca Cosmo, Emanuele Rodolà
arXiv 2023. [Paper] [Project]
4 Feb 2023

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Ludan Ruan, Yiyang Ma, Huan Yang, Huiguo He, Bei Liu, Jianlong Fu, Nicholas Jing Yuan, Qin Jin, Baining Guo
CVPR 2023. [Paper] [Github]
19 Dec 2022

SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation
Chen Zhang, Yi Ren, Kejun Zhang, Shuicheng Yan
arXiv 2022. [Paper] [Project]
1 Nov 2022

Full-band General Audio Synthesis with Score-based Diffusion
Santiago Pascual, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, Joan Serrà
arXiv 2022. [Paper]
26 Oct 2022

Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Naoya Takahashi, Mayank Kumar, Singh, Yuki Mitsufuji
arXiv 2022. [Paper]
14 Oct 2022

Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Yin-Ping Cho, Yu Tsao, Hsin-Min Wang, Yi-Wen Liu
arXiv 2022. [Paper] [Project]
21 Sep 2022

DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation
Da-Yi Wu, Wen-Yi Hsiao, Fu-Rong Yang, Oscar Friedman, Warren Jackson, Scott Bruzenak, Yi-Wen Liu, Yi-Hsuan Yang
ISMIR 2022. [Paper] [Github]
9 Aug 2022

ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren
ACM Multimedia 2022. [Paper] [Project]
13 Jul 2022

CARD: Classification and Regression Diffusion Models
Xizewen Han, Huangjie Zheng, Mingyuan Zhou
NeurIPS 2022. [Paper] [Github]
15 Jun 2022

Adversarial Audio Synthesis with Complex-valued Polynomial Networks
Yongtao Wu, Grigorios G Chrysos, Volkan Cevher
ICML workshop 2022. [Paper]
14 Jun 2022

Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Josh Gardner, Ethan Manilow, Jesse Engel
ISMIR 2022. [Paper]
11 Jun 2022

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu
NeurIPS 2022. [Paper] [Github]
30 May 2022

FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao
IJCAI 2022. [Paper] [Project] [Github]
21 Apr 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
Yuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel Bacchiani
Interspeech 2022. [Paper]
31 Mar 2022

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu
ICLR 2022. [Paper] [Github]
25 Mar 2022

ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave Generation
Shoule Wu, Ziqiang Shi
CoRR 2022. [Paper] [Project]
29 Jan 2022

Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal Derivatives
Hideyuki Tachibana, Mocho Go, Muneyoshi Inahara, Yotaro Katayama, Yotaro Watanabe
arXiv 2021. [Paper]
26 Dec 2021

Denoising Diffusion Gamma Models
Eliya Nachmani, Robin San Roman, Lior Wolf
arXiv 2021. [Paper]
10 Oct 2021

Variational Diffusion Models
Diederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho
NeurIPS 2021. [Paper] [Github]
1 Jul 2021

CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis
Simon Rouard, Gaëtan Hadjeres
ISMIR 2021. [Paper] [Project]
14 Jun 2021

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive Prior
Sang-gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon, Tie-Yan Liu
ICLR 2022. [Paper] [Project]
11 Jun 2021

ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation
Shoule Wu, Ziqiang Shi
arXiv 2022. [Paper] [Project]
17 May 2021

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Peng Liu, Zhou Zhao
AAAI 2022. [Paper] [Project] [Github]
6 May 2021

Symbolic Music Generation with Diffusion Models
Gautam Mittal, Jesse Engel, Curtis Hawthorne, Ian Simon
ISMIR 2021. [Paper] [Github]
30 Mar 2021

DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro
ICLR 2021. [Paper] [Github]
21 Sep 2020

WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Chan
ICLR 2021. [Paper] [Project] [Github]
2 Sep 2020

Conversion

DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation
Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu
arXiv 2023. [Paper]
26 Oct 2023

Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion
Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Liumeng Xue, Zhizheng Wu
arXiv 2023. [Paper]
17 Oct 2023

PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Jixun Yao, Yuguang Yang, Yi Lei, Ziqian Ning, Yanni Hu, Yu Pan, Jingjing Yin, Hongbin Zhou, Heng Lu, Lei Xie
arXiv 2023. [Paper]
17 Sep 2023

EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel and In-the-wild Data
Navin Raj Prabhu, Bunlong Lay, Simon Welker, Nale Lehmann-Willenbrock, Timo Gerkmann
arXiv 2023. [Paper]
14 Sep 2023

Highly Controllable Diffusion-based Any-to-Any Voice Conversion Model with Frame-level Prosody Feature
Kyungguen Byun, Sunkuk Moon, Erik Visser
arXiv 2023. [Paper]
6 Sep 2023

Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data
Hyungseob Lim, Kyungguen Byun, Sunkuk Moon, Erik Visser
arXiv 2023. [Paper]
6 Sep 2023

Voice Conversion with Denoising Diffusion Probabilistic GAN Models
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao
ADMA 2023. [Paper]
28 Aug 2023

DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion
Ha-Yeong Choi, Sang-Hoon Lee, Seong-Whan Lee
arXiv 2023. [Paper] [Project]
25 May 2023

Duplex Diffusion Models Improve Speech-to-Speech Translation
Xianchao Wu
ACL 2023. [Paper]
22 May 2023

DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion
Songxiang Liu, Yuewen Cao, Dan Su, Helen Meng
IEEE 2021. [Paper] [Github]
28 May 2021

Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme
Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei
ICLR 2022. [Paper] [Project]
28 Sep 2021

Enhancement

uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models
Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu
arXiv 2023. [Paper]
2 Oct 2023

Diffusion-based speech enhancement with a weighted generative-supervised learning loss
Jean-Eudes Ayilo, Mostafa Sadeghi, Romain Serizel
arXiv 2023. [Paper]
19 Sep 2023

Unsupervised speech enhancement with diffusion-based generative models
Berné Nortier, Mostafa Sadeghi, Romain Serizel
arXiv 2023. [Paper] [Github]
19 Sep 2023

Single and Few-step Diffusion for Generative Speech Enhancement
Bunlong Lay, Jean-Marie Lemercier, Julius Richter, Timo Gerkmann
arXiv 2023. [Paper]
18 Sep 2023

AudioSR: Versatile Audio Super-resolution at Scale
Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley
arXiv 2023. [Paper] [Github]
13 Sep 2023

VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance
Carlos Hernandez-Olivan, Koichi Saito, Naoki Murata, Chieh-Hsin Lai, Marco A. Martínez-Ramirez, Wei-Hsiang Liao, Yuki Mitsufuji
arXiv 2023. [Paper] [Project]
13 Sep 2023

NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement
Wen Wang, Dongchao Yang, Qichen Ye, Bowen Cao, Yuexian Zou
arXiv 2023. [Paper] [Project]
3 Sep 2023

Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng
arXiv 2023. [Paper]
16 Jul 2023

Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Sandipana Dowerah, Ajinkya Kulkarni, Romain Serizel, Denis Jouvet
arXiv 2023. [Paper]
5 Jul 2023

Diffusion Posterior Sampling for Informed Single-Channel Dereverberation
Jean-Marie Lemercier, Simon Welker, Timo Gerkmann
arXiv 2023. [Paper]
21 Jun 2023

Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
Zilu Guo, Jun Du, Chin-Hui Lee, Yu Gao, Wenbin Zhang
arXiv 2023. [Paper]
14 Jun 2023

UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model
Anastasiia Iashchenko, Pavel Andreev, Ivan Shchekotov, Nicholas Babaev, Dmitry Vetrov
Interspeech 2023. [Paper]
1 Jun 2023

SE-Bridge: Speech Enhancement with Consistent Brownian Bridge
Zhibin Qiu, Mengfan Fu, Fuchun Sun, Gulila Altenbek, Hao Huang
arXiv 2023. [Paper]
23 May 2023

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Hao Shi, Kazuki Shimada, Masato Hirano, Takashi Shibuya, Yuichiro Koyama, Zhi Zhong, Shusuke Takahashi, Tatsuya Kawahara, Yuki Mitsufuji
arXiv 2023. [Paper]
18 May 2023

Speech Signal Improvement Using Causal Generative Diffusion Models
Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Tal Peer, Timo Gerkmann
ICASSP 2023. [Paper]
15 Mar 2023

Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement
Bunlong Lay, Simon Welker, Julius Richter, Timo Gerkmann
arXiv 2023. [Paper]
28 Feb 2023

Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng
arXiv 2023. [Paper]
23 Feb 2023

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Jean-Marie Lemercier, Julius Richter, Simon Welker, Timo Gerkmann
ICASSP 2023. [Paper]
22 Dec 2022

Unsupervised vocal dereverberation with diffusion-based generative models
Koichi Saito, Naoki Murata, Toshimitsu Uesaka, Chieh-Hsin Lai, Yuhta Takida, Takao Fukui, Yuki Mitsufuji
ICASSP 2023. [Paper]
8 Nov 2022

DiffPhase: Generative Diffusion-based STFT Phase Retrieval
Tal Peer, Simon Welker, Timo Gerkmann
ICASSP 2023. [Paper]
8 Nov 2022

Cold Diffusion for Speech Enhancement
Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux
ICASSP 2023. [Paper]
4 Nov 2022

Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration
Jean-Marie Lemercier, Julius Richter, Simon Welker, Timo Gerkmann
Interspeech 2022. [Paper] [Project] [Github]
4 Nov 2022

SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement
Zhibin Qiu, Mengfan Fu, Yinfeng Yu, LiLi Yin, Fuchun Sun, Hao Huang
ICASSP 2022. [Paper] [Github]
30 Oct 2022

A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji
ICASSP 2023. [Paper]
27 Oct 2022

Conditioning and Sampling in Variational Diffusion Models for Speech Super-resolution
Chin-Yun Yu, Sung-Lin Yeh, György Fazekas, Hao Tang
ICASSP 2023. [Paper] [Project] [Github]
27 Oct 2022

Solving Audio Inverse Problems with a Diffusion Model
Eloi Moliner, Jaakko Lehtinen, Vesa Välimäki
ICASSP 2023. [Paper]
27 Oct 2022

Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Timo Gerkmann
arXiv 2022. [Paper] [Project] [Github]
11 Aug 2022

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates
Seungu Han, Junhyeok Lee
Interspeech 2022. [Paper] [Project]
17 Jun 2022

Universal Speech Enhancement with Score-based Diffusion
Joan Serrà, Santiago Pascual, Jordi Pons, R. Oguz Araz, Davide Scaini
arXiv 2022. [Paper]
7 Jun 2022

Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker, Julius Richter, Timo Gerkmann
InterSpeech 2022. [Paper] [Github]
31 Mar 2022

Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao
IEEE 2022. [Paper] [Github]
10 Feb 2022

A Study on Speech Enhancement Based on Diffusion Probabilistic Model
Yen-Ju Lu, Yu Tsao, Shinji Watanabe
APSIPA 2021. [Paper]
25 Jul 2021

Restoring degraded speech via a modified diffusion model
Jianwei Zhang, Suren Jayasuriya, Visar Berisha
Interspeech 2021. [Paper]
22 Apr 2021

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
Junhyeok Lee, Seungu Han
Interspeech 2021. [Paper] [Project] [Github]
6 Apr 2021

Separation

VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model
Yayun He, Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao
arXiv 2023. [Paper]
7 Oct 2023

DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction
Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali
arXiv 2023. [Paper]
6 Oct 2023

Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction
Leying Zhang, Yao Qian, Linfeng Yu, Heming Wang, Xinkai Wang, Hemin Yang, Long Zhou, Shujie Liu, Yanmin Qian, Michael Zeng
arXiv 2023. [Paper]
25 Sep 2023

Diffusion-based Signal Refiner for Speech Separation
Masato Hirano, Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Yuki Mitsufuji
arXiv 2023. [Paper]
10 May 2023

Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
Giorgio Mariani, Irene Tallini, Emilian Postolache, Michele Mancusi, Luca Cosmo, Emanuele Rodolà
arXiv 2023. [Paper] [Project]
4 Feb 2023

Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
Shahar Lutati, Eliya Nachmani, Lior Wolf
arXiv 2023. [Paper]
25 Jan 2023

Diffusion-based Generative Speech Source Separation
Robin Scheibler, Youna Ji, Soo-Whan Chung, Jaeuk Byun, Soyeon Choe, Min-Seok Choi
ICASSP 2023. [Paper]
31 Oct 2022

Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model
Sangjun Han, Hyeongrae Ihm, DaeHan Ahn, Woohyung Lim
NeurIPS Workshop 2022. [Paper]
5 Sep 2022

Text-to-Speech

Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN
Neeraj Kumar, Ankur Narang, Brejesh Lall
arXiv 2023. [Paper]
27 Oct 2023

VoiceLDM: Text-to-Speech with Environmental Context
Yeonghyeon Lee, Inmo Yeon, Juhan Nam, Joon Son Chung
arXiv 2023. [Paper] [Project] [Github]
24 Sep 2023

DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis
Yu Gu, Yianrao Bian, Guangzhi Lei, Chao Weng, Dan Su
arXiv 2023. [Paper]
22 Sep 2023

DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech – A Study between English and Mandarin
Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie
TASLP 2023. [Paper]
2 Sep 2023

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech
Jie Chen, Xingchen Song, Zhendong Peng, Binbin Zhang, Fuping Pan, Zhiyong Wu
ICASSP 2023. [Paper]
31 Aug 2023

Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models
Heyang Xue, Shuai Guo, Pengcheng Zhu, Mengxiao Bi
arXiv 2023. [Paper] [Project]
21 Aug 2023

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Peike Li, Boyu Chen, Yao Yao, Yikai Wang, Allen Wang, Alex Wang
arXiv 2023. [Paper] [Project]
9 Aug 2023

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Ke Chen, Yusong Wu, Haohe Liu, Marianna Nezhurina, Taylor Berg-Kirkpatrick, Shlomo Dubnov
arXiv 2023. [Paper] [Project]
3 Aug 2023

Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS
Myeongjin Ko, Yong-Hoon Choi
arXiv 2023. [Paper]
3 Aug 2023

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura-Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba
arXiv 2023. [Paper]
31 Jul 2023

Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Chunyu Qiang, Hao Li, Hao Ni, He Qu, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang
arXiv 2023. [Paper]
28 Jul 2023

Text-Driven Foley Sound Generation With Latent Diffusion Model
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang
arXiv 2023. [Paper]
17 Jun 2023

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian McAuley
arXiv 2023. [Paper]
16 Jun 2023

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani
arXiv 2023. [Paper]
13 Jun 2023

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu
arXiv 2023. [Paper]
13 Jun 2023

Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Wenhao Guan, Tao Li, Yishuang Li, Hukai Huang, Qingyang Hong, Lin Li
Interspeech 2023. [Paper]
7 Jun 2023

Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao
arXiv 2023. [Paper] [Github]
6 Jun 2023

Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao
arXiv 2023. [Paper]
29 May 2023

ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models
Minki Kang, Wooseok Han, Sung Ju Hwang, Eunho Yang
arXiv 2023. [Paper]
23 May 2023

U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
Xin Jing, Yi Chang, Zijiang Yang, Jiangjian Xie, Andreas Triantafyllopoulos, Bjoern W. Schuller
arXiv 2023. [Paper] [Project]
22 May 2023

DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment
Shentong Mo, Jing Shi, Yapeng Tian
arXiv 2023. [Paper]
22 May 2023

ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao
arXiv 2023. [Paper] [Project]
22 May 2023

RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao
ACL 2023. [Paper] [Project]
18 May 2023

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Zhen Ye, Wei Xue, Xu Tan, Jie Chen, Qifeng Liu, Yike Guo
arXiv 2023. [Paper] [Github]
11 May 2023

Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
Deepanway Ghosal, Navonil Majumder, Ambuj Mehrish, Soujanya Poria
arXiv 2023. [Paper] [Project] [Github]
24 Apr 2023

DiffVoice: Text-to-Speech with Latent Diffusion
Zhijun Liu, Yiwei Guo, Kai Yu
ICASSP 2023. [Paper]
23 Apr 2023

An investigation into the adaptability of a diffusion-based TTS model
Haolin Chen, Philip N. Garner
arXiv 2023. [Paper]
3 Mar 2023

Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Jiyoung Lee, Joon Son Chung, Soo-Whan Chung
ICASSP 2023. [Paper]
27 Feb 2023

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
Pengfei Zhu, Chao Pang, Shuohuan Wang, Yekun Chai, Yu Sun, Hao Tian, Hua Wu
arXiv 2023. [Paper]
9 Feb 2023

Noise2Music: Text-conditioned Music Generation with Diffusion Models
Qingqing Huang, Daniel S. Park, Tao Wang, Timo I. Denk, Andy Ly, Nanxin Chen, Zhengdong Zhang, Zhishuai Zhang, Jiahui Yu, Christian Frank, Jesse Engel, Quoc V. Le, William Chan, Wei Han
arXiv 2023. [Paper] [Project]
8 Feb 2023

Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
Flavio Schneider, Zhijing Jin, Bernhard Schölkopf
arXiv 2023. [Paper] [Project] [Github]
27 Jan 2023

InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Dongchao Yang, Songxiang Liu, Rongjie Huang, Guangzhi Lei, Chao Weng, Helen Meng, Dong Yu
arXiv 2023. [Paper] [Project]
31 Jan 2023

Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao
arXiv 2023. [Paper] [Project]
30 Jan 2023

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D. Plumbley
arXiv 2023. [Paper] [Project] [Github]
29 Jan 2023

ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
Zehua Chen, Yihan Wu, Yichong Leng, Jiawei Chen, Haohe Liu, Xu Tan, Yang Cui, Ke Wang, Lei He, Sheng Zhao, Jiang Bian, Danilo Mandic
arXiv 2022. [Paper] [Project]
30 Dec 2022

Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
Yusuke Yasuda, Tomoki Toda
ICASSP 2023. [Paper]
16 Dec 2022

Any-speaker Adaptive Text-To-Speech Synthesis with Diffusion Models
Minki Kang, Dongchan Min, Sung Ju Hwang
ICASSP 2023. [Paper] [Project]
17 Nov 2022

EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu
ICASSP 2023. [Paper] [Project]
17 Nov 2022

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS
Dongchao Yang, Songxiang Liu, Jianwei Yu, Helin Wang, Chao Weng, Yuexian Zou
ICASSP 2023. [Paper]
4 Nov 2022

WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration
Yuma Koizumi, Kohei Yatabe, Heiga Zen, Michiel Bacchiani
IEEE SLT 2023. [Paper] [Project]
3 Oct 2022

Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu
TASLP 2022. [Paper] [Project]
20 Jul 2022

Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models
Alon Levkovitch, Eliya Nachmani, Lior Wolf
Interspeech 2022. [Paper] [Project]
5 Jun 2022

Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Sungwon Kim, Heeseung Kim, Sungroh Yoon
arXiv 2022. [Paper] [Project]
30 May 2022

InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training
Zehua Chen, Xu Tan, Ke Wang, Shifeng Pan, Danilo Mandic, Lei He, Sheng Zhao
ICASSP 2022. [Paper]
8 Feb 2022

DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Songxiang Liu, Dan Su, Dong Yu
arXiv 2022. [Paper] [Github]
28 Jan 2022

Guided-TTS:Text-to-Speech with Untranscribed Speech
Heeseung Kim, Sungwon Kim, Sungroh Yoon
ICML 2021. [Paper]
30 Nov 2021

EdiTTS: Score-based Editing for Controllable Text-to-Speech
Jaesung Tae, Hyeongju Kim, Taesu Kim
Interspeech 2022. [Paper] [Project] [Github]
6 Oct 2021

WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan
Interspeech 2021. [Paper] [Project] [Github] [Github2]
17 Jun 2021

Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov
ICML 2021. [Paper] [Project] [Github]
13 May 2021

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Peng Liu, Zhou Zhao
AAAI 2022. [Paper] [Project] [Github]
6 May 2021

Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim
Interspeech 2021. [Paper]
3 Apr 2021

Miscellany

Diffusion-Based Adversarial Purification for Speaker Verification
Yibo Bai, Xiao-Lei Zhang
arXiv 2023. [Paper]
22 Oct 2023

LD4MRec: Simplifying and Powering Diffusion Model for Multimedia Recommendation
Penghang Yu, Zhiyi Tan, Guanming Lu, Bing-Kun Bao
arXiv 2023. [Paper]
27 Sep 2023

Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models
Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim, Ha-Jin Yu
arXiv 2023. [Paper] [Github]
14 Sep 2023

InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models
Bing Han, Junyu Dai, Xuchen Song, Weituo Hao, Xinyan He, Dong Guo, Jitong Chen, Yuxuan Wang, Yanmin Qian
arXiv 2023. [Paper] [Project]
28 Aug 2023

DiffSED: Sound Event Detection with Denoising Diffusion
Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu
arXiv 2023. [Paper]
14 Aug 2023

Target Speech Extraction with Conditional Diffusion Model
Naoyuki Kamo, Marc Delcroix, Tomohiro Nakatani
Interspeech 2023. [Paper]
8 Aug 2023

UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang, Bowen Gao, Yangjian Wu, Jingwen Cai, Teik Toe Teoh
arXiv 2023. [Paper] [Github]
29 Jul 2023

Zero-Shot Blind Audio Bandwidth Extension
Eloi Moliner, Filip Elvander, Vesa Välimäki
arXiv 2023. [Paper]
2 Jun 2023

Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model
Xiang Li, Songxiang Liu, Max W. Y. Lam, Zhiyong Wu, Chao Weng, Helen Meng
Interspeech 2023. [Paper]
26 May 2023

Diffusion-Based Audio Inpainting
Eloi Moliner, Vesa Välimäki
arXiv 2023. [Paper]
24 May 2023

FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao
arXiv 2023. [Paper] [Github]
23 May 2023

A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
Ibrahim Malik, Siddique Latif, Raja Jurdak, Björn Schuller
arXiv 2023. [Paper]
19 May 2023

AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao
arXiv 2023. [Paper] [Project]
3 Apr 2023

Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-k Selection Discriminator
Yunhao Chen, Yunjie Zhu, Zihui Yan, Jianlu Shen, Zhen Ren, Yifan Huang
arXiv 2023. [Paper] [Github]
27 Mar 2023

Enhancing Unsupervised Speech Recognition with Diffusion GANs
Xianchao Wu
ICASSP 2023. [Paper]
23 Mar 2023

Defending against Adversarial Audio via Diffusion Model
Shutong Wu, Jiongxiao Wang, Wei Ping, Weili Nie, Chaowei Xiao
ICLR 2023. [Paper] [Github]
2 Mar 2023

TransFusion: Transcribing Speech with Multinomial Diffusion
Matthew Baas, Kevin Eloff, Herman Kamper
SACAIR 2022. [Paper] [Github]
14 Oct 2022

Natural Language

Discrete Diffusion Language Modeling by Estimating the Ratios of the Data Distribution
Aaron Lou, Chenlin Meng, Stefano Ermon
arXiv 2023. [Paper]
25 Oct 2023

ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts
Lena S. Bolliger, David R. Reich, Patrick Haller, Deborah N. Jakobi, Paul Prasse, Lena A. Jäger
arXiv 2023. [Paper] [Github]
24 Oct 2023

DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM
Weijie Xu, Wenxiang Hu, Fanyou Wu, Srinivasan Sengamedu
arXiv 2023. [Paper]
23 Oct 2023

InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation
Renzhi Wang, Jing Li, Piji Li
arXiv 2023. [Paper]
18 Oct 2023

DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng Kong
arXiv 2023. [Paper] [Github]
9 Oct 2023

ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer
Zachary Horvitz, Ajay Patel, Chris Callison-Burch, Zhou Yu, Kathleen McKeown
arXiv 2023. [Paper]
29 Aug 2023

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye, Zaixiang Zheng, Yu Bao, Lihua Qian, Quanquan Gu
arXiv 2023. [Paper] [Github]
23 Aug 2023

Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction
Yuanzhen Luo, Qingyu Zhou, Feng Zhou
arXiv 2023. [Paper]
17 Aug 2023

How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Huazheng Wang, Daixuan Cheng, Haifeng Sun, Jingyu Wang, Qi Qi, Jianxin Liao, Jing Wang, Cong Liu
ECAI 2023. [Paper]
26 Jul 2023

XDLM: Cross-lingual Diffusion Language Model for Machine Translation
Linyao Chen, Aosong Feng, Boming Yang, Zihui Li
arXiv 2023. [Paper]
25 Jul 2023

DiffuDetox: A Mixed Diffusion Model for Text Detoxification
Griffin Floto, Mohammad Mahdi Abdollah Pour, Parsa Farinneya, Zhenwei Tang, Ali Pesaranghader, Manasa Bharadwaj, Scott Sanner
arXiv 2023. [Paper]
14 Jun 2023

PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation
Zhiyuan Hu, Chumin Liu, Yue Feng, Bryan Hooi
arXiv 2023. [Paper]
14 Jun 2023

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Yizhe Zhang, Jiatao Gu, Zhuofeng Wu, Shuangfei Zhai, Josh Susskind, Navdeep Jaitly
arXiv 2023. [Paper]
5 Jun 2023

DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation
Guanqun Bi, Lei Shen, Yanan Cao, Meng Chen, Yuqiang Xie, Zheng Lin, Xiaodong He
arXiv 2023. [Paper]
2 Jun 2023

Perturbation-Assisted Sample Synthesis: A Novel Approach for Uncertainty Quantification
Yifei Liu, Rex Shen, Xiaotong Shen
arXiv 2023. [Paper]
30 May 2023

Likelihood-Based Diffusion Language Models
Ishaan Gulrajani, Tatsunori B. Hashimoto
arXiv 2023. [Paper] [Github]
30 May 2023

Learning to Imagine: Visually-Augmented Natural Language Generation
Tianyi Tang, Yushuo Chen, Yifan Du, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen
ACL 2023. [Paper]
26 May 2023

Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving
Xueliang Zhao, Wenda Li, Lingpeng Kong
arXiv 2023. [Paper]
25 May 2023

Dior-CVAE: Diffusion Priors in Variational Dialog Generation
Tianyu Yang, Thy Thy Tran, Iryna Gurevych
arXiv 2023. [Paper]
24 May 2023

SSD-2: Scaling and Inference-time Fusion of Diffusion Language Models
Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov, Marjan Ghazvininejad
arXiv 2023. [Paper]
24 May 2023

DiffusionNER: Boundary Diffusion for Named Entity Recognition
Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
ACL 2023. [Paper]
22 May 2023

DiffCap: Exploring Continuous Diffusion on Image Captioning
Yufeng He, Zefan Cai, Xu Gan, Baobao Chang
arXiv 2023. [Paper]
20 May 2023

DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion
Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling
arXiv 2023. [Paper]
19 May 2023

Democratized Diffusion Language Model
Nikita Balagansky, Daniil Gavrilov
arXiv 2023. [Paper]
18 May 2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
Tong Wu, Zhihao Fan, Xiao Liu, Yeyun Gong, Yelong Shen, Jian Jiao, Hai-Tao Zheng, Juntao Li, Zhongyu Wei, Jian Guo, Nan Duan, Weizhu Chen
arXiv 2023. [Paper]
16 May 2023

TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Rabeeh Karimi Mahabadi, Jaesung Tae, Hamish Ivison, James Henderson, Iz Beltagy, Matthew E. Peters, Arman Cohan
arXiv 2023. [Paper]
15 May 2023

Large Language Models Need Holistically Thought in Medical Conversational QA
Yixuan Weng, Bin Li, Fei Xia, Minjun Zhu, Bin Sun, Shizhu He, Kang Liu, Jun Zhao
arXiv 2023. [Paper] [Github]
9 May 2023

Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias
Zhiyuan Zhang, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun
arXiv 2023. [Paper]
8 May 2023

Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap between Training and Inference!
Zecheng Tang, Pinzheng Wang, Keyan Zhou, Juntao Li, Ziqiang Cao, Min Zhang
ACL 2023. [Paper] [Github]
8 May 2023

Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation
Kun Zhou, Yifan Li, Wayne Xin Zhao, Ji-Rong Wen
arXiv 2023. [Paper]
6 May 2023

DiffuSum: Generation Enhanced Extractive Summarization with Diffusion
Haopeng Zhang, Xiao Liu, Jiawei Zhang
ACL 2023. [Paper]
2 May 2023

RenderDiffusion: Text Generation as Image Generation
Junyi Li, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen
arXiv 2023. [Paper]
25 Apr 2023

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models
Yuwei Yin, Jean Kaddour, Xiang Zhang, Yixin Nie, Zhenguang Liu, Lingpeng Kong, Qi Liu
arXiv 2023. [Paper]
18 Apr 2023

A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen, Aston Zhang, Mu Li, Alex Smola, Diyi Yang
arXiv 2023. [Paper] [Github]
10 Apr 2023

DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises
Jiasheng Ye, Zaixiang Zheng, Yu Bao, Lihua Qian, Mingxuan Wang
arXiv 2023. [Paper]
20 Feb 2023

A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng, Jianbo Yuan, Lei Yu, Lingpeng Kong
arXiv 2023. [Paper] [Github]
11 Feb 2023

Long Horizon Temperature Scaling
Andy Shih, Dorsa Sadigh, Stefano Ermon
arXiv 2023. [Paper] [Github]
7 Feb 2023

DiffusER: Diffusion via Edit-based Reconstruction
Machel Reid, Vincent Josua Hellendoorn, Graham Neubig
ICLR 2023. [Paper]
2 Feb 2023

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model
Zhenghao Lin, Yeyun Gong, Yelong Shen, Tong Wu, Zhihao Fan, Chen Lin, Weizhu Chen, Nan Duan
arXiv 2022. [Paper]
22 Dec 2022

Diff-Glat: Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
Lihua Qian, Mingxuan Wang, Yang Liu, Hao Zhou
arXiv 2022. [Paper]
20 Dec 2022

SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers
Hongyi Yuan, Zheng Yuan, Chuanqi Tan, Fei Huang, Songfang Huang
arXiv 2022. [Paper]
20 Dec 2022

Latent Diffusion for Language Generation
Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman, Kilian Weinberger
arXiv 2022. [Paper]
19 Dec 2022

Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Zhujin Gao, Junliang Guo, Xu Tan, Yongxin Zhu, Fang Zhang, Jiang Bian, Linli Xu
arXiv 2022. [Paper]
19 Dec 2022

DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
Zhengfu He, Tianxiang Sun, Kuanning Wang, Xuanjing Huang, Xipeng Qiu
arXiv 2022. [Paper] [Github]
28 Nov 2022

Continuous diffusion for categorical data
Sander Dieleman, Laurent Sartran, Arman Roshannai, Nikolay Savinov, Yaroslav Ganin, Pierre H. Richemond, Arnaud Doucet, Robin Strudel, Chris Dyer, Conor Durkan, Curtis Hawthorne, Rémi Leblond, Will Grathwohl, Jonas Adler
arXiv 2022. [Paper]
28 Nov 2022

Self-conditioned Embedding Diffusion for Text Generation
Robin Strudel, Corentin Tallec, Florent Altché, Yilun Du, Yaroslav Ganin, Arthur Mensch, Will Grathwohl, Nikolay Savinov, Sander Dieleman, Laurent Sifre, Rémi Leblond
arXiv 2022. [Paper]
8 Nov 2022

DiffusER: Discrete Diffusion via Edit-based Reconstruction
Machel Reid, Vincent J. Hellendoorn, Graham Neubig
ICLR 2023. [Paper]
30 Oct 2022

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov
arXiv 2022. [Paper] [Github]
31 Oct 2022

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, LingPeng Kong
ICLR 2023. [Paper]
17 Oct 2022

Composable Text Controls in Latent Space with ODEs
Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu
arXiv 2022. [Paper] [Github]
1 Aug 2022

Structured Denoising Diffusion Models in Discrete State-Spaces
Jacob Austin, Daniel D. Johnson, Jonathan Ho, Daniel Tarlow, Rianne van den Berg
NeurIPS 2021. [Paper]
7 Jul 2021

Latent Diffusion Energy-Based Model for Interpretable Text Modeling
Peiyu Yu, Sirui Xie, Xiaojian Ma, Baoxiong Jia, Bo Pang, Ruigi Gao, Yixin Zhu, Song-Chun Zhu, Ying Nian Wu
ICML 2022. [Paper] [Github]
13 Jun 2022

Diffusion-LM Improves Controllable Text Generation
Xiang Lisa Li, John Thickstun, Ishaan Gulrajani, Percy Liang, Tatsunori B. Hashimoto
NeurIPS 2022. [Paper] [Github]
27 May 2022

Step-unrolled Denoising Autoencoders for Text Generation
Nikolay Savinov, Junyoung Chung, Mikolaj Binkowski, Erich Elsen, Aaron van den Oord
ICLR 2022. [Paper] [Github]
13 Dec 2021

Zero-Shot Translation using Diffusion Models
Eliya Nachmani, Shaked Dovrat
arXiv 2021. [Paper]
2 Nov 2021

Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions
Emiel Hoogeboom, Didrik Nielsen, Priyank Jaini, Patrick Forré, Max Welling
NeurIPS 2021. [Paper]
10 Feb 2021

Tabular and Time Series

Generation

AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizing
Namjoon Suh, Xiaofeng Lin, Din-Yin Hsieh, Merhdad Honarkhah, Guang Cheng
arXiv 2023. [Paper]
24 Oct 2023

Fast and Reliable Generation of EHR Time Series via Diffusion Models
Muhang Tian, Bernie Chen, Allan Guo, Shiyi Jiang, Anru R. Zhang
arXiv 2023. [Paper]
23 Oct 2023

Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space
Hengrui Zhang, Jiani Zhang, Balasubramaniam Srinivasan, Zhengyuan Shen, Xiao Qin, Christos Faloutsos, Huzefa Rangwala, George Karypis
arXiv 2023. [Paper]
14 Oct 2023

NetDiffus: Network Traffic Generation by Diffusion Models through Time-Series Imaging
Nirhoshan Sivaroopan, Dumindu Bandara, Chamara Madarasingha, Guilluame Jourjon, Anura Jayasumana, Kanchana Thilakarathna
arXiv 2023. [Paper]
23 Sep 2023

Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
Alexia Jolicoeur-Martineau, Kilian Fatras, Tal Kachman
arXiv 2023. [Paper]
18 Sep 2023

FinDiff: Diffusion Models for Financial Tabular Data Generation
Timur Sattarov, Marco Schreyer, Damian Borth
arXiv 2023. [Paper]
4 Sep 2023

Conditioning Score-Based Generative Models by Neuro-Symbolic Constraints
Davide Scassola, Sebastiano Saccani, Ginevra Carbone, Luca Bortolussi
arXiv 2023. [Paper]
31 Aug 2023

From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Robin San Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Défossez
arXiv 2023. [Paper]
2 Aug 2023

TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers
Md Fahim Sikder, Resmi Ramachandranpillai, Fredrik Heintz
arXiv 2023. [Paper]
24 Jul 2023

On the Constrained Time-Series Generation Problem
Andrea Coletta, Sriram Gopalakrishan, Daniel Borrajo, Svitlana Vyetrenko
arXiv 2023. [Paper]
4 Jul 2023

DiffECG: A Generalized Probabilistic Diffusion Model for ECG Signals Synthesis
Nour Neifar, Achraf Ben-Hamadou, Afef Mdhaffar, Mohamed Jmaiel
arXiv 2023. [Paper]
2 Jun 2023

CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis
Chaejeong Lee, Jayoung Kim, Noseong Park
ICML 2023. [Paper]
25 Apr 2023

Customized Load Profiles Synthesis for Electricity Customers Based on Conditional Diffusion Models
Zhenyi Wang, Hongcai Zhang
arXiv 2023. [Paper]
24 Apr 2023

Synthetic Health-related Longitudinal Data with Mixed-type Variables Generated using Diffusion Models
Nicholas I-Hsien Kuo, Louisa Jorm, Sebastiano Barbieri
arXiv 2023. [Paper]
22 Mar 2023

Diffusing Gaussian Mixtures for Generating Categorical Data
Florence Regol, Mark Coates
arXiv 2023. [Paper]
8 Mar 2023

EHRDiff: Exploring Realistic EHR Synthesis with Diffusion Models
Hongyi Yuan, Songchi Zhou, Sheng Yu
arXiv 2023. [Paper] [Github]
10 Mar 2023

Synthesizing Mixed-type Electronic Health Records using Diffusion Models
Taha Ceritli, Ghadeer O. Ghosheh, Vinod Kumar Chauhan, Tingting Zhu, Andrew P. Creagh, David A. Clifton
arXiv 2023. [Paper]
28 Feb 2023

MedDiff: Generating Electronic Health Records using Accelerated Denoising Diffusion Model
Huan He, Shifan Zhao, Yuanzhe Xi, Joyce C Ho
arXiv 2023. [Paper]
8 Feb 2023

Diffusion-based Conditional ECG Generation with Structured State Space Models
Juan Miguel Lopez Alcaraz, Nils Strodthoff
arXiv 2023. [Paper]
19 Jan 2023

TabDDPM: Modelling Tabular Data with Diffusion Models
Akim Kotelnikov, Dmitry Baranchuk, Ivan Rubachev, Artem Babenko
arXiv 2022. [Paper] [Github]
30 Sep 2022

Forecasting

Interacting Diffusion Processes for Event Sequence Forecasting
Mai Zeng, Florence Regol, Mark Coates
arXiv 2023. [Paper]
26 Oct 2023

Diffusion Variational Autoencoder for Tackling Stochasticity in Multi-Step Regression Stock Price Prediction
Kelvin J. L. Koa, Yunshan Ma, Ritchie Ng, Tat-Seng Chua
CIKM 2023. [Paper]
18 Aug 2023

Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting
Marcel Kollovieh, Abdul Fatir Ansari, Michael Bohlke-Schneider, Jasper Zschiegner, Hao Wang, Yuyang Wang
arXiv 2023. [Paper]
21 Jul 2023

Data Augmentation for Seizure Prediction with Generative Diffusion Model
Kai Shu, Yuchang Zhao, Le Wu, Aiping Liu, Ruobing Qian, Xun Chen
arXiv 2023. [Paper]
14 Jun 2023

Non-autoregressive Conditional Diffusion Models for Time Series Prediction
Lifeng Shen, James Kwok
arXiv 2023. [Paper]
8 Jun 2023

DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting
Salva Rühling Cachay, Bo Zhao, Hailey James, Rose Yu
arXiv 2023. [Paper]
3 Jun 2023

DiffLoad: Uncertainty Quantification in Load Forecasting with Diffusion Model
Zhixian Wang, Qingsong Wen, Chaoli Zhang, Liang Sun, Yi Wang
arXiv 2023. [Paper]
31 May 2023

TDSTF: Transformer-based Diffusion probabilistic model for Sparse Time series Forecasting
Ping Chang, Huayu Li, Stuart F. Quan, Janet Roveda, Ao Li
arXiv 2023. [Paper] [Github]
16 Jan 2023

Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement
Yan Li, Xinjiang Lu, Yaqing Wang, Dejing Dou
NeurIPS 2022. [Paper] [Github]
8 Jan 2023

Denoising diffusion probabilistic models for probabilistic energy forecasting
Esteban Hernandez, Jonathan Dumas
Powertech 2022. [Paper]
6 Dec 2022

Modeling Temporal Data as Continuous Functions with Process Diffusion
Marin Biloš, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, Stephan Günnemann
arXiv 2022. [Paper]
4 Nov 2022

Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models
Juan Miguel Lopez Alcaraz, Nils Strodthoff
TMLR 2022. [Paper] [Github]
19 Aug 2022

CARD: Classification and Regression Diffusion Models
Xizewen Han, Huangjie Zheng, Mingyuan Zhou
NeurIPS 2022. [Paper]
15 Jun 2022

ScoreGrad: Multivariate Probabilistic Time Series Forecasting with Continuous Energy-based Generative Models
Tijin Yan, Hongwei Zhang, Tong Zhou, Yufeng Zhan, Yuanqing Xia
arXiv 2021. [Paper] [Github]
18 Jun 2021

Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting
Kashif Rasul, Calvin Seward, Ingmar Schuster, Roland Vollgraf
ICLR 2021. [Paper] [Github]
2 Feb 2021

Imputation

Improving Diffusion Models for ECG Imputation with an Augmented Template Prior
Alexander Jenkins, Zehua Chen, Fu Siong Ng, Danilo Mandic
arXiv 2023. [Paper]
24 Oct 2023

sasdim: self-adaptive noise scaling diffusion model for spatial time series imputation
Shunyang Zhang, Senzhang Wang, Xianzhen Tan, Ruochen Liu, Jian Zhang, Jianxin Wang
arXiv 2023. [Paper]
5 Sep 2023

Diffusion-based Time Series Data Imputation for Microsoft 365
Fangkai Yang, Wenjie Yin, Lu Wang, Tianci Li, Pu Zhao, Bo Liu, Paul Wang, Bo Qiao, Yudong Liu, Mårten Björkman, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang
arXiv 2023. [Paper]
3 Aug 2023

PriSTI: A Conditional Diffusion Framework for Spatiotemporal Imputation
Mingzhe Liu, Han Huang, Hao Feng, Leilei Sun, Bowen Du, Yanjie Fu
ICDE 2023. [Paper] [Github]
20 Feb 2023

Modeling Temporal Data as Continuous Functions with Process Diffusion
Marin Biloš, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, Stephan Günnemann
arXiv 2022. [Paper]
4 Nov 2022

Diffusion models for missing value imputation in tabular data
Shuhan Zheng, Nontawat Charoenphakdee
NeurIPS 2022. [Paper]
31 Oct 2022

Neural Markov Controlled SDE: Stochastic Optimization for Continuous-Time Data
Sung Woo Park, Kyungjae Lee, Junseok Kwon
ICLR 2022. [Paper]
29 Sep 2021

Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models
Juan Miguel Lopez Alcaraz, Nils Strodthoff
TMLR 2022. [Paper] [Github]
19 Aug 2022

CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation
Yusuke Tashiro, Jiaming Song, Yang Song, Stefano Ermon
NeurIPS 2021. [Paper] [Github]
7 Jul 2021

Miscellany

DDMT: Denoising Diffusion Mask Transformer Models for Multivariate Time Series Anomaly Detection
Chaocheng Yang, Tingyin Wang, Xuanhui Yan
arXiv 2023. [Paper]
13 Oct 2023

NetDiffusion: Network Data Augmentation Through Protocol-Constrained Traffic Generation
Xi Jiang, Shinan Liu, Aaron Gember-Jacobson, Arjun Nitin Bhagoji, Paul Schmitt, Francesco Bronzino, Nick Feamster
arXiv 2023. [Paper]
12 Oct 2023

Latent Diffusion Model for DNA Sequence Generation
Zehui Li, Yuhao Ni, Tim August B. Huygelen, Akashaditya Das, Guoxuan Xia, Guy-Bart Stan, Yiren Zhao
arXiv 2023. [Paper]
9 Oct 2023

MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data Augmentation
Yuan Zhong, Suhan Cui, Jiaqi Wang, Xiaochen Wang, Ziyi Yin, Yaqing Wang, Houping Xiao, Mengdi Huai, Ting Wang, Fenglong Ma
arXiv 2023. [Paper]
4 Oct 2023

Diffusion Augmentation for Sequential Recommendation
Qidong Liu, Fan Yan, Xiangyu Zhao, Zhaocheng Du, Huifeng Guo, Ruiming Tang, Feng Tian
CIKM 2023. [Paper] [Github]
22 Sep 2023

Generating tabular datasets under differential privacy
Gianluca Truda
arXiv 2023. [Paper]
28 Aug 2023

Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Debaditya Shome, Pritam Sarkar, Ali Etemad
arXiv 2023. [Paper]
25 Aug 2023

Discrete Conditional Diffusion for Reranking in Recommendation
Xiao Lin, Xiaokai Chen, Chenyang Wang, Hantao Shu, Linfeng Song, Biao Li, Peng jiang
arXiv 2023. [Paper]
14 Aug 2023

Diffusion Model in Causal Inference with Unmeasured Confounders
Tatsuhiro Shimizu
arXiv 2023. [Paper] [Github]
7 Aug 2023

Diff-E: Diffusion-based Learning for Decoding Imagined Speech EEG
Soowon Kim, Young-Eun Lee, Seo-Hyun Lee, Seong-Whan Lee
arXiv 2023. [Paper]
26 Jul 2023

TabADM: Unsupervised Tabular Anomaly Detection with Diffusion Models
Guy Zamberg, Moshe Salhov, Ofir Lindenbaum, Amir Averbuch
arXiv 2023. [Paper]
23 Jul 2023

DiTTO: Diffusion-inspired Temporal Transformer Operator
Oded Ovadia, Eli Turkel, Adar Kahana, George Em Karniadakis
arXiv 2023. [Paper]
18 Jul 2023

ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection
Yuhang Chen, Chaoyun Zhang, Minghua Ma, Yudong Liu, Ruomeng Ding, Bowen Li, Shilin He, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang
arXiv 2023. [Paper]
3 Jul 2023

MissDiff: Training Diffusion Models on Tabular Data with Missing Values
Yidong Ouyang, Liyan Xie, Chongxuan Li, Guang Cheng
arXiv 2023. [Paper]
2 Jul 2023

RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation
Gabriel Bénédict, Olivier Jeunen, Samuele Papa, Samarth Bhargav, Daan Odijk, Maarten de Rijke
arXiv 2023. [Paper]
15 Jun 2023

Unsupervised Statistical Feature-Guided Diffusion Model for Sensor-based Human Activity Recognition
Si Zuo, Vitor Fortes Rey, Sungho Suh, Stephan Sigg, Paul Lukowicz
arXiv 2023. [Paper]
30 May 2023

Domain-Specific Denoising Diffusion Probabilistic Models for Brain Dynamics
Yiqun Duan, Jinzhao Zhou, Zhen Wang, Yu-Cheng Chang, Yu-Kai Wang, Chin-Teng Lin
arXiv 2023. [Paper]
7 May 2023

Conditional Denoising Diffusion for Sequential Recommendation
Yu Wang, Zhiwei Liu, Liangwei Yang, Philip S. Yu
arXiv 2023. [Paper]
22 Apr 2023

Diffusion Recommender Model
Wenjie Wang, Yiyan Xu, Fuli Feng, Xinyu Lin, Xiangnan He, Tat-Seng Chua
SIGIR 2023. [Paper]
11 Apr 2023

Sequential Recommendation with Diffusion Models
Hanwen Du, Huanhuan Yuan, Zhen Huang, Pengpeng Zhao, Xiaofang Zhou
arXiv 2023. [Paper]
10 Apr 2023

DiffuRec: A Diffusion Model for Sequential Recommendation
Zihao Li, Aixin Sun, Chenliang Li
arXiv 2023. [Paper]
3 Apr 2023

EEG Synthetic Data Generation Using Probabilistic Diffusion Models
Giulio Tosato, Cesare M. Dalbagno, Francesco Fumagalli
Synapsium 2023. [Paper]
6 Mar 2023

DeScoD-ECG: Deep Score-Based Diffusion Model for ECG Baseline Wander and Noise Removal
Huayu Li, Gregory Ditzler, Janet Roveda, Ao Li
IEEE JBHI 2023. [Paper]
31 Jul 2022

Recommendation via Collaborative Diffusion Generative Model
Joojo Walker, Ting Zhong, Fengli Zhang, Qiang Gao, Fan Zhou
KSEM 2022. [Paper]
19 Jul 2022

Graph

Generation

D4Explainer: In-Distribution GNN Explanations via Discrete Denoising Diffusion
Jialin Chen, Shirley Wu, Abhijit Gupta, Rex Ying
arXiv 2023. [Paper]
30 Oct 2023``

Towards Unifying Diffusion Models for Probabilistic Spatio-Temporal Graph Learning
Junfeng Hu, Xu Liu, Zhencheng Fan, Yuxuan Liang, Roger Zimmermann
arXiv 2023. [Paper]
26 Oct 2023

EDGE++: Improved Training and Sampling of EDGE
Mingyang Wu, Xiaohui Chen, Li-Ping Liu
arXiv 2023. [Paper]
22 Oct 2023

GraphMaker: Can Diffusion Models Generate Large Attributed Graphs?
Mufei Li, Eleonora Kreačić, Vamsi K. Potluru, Pan Li
arXiv 2023. [Paper]
20 Oct 2023

Autoregressive Diffusion Model for Graph Generation
Lingkai Kong, Jiaming Cui, Haotian Sun, Yuchen Zhuang, B. Aditya Prakash, Chao Zhang
arXiv 2023. [Paper]
17 Jul 2023

Geometric Neural Diffusion Processes
Emile Mathieu, Vincent Dutordoir, Michael J. Hutchinson, Valentin De Bortoli, Yee Whye Teh, Richard E. Turner
arXiv 2023. [Paper]
11 Jul 2023

SwinGNN: Rethinking Permutation Invariance in Diffusion Models for Graph Generation
Qi Yan, Zhengyang Liang, Yang Song, Renjie Liao, Lele Wang
arXiv 2023. [Paper] [Github]
4 Jul 2023

SaGess: Sampling Graph Denoising Diffusion Model for Scalable Graph Generation
Stratis Limnios, Praveen Selvaraj, Mihai Cucuringu, Carsten Maple, Gesine Reinert, Andrew Elliott
arXiv 2023. [Paper]
29 Jun 2023

Complexity-aware Large Scale Origin-Destination Network Generation via Diffusion Model
Can Rong, Jingtao Ding, Zhicheng Liu, Yong Li
arXiv 2023. [Paper]
8 Jun 2023

A Diffusion Model for Event Skeleton Generation
Fangqi Zhu, Lin Zhang, Jun Gao, Bing Qin, Ruifeng Xu, Haiqin Yang
arXiv 2023. [Paper] [Github]
27 May 2023

Confidence-Based Feature Imputation for Graphs with Partially Known Features
Daeho Um, Jiwoong Park, Seulki Park, Jin Young Choi
ICLR 2023. [Paper] [Github]
26 May 2023

Spatio-temporal Diffusion Point Processes
Yuan Yuan, Jingtao Ding, Chenyang Shao, Depeng Jin, Yong Li
arXiv 2023. [Paper] [Github]
21 May 2023

Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling
Xiaohui Chen, Jiaxing He, Xu Han, Li-Ping Liu
ICML 2023. [Paper]
6 May 2023

A 2D Graph-Based Generative Approach For Exploring Transition States Using Diffusion Model
Seonghwan Kim, Jeheon Woo, Woo Youn Kim
arXiv 2023. [Paper]
20 Apr 2023

Two-stage Denoising Diffusion Model for Source Localization in Graph Inverse Problems
Bosong Huang, Weihao Yu, Ruzhong Xie, Jing Xiao, Jin Huang
arXiv 2023. [Paper]
18 Apr 2023

Diffusion Probabilistic Models for Graph-Structured Prediction
Hyosoon Jang, Sangwoo Mo, Sungsoo Ahn
arXiv 2023. [Paper]
21 Feb 2023

GraphGUIDE: interpretable and controllable conditional graph generation with discrete Bernoulli diffusion
Alex M. Tseng, Nathaniel Diamant, Tommaso Biancalani, Gabriele Scalia
arXiv 2023. [Paper]
7 Feb 2023

Graph Generation with Destination-Driven Diffusion Mixture
Jaehyeong Jo, Dongki Kim, Sung Ju Hwang
arXiv 2023. [Paper]
7 Feb 2023

DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising Diffusion Models
Haomin Wen, Youfang Lin, Yutong Xia, Huaiyu Wan, Roger Zimmermann, Yuxuan Liang
arXiv 2023. [Paper]
31 Jan 2023

DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion
Qitian Wu, Chenxiao Yang, Wentao Zhao, Yixuan He, David Wipf, Junchi Yan
ICLR 2023. [Paper]
23 Jan 2023

GraphGDP: Generative Diffusion Processes for Permutation Invariant Graph Generation
Han Huang, Leilei Sun, Bowen Du, Yanjie Fu, Weifeng Lv
IEEE ICDM 2022. [Paper] [Github]
4 Dec 2022

NVDiff: Graph Generation through the Diffusion of Node Vectors
Cristian Sbrolli, Paolo Cudrano, Matteo Frosi, Matteo Matteucci
arXiv 2022. [Paper]
20 Nov 2022

Fast Graph Generative Model via Spectral Diffusion
Tianze Luo, Zhanfeng Mo, Sinno Jialin Pan
arXiv 2022. [Paper]
16 Nov 2022

Diffusion Models for Graphs Benefit From Discrete State Spaces
Kilian Konstantin Haefeli, Karolis Martinkus, Nathanaël Perraudin, Roger Wattenhofer
NeurIPS Workshop 2022. [Paper]
4 Oct 2022

DiGress: Discrete Denoising diffusion for graph generation
Clement Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, Volkan Cevher, Pascal Frossard
ICLR 2023. [Paper]
29 Sep 2022

Permutation Invariant Graph Generation via Score-Based Generative Modeling
Chenhao Niu, Yang Song, Jiaming Song, Shengjia Zhao, Aditya Grover, Stefano Ermon
AISTATS 2021. [Paper] [Github]
2 Mar 2020

Molecular and Material Generation

Discriminator Guidance for Autoregressive Diffusion Models
Filip Ekström Kelvinius, Fredrik Lindsten
arXiv 2023. [Paper]
24 Oct 2023

Reflection-Equivariant Diffusion for 3D Structure Determination from Isotopologue Rotational Spectra in Natural Abundance
Austin Cheng, Alston Lo, Santiago Miret, Brooks Pate, Alán Aspuru-Guzik
arXiv 2023. [Paper]
17 Oct 2023

MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design
Xiang Fu, Tian Xie, Andrew S. Rosen, Tommi Jaakkola, Jake Smith
arXiv 2023. [Paper]
16 Oct 2023

ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a protein language diffusion model
Bo Ni, David L. Kaplan, Markus J. Buehler
arXiv 2023. [Paper]
16 Oct 2023

Latent Conservative Objective Models for Data-Driven Crystal Structure Prediction
Han Qi, Xinyang Geng, Stefano Rando, Iku Ohama, Aviral Kumar, Sergey Levine
arXiv 2023. [Paper]
16 Oct 2023

On Accelerating Diffusion-based Molecular Conformation Generation in SE(3)-invariant Space
Zihan Zhou, Ruiying Liu, Tianshu Yu
arXiv 2023. [Paper]
7 Oct 2023

Ophiuchus: Scalable Modeling of Protein Structures through Hierarchical Coarse-graining SO(3)-Equivariant Autoencoders
Allan dos Santos Costa, Ilan Mitnikov, Mario Geiger, Manvitha Ponnapati, Tess Smidt, Joseph Jacobson
arXiv 2023. [Paper]
4 Oct 2023

SE(3)-Stochastic Flow Matching for Protein Backbone Generation
Avishek Joey Bose, Tara Akhound-Sadegh, Kilian Fatras, Guillaume Huguet, Jarrid Rector-Brooks, Cheng-Hao Liu, Andrei Cristian Nica, Maksym Korablyov, Michael Bronstein, Alexander Tong
arXiv 2023. [Paper]
3 Oct 2023

Backdiff: a diffusion model for generalized transferable protein backmapping
Yikai Liu, Ming Chen, Guang Lin
arXiv 2023. [Paper]
3 Oct 2023

Score dynamics: scaling molecular dynamics with picosecond timesteps via conditional diffusion model
Tim Hsu, Babak Sadigh, Vasily Bulatov, Fei Zhou
arXiv 2023. [Paper]
2 Oct 2023

Generative Design of inorganic compounds using deep diffusion language models
Rongzhi Dong, Nihang Fu, dirisuriya M. D. Siriwardane, Jianjun Hu
arXiv 2023. [Paper]
30 Sep 2023

Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation
Tuan Le, Julian Cremer, Frank Noé, Djork-Arné Clevert, Kristof Schütt
arXiv 2023. [Paper]
29 Sep 2023

Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models
Ziqi Chen, Bo Peng, Srinivasan Parthasarathy, Xia Ning
arXiv 2023. [Paper]
23 Aug 2023

Leveraging Side Information for Ligand Conformation Generation using Diffusion-Based Approaches
Jiamin Wu, He Cao, Yuan Yao
arXiv 2023. [Paper]
2 Aug 2023

Crystal Structure Prediction by Joint Equivariant Diffusion
Rui Jiao, Wenbing Huang, Peijia Lin, Jiaqi Han, Pin Chen, Yutong Lu, Yang Liu
arXiv 2023. [Paper]
30 Jul 2023

AbDiffuser: Full-Atom Generation of In-Vitro Functioning Antibodies
Karolis Martinkus, Jan Ludwiczak, Kyunghyun Cho, Wei-Ching Liang, Julien Lafrance-Vanasse, Isidro Hotzel, Arvind Rajpal, Yan Wu, Richard Bonneau, Vladimir Gligorijevic, Andreas Loukas
arXiv 2023. [Paper]
28 Jul 2023

DiAMoNDBack: Diffusion-denoising Autoregressive Model for Non-Deterministic Backmapping of C{\alpha} Protein Traces
Michael S. Jones, Kirill Shmilovich, Andrew L. Ferguson
arXiv 2023. [Paper]
23 Jul 2023

Geometric Constraints in Probabilistic Manifolds: A Bridge from Molecular Dynamics to Structured Diffusion Processes
Justin Diamond, Markus Lill
ICML Workshop 2023. [Paper]
10 Jul 2023

Towards Symmetry-Aware Generation of Periodic Materials
Youzhi Luo, Chengkai Liu, Shuiwang Ji
arXiv 2023. [Paper]
6 Jul 2023

Variational Autoencoding Molecular Graphs with Denoising Diffusion Probabilistic Model
Daiki Koge, Naoaki Ono, Shigehiko Kanaya
arXiv 2023. [Paper]
2 Jul 2023

Practical and Asymptotically Exact Conditional Sampling in Diffusion Models
Luhuan Wu, Brian L. Trippe, Christian A. Naesseth, David M. Blei, John P. Cunningham
arXiv 2023. [Paper] [Github]
30 Jun 2023

Graph Denoising Diffusion for Inverse Protein Folding
Kai Yi, Bingxin Zhou, Yiqing Shen, Pietro Liò, Yu Guang Wang
arXiv 2023. [Paper]
29 Jun 2023

DiffDTM: A conditional structure-free framework for bioactive molecules generation targeted for dual proteins
Lei Huang, Zheng Yuan, Huihui Yan, Rong Sheng, Linjing Liu, Fuzhou Wang, Weidun Xie, Nanjun Chen, Fei Huang, Songfang Huang, Ka-Chun Wong, Yaoyun Zhang
arXiv 2023. [Paper]
24 Jun 2023

Hyperbolic Graph Diffusion Model for Molecule Generation
Lingfeng Wen, Xian Wei
arXiv 2023. [Paper]
13 Jun 2023

3D molecule generation by denoising voxel grids
Pedro O. Pinheiro, Joshua Rackers, Joseph Kleinhenz, Michael Maser, Omar Mahmood, Andrew Martin Watkins, Stephen Ra, Vishnu Sresht, Saeed Saremi
arXiv 2023. [Paper]
13 Jun 2023

DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing
Yangtian Zhan, Zuobai Zhang, Bozitao Zhong, Sanchit Misra, Jian Tang
arXiv 2023. [Paper]
1 Jun 2023

Protein Design with Guided Discrete Diffusion
Nate Gruver, Samuel Stanton, Nathan C. Frey, Tim G. J. Rudner, Isidro Hotzel, Julien Lafrance-Vanasse, Arvind Rajpal, Kyunghyun Cho, Andrew Gordon Wilson
arXiv 2023. [Paper]
31 May 2023

RINGER: Rapid Conformer Generation for Macrocycles with Sequence-Conditioned Internal Coordinate Diffusion
Colin A. Grambow, Hayley Weir, Nathaniel L. Diamant, Alex M. Tseng, Tommaso Biancalani, Gabriele Scalia, Kangway V. Chuang
arXiv 2023. [Paper]
30 May 2023

Trans-Dimensional Generative Modeling via Jump Diffusion Models
Andrew Campbell, William Harvey, Christian Weilbach, Valentin De Bortoli, Tom Rainforth, Arnaud Doucet
arXiv 2023. [Paper]
25 May 2023

Learning Joint 2D & 3D Diffusion Models for Complete Molecule Generation
Han Huang, Leilei Sun, Bowen Du, Weifeng Lv
arXiv 2023. [Paper] [Github]
21 May 2023

MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation
Xingang Peng, Jiaqi Guan, Qiang Liu, Jianzhu Ma
ICML 2023. [Paper]
11 May 2023

Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D
Bo Qiang, Yuxuan Song, Minkai Xu, Jingjing Gong, Bowen Gao, Hao Zhou, Weiying Ma, Yanyan Lan
arXiv 2023. [Paper]
5 May 2023

A Latent Diffusion Model for Protein Structure Generation
Cong Fu, Keqiang Yan, Limei Wang, Wing Yee Au, Michael McThrow, Tao Komikado, Koji Maruhashi, Kanji Uchino, Xiaoning Qian, Shuiwang Ji
arXiv 2023. [Paper]
6 May 2023

Geometric Latent Diffusion Models for 3D Molecule Generation
Minkai Xu, Alexander Powers, Ron Dror, Stefano Ermon, Jure Leskovec
ICML 2023. [Paper]
2 May 2023

MUDiff: Unified Diffusion for Complete Molecule Generation
Chenqing Hua, Sitao Luan, Minkai Xu, Rex Ying, Jie Fu, Stefano Ermon, Doina Precup
arXiv 2023. [Paper]
28 Apr 2023

Towards Controllable Diffusion Models via Reward-Guided Exploration
Hengtong Zhang, Tingyang Xu
arXiv 2023. [Paper]
14 Apr 2023

Accurate transition state generation with an object-aware equivariant elementary reaction diffusion model
Chenru Duan, Yuanqi Du, Haojun Jia, Heather J. Kulik
arXiv 2023. [Paper]
12 Apr 2023

DiffDock-PP: Rigid Protein-Protein Docking with Diffusion Models
Mohamed Amine Ketata, Cedrik Laue, Ruslan Mammadov, Hannes Stärk, Menghua Wu, Gabriele Corso, Céline Marquet, Regina Barzilay, Tommi S. Jaakkola
ICLR 2023. [Paper]
8 Apr 2023

3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction
Jiaqi Guan, Wesley Wei Qian, Xingang Peng, Yufeng Su, Jian Peng, Jianzhu Ma
ICLR 2023. [Paper]
6 Mar 2023

EigenFold: Generative Protein Structure Prediction with Diffusion Models
Bowen Jing, Ezra Erives, Peter Pao-Huang, Gabriele Corso, Bonnie Berger, Tommi Jaakkola
ICLR Workshop 2023. [Paper] [Github]
5 Apr 2023

Denoising diffusion algorithm for inverse design of microstructures with fine-tuned nonlinear material properties
Nikolaos N. Vlassis, WaiChing Sun
arXiv 2023. [Paper]
24 Feb 2023

Aligned Diffusion Schrödinger Bridges
Vignesh Ram Somnath, Matteo Pariset, Ya-Ping Hsieh, Maria Rodriguez Martinez, Andreas Krause, Charlotte Bunne
arXiv 2023. [Paper]
22 Feb 2023

MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule Generation
Clement Vignac, Nagham Osman, Laura Toni, Pascal Frossard
arXiv 2023. [Paper]
17 Feb 2023

Geometry-Complete Diffusion for 3D Molecule Generation
Alex Morehead, Jianlin Cheng
ICML Workshop 2023. [Paper] [Github]
8 Feb 2023

SE(3) diffusion model with application to protein backbone generation
Jason Yim, Brian L. Trippe, Valentin De Bortoli, Emile Mathieu, Arnaud Doucet, Regina Barzilay, Tommi Jaakkola
arXiv 2023. [Paper]
5 Feb 2023

Data-Efficient Protein 3D Geometric Pretraining via Refinement of Diffused Protein Structure Decoy
Yufei Huang, Lirong Wu, Haitao Lin, Jiangbin Zheng, Ge Wang, Stan Z. Li
arXiv 2023. [Paper]
5 Feb 2023

Two for One: Diffusion Models and Force Fields for Coarse-Grained Molecular Dynamics
Marloes Arts, Victor Garcia Satorras, Chin-Wei Huang, Daniel Zuegner, Marco Federici, Cecilia Clementi, Frank Noé, Robert Pinsler, Rianne van den Berg
arXiv 2023. [Paper]
1 Feb 2023

Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds
Yeqing Lin, Mohammed AlQuraishi
arXiv 2023. [Paper]
29 Jan 2023

Physics-Inspired Protein Encoder Pre-Training via Siamese Sequence-Structure Diffusion Trajectory Prediction
Zuobai Zhang, Minghao Xu, Aurélie Lozano, Vijil Chenthamarakshan, Payel Das, Jian Tang
arXiv 2023. [Paper]
28 Jan 2023

DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints
Zhangyang Gao, Cheng Tan, Stan Z. Li
arXiv 2023. [Paper]
22 Jan 2023

DiffBP: Generative Diffusion of 3D Molecules for Target Protein Binding
Haitao Lin, Yufei Huang, Meng Liu, Xuanjing Li, Shuiwang Ji, Stan Z. Li
arXiv 2022. [Paper]
21 Nov 2022

ParticleGrid: Enabling Deep Learning using 3D Representation of Materials
Shehtab Zaman, Ethan Ferguson, Cecile Pereira, Denis Akhiyarov, Mauricio Araya-Polo, Kenneth Chiu
IEEE eScience 2022. [Paper]
15 Nov 2022

Structure-based Drug Design with Equivariant Diffusion Models
Arne Schneuing, Yuanqi Du, Charles Harris, Arian Jamasb, Ilia Igashov, Weitao Du, Tom Blundell, Pietro Lió, Carla Gomes, Max Welling, Michael Bronstein, Bruno Correia
arXiv 2022. [Paper]
24 Oct 2022

Protein Sequence and Structure Co-Design with Equivariant Translation
Chence Shi, Chuanrui Wang, Jiarui Lu, Bozitao Zhong, Jian Tang
ICLR 2023. [Paper]
17 Oct 2022

Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design
Ilia Igashov, Hannes Stärk, Clément Vignac, Victor Garcia Satorras, Pascal Frossard, Max Welling, Michael Bronstein, Bruno Correia
arXiv 2022. [Paper]
11 Oct 2022

State-specific protein-ligand complex structure prediction with a multi-scale deep generative model
Zhuoran Qiao, Weili Nie, Arash Vahdat, Thomas F. Miller III, Anima Anandkumar
NeurIPS Workshop 2022. [Paper]
30 Sep 2022

Equivariant Energy-Guided SDE for Inverse Molecular Design
Fan Bao, Min Zhao, Zhongkai Hao, Peiyao Li, Chongxuan Li, Jun Zhu
ICLR 2023. [Paper]
30 Sep 2022

Protein structure generation via folding diffusion
Kevin E. Wu, Kevin K. Yang, Rianne van den Berg, James Y. Zou, Alex X. Lu, Ava P. Amini
arXiv 2022. [Paper]
30 Sep 2022

MDM: Molecular Diffusion Model for 3D Molecule Generation
Lei Huang, Hengtong Zhang, Tingyang Xu, Ka-Chun Wong
AAAI 2023. [Paper]
13 Sep 2022

Diffusion-based Molecule Generation with Informative Prior Bridges
Lemeng Wu, Chengyue Gong, Xingchao Liu, Mao Ye, Qiang Liu
NeurIPS 2022. [Paper]
2 Sep 2022

Antigen-Specific Antibody Design and Optimization with Diffusion-Based Generative Models
Shitong Luo, Yufeng Su, Xingang Peng, Sheng Wang, Jian Peng, Jianzhu Ma
BioRXiv 2022. [Paper]
11 Jul 2022

Data-driven discovery of novel 2D materials by deep generative models
Peder Lyngby, Kristian Sommer Thygesen
NPJ 2022. [Paper]
24 Jun 2022

Score-based Generative Models for Calorimeter Shower Simulation
Vinicius Mikuni, Benjamin Nachman
arXiv 2022. [Paper]
17 Jun 2022

Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem
Brian L. Trippe, Jason Yim, Doug Tischer, Tamara Broderick, David Baker, Regina Barzilay, Tommi Jaakkola
CoRR 2022. [Paper]
8 Jun 2022

Torsional Diffusion for Molecular Conformer Generation
Bowen Jing, Gabriele Corso, Regina Barzilay, Tommi S. Jaakkola
ICLR Workshop 2022. [Paper] [Github]
1 Jun 2022

Protein Structure and Sequence Generation with Equivariant Denoising Diffusion Probabilistic Models
Namrata Anand, Tudor Achim
arXiv 2022. [Paper] [Project] [Github]
26 May 2022

A Score-based Geometric Model for Molecular Dynamics Simulations
Fang Wu, Qiang Zhang, Xurui Jin, Yinghui Jiang, Stan Z. Li
CoRR 2022. [Paper]
19 Apr 2022

Equivariant Diffusion for Molecule Generation in 3D
Emiel Hoogeboom, Victor Garcia Satorras, Clément Vignac, Max Welling
ICML 2022. [Paper] [Github]
31 Mar 2022

GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation
Minkai Xu, Lantao Yu, Yang Song, Chence Shi, Stefano Ermon, Jian Tang
ICLR 2022. [Paper] [Github]
6 Mar 2022

Crystal Diffusion Variational Autoencoder for Periodic Material Generation
Tian Xie, Xiang Fu, Octavian-Eugen Ganea, Regina Barzilay, Tommi Jaakkola
NeurIPS 2021. [Paper] [Github]
12 Oct 2021

Predicting Molecular Conformation via Dynamic Graph Score Matching
Shitong Luo, Chence Shi, Minkai Xu, Jian Tang
NeurIPS 2021. [Paper]
22 May 2021

Reinforcement Learning

Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans
Kyowoon Lee, Seongun Kim, Jaesik Choi
arXiv 2023. [Paper]
30 Oct 2023

Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States
Zidan Wang, Takeru Oba, Takuma Yoneda, Rui Shen, Matthew Walter, Bradly C. Stadie
arXiv 2023. [Paper]
21 Oct 2023

Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion Planning
Junwoo Chang, Hyunwoo Ryu, Jiwoo Kim, Soochul Yoo, Joohwan Seo, Nikhil Prakash, Jongeun Choi, Roberto Horowitz
arXiv 2023. [Paper]
19 Oct 2023

Adaptive Online Replanning with Diffusion Models
Siyuan Zhou, Yilun Du, Shun Zhang, Mengdi Xu, Yikang Shen, Wei Xiao, Dit-Yan Yeung, Chuang Gan
arXiv 2023. [Paper]
14 Oct 2023

DiPPeR: Diffusion-based 2D Path Planner applied on Legged Robots
Jianwei Liu, Maria Stamatopoulou, Dimitrios Kanoulas
arXiv 2023. [Paper]
11 Oct 2023

Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen, Cheng Lu, Zhengyi Wang, Hang Su, Jun Zhu
arXiv 2023. [Paper] [Github]
11 Oct 2023

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
Longxiang He, Linrui Zhang, Junbo Tan, Xueqian Wang
arXiv 2023. [Paper]
9 Oct 2023

Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization
Dinghuai Zhang, Ricky Tian Qi Chen, Cheng-Hao Liu, Aaron Courville, Yoshua Bengio
arXiv 2023. [Paper]
4 Oct 2023

Learning to Reach Goals via Diffusion
Vineet Jain, Siamak Ravanbakhsh
arXiv 2023. [Paper]
4 Oct 2023

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu
arXiv 2023. [Paper] [Project] [Github]
3 Oct 2023

Efficient Planning with Latent Diffusion
Wenhao Li
arXiv 2023. [Paper]
30 Sep 2023

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Zihan Ding, Chi Jin
arXiv 2023. [Paper]
29 Sep 2023

Maximum Diffusion Reinforcement Learning
Thomas A. Berrueta, Allison Pinosky, Todd D. Murphey
arXiv 2023. [Paper] [Github]
26 Sep 2023

EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning
Kallol Saha, Vishal Mandadi, Jayaram Reddy, Ajit Srikanth, Aditya Agarwal, Bipasha Sen, Arun Singh, Madhava Krishna
arXiv 2023. [Paper]
20 Sep 2023

Reasoning with Latent Diffusion in Offline Reinforcement Learning
Siddarth Venkatraman, Shivesh Khaitan, Ravi Tej Akella, John Dolan, Jeff Schneider, Glen Berseth
arXiv 2023. [Paper]
12 Sep 2023

Compositional Diffusion-Based Continuous Constraint Solvers
Zhutian Yang, Jiayuan Mao, Yilun Du, Jiajun Wu, Joshua B. Tenenbaum, Tomás Lozano-Pérez, Leslie Pack Kaelbling
CoRL 2023. [Paper] [Project]
2 Sep 2023

GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields
Yanjie Ze, Ge Yan, Yueh-Hua Wu, Annabella Macaluso, Yuying Ge, Jianglong Ye, Nicklas Hansen, Li Erran Li, Xiaolong Wang
CoRL 2023. [Paper] [Project] [Github]
31 Aug 2023

Beyond Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization
Hongyang Du, Ruichen Zhang, Yinqiu Liu, Jiacheng Wang, Yijing Lin, Zonghang Li, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shuguang Cui, Bo Ai, Haibo Zhou, Dong In Kim
arXiv 2023. [Paper]
10 Aug 2023

Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models
Joao Carvalho, An T. Le, Mark Baierl, Dorothea Koert, Jan Peters
arXiv 2023. [Paper]
3 Aug 2023

Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement
Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang
arXiv 2023. [Paper]
13 Jul 2023

Diffusion Based Multi-Agent Adversarial Tracking
Sean Ye, Manisha Natarajan, Zixuan Wu, Matthew Gombolay
arXiv 2023. [Paper]
12 Jul 2023

Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement
Anthony Simeonov, Ankit Goyal, Lucas Manuelli, Lin Yen-Chen, Alina Sarmiento, Alberto Rodriguez, Pulkit Agrawal, Dieter Fox
arXiv 2023. [Paper] [Project] [Github]
10 Jul 2023

Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
Suzan Ece Ada, Erhan Oztop, Emre Ugur
arXiv 2023. [Paper]
10 Jul 2023

Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
Xiang Li, Varun Belagali, Jinghuan Shang, Michael S. Ryoo
arXiv 2023. [Paper]
4 Jul 2023

Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Zhuoran Li, Ling Pan, Longbo Huang
arXiv 2023. [Paper]
4 Jul 2023

Trajectory Generation, Control, and Safety with Denoising Diffusion Probabilistic Models
Nicolò Botteghi, Federico Califano, Mannes Poel, Christoph Brune
arXiv 2023. [Paper]
27 Jun 2023

Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
H. J. Terry Suh, Glen Chou, Hongkai Dai, Lujie Yang, Abhishek Gupta, Russ Tedrake
arXiv 2023. [Paper]
24 Jun 2023

DiMSam: Diffusion Models as Samplers for Task and Motion Planning under Partial Observability
Xiaolin Fang, Caelan Reed Garrett, Clemens Eppner, Tomás Lozano-Pérez, Leslie Pack Kaelbling, Dieter Fox
arXiv 2023. [Paper]
22 Jun 2023

Reward Shaping via Diffusion Process in Reinforcement Learning
Peeyush Kumar
arXiv 2023. [Paper]
20 Jun 2023

Value function estimation using conditional diffusion models for control
Bogdan Mazoure, Walter Talbott, Miguel Angel Bautista, Devon Hjelm, Alexander Toshev, Josh Susskind
arXiv 2023. [Paper]
9 Jun 2023

Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
Jifeng Hu, Yanchao Sun, Sili Huang, SiYuan Guo, Hechang Chen, Li Shen, Lichao Sun, Yi Chang, Dacheng Tao
arXiv 2023. [Paper]
8 Jun 2023

Professional Basketball Player Behavior Synthesis via Planning with Diffusion
Xiusi Chen, Wei-Yao Wang, Ziniu Hu, Curtis Chou, Lam Hoang, Kun Jin, Mingyan Liu, P. Jeffrey Brantingham, Wei Wang
arXiv 2023. [Paper]
7 Jun 2023

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion
Chiyu Max Jiang, Andre Cornman, Cheolho Park, Ben Sapp, Yin Zhou, Dragomir Anguelov
arXiv 2023. [Paper]
5 Jun 2023

Extracting Reward Functions from Diffusion Models
Felipe Nuti, Tim Franzmeyer, João F. Henriques
arXiv 2023. [Paper]
1 Jun 2023

Reconstructing Graph Diffusion History from a Single Snapshot
Ruizhong Qiu, Dingsu Wang, Lei Ying, H. Vincent Poor, Yifang Zhang, Hanghang Tong
arXiv 2023. [Paper]
1 Jun 2023

SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
Wei Xiao, Tsun-Hsuan Wang, Chuang Gan, Daniela Rus
arXiv 2023. [Paper] Project]
31 May 2023

Efficient Diffusion Policies for Offline Reinforcement Learning
Bingyi Kang, Xiao Ma, Chao Du, Tianyu Pang, Shuicheng Yan
arXiv 2023. [Paper] [Github]
31 May 2023

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang
arXiv 2023. [Paper]
31 May 2023

Generating Behaviorally Diverse Policies with Latent Diffusion Models
Shashank Hegde, Sumeet Batra, K. R. Zentner, Gaurav S. Sukhatme
arXiv 2023. [Paper] [Project]
30 May 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li
arXiv 2023. [Paper]
29 May 2023

MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu, Stefano Ermon, Weinan Zhang
arXiv 2023. [Paper]
27 May 2023

Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang, Zhixiong Huang, Fenghao Lei, Yucun Zhong, Yiming Yang, Cong Fang, Shiting Wen, Binbin Zhou, Zhouchen Lin
arXiv 2023. [Paper] [Github]
22 May 2023

Diffusion Co-Policy for Synergistic Human-Robot Collaborative Tasks
Eley Ng, Ziang Liu, Monroe Kennedy III
arXiv 2023. [Paper]
20 May 2023

Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
Cheng Lu, Huayu Chen, Jianfei Chen, Hang Su, Chongxuan Li, Jun Zhu
ICML 2023. [Paper] [Github]
25 Apr 2023

Goal-Conditioned Imitation Learning using Score-based Diffusion Policies
Moritz Reuss, Maximilian Li, Xiaogang Jia, Rudolf Lioutikov
arXiv 2023. [Paper]
5 Apr 2023

PDPP:Projected Diffusion for Procedure Planning in Instructional Videos
Hanlin Wang, Yilu Wu, Sheng Guo, Limin Wang
CVPR 2023. [Paper]
26 Mar 2023

EDGI: Equivariant Diffusion for Planning with Embodied Agents
Johann Brehmer, Joey Bose, Pim de Haan, Taco Cohen
ICLR Workshop 2023. [Paper]
22 Mar 2023

Multi-Agent Adversarial Training Using Diffusion Learning
Ying Cao, Elsa Rizk, Stefan Vlaski, Ali H. Sayed
arXiv 2023. [Paper]
3 Mar 2023

Diffusion Model-Augmented Behavioral Cloning
Hsiang-Chun Wang, Shang-Fu Chen, Shao-Hua Sun
arXiv 2023. [Paper]
26 Feb 2023

To the Noise and Back: Diffusion for Shared Autonomy
Takuma Yoneda, Luzhe Sun, Bradly Stadie, Ge Yang, Matthew Walter
arXiv 2023. [Paper] [Github]
23 Feb 2023

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
Zhixuan Liang, Yao Mu, Mingyu Ding, Fei Ni, Masayoshi Tomizuka, Ping Luo
arXiv 2023. [Paper]
3 Feb 2023

Imitating Human Behaviour with Diffusion Models
Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin
ICLR 2023. [Paper]
25 Jan 2023

Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay, Yilun Du, Abhi Gupta, Joshua Tenenbaum, Tommi Jaakkola, Pulkit Agrawal
ICLR 2023. [Paper]
28 Nov 2022

LAD: Language Augmented Diffusion for Reinforcement Learning
Edwin Zhang, Yujie Lu, William Wang, Amy Zhang
NeurIPS Workshop 2022. [Paper]
27 Oct 2022

Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
Zhendong Wang, Jonathan J Hunt, Mingyuan Zhou
ICLR 2023. [Paper] [Github]
12 Oct 2022

Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen, Cheng Lu, Chengyang Ying, Hang Su, Jun Zhu
ICLR 2023. [Paper]
29 Sep 2022

Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner, Yilun Du, Joshua B. Tenenbaum, Sergey Levine
ICML 2022. [Paper] [Github]
20 May 2022

Theory

Generative Fractional Diffusion Models
Gabriel Nobis, Marco Aversa, Maximilian Springenberg, Michael Detzel, Stefano Ermon, Shinichi Nakajima, Roderick Murray-Smith, Sebastian Lapuschkin, Christoph Knochenhauer, Luis Oala, Wojciech Samek
arXiv 2023. [Paper]
26 Oct 2023

Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes
Jaehyeong Jo, Sung Ju Hwang
arXiv 2023. [Paper]
11 Oct 2023

Generalized Schrödinger Bridge Matching
Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen
arXiv 2023. [Paper]
3 Oct 2023

On the Stability of Iterative Retraining of Generative Models on their own Data
Quentin Bertrand, Avishek Joey Bose, Alexandre Duplessis, Marco Jiralerspong, Gauthier Gidel
arXiv 2023. [Paper]
30 Sep 2023

In search of dispersed memories: Generative diffusion models are associative memory networks
Luca Ambrogioni
arXiv 2023. [Paper]
29 Sep 2023

Leveraging Optimization for Adaptive Attacks on Image Watermarks
Nils Lukas, Abdulrahman Diaa, Lucas Fenaux, Florian Kerschbaum
arXiv 2023. [Paper]
29 Sep 2023

Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models
Song Mei, Yuchen Wu
arXiv 2023. [Paper]
20 Sep 2023

Generative Hyperelasticity with Physics-Informed Probabilistic Diffusion Fields
Vahidullah Tac, Manuel K Rausch, Ilias Bilionis, Francisco Sahli Costabal, Adrian Buganza Tepole
arXiv 2023. [Paper]
11 Sep 2023

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Shuchen Xue, Mingyang Yi, Weijian Luo, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhi-Ming Ma
arXiv 2023. [Paper]
10 Sep 2023

An Ensemble Score Filter for Tracking High-Dimensional Nonlinear Dynamical Systems
Feng Bao, Zezhong Zhang, Guannan Zhang
arXiv 2023. [Paper]
2 Sep 2023

Sampling with flows, diffusion and autoregressive neural networks: A spin-glass perspective
Davide Ghio, Yatin Dandi, Florent Krzakala, Lenka Zdeborová
arXiv 2023. [Paper]
27 Aug 2023

Renormalizing Diffusion Models
Jordan Cotler, Semon Rezchikov
arXiv 2023. [Paper]
23 Aug 2023

Improving Generative Model-based Unfolding with Schrödinger Bridges
Sascha Diefenbacher, Guan-Horng Liu, Vinicius Mikuni, Benjamin Nachman, Weili Nie
arXiv 2023. [Paper] [Github]
23 Aug 2023

Convergence guarantee for consistency models
Junlong Lyu, Zhitang Chen, Shoubo Feng
arXiv 2023. [Paper]
22 Aug 2023

Mirror Diffusion Models
Jaesung Tae
arXiv 2023. [Paper]
11 Aug 2023

Do Diffusion Models Suffer Error Propagation? Theoretical Analysis and Consistency Regularization
Yangming Li, Zhaozhi Qian, Mihaela van der Schaar
arXiv 2023. [Paper]
9 Aug 2023

Linear Convergence Bounds for Diffusion Models via Stochastic Localization
Joe Benton, Valentin De Bortoli, Arnaud Doucet, George Deligiannidis
arXiv 2023. [Paper]
7 Aug 2023

Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior
Adam Block, Daniel Pfrommer, Max Simchowitz
arXiv 2023. [Paper]
27 Jul 2023

MCMC-Correction of Score-Based Diffusion Models for Model Composition
Anders Sjöberg, Jakob Lindqvist, Magnus Önnheim, Mats Jirstrand, Lennart Svensson
arXiv 2023. [Paper] [Github]
26 Jul 2023

Synthetic Lagrangian Turbulence by Generative Diffusion Models
Tianyi Li, Luca Biferale, Fabio Bonaccorso, Martino Andrea Scarpolini, Michele Buzzicotti
arXiv 2023. [Paper] [Github]
17 Jul 2023

Metropolis Sampling for Constrained Diffusion Models
Nic Fishman, Leo Klarner, Emile Mathieu, Michael Hutchinson, Valentin de Bortoli
arXiv 2023. [Paper]
11 Jul 2023

Simulation-free Schrödinger bridges via score and flow matching
Alexander Tong, Nikolay Malkin, Kilian Fatras, Lazar Atanackovic, Yanlei Zhang, Guillaume Huguet, Guy Wolf, Yoshua Bengio
arXiv 2023. [Paper]
7 Jul 2023

DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks
Jingwei Zhang, Han Shi, Jincheng Yu, Enze Xie, Zhenguo Li
arXiv 2023. [Paper]
5 Jul 2023

Monte Carlo Sampling without Isoperimetry: A Reverse Diffusion Approach
Xunpeng Huang, Hanze Dong, Yifan Hao, Yian Ma, Tong Zhang
arXiv 2023. [Paper]
5 Jul 2023

Improved sampling via learned diffusions
Lorenz Richter, Julius Berner, Guan-Horng Liu
ICML Workshop 2023. [Paper]
3 Jul 2023

Learning Mixtures of Gaussians Using the DDPM Objective
Kulin Shah, Sitan Chen, Adam Klivans
arXiv 2023. [Paper]
3 Jul 2023

Transport, Variational Inference and Diffusions: with Applications to Annealed Flows and Schrödinger Bridges
Francisco Vargas, Nikolas Nüsken
arXiv 2023. [Paper]
3 Jul 2023

Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models
Gen Li, Yuting Wei, Yuxin Chen, Yuejie Chi
arXiv 2023. [Paper]
15 Jun 2023

Diffusion Models for Black-Box Optimization
Siddarth Krishnamoorthy, Satvik Mehul Mashkaria, Aditya Grover
ICML 2023. [Paper]
12 Jun 2023

Variational Gaussian Process Diffusion Processes
Prakhar Verma, Vincent Adam, Arno Solin
arXiv 2023. [Paper]
3 Jun 2023

Exploring the Optimal Choice for Generative Processes in Diffusion Models: Ordinary vs Stochastic Differential Equations
Yu Cao, Jingrun Chen, Yixin Luo, Xiang Zhou
arXiv 2023. [Paper]
3 Jun 2023

On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization
Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji, Stefano Ermon
arXiv 2023. [Paper]
1 Jun 2023

Deep Stochastic Mechanics
Elena Orlova, Aleksei Ustimenko, Ruoxi Jiang, Peter Y. Lu, Rebecca Willett
arXiv 2023. [Paper]
31 May 2023

Conditional score-based diffusion models for Bayesian inference in infinite dimensions
Lorenzo Baldassari, Ali Siahkoohi, Josselin Garnier, Knut Solna, Maarten V. de Hoop
arXiv 2023. [Paper]
28 May 2023

Error Bounds for Flow Matching Methods
Joe Benton, George Deligiannidis, Arnaud Doucet
arXiv 2023. [Paper]
26 May 2023

Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters
Maxence Noble, Valentin De Bortoli, Arnaud Doucet, Alain Durmus
arXiv 2023. [Paper]
26 May 2023

Debias Coarsely, Sample Conditionally: Statistical Downscaling through Optimal Transport and Probabilistic Diffusion Models
Zhong Yi Wan, Ricardo Baptista, Yi-fan Chen, John Anderson, Anudhyan Boral, Fei Sha, Leonardo Zepeda-Núñez
arXiv 2023. [Paper]
24 May 2023

Optimal Linear Subspace Search: Learning to Construct Fast and High-Quality Schedulers for Diffusion Models
Zhongjie Duan, Chengyu Wang, Cen Chen, Jun Huang, Weining Qian
arXiv 2023. [Paper]
24 May 2023

SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models
Martin Gonzalez, Nelson Fernandez, Thuy Tran, Elies Gherbi, Hatem Hajri, Nader Masmoudi
arXiv 2023. [Paper]
23 May 2023

Improved Convergence of Score-Based Diffusion Models via Prediction-Correction
Francesco Pedrotti, Jan Maas, Marco Mondelli
arXiv 2023. [Paper]
23 May 2023

Expressiveness Remarks for Denoising Diffusion Models and Samplers
Francisco Vargas, Teodora Reu, Anna Kerekes
arXiv 2023. [Paper]
16 May 2023

The Score-Difference Flow for Implicit Generative Modeling
Romann M. Weber
arXiv 2023. [Paper]
25 Apr 2023

Diffusion Models for Constrained Domains
Nic Fishman, Leo Klarner, Valentin De Bortoli, Emile Mathieu, Michael Hutchinson
arXiv 2023. [Paper]
11 Apr 2023

Diffusion Bridge Mixture Transports, Schrödinger Bridge Problems and Generative Modeling
Stefano Peluchetti
arXiv 2023. [Paper] 3 Apr 2023

Efficient Sampling of Stochastic Differential Equations with Positive Semi-Definite Models
Anant Raj, Umut Şimşekli, Alessandro Rudi
arXiv 2023. [Paper]
30 Mar 2023

Diffusion Schrödinger Bridge Matching
Yuyang Shi, Valentin De Bortoli, Andrew Campbell, Arnaud Doucet
arXiv 2023. [[Paper(https://arxiv.org/abs/2303.16852)]
29 Mar 2023

Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers
Sitan Chen, Giannis Daras, Alexandros G. Dimakis
arXiv 2023. [Paper]
6 Mar 2023

Diffusion Models are Minimax Optimal Distribution Estimators
Kazusato Oko, Shunta Akiyama, Taiji Suzuki
ICLR Workshop 2023. [Paper]
3 Mar 2023

Understanding the Diffusion Objective as a Weighted Integral of ELBOs
Diederik P. Kingma, Ruiqi Gao
arXiv 2023. [Paper]
1 Mar 2023

Continuous-Time Functional Diffusion Processes
Giulio Franzese, Simone Rossi, Dario Rossi, Markus Heinonen, Maurizio Filippone, Pietro Michiardi
arXiv 2023. [Paper]
1 Mar 2023

Denoising Diffusion Samplers
Francisco Vargas, Will Grathwohl, Arnaud Doucet
ICLR 2023. [Paper]
27 Feb 2023

Infinite-Dimensional Diffusion Models for Function Spaces
Jakiw Pidstrigach, Youssef Marzouk, Sebastian Reich, Sven Wang
arXiv 2023. [Paper]
20 Feb 2023

Score-based Diffusion Models in Function Space
Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar
arXiv 2023. [Paper]
14 Feb 2023

Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data
Minshuo Chen, Kaixuan Huang, Tuo Zhao, Mengdi Wang
arXiv 2023. [Paper]
14 Feb 2023

Stochastic Modified Flows, Mean-Field Limits and Dynamics of Stochastic Gradient Descent
Benjamin Gess, Sebastian Kassing, Vitalii Konarovskyi
JMLR 2023. [Paper]
14 Feb 2023

Monte Carlo Neural Operator for Learning PDEs via Probabilistic Representation
Rui Zhang, Qi Meng, Rongchan Zhu, Yue Wang, Wenlei Shi, Shihua Zhang, Zhi-Ming Ma, Tie-Yan Liu
arXiv 2023. [Paper]
10 Feb 2023

Example-Based Sampling with Diffusion Models
Bastien Doignies, Nicolas Bonneel, David Coeurjolly, Julie Digne, Loïs Paulin, Jean-Claude Iehl, Victor Ostromoukhov
arXiv 2023. [Paper]
10 Feb 2023

Information-Theoretic Diffusion
Xianghao Kong, Rob Brekelmans, Greg Ver Steeg
ICLR 2023. [Paper] [Github]
7 Feb 2023

Conditional Flow Matching: Simulation-Free Dynamic Optimal Transport
Alexander Tong, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks, Kilian Fatras, Guy Wolf, Yoshua Bengio
arXiv 2023. [Paper] [Github]
1 Feb 2023

Transport with Support: Data-Conditional Diffusion Bridges
Ella Tamir, Martin Trapp, Arno Solin
arXiv 2023. [Paper]
31 Jan 2023

Understanding and contextualising diffusion models
Stefano Scotta, Alberto Messina
arXiv 2023. [Paper]
26 Jan 2023

On the Mathematics of Diffusion Models
David McAllester
arXiv 2023. [Paper]
25 Jan 2023

Understanding the diffusion models by conditional expectations
Yubin Lu, Zhongjian Wang, Guillaume Bal
arXiv 2023. [Paper]
19 Jan 2023

Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh, Shiva Prasad Kasiviswanathan, Branislav Kveton, Patrick Blöbaum
arXiv 2023. [Paper]
12 Jan 2023

Your diffusion model secretly knows the dimension of the data manifold
Georgios Batzolis, Jan Stanczuk, Carola-Bibiane Schönlieb
arXiv 2022. [Paper]
23 Dec 2022

Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance
Dohyun Kwon, Ying Fan, Kangwook Lee
NeurIPS 2022. [Paper] [Github]
13 Dec 2022

Nonlinear controllability and function representation by neural stochastic differential equations
Tanya Veeravalli, Maxim Raginsky
arXiv 2022. [Paper]
1 Dec 2022

Diffusion Generative Models in Infinite Dimensions
Gavin Kerrigan, Justin Ley, Padhraic Smyth
AISTATS 2023. [Paper]
1 Dec 2022

Neural Langevin Dynamics: towards interpretable Neural Stochastic Differential Equations
Simon M. Koop, Mark A. Peletier, Jacobus W. Portegies, Vlado Menkovski
arXiv 2022. [Paper]
17 Nov 2022

Improved Analysis of Score-based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions
Hongrui Chen, Holden Lee, Jianfeng Lu
arXiv 2022. [Paper]
3 Nov 2022

Categorical SDEs with Simplex Diffusion
Pierre H. Richemond, Sander Dieleman, Arnaud Doucet
arXiv 2022. [Paper]
26 Oct 2022

Diffusion Models for Causal Discovery via Topological Ordering
Pedro Sanchez, Xiao Liu, Alison Q O’Neil, Sotirios A. Tsaftaris
ICLR 2023. [Paper] [Github]
12 Oct 2022

Regularizing Score-based Models with Score Fokker-Planck Equations
Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon
NeurIPS Workshop 2022. [Paper]
9 Oct 2022

Sequential Neural Score Estimation: Likelihood-Free Inference with Conditional Score Based Diffusion Models
Louis Sharrock, Jack Simons, Song Liu, Mark Beaumont
arXiv 2022. [Paper]
10 Oct 2022

Analyzing Diffusion as Serial Reproduction
Raja Marjieh, Ilia Sucholutsky, Thomas A. Langlois, Nori Jacoby, Thomas L. Griffiths
arXiv 2022. [Paper]
29 Sep 2022

Convergence of score-based generative modeling for general data distributions
Holden Lee, Jianfeng Lu, Yixin Tan
NeurIPS Workshop 2022. [Paper]
26 Sep 2022

Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions
Sitan Chen, Sinho Chewi, Jerry Li, Yuanzhi Li, Adil Salim, Anru R. Zhang
arXiv 2022. [Paper]
22 Sep 2022

Riemannian Diffusion Models
Chin-Wei Huang, Milad Aghajohari, Avishek Joey Bose, Prakash Panangaden, Aaron Courville
NeurIPS 2022. [Paper]
16 Aug 2022

Convergence of denoising diffusion models under the manifold hypothesis
Valentin De Bortoli
TMLR 2022. [Paper]
10 Aug 2022

Neural Diffusion Processes
Vincent Dutordoir, Alan Saul, Zoubin Ghahramani, Fergus Simpson
arXiv 2022. [Paper]
8 Jun 2022

Theory and Algorithms for Diffusion Processes on Riemannian Manifolds
Bowen Jing, Gabriele Corso, Jeffrey Chang, Regina Barzilay, Tommi Jaakkola
arXiv 2022. [Paper] [Github]
1 Jun 2022

Riemannian Score-Based Generative Modeling
Valentin De Bortoli, Emile Mathieu, Michael Hutchinson, James Thornton, Yee Whye Teh, Arnaud Doucet
NeurIPS 2022. [Paper]
6 Feb 2022

Interpreting diffusion score matching using normalizing flow
Wenbo Gong, Yingzhen Li
ICML Workshop 2021. [Paper]
21 Jul 2021

A Connection Between Score Matching and Denoising Autoencoders
Pascal Vincent
Neural Computation 2011. [Paper]
7 Jul 2011

Bayesian Learning via Stochastic Gradient Langevin Dynamics
Max Welling, Yee Whye Teh
ICML 2011. [Paper] [Github]
20 Apr 2011

Applications

Denoising Diffusion Probabilistic Models for Hardware-Impaired Communication Systems: Towards Wireless Generative AI
Mehdi Letafati, Samad Ali, Matti Latva-aho
arXiv 2023. [Paper]
30 Oct 2023

Asymmetric Diffusion Based Channel-Adaptive Secure Wireless Semantic Communications
Xintian Ren, Jun Wu, Hansong Xu, Qianqian Pan
arXiv 2023. [Paper]
30 Oct 2023

CodeFusion: A Pre-trained Diffusion Model for Code Generation
Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen
arXiv 2023. [Paper]
26 Oct 2023

Causal Modeling with Stationary Diffusions
Lars Lorch, Andreas Krause, Bernhard Schölkopf
arXiv 2023. [Paper]
26 Oct 2023

DICE: Diverse Diffusion Model with Scoring for Trajectory Prediction
Younwoo Choi, Ray Coden Mercurius, Soheil Mohamad Alizadeh Shabestary, Amir Rasouli
arXiv 2023. [Paper]
23 Oct 2023

Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models
Jincheng Zhang, Jingjing Tang, Charalampos Saitis, György Fazekas
arXiv 2023. [Paper]
21 Oct 2023

Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions
Jincheng Zhang, György Fazekas, Charalampos Saitis
arXiv 2023. [Paper]
21 Oct 2023

Enhancing ML model accuracy for Digital VLSI circuits using diffusion models: A study on synthetic data generation
Prasha Srivastava, Pawan Kumar, Zia Abbas
arXiv 2023. [Paper]
15 Oct 2023

MINDE: Mutual Information Neural Diffusion Estimation
Giulio Franzese, Mustapha Bounoua, Pietro Michiardi
arXiv 2023. [Paper]
13 Oct 2023

WiGenAI: The Symphony of Wireless and Generative AI via Diffusion Models
Mehdi Letafati, Samad Ali, Matti Latva-aho
arXiv 2023. [Paper]
11 Oct 2023

Stochastic Super-resolution of Cosmological Simulations with Denoising Diffusion Models
Andreas Schanz, Florian List, Oliver Hahn
arXiv 2023. [Paper]
10 Oct 2023

Generative quantum machine learning via denoising diffusion probabilistic models
Bingzhi Zhang, Peng Xu, Xiaohui Chen, Quntao Zhuang
arXiv 2023. [Paper]
9 Oct 2023

Forecasting Tropical Cyclones with Cascaded Diffusion Models
Pritthijit Nath, Pancham Shukla, César Quilodrán-Casas
arXiv 2023. [Paper]
2 Oct 2023

Seal2Real: Prompt Prior Learning on Diffusion Model for Unsupervised Document Seal Data Generation and Realisation
Jiancheng Huang, Yifan Liu, Yi Huang, Shifeng Chen
arXiv 2023. [Paper]
1 Oct 2023

High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models
Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz
arXiv 2023. [Paper]
27 Sep 2023

AntiBARTy Diffusion for Property Guided Antibody Design
Jordan Venderley
arXiv 2023. [Paper]
22 Sep 2023

A Diffusion-Model of Joint Interactive Navigation
Matthew Niedoba, Jonathan Wilder Lavington, Yunpeng Liu, Vasileios Lioutas, Justice Sefas, Xiaoxuan Liang, Dylan Green, Setareh Dabiri, Berend Zwartsenberg, Adam Scibior, Frank Wood
NeurIPS 2023. [Paper]
21 Sep 2023

Learning End-to-End Channel Coding with Diffusion Models
Muah Kim, Rick Fritschek, Rafael F. Schaefer
arXiv 2023. [Paper]
19 Sep 2023

Towards Generative Modeling of Urban Flow through Knowledge-enhanced Denoising Diffusion
Zhilun Zhou, Jingtao Ding, Yu Liu, Depeng Jin, Yong Li
arXiv 2023. [Paper]
19 Sep 2023

CDDM: Channel Denoising Diffusion Models for Wireless Semantic Communications
Tong Wu, Zhiyong Chen, Dazhi He, Liang Qian, Yin Xu, Meixia Tao, Wenjun Zhang
arXiv 2023. [Paper]
16 Sep 2023

Probabilistic Constellation Shaping With Denoising Diffusion Probabilistic Models: A Novel Approach
Mehdi Letafati, Samad Ali, Matti Latva-aho
arXiv 2023. [Paper]
15 Sep 2023

Denoising Diffusion Probabilistic Models for Hardware-Impaired Communications
Mehdi Letafati, Samad Ali, Matti Latva-aho
arXiv 2023. [Paper]
15 Sep 2023

Predicting the Radiation Field of Molecular Clouds using Denoising Diffusion Probabilistic Models
Duo Xu, Stella Offner, Robert Gutermuth, Michael Grudic, David Guszejnov, Philip Hopkins
arXiv 2023. [Paper]
11 Sep 2023

Discrete Denoising Diffusion Approach to Integer Factorization
Karlis Freivalds, Emils Ozolins, Guntis Barzdins
ICANN 2023. [Paper] [Github]
11 Sep 2023

Diffusion Generative Inverse Design
Marin Vlastelica, Tatiana López-Guevara, Kelsey Allen, Peter Battaglia, Arnaud Doucet, Kimberley Stachenfeld
ICML Workshop 2023. [Paper]
5 Sep 2023

Turbulent Flow Simulation using Autoregressive Conditional Diffusion Models
Georg Kohl, Li-Wei Chen, Nils Thuerey
arXiv 2023. [Paper]
4 Sep 2023

Quantum-Noise-driven Generative Diffusion Models
Marco Parigi, Stefano Martina, Filippo Caruso
arXiv 2023. [Paper]
23 Aug 2023

A Hybrid Wireless Image Transmission Scheme with Diffusion
Xueyan Niu, Xu Wang, Deniz Gündüz, Bo Bai, Weichao Chen, Guohua Zhou
arXiv 2023. [Paper]
16 Aug 2023

Maat: Performance Metric Anomaly Anticipation for Cloud Services with Conditional Diffusion
Cheryl Lee, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Michael R. Lyu
ASE 2023. [Paper]
15 Aug 2023

Precipitation nowcasting with generative diffusion models
Andrea Asperti, Fabio Merizzi, Alberto Paparella, Giorgio Pedrazzi, Matteo Angelinelli, Stefano Colamonaco
arXiv 2023. [Paper]
13 Aug 2023

Generating observation guided ensembles for data assimilation with denoising diffusion probabilistic model
Yuuichi Asahi, Yuta Hasegawa, Naoyuki Onodera, Takashi Shimokawabe, Hayato Shiba, Yasuhiro Idomura
ICML Workshop 2023. [Paper]
13 Aug 2023

EquiDiff: A Conditional Equivariant Diffusion Model For Trajectory Prediction
Kehua Chen, Xianda Chen, Zihan Yu, Meixin Zhu, Hai Yang
arXiv 2023. [Paper]
12 Aug 2023

Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation
Junwei Huang, Zhiqing Sun, Yiming Yang
ICML Workshop 2023. [Paper]
12 Aug 2023

Generative Diffusion Models for Radio Wireless Channel Modelling and Sampling
Ushnish Sengupta, Chinkuo Jao, Alberto Bernacchia, Sattar Vakili, Da-shan Shiu
IEEE GCC 2023. [Paper]
10 Aug 2023

Diffusion probabilistic models enhance variational autoencoder for crystal structure generative modeling
Teerachote Pakornchote, Natthaphon Choomphon-anomakhun, Sorrjit Arrerut, Chayanon Atthapak, Sakarn Khamkaeo, Thiparat Chotibut, Thiti Bovornratanaraks
arXiv 2023. [Paper]
4 Aug 2023

Don’t be so negative! Score-based Generative Modeling with Oracle-assisted Guidance
Saeid Naderiparizi, Xiaoxuan Liang, Berend Zwartsenberg, Frank Wood
arXiv 2023. [Paper]
31 Jul 2023

An Effective LSTM-DDPM Scheme for Energy Theft Detection and Forecasting in Smart Grid
Xun Yuan, Yang Yang, Arwa Alromih, Prosanta Gope, Biplab Sikdar
arXiv 2023. [Paper]
30 Jul 2023

Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls
Lejun Min, Junyan Jiang, Gus Xia, Jingwei Zhao
ISMIR 2023. [Paper]
19 Jul 2023

PreDiff: Precipitation Nowcasting with Latent Diffusion Models
Zhihan Gao, Xingjian Shi, Boran Han, Hao Wang, Xiaoyong Jin, Danielle Maddix, Yi Zhu, Mu Li, Yuyang Wang
arXiv 2023. [Paper]
19 Jul 2023

SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation
Vic De Ridder, Bappaditya Dey, Sandip Halder, Bartel Van Waeyenberge
arXiv 2023. [Paper]
17 Jul 2023

Neuro-symbolic Empowered Denoising Diffusion Probabilistic Models for Real-time Anomaly Detection in Industry 4.0
Luigi Capogrosso, Alessio Mascolini, Federico Girella, Geri Skenderi, Sebastiano Gaiardelli, Nicola Dall’Ora, Francesco Ponzio, Enrico Fraccaroli, Santa Di Cataldo, Sara Vinco, Enrico Macii, Franco Fummi, Marco Cristani
arXiv 2023. [Paper]
13 Jul 2023

PC-Droid: Faster diffusion and improved quality for particle cloud generation
Matthew Leigh, Debajyoti Sengupta, John Andrew Raine, Guillaume Quétant, Tobias Golling
arXiv 2023. [Paper]
13 Jul 2023

Score-based Source Separation with Applications to Digital Communication Signals
Tejas Jayashankar, Gary C. F. Lee, Alejandro Lancho, Amir Weiss, Yury Polyanskiy, Gregory W. Wornell
arXiv 2023. [Paper]
26 Jun 2023

SEEDS: Emulation of Weather Forecast Ensembles with Diffusion Models
Lizao Li, Rob Carver, Ignacio Lopez-Gomez, Fei Sha, John Anderson
arXiv 2023. [Paper]
24 Jun 2023

A prior regularized full waveform inversion using generative diffusion models
Fu Wang, Xinquan Huang, Tariq Alkhalifah
arXiv 2023. [Paper]
22 Jun 2023

HumanDiffusion: diffusion model using perceptual gradients
Yota Ueda, Shinnosuke Takamichi, Yuki Saito, Norihiro Takamune, Hiroshi Saruwatari
arXiv 2023. [Paper]
21 Jun 2023

Ambigram Generation by A Diffusion Model
Takahiro Shirakawa, Seiichi Uchida
arXiv 2023. [Paper]
21 Jun 2023

Unbalanced Diffusion Schrödinger Bridge
Matteo Pariset, Ya-Ping Hsieh, Charlotte Bunne, Andreas Krause, Valentin De Bortoli
arXiv 2023. [Paper]
15 Jun 2023

User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems
Marc Finzi, Anudhyan Boral, Andrew Gordon Wilson, Fei Sha, Leonardo Zepeda-Núñez
arXiv 2023. [Paper]
13 Jun 2023

Latent Dynamical Implicit Diffusion Processes
Mohammad R. Rezaei
arXiv 2023. [Paper]
12 Jun 2023

Professional Basketball Player Behavior Synthesis via Planning with Diffusion
Xiusi Chen, Wei-Yao Wang, Ziniu Hu, Curtis Chou, Lam Hoang, Kun Jin, Mingyan Liu, P. Jeffrey Brantingham, Wei Wang
arXiv 2023. [Paper]
7 Jun 2023

High-dimensional and Permutation Invariant Anomaly Detection
Vinicius Mikuni, Benjamin Nachman
arXiv 2023. [Paper]
6 Jun 2023

SwinRDM: Integrate SwinRNN with Diffusion Model towards High-Resolution and High-Quality Weather Forecasting
Lei Chen, Fei Du, Yuan Hu, Fan Wang, Zhibin Wang
arXiv 2023. [Paper]
5 Jun 2023

Inverse-design of nonlinear mechanical metamaterials via video denoising diffusion models
Jan-Hendrik Bastek, Dennis M. Kochmann
arXiv 2023. [Paper]
31 May 2023

A Score-Based Model for Learning Neural Wavefunctions
Xuan Zhang, Shenglong Xu, Shuiwang Ji
arXiv 2023. [Paper]
25 May 2023

Dirichlet Diffusion Score Model for Biological Sequence Generation
Pavel Avdeyev, Chenlai Shi, Yuhao Tan, Kseniia Dudnyk, Jian Zhou
ICML 2023 [Paper] [Github]
18 May 2023

GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework
Ang Lv, Xu Tan, Peiling Lu, Wei Ye, Shikun Zhang, Jiang Bian, Rui Yan
arXiv 2023. [Paper] [Github]
18 May 2023

End-To-End Latent Variational Diffusion Models for Inverse Problems in High Energy Physics
Alexander Shmakov, Kevin Greif, Michael Fenton, Aishik Ghosh, Pierre Baldi, Daniel Whiteson
arXiv 2023. [Paper]
17 May 2023

Discrete Diffusion Probabilistic Models for Symbolic Music Generation
Matthias Plasser, Silvan Peter, Gerhard Widmer
arXiv 2023. [Paper] [Github]
16 May 2023

AWFSD: Accelerated Wirtinger Flow with Score-based Diffusion Image Prior for Poisson-Gaussian Holographic Phase Retrieval
Zongyu Li, Jason Hu, Xiaojian Xu, Liyue Shen, Jeffrey A. Fessler
arXiv 2023. [Paper]
12 May 2023

LatentPINNs: Generative physics-informed neural networks via a latent representation learning
Mohammad H. Taufik, Tariq Alkhalifah
arXiv 2023. [Paper]
11 May 2023

Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification
Jussi Leinonen, Ulrich Hamann, Daniele Nerini, Urs Germann, Gabriele Franch
arXiv 2023. [Paper]
25 Apr 2023

DiffESM: Conditional Emulation of Earth System Models with Diffusion Models
Seth Bassetti, Brian Hutchinson, Claudia Tebaldi, Ben Kravitz
ICLR 2023. [Paper]
23 Apr 2023

Diffusion Model for GPS Trajectory Generation
Yuanshao Zhu, Yongchao Ye, Xiangyu Zhao, James J.Q. Yu
arXiv 2023. [Paper]
23 Apr 2023

On Accelerating Diffusion-Based Sampling Process via Improved Integration Approximation
Guoqiang Zhang, Niwa Kenta, W. Bastiaan Kleijn
arXiv 2023. [Paper]
22 Apr 2023

iPINNs: Incremental learning for Physics-informed neural networks
Aleksandr Dekhovich, Marcel H.F. Sluiter, David M.J. Tax, Miguel A. Bessa
arXiv 2023. [Paper]
10 Apr 2023

Denoising Diffusion Probabilistic Models to Predict the Density of Molecular Clouds
Duo Xu, Jonathan C. Tan, Chia-Jung Hsu, Ye Zhu
ApJ 2023. [Paper]
4 Apr 2023

Deep Generative Model and Its Applications in Efficient Wireless Network Management: A Tutorial and Case Study
Yinqiu Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Dong In Kim, Abbas Jamalipour
arXiv 2023. [Paper]
30 Mar 2023

AI-Generated 6G Internet Design: A Diffusion Model-based Learning Approach
Yudong Huang, Minrui Xu, Xinyuan Zhang, Dusit Niyato, Zehui Xiong, Shuo Wang, Tao Huang
arXiv 2023. [Paper]
24 Mar 2023

Generative AI-aided Optimization for AI-Generated Content (AIGC) Services in Edge Networks
Hongyang Du, Zonghang Li, Dusit Niyato, Jiawen Kang, Zehui Xiong, Huawei Huang, Shiwen Mao
arXiv 2023. [Paper] [Github]
23 Mar 2023

Stable Bias: Analyzing Societal Representations in Diffusion Models
Alexandra Sasha Luccioni, Christopher Akiki, Margaret Mitchell, Yacine Jernite
arXiv 2023. [Paper]
20 Mar 2023

PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics
Matthew Leigh, Debajyoti Sengupta, Guillaume Quétant, John Andrew Raine, Knut Zoch, Tobias Golling
arXiv 2023. [Paper]
9 Mar 2023

Generating Initial Conditions for Ensemble Data Assimilation of Large-Eddy Simulations with Latent Diffusion Models
Alex Rybchuk, Malik Hassanaly, Nicholas Hamilton, Paula Doubrawa, Mitchell J. Fulton, Luis A. Martínez-Tossas
arXiv 2023. [Paper]
1 Mar 2023

ReorientDiff: Diffusion Model based Reorientation for Object Manipulation
Utkarsh A. Mishra, Yongxin Chen
arXiv 2023. [Paper] [Project]
28 Feb 2023

Interventional and Counterfactual Inference with Diffusion Models
Patrick Chao, Patrick Blöbaum, Shiva Prasad Kasiviswanathan
arXiv 2023. [Paper]
2 Feb 2023

Denoising Diffusion for Sampling SAT Solutions
Karlis Freivalds, Sergejs Kozlovics
NeurIPS Workshop 2022. [Paper]
30 Nov 2022

Generating astronomical spectra from photometry with conditional diffusion models
Lars Doorenbos, Stefano Cavuoti, Giuseppe Longo, Massimo Brescia, Raphael Sznitman, Pablo Márquez-Neila
NeurIPS Workshop 2022. [Paper] [Github]
10 Nov 2022

Graphically Structured Diffusion Models
Christian Weilbach, William Harvey, Frank Wood
arXiv 2022. [Paper]
20 Oct 2022

Denoising Diffusion Error Correction Codes
Yoni Choukroun, Lior Wolf
arXiv 2022. [Paper]
16 Sep 2022

Deep Diffusion Models for Robust Channel Estimation
Marius Arvinte, Jonathan I Tamir
arXiv 2021. [Paper] [Github]
16 Nov 2021

Diffusion models for Handwriting Generation
Troy Luhman, Eric Luhman
arXiv 2020. [Paper] [Github]
13 Nov 2020