最新视觉顶会 CVPR 2024 会议,涌现出大量基于生成式AIGC的CV论文,尤其扩散模型diffusion为代表!除直接生成,还广泛应用在各类 low-level、high-level 视觉任务!本文集齐和梳理CVPR 2024共40+方向、百篇AIGC+扩散模型论文!均已分类打包好!
关注【机器学习与AI生成创作】公众号,后台回复 CVPR2024 (长按红字、选中复制)即可获取分类、按文件夹汇总好的论文集!!!
本文为清单版,详细版文章很长(CVPR 2024 | 绝了!!最新 diffusion 扩散模型梳理!100+篇论文、40+研究方向!),梳理不易,越到后面越有趣!麻烦列位,转发、分享、三连,多多鼓励!!!
扩散模型应用方向目录
- 1、扩散模型改进
- 2、可控文生图
- 3、风格迁移
- 4、人像生成
- 5、图像超分
- 6、图像恢复
- 7、目标跟踪
- 8、目标检测
- 9、关键点检测
- 10、deepfake检测
- 11、异常检测
- 12、图像分割
- 13、图像压缩
- 14、视频理解
- 15、视频生成
- 16、倾听人生成
- 17、数字人生成
- 18、新视图生成
- 19、3D相关
- 20、图像修复
- 21、草图相关
- 22、版权隐私
- 23、数据增广
- 24、医学图像
- 25、交通驾驶
- 26、语音相关
- 27、姿势估计
- 28、图相关
- 29、动作检测/生成
- 30、机器人规划/智能决策
- 31、视觉叙事/故事生成
- 32、因果生成
- 33、隐私保护-对抗估计
- 34、扩散模型改进-补充
- 35、交互式可控生成
- 36、图像恢复-补充
- 37、域适应-迁移学习
- 38、手交互
- 39、伪装检测
- 40、多任务学习
- 41、轨迹预测
- 42、场景生成
- 43、流估计-3D相关
一、扩散模型改进
1、Accelerating Diffusion Sampling with Optimized Time Steps
2、DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
3、Balancing Act: Distribution-Guided Debiasing in Diffusion Models
4、Few-shot Learner Parameterization by Diffusion Time-steps
5、Structure-Guided Adversarial Training of Diffusion Models
6、Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
7、Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
8、Towards Memorization-Free Diffusion Models
9、SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
二、可控文生图
10、ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
11、NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging
12、Discriminative Probing and Tuning for Text-to-Image Generation
13、Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
14、Face2Diffusion for Fast and Editable Face Personalization
15、LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model
16、InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models
17、MACE: Mass Concept Erasure in Diffusion Models
18、MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
19、One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications
20、FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models
三、风格迁移
21、DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations
22、Deformable One-shot Face Stylization via DINO Semantic Guidance
23、One-Shot Structure-Aware Stylized Image Synthesis
四、人像生成
24、Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
25、High-fidelity Person-centric Subject-to-Image Synthesis
26、Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
27、A Unified and Interpretable Emotion Representation and Expression Generation
28、CosmicMan: A Text-to-Image Foundation Model for Humans
29、DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans
30、Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On
五、图像超分
31、Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder
32、Diffusion-based Blind Text Image Super-Resolution
33、Text-guided Explorable Image Super-resolution
34、Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model
六、图像恢复
35、Boosting Image Restoration via Priors from Pre-trained Models
36、Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance
37、Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
38、Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
39、Shadow Generation for Composite Image Using Diffusion Model
七、目标跟踪
40、Delving into the Trajectory Long-tail Distribution for Muti-object Tracking
八、目标检测
41、SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection
42、DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
43、SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection
九、关键点检测
44、Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
十、deepfake检测
####45、Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection
十一、异常检测
46、RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection
十二、抠图/分割
47、In-Context Matting
十三、图像压缩
48、Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis
十四、视频理解
49、Abductive Ego-View Accident Video Understanding for Safe Driving Perception
十五、视频生成
50、FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
51、Grid Diffusion Models for Text-to-Video Generation
52、TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
53、Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
54、Video Interpolation With Diffusion Models
十六、倾听人生成
55、CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation
十七、数字人生成
56、Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
十八、新视图生成
57、EscherNet: A Generative Model for Scalable View Synthesis
十九、3D相关
58、Bayesian Diffusion Models for 3D Shape Reconstruction
59、DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior
60、DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
61、DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis
62、IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images
63、Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
64、MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
65、Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior
66、Score-Guided Diffusion for 3D Human Recovery
67、Towards Realistic Scene Generation with LiDAR Diffusion Models
68、VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
二十、图像修复
69、Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting
二十一、草图相关
70、It’s All About Your Sketch: Democratising Sketch Control in Diffusion Models
71、Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers
二十二、版权隐私
72、CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion
73、CPR: Retrieval Augmented Generation for Copyright Protection
二十三、数据增广
74、SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation
75、ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
二十四、医学图像
76、MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
二十五、交通驾驶
77、Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion
78、Generalized Predictive Model for Autonomous Driving
二十六、语音相关
79、FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
80、ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
81、Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
二十七、姿势估计
82、Object Pose Estimation via the Aggregation of Diffusion Features
二十八、图相关
83、DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly
二十九、动作检测或生成
84、Action Detection via an Image Diffusion Process
85、Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
86、OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers
三十、机器人规划/智能决策
87、SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
三十一、视觉叙事-故事生成
88、Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
三十二、因果归因
89、 ProMark: Proactive Diffusion Watermarking for Causal Attribution
三十三、隐私保护-对抗估计
90、Robust Imperceptible Perturbation against Diffusion Models
三十四、扩散模型改进-补充
91、Condition-Aware Neural Network for Controlled Image Generation
三十五、交互式可控生成
92、Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation
三十六、图像恢复-补充
93、Generating Content for HDR Deghosting from Frequency View
三十七、域适应/迁移学习
94、Unknown Prompt, the only Lacuna: Unveiling CLIP’s Potential for Open Domain Generalization
三十八、手交互
95、Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction
96、InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion
三十九、伪装检测
97、LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
四十、多任务学习
98、DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data
四十一、轨迹预测
99、SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model
四十二、场景生成
100、SemCity: Semantic Scene Generation with Triplane Diffusion
四十三、3D相关/流估计
101、DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Iterative Diffusion-Based Refinement
关注公众号【机器学习与AI生成创作】