Mengping's Personal Blogs and Thoughts on Generative Models & Multimodal Learning
Personal blogs and research notes from Mengping Yang
Image Generative Models
Decomposing 2D visual generative models: modularization, key technologies, limitations, and potential works
In this blog, I discussed existing visual generation models by decomposing key components for training and developing 2D image generative models including Diffusion Models, GANs, and visual auto-regressive models.
Neural Information Processing Systems (NeurIPS), 2024
[Paper] [Project Page] [Code]
Video Generative Models
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
Kuan Heng Lin*, Sicheng Mo*, Ben Klingher, Fangzhou Mu, Bolei Zhou
Neural Information Processing Systems (NeurIPS), 2024
[Paper] [Project Page] [Code]
Multimodal Models
Improving GANs with A Dynamic Discriminator
Ceyuan Yang*, Yujun Shen*, Yinghao Xu, Deli Zhao, Bo Dai, Bolei Zhou
Neural Information Processing Systems (NeurIPS), 2022
[Paper] [Project Page] [Code]
Large Language Models
Improving GANs with A Dynamic Discriminator
Ceyuan Yang*, Yujun Shen*, Yinghao Xu, Deli Zhao, Bo Dai, Bolei Zhou
Neural Information Processing Systems (NeurIPS), 2022
[Paper] [Project Page] [Code]
Misk & Resources
Exploring and Exploiting Interpretable Semantics in GANs
Bolei Zhou
Interpretable Machine Learning for Computer Vision (CVPR'20 Tutorial)
[Video (YouTube)] [Video (bilibili)] [Slides]
Credit
Individual Researcher
Mengping Yang
               

The website template was adapted from Genforce, thanks a lot for the amazing repo!