- ๐ Iโm a first-year master student at IIGROUP in Tsinghua University, supervised by Prof. Yujiu Yang.
- ๐ฑ I am currently working closely with Dr. Ling Yang and Prof. Bin Cui from DAIR Lab in Peking University.
- ๐ญ My research interests lie in Controllable Text-to-Image Generation, Text-to-Video Generation and Multimodal Large Language Models.
Tsinghua University
-
Tsinghua University
- Beijing
-
04:37
(UTC +08:00) - https://cominclip.github.io/
Pinned Loading
-
YangLing0818/RPG-DiffusionMaster
YangLing0818/RPG-DiffusionMaster Public[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
-
YangLing0818/IterComp
YangLing0818/IterComp PublicIterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
-
YangLing0818/RealCompo
YangLing0818/RealCompo Public[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
-
mini-sora/minisora
mini-sora/minisora PublicMiniSora: A community aims to explore the implementation path and future development direction of Sora.
-
BoxDiff-XL
BoxDiff-XL PublicExtend BoxDiff to SDXL (SDXL-based layout-to-image generation)
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.