Ph.D. student at the Renmin University of China (from the 2023 fall @ML-GSAI ). I'm interested in scalability and optimization in deep learning.
-
Renmin University of China
- Beijing, China
- https://chen-yu-zheng.github.io/
Pinned Loading
-
ML-GSAI/Scaling-Diffusion-Transformers-muP
ML-GSAI/Scaling-Diffusion-Transformers-muP Public[NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".
-
ML-GSAI/Revisiting-Dis-vs-Gen-Classifiers
ML-GSAI/Revisiting-Dis-vs-Gen-Classifiers PublicOfficial implementation for "Revisiting Discriminative vs. Generative Classifiers: Theory and Implications".
-
ML-GSAI/Understanding-GDA
ML-GSAI/Understanding-GDA PublicTowards understanding modern generative data augmentation techniques.
-
ML-GSAI/MesaOpt-AR-Transformer
ML-GSAI/MesaOpt-AR-Transformer PublicOfficial implementation for NeurIPS 2024 paper "On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability".
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
