Skip to content

JosephTiTan/ParaUni

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ParaUni: Enhance Generation in Unified Multimodal Model with Reinforcement-driven Hierarchical Parallel Information Interaction

Overview

We propose ParaUni. It extracts features from variants VLM's layers in a Parallel way for comprehensive information interaction and retains a flexible separation architecture to enhance generation in Unified multimodal model. Concretely, visual features from all VLM's layers are fed in parallel into a Layer Integration Module (LIM), which efficiently integrates fine-grained details and semantic abstractions and provides the fused representation as a condition to the diffusion model.

Result

Citation

Thanks to the developers of OpenUni for their excellent work. Our code is adapted from OpenUni and Flow-GRPO. If our work assists your research, feel free to give us a star ⭐ or cite us using:

@article{tan2025parauni,
  title={ParaUni: Enhance Generation in Unified Multimodal Model with Reinforcement-driven Hierarchical Parallel Information Interaction},
  author={Tan, Jiangtong and Liu, Lin and Huanng, Jie and Zhang, Xiaopeng and Tian, Qi and Zhao, Feng},
  journal={arXiv preprint arXiv:2512.05422},
  year={2025}
}

About

Official code of the paper: ParaUni: Enhance Generation in Unified Multimodal Model with Reinforcement-driven Hierarchical Parallel Information Interaction

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors