DPO-LLPS

This repository contains the implementation code for our research paper, [DPO-LLPS: Biologically-informed hierarchical transfer learning Strategy for Designing Phase Separation–Driving Proteins]

Introduction

The framework combines hierarchical transfer learning and generative modeling for LLPS protein design. Localization fine-tuning encodes compartment-specific “chemical grammar,” while DPO captures LLPS-driving “molecular grammar.” The model generates novel proteins with targeted localization, tunable phase behavior, and validated condensate stability, enabling programmable and mechanistically interpretable LLPS design.

Installation

Create and activate conda environment

conda env create -f env.yml
conda activate DPO-LLPS

Usage

Model Checkpoints and data

Model checkpoints and complete datasets are available on Zenodo.

Train

To train a model with the default configuration (e.g., DPO-LLPS), simply run:

python train_llps_dpo_multigpu.py

Inference

To perform inference with the default configuration (e.g., DPO-LLPS), run:

python LLPS-DPO-inference.py

Data process

The processing procedures for all datasets can be found in the data_process folder.

Citation

If you find the models useful in your research, please cite our paper.

Contact

If you have any question, please feel free to email us (yangyangzhang@zju.edu.cn).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
data_process		data_process
mapping_files		mapping_files
.gitignore		.gitignore
LLPS-DPO-inference.py		LLPS-DPO-inference.py
LLPS-SFT-inference.py		LLPS-SFT-inference.py
README.md		README.md
RNA-DPP-inference.py		RNA-DPP-inference.py
env.yml		env.yml
image.png		image.png
location-inference.py		location-inference.py
pytorch_transformer.py		pytorch_transformer.py
train_condesate_location_multigpu.py		train_condesate_location_multigpu.py
train_llps_dpo_multigpu.py		train_llps_dpo_multigpu.py
train_llps_sft_multigpu.py		train_llps_sft_multigpu.py
train_subcellular_location_multigpu.py		train_subcellular_location_multigpu.py
trian_rna_dpo_multigpu.py		trian_rna_dpo_multigpu.py
untils.py		untils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DPO-LLPS

Introduction

Installation

Create and activate conda environment

Usage

Model Checkpoints and data

Train

Inference

Data process

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DPO-LLPS

Introduction

Installation

Create and activate conda environment

Usage

Model Checkpoints and data

Train

Inference

Data process

Citation

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages