Introduction

📘 Project Page 📄 Technical Report 🧬 Web Server

📣📣📣 We're hiring!
Positions in Beijing 🇨🇳 and Seattle 🇺🇸
👉 Join us »

Introduction

PXDesign is a model suite for de novo protein-binder design — a diffusion generator (PXDesign-d) paired with Protenix and AF2-IG confidence models for selection. Across seven targets, PXDesign delivers 17-82% nanomolar hits on six.

PXDesign Web Server (✅ Highly Recommended)

PXDesign involves complex models and custom CUDA kernels. We strongly recommend the Web Server as the fastest, most stable, and most user-friendly way to use PXDesign:

Zero Setup: No installation, no GPU required, no environment debugging.
Proven Utility: Since launch (2025-09), the server has supported numerous researchers and external collaborators in successfully identifying binders for wet-lab validation.
Aligned with Paper: Runs the exact pipeline used in our Technical Report.
Free Access & Extra Quota: The server is free. Users with wet-lab plans can apply for generous additional quota (>90% approval rate within a week).

Click here to access PXDesign Web Server.

If you want to run PXDesign locally, continue to the sections below.

⚡️ Installation & Setup

Before running PXDesign, you need to set up the environment and download necessary weights.

1. Install PXDesign

Choose one of the following methods to install.

Option 1: Docker

Step 1. Build the Docker Image

docker build -t pxdesign -f Dockerfile .

Step 2. Start the Container

docker run -it --gpus all pxdesign bash

Step 3. Install PXDesign in the Container

Inside the container:

git clone https://github.com/bytedance/PXDesign.git
cd PXDesign
pip install --upgrade pip
pip install -e ./PXDesignBench
pip install -e .

Option 2: Conda

Run the script install.sh to set up an environment and install all dependencies.

What the installer will do

Create a dedicated conda / mamba / micromamba environment
Install PyTorch matching your specified CUDA version
Install Protenix
Install PXDesignBench (bundled in this repo)
Install PXDesign
Run basic import sanity checks

Supported options

--env <name>           Environment name (default: pxdesign)
--pkg_manager <tool>   conda | mamba | micromamba (default: conda)
--cuda-version <ver>   CUDA version string, e.g. 12.1, 12.2, 12.4
                        Required. Must be >= 12.1.

Example:

bash -x install.sh --env pxdesign --pkg_manager conda --cuda-version 12.1

Run pxdesign pipeline --help to ensure the installation is successful.

2. First-Time Downloads

Required Step: Manual Downloads

Please run the script with:

bash download_tool_weights.sh

to download the model weights of external evaluation tools (e.g., AF2, MPNN) and CCD cache.

By default, CCD cache is downloaded to ${project_root}/release_data/ccd_cache. You may override this location via environment variable export PROTENIX_DATA_ROOT_DIR=/custom/path/to/ccd_cache.

Automatic Downloads (No Action Required)

On the first run, PXDesign will automatically perform the following model checkpoints when needed. You don't need to run any of them manually, but wait for the automatic download to complete.

PXDesign diffusion checkpoint
Protenix checkpoints:
- base
- mini
- mini_tmpl

Default checkpoint path: ./release_data/checkpoint, which can be overridden via --load_checkpoint_dir (defined in pxdesign.configs.configs_infer.py).

🚀 Quick Start

We will walk you through a complete design task in 3 steps. For more advanced features and configurations, we highly recommend following the Detailed Usage Guide section.

1. Prepare Input

Create a <task_name>.yaml file to specify the design task. You can copy the template below or use the demo file ./examples/PDL1_quick_start.yaml

target:
  file: "./examples/5o45.cif"  # Path to target structure
  chains:
    A:
      crop: ["1-116"]          # Region to keep
      hotspots: [40, 99, 107]  # Interface residues
      msa: "./examples/msa/PDL1/0" # Path to pre-computed MSA (Recommended)

binder_length: 80 # Binder length (residues). Also supports a range string like "80-100".

To customize the input YAML file, we highly recommend following the Preparing Customized Inputs section.

2. Running PXDesign with One Command Line

⚠️ First Run Notice: The initial run involves model downloading and kernel compilation. Please expect a one-time delay; subsequent runs will be faster.

pxdesign pipeline \
  --preset extended \
  -i <YAML_FILE> \
  -o <OUT_DIR> \
  --N_sample <NUM_SAMPLES> \
  --dtype bf16 \
  --use_fast_ln True \
  --use_deepspeed_evo_attention True

Example:

pxdesign pipeline \
  --preset extended \
  -i ./examples/PDL1_quick_start.yaml \
  -o ./examples/test_run \
  --N_sample 10 \
  --dtype bf16 \
  --use_fast_ln True \
  --use_deepspeed_evo_attention True

For runs on modern GPUs (e.g., A100/H100), we recommend the BF16 precision and kernel optimizations by setting --dtype bf16 --use_fast_ln True --use_deepspeed_evo_attention True.
If you are running on older GPUs (e.g., V100), you may set --dtype fp32 --use_deepspeed_evo_attention False.

3. Check Results

Key results are aggregated in the <out_dir>/results/<task_name>/ folder (or results_v2/, results_v3/, ... if you finalize again in the same <out_dir>). Go to this folder and open summary.csv to see your ranked binders. PXDesign runs are resumable: re-run the same command with the same <out_dir> and it will only do missing work (and skip already-completed stages when possible).

📖 Detailed Usage Guide

This section covers the details of Input Preparation, Running Modes, and Result Interpretation.

1. Preparing Customized Inputs

Input Configuration (YAML)

Configuration is defined in a simple YAML file. Below is a complete example with explanations for each field.

# ---------------- Basic Settings ----------------
binder_length: 100        # Binder length (residues)
# binder_length: "80-120" # OR a range: model samples ONE length per run from [min, max]

# ---------------- Target Settings ----------------
target:
  file: <your_target>.cif   # Path to structure (CIF or PDB)
  chains:
    A:  # Settings for Chain A
      
      # 1. CROP: Define regions to keep (Standard Residue Indexing)
      # - Keep full chain: Remove this field
      # - Continuous: ["1-100"]
      # - Discontinuous: ["1-50", "80-100"]
      crop: ["1-100"] 

      # 2. HOTSPOTS: Guide diffusion to specific interface residues
      hotspots: [10, 11, 45, 46]

      # 3. MSA: Required for 'Extended' mode (Protenix evaluation)
      # - Option A: Path to pre-computed .a3m directory (Recommended)
      # - Option B: Remove field to auto-search online (Slower)
      msa: <your_chainA_msa_dir>/
    B:  # Settings for Chain B
      crop: ["1-50", "80-100"]
      hotspots: [46]
      msa: <your_chainB_msa_dir>/
    C: "all" # Include the full chain C

✅ Validating Your Input YAML

We provide two tools to ensure your input is correct. We recommend running both before starting expensive jobs.

1. Syntax Check (Fast)

Check for missing fields or YAML format errors.

pxdesign check-input --yaml <YAML_FILE>

2. Visual Verification (Recommended for Crop and Hotspots)

To confirm your crop and hotspots point to the correct residues, generate a debug structure using:

pxdesign parse-target --yaml <YAML_FILE> -o <debug_dir>

This will create <debug_dir>/<task_name>_parsed_target.cif and <debug_dir>/<task_name>_parsed_target.pml files.

You could open the *_parsed_target.cif file in a molecular viewer such as PyMOL or Mol (Molstar), and then verify the index alignment: Select the residue indices you defined in your YAML (e.g., hotspots: [40] or crop: ["1-50"]) and check if they match the expected residues.

You may also download the entire <debug_dir> locally and open the accompanying <task_name>_parsed_target.pml script in PyMOL for guided inspection:

Cropped regions are colored blue
Hotspot residues are shown as pink sticks
All other residues are colored grey

About Structure File Format

PXDesign supports both mmCIF (.cif) and PDB (.pdb) formats for target.file.

If a .pdb file is provided,

PXDesign will convert it to mmCIF before processing.
During this conversion, PXDesign performs basic sanity checks, and chain IDs and residue IDs may be reassigned.
When specifying crop or hotspots in the input YAML, you may continue to use PDB-style residue numbering (auth_seq_id). PXDesign will automatically map these indices to the canonical mmCIF residue index.

Strongly recommended: Provide mmCIF (.cif) files directly to avoid unintended chain ID / residue ID changes during conversion.

About Residue Indexing

PXDesign uses the canonical mmCIF residue index (label_seq_id), which is 1-based and strictly sequential.

✅ Correct Index	❌ AVOID
label_seq_id	auth_seq_id
Used internally by PXDesign.	May contain gaps or insertion codes (e.g., 27A).

How to verify the correct index?

Option 1: Built-in Visual Verification (Recommended). To confirm your crop and hotspots point to the correct residues, use our build-in visual tool to verify that (See Visual Verification).
Option 2: Molstar Viewer. Open your .cif file in the Molstar Viewer. Hover over your target residue and look at the status bar in the bottom right. Ensure you use the number labeled Sequence ID (which corresponds to label_seq_id), NOT the one labeled Auth ID.

About Target MSA

When running the full pipeline extended mode, PXDesign will:

Require MSA for each target chain specified in the YAML configuration.
Search for MSAs automatically if not provided.

We strongly recommend pre-computing MSAs for each target chain. PXDesign provides a convenience command to automatically populate target-chain MSA paths in your YAML configuration:

pxdesign prepare-msa --yaml <input.yaml>

By default, this command generates MSAs via the ColabFold MMseqs2 server (equivalent to --msa-mode mmseqs). This writes per-chain MSA directories (each containing pairing.a3m and non_pairing.a3m) and injects them into your YAML under target.chains[*].msa.

To use the Protenix search pipeline instead, run:

pxdesign prepare-msa --yaml <input.yaml> --msa-mode protenix

This command will:

Parse the target structure (PDB or CIF) specified in target.file
Identify the target chains defined under target.chains
Populate MSAs from Protenix search (--msa-mode protenix) or generate them via ColabFold (--msa-mode mmseqs, default)
Inject the corresponding MSA directories into:

target:
  chains:
    <CHAIN_ID>:
      msa: <path_to_precomputed_msa>

Why Does PXDesign Use MSA?

MSA provides evolutionary constraints that are critical for reliable structure prediction and confidence estimation.

In PXDesign, MSAs are not required for the diffusion-based generation stage itself, but they play an essential role during the filter stage, in the Extended mode:

Protenix relies heavily on target MSAs to correctly fold the target protein and to assess the quality of the designed binder–target complex.
As a result, the confidence metrics used for ranking in Extended mode (e.g., ipTM, pAE) are strongly dependent on the availability and quality of the target MSA.
Without a target MSA, these confidence scores become significantly less reliable, which directly impacts ranking quality.

Why must the MSA correspond to the full-length sequence?

Even if you crop the target structure for design purposes, the MSA must always be generated on the full-length target sequence. This is because:

PXDesign uses the canonical mmCIF label_seq_id, which is defined with respect to the full-length sequence.
Cropping only affects which residues are kept during design, but does not redefine residue indices.

2. Running PXDesign

⚡ Quick Start: Recommended Configuration

For most production runs on modern GPUs (e.g., A100/H100), we recommend the Full Pipeline Extended Mode with BF16 precision and kernel optimizations.
pxdesign pipeline --preset extended \
 -i <YAML_FILE> \
 -o <out_dir> \
 --N_sample <N_samples> \
 --dtype bf16 \
 --use_fast_ln True \
 --use_deepspeed_evo_attention True

Running PXDesign efficiently involves making three key decisions:

Step 1: Select Running Mode

You could choose the mode that best fits your needs.

Option
Full Pipeline (Extended Mode)	Recommended For: 🔹 Running the full pipeline with full evaluation (AF2 + Protenix). 🔹 Collecting high-quality candidates for wet-lab validation. 🔹 Aligning with the pipeline used in our Technical Report. Command: `pxdesign pipeline --preset extended -i <YAML_FILE> -o <out_dir> --N_sample <num_samples>` `--dtype bf16 --use_fast_ln True --use_deepspeed_evo_attention True`
Full Pipeline (Preview Mode)	Recommended For: 🔹 Getting a preview of PXDesign results. 🔹 Getting an estimate of the difficulty level of your design task. 🔹 Try different crop or hotspot settings to see their effect and adjust them accordingly. Command: `pxdesign pipeline --preset preview -i <YAML_FILE> -o <out_dir> --N_sample <num_samples>`
Generation Only	Recommended For: 🔹 Only need the raw backbone structures (This mode does not provide confidence metrics or ranking). Command: `pxdesign infer -i <YAML_FILE> -o <out_dir> --N_sample <num_samples>`

Step 2: Determine Sample Size (`--N_sample`)

The number of samples depends on your goal and the difficulty of the design task.

🐞 Debugging: Run a small batch (e.g., --N_sample 10) to verify your input files and configuration.
🚀 Production: We recommend targeting at collecting 10000+ designs and targeting at getting 10-100 designs passing both Protenix and AF2-IG filters. The harder the target, the more samples you may need to find a high-confidence binder.

Runtime Estimation

The plot illustrates how the expected runtime on an NIVIDA L20 GPU (seconds per design) varies with protein length across the Extended, Preview, and Infer modes. This information can be used to estimate the total duration of your job.

Note: Actual runtime may vary based on GPU model and system load.

Step 3: Optimize Performance

PXDesign exposes several runtime-level knobs that control numerical precision and kernel implementations. These options primarily affect performance and memory usage.

Argument	Default	Notes
`--dtype`	`bf16`	Controls numerical precision. 🔹 `bf16` is recommended on modern GPUs (e.g., A100/H100) for faster speed and lower memory. 🔹 `fp32` may be preferred for debugging or on older hardware.
`--use_fast_ln`	`True`	Controls whether to use optimized LayerNorm kernels. 🔹 Generally recommended to keep enabled.
`--use_deepspeed_evo_attention`	`False`	Enables DeepSpeed Evo attention kernels (Protenix only). 🔹 This kernel is only used by the Protenix filter. 🔹 NVIDIA CUTLASS (v3.5.1) is required and is expected at `${CUTLASS_PATH:-$HOME/cutlass}`

Step 4: Monitor long runs (progress + heartbeat)

PXDesign often generates many designs inside a single diffusion forward pass (e.g., --N_sample 10000), so stdout can be sparse for long stretches.

To make long-running jobs easier to monitor, PXDesign writes:

Periodic diffusion progress logs (time-throttled; defaults to ~30s)
A heartbeat JSON at <out_dir>/status.json (rank-0 aggregate) and <out_dir>/status_rank{rank}.json (per-rank)

Useful environment variables:

PXDESIGN_HEARTBEAT_INTERVAL: heartbeat write interval in seconds (default: 15)
PXDESIGN_PROGRESS_INTERVAL: progress log interval in seconds (default: 30)
PXDESIGN_STEP_HEARTBEAT_INTERVAL: liveness heartbeat interval during diffusion steps in seconds (default: 30)
To disable heartbeat writing: set PXDESIGN_STATUS_DIR="" before running

CLI flags (recommended):

--pxdesign_progress_interval <sec>: sets PXDESIGN_PROGRESS_INTERVAL (default: 30, 0 disables)

Step 5 (Optional): Streaming dump + resume for ultra-long jobs

For very long runs, it can be useful to dump designs incrementally and support resume after interruption. PXDesign can stream diffusion outputs incrementally to disk (atomic writes):

Streaming dump is enabled by default in the pxdesign CLI.
- Disable with --no_stream_dump
- If running python -m pxdesign.runner.inference directly, set PXDESIGN_STREAM_DUMP=1 to enable it.
Diffusion CIFs are written under:
- <out_dir>/runs/run_XXX/diffusion/structures/<task_name>/<task_name>_sample_000123.cif
On restart (same <out_dir>), PXDesign scans existing *_sample_*.cif files and only generates missing indices.
- This is world-size agnostic: you can resume with a different number of GPUs (world_size) and PXDesign will fill missing design indices.

Notes:

Streaming dump can slightly reduce throughput (it writes CIFs during inference), but it prevents catastrophic “all progress lost” failures.
CIF writes are atomic (.tmp then rename) to avoid corrupted partial artifacts.
The v2 pipeline stores resume metadata in <out_dir>/pipeline_state.json (and caches a normalized input in <out_dir>/pipeline_input.json). Keep these files intact when resuming.

3. Outputs & Results

Upon completion, all key results are aggregated in the results folder. If you run finalization again in the same <out_dir>, PXDesign will create a new snapshot folder (results_v2, results_v3, ...) rather than overwriting the previous one.

3.1 File Structure

The primary results are located in results/<task_name>/. Raw diffusion/eval artifacts are organized per-run under runs/run_XXX/.

<out_dir>/
│
├── config.yaml
├── pipeline_input.json
├── pipeline_state.json
├── status.json
├── status_rank0.json ... status_rankN.json
│
├── runs/
│   ├── run_000/
│   │   ├── diffusion/structures/<task_name>/<task_name>_sample_000000.cif
│   │   ├── eval/<task_name>/sample_level_output.csv
│   │   └── final/{final_state.json,run_all_summary.csv,run_filtered_summary.csv}
│   └── run_001/...
│
├── results/
│   ├── manifest.json
│   ├── analysis/
│   │   ├── overall/{all_summary.csv,filtered_summary.csv}
│   │   └── by_run/run_000/{run_all_summary.csv,run_filtered_summary.csv}
│   └── <task_name>/             <-- PRIMARY RESULTS FOLDER
│       ├── summary.csv                 <-- Start here: ranked binders with scores
│       ├── task_info.json              <-- Meta info for this results snapshot
│       ├── plots/server_xx_mode.png    <-- Diagnostic plot
│       └── structures/
│           ├── orig/                   <-- Designed backbones (from diffusion outputs)
│           ├── af2/                    <-- AF2-docked structures (if pass_af2)
│           └── ptx/                    <-- Protenix-docked structures (if pass_ptx)
└── results_v2/ ... (created if you finalize again)

Note: All output CIF files generated by PXDesign use deterministically re-assigned chain IDs: condition (target) chains are re-labeled according to the order specified in target.chains (e.g., A0, B0, C0, ...), the binder chain is always placed as the final chain in the output structure, and currently only a single binder chain is supported.

If you want to re-run the final ranking/selection (generate a new results_vN/ snapshot) without re-running diffusion/eval, you can run: pxdesign rank --dump_dir <out_dir>

3.2 Understanding Metrics & Filters (`summary.csv`)

The summary.csv file contains validation scores from two independent structure prediction pipelines: AF2-IG and Protenix.

Key Status Columns: The most practical way to filter your designs is using the "Success" columns (i.e., AF2-IG-success, AF2-IG-easy-success, Protenix-success, Protenix-basic-success). These boolean indicators (True/False) tell you if a design passed a specific quality filter.
Headers starting with af2_: Derived from AF2-IG validation.
Headers starting with ptx_: Derived from Protenix validation.

Table 1: Thresholds for "Success" Filters. The table below specifies criteria for each filter.

Filter Name	Confidence Thresholds (Score Quality)	Structure Thresholds (Geometry)
AF2-IG-easy	ipAE < 10.85, ipTM > 0.5, pLDDT > 0.8	binder bound/unbound RMSD < 3.5 Å
AF2-IG	ipAE < 7.0, pLDDT > 0.9	binder RMSD < 1.5 Å
Protenix-basic	binder ipTM > 0.8, binder pTM > 0.8	complex RMSD < 2.5 Å
Protenix	binder ipTM > 0.85, binder pTM > 0.88	complex RMSD < 2.5 Å

Note: The AF2-IG-easy filter uses the thresholds proposed by BindCraft. Other filters were established based on our internal benchmarking on the Cao et al. dataset. For detailed analysis and definitions of individual metrics (e.g., ipAE, ipTM), please refer to our Technical Report.

3.3 Interpreting Task Difficulty (`server_xx_mode.png`)

This diagnostic plot positions your current task against known benchmarks to estimate difficulty. (Example below: A dot on the left indicates a "Hard" target with a low passing rate; a dot on the right indicates an "Easy" target. Note that classifications of "Hard" and "Easy" depend on the specific filter applied.)

Guidelines for interpretation:

If your job is on the "Hard" side: Consider adjusting input settings (e.g., revising hotspot location or binder length) to improve success rates. Alternatively, relax the filter criteria (e.g., use Protenix-basic instead of the strict Protenix filter).
If your job is on the "Easy" side: You can strictly rely on the high-confidence filters.
Our experience: For challenging targets in our validated experiments (e.g., VEGFA, TrkA, SC2RBD, and TNF-α), we utilized the Protenix-basic filter to preserve diversity. For all other targets, the strict Protenix filter was sufficient. We consistently utilized the strict AF2-IG filter over the AF2-IG-easy filter.

Pick Designs for Wet-Lab Validation

If your goal is to synthesize and test the designs experimentally, we recommend this 4-step workflow:

1️⃣ Generation: Run Extended Mode (with multiple jobs) with different seeds and/or binder lengths. We recommend targeting at collecting 10000+ designs and targeting at getting 10-100 designs passing both Protenix and AF2-IG filters.
2️⃣ Filtering: Select candidates based on the task difficulty observed.
- General/Easy Targets: Prioritize designs that pass both strict filters (Protenix + AF2-IG).
- If strict filters yield few results, relax the criteria.
3️⃣ Clustering (Promote Diversity, Optional): Cluster your filtered candidates by structure (e.g., using Foldseek or TM-align) to group similar binding modes together.
4️⃣ Final Ranking: Within each cluster, rank designs by Protenix ipTM and select the top representatives from each cluster for wet-lab testing.

Acknowledgements & Citations

We explicitly thank the members who contribute to the release of this repository: Jiaqi Guan, Jinyuan Sun, Zhaolong Li, and Xinshi Chen. We also deeply appreciate the contributions of the open-source community. This project stands on the shoulders of giants.

Codebase Acknowledgements:

This repository (specifically within PXDesignBench) utilizes the codebase of ColabDesign in a part of the filtering modules. We thank Dr. Sergey Ovchinnikov and contributors for their outstanding integration of ProteinMPNN and AF2-IG interfaces, which accelerated our development.

Methodological Foundations:

If you use this repository, please cite our preprint. Additionally, our pipeline heavily relies on Protenix for structure prediction and confidence estimation, and integrates AF2-IG and ProteinMPNN for filtering and sequence design. We strongly encourage citing these original papers to respect the methods used.

/* ================== PXDesign & Protenix ================== */
@article{ren2025pxdesign,
  title={PXDesign: Fast, Modular, and Accurate De Novo Design of Protein Binders},
  author={Ren, Milong and Sun, Jinyuan and Guan, Jiaqi and Liu, Cong and Gong, Chengyue and Wang, Yuzhe and Wang, Lan and Cai, Qixu and Chen, Xinshi and Xiao, Wenzhi},
  journal={bioRxiv},
  pages={2025--08},
  year={2025},
  publisher={Cold Spring Harbor Laboratory}
}

@article{bytedance2025protenix,
  title={Protenix - Advancing Structure Prediction Through a Comprehensive AlphaFold3 Reproduction},
  author={ByteDance AML AI4Science Team and Chen, Xinshi and Zhang, Yuxuan and Lu, Chan and Ma, Wenzhi and Guan, Jiaqi and Gong, Chengyue and Yang, Jincai and Zhang, Hanyu and Zhang, Ke and Wu, Shenghao and Zhou, Kuangqi and Yang, Yanping and Liu, Zhenyu and Wang, Lan and Shi, Bo and Shi, Shaochen and Xiao, Wenzhi},
  year={2025},
  journal={bioRxiv},
  publisher={Cold Spring Harbor Laboratory},
  doi={10.1101/2025.01.08.631967}
}

/* ================== ProteinMPNN ================== */
@article{dauparas2022robust,
  title={Robust deep learning--based protein sequence design using ProteinMPNN},
  author={Dauparas, Justas and Anishchenko, Ivan and Bennett, Nathaniel and Bai, Hua and Ragotte, Robert J and Milles, Lukas F and Wicky, Basile IM and Courbet, Alexis and de Haas, Rob J and Bethel, Neville and others},
  journal={Science},
  volume={378},
  number={6615},
  pages={49--56},
  year={2022},
  publisher={American Association for the Advancement of Science}
}
/* ================== AF2-IG ================== */
@article{bennett2023improving,
  title={Improving de novo protein binder design with deep learning},
  author={Bennett, Nathaniel R and Coventry, Brian and Goreshnik, Inna and Huang, Buwei and Allen, Aza and Vafeados, Dionne and Peng, Ying Po and Dauparas, Justas and Baek, Minkyung and Stewart, Lance and others},
  journal={Nature Communications},
  volume={14},
  number={1},
  pages={2625},
  year={2023},
  publisher={Nature Publishing Group UK London}
}

Contributing

We welcome contributions from the community to help improve PXDesign.

Please follow the Contributing Guide.

Code of Conduct

We are committed to fostering a welcoming and inclusive environment.
Please review our Code of Conduct for guidelines on how to participate respectfully.

Security

If you discover a potential security issue in this project, or think you may have discovered a security issue, please notify ByteDance Security via our security center or by email at sec@bytedance.com.

Please do not create a public GitHub issue.

License

This project is licensed under the Apache 2.0 License.
It is free for both academic research and commercial use.

Contact Us

We welcome inquiries and collaboration opportunities for advanced applications of our model, such as developing new features, fine-tuning for specific use cases, and more.

📧 Please contact us at: ai4s-bio@bytedance.com

Join Us

We're expanding the Protenix team at ByteDance Seed! We’re looking for talented individuals in machine learning and computational biology/chemistry (“Computational Biology/Chemistry” covers structural biology, computational biology, computational chemistry, drug discovery, and more). Opportunities are available in both Beijing and Seattle, across internships, new grad roles, and experienced full-time positions.

Outstanding applicants will be considered for ByteDance’s Top Seed Talent Program — with enhanced support.

📍 Beijing, China

Type	Expertise	Apply Link
Full-Time	Protein Design Scientist	Experienced
Full-Time	Computational Biology / Chemistry	Experienced, New Grad
Full-Time	Machine Learning	Experienced, New Grad
Internship	Computational Biology / Chemistry	Internship
Internship	Machine Learning	Internship

📍 Seattle, US

Type	Expertise	Apply Link
Full-Time	Computational Biology / Chemistry	Experienced, New Grad
Full-Time	Machine Learning	Experienced, New Grad
Internship	Computational Biology / Chemistry	Internship
Internship	Machine Learning	Internship

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
PXDesignBench		PXDesignBench
assets		assets
examples		examples
pxdesign		pxdesign
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
download_tool_weights.sh		download_tool_weights.sh
install.sh		install.sh
requirements.txt		requirements.txt
setup.py		setup.py

License

cytokineking/PXDesign

Folders and files

Latest commit

History

Repository files navigation

Introduction

PXDesign Web Server (✅ Highly Recommended)

⚡️ Installation & Setup

1. Install PXDesign

Step 1. Build the Docker Image

Step 2. Start the Container

Step 3. Install PXDesign in the Container

What the installer will do

Supported options

2. First-Time Downloads

🚀 Quick Start

1. Prepare Input

2. Running PXDesign with One Command Line

3. Check Results

📖 Detailed Usage Guide

1. Preparing Customized Inputs

Input Configuration (YAML)

✅ Validating Your Input YAML

1. Syntax Check (Fast)

2. Visual Verification (Recommended for Crop and Hotspots)

About Structure File Format

About Residue Indexing

How to verify the correct index?

About Target MSA

2. Running PXDesign

⚡ Quick Start: Recommended Configuration

Step 1: Select Running Mode

Step 2: Determine Sample Size (--N_sample)

Runtime Estimation

Step 3: Optimize Performance

Step 4: Monitor long runs (progress + heartbeat)

Step 5 (Optional): Streaming dump + resume for ultra-long jobs

3. Outputs & Results

3.1 File Structure

3.2 Understanding Metrics & Filters (summary.csv)

3.3 Interpreting Task Difficulty (server_xx_mode.png)

Pick Designs for Wet-Lab Validation

Acknowledgements & Citations

Contributing

Code of Conduct

Security

License

Contact Us

Join Us

📍 Beijing, China

📍 Seattle, US

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Step 2: Determine Sample Size (`--N_sample`)

3.2 Understanding Metrics & Filters (`summary.csv`)

3.3 Interpreting Task Difficulty (`server_xx_mode.png`)

Packages