MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization(Work in progress)

MolAct is an Agentic RL framework that trains LLMs to design molecules through a multi-turn "Think-Tool-Observation" loop. By leveraging GRPO and a two-stage training paradigm—mastering basic editing before tackling complex property optimization—MolAct learns to autonomously invoke chemical tools to ensure every modification is both physically valid and property-optimized.

📰 Latest News

2025-12-24: 📄 Our paper is now available on arXiv!
2025-12-23: 🎉 We released the inference code, training datasets, and pre-trained models for MolAct! You can now run inference with our pre-trained models on molecular editing and optimization tasks.

🚀 Quick Start

Installation

Step 1: Clone the Repository

git clone https://github.com/little1d/MolAct.git
cd MolAct

Step 2: Install AgentFly Framework

Please refer to the official AgentFly installation guide for setup instructions.

Step 3: Install ChemCoTBench Evaluation Dependencies

For running evaluations on ChemCoTBench, install additional dependencies:

pip install python-Levenshtein nltk rouge-score selfies scikit-learn

Download NLTK data (required for evaluation metrics):

python -c "import nltk; nltk.download('wordnet', quiet=True); nltk.download('punkt', quiet=True)"

Inference

We have open-sourced our training datasets and pre-trained models on HuggingFace. Please download the models before running inference.

Molecular Editing (MolEdit)

Run inference on molecular editing tasks using the provided script:

bash scripts/1\ run_mol_edit_inference.sh \
    [MODEL_DIR] [BENCH_DIR] [OUT_DIR]

Recommended approach: Instead of passing parameters via command line, you can directly edit the script file scripts/1 run_mol_edit_inference.sh to modify the default paths (MODEL_DIR, BENCH_DIR, OUT_DIR) and other parameters (MAX_NEW_TOKENS, TEMP, TOP_P, BACKEND). This is more convenient and recommended.

The script will process all subtasks (add, delete, sub) automatically and generate JSON result files in the outputs/ directory. Each result file contains the model's reasoning trajectory (thought process, tool calls, and observations) along with the final predictions, which can be used for subsequent evaluation.

Molecular Optimization (MolOpt)

Run inference on molecular optimization tasks using the provided script:

bash scripts/3\ run_mol_opt_inference.sh \
    [MODEL_DIR] [BENCH_DIR] [OUT_DIR]

Recommended approach: Instead of passing parameters via command line, you can directly edit the script file scripts/3 run_mol_opt_inference.sh to modify the default paths (MODEL_DIR, BENCH_DIR, OUT_DIR) and other parameters (MAX_NEW_TOKENS, TEMP, TOP_P). This is more convenient and recommended.

The script will process all subtasks (logp, drd, jnk, gsk, qed, solubility) automatically and generate JSON result files in the outputs/ directory. Each result file contains the model's reasoning trajectory (thought process, tool calls, and observations) along with the final predictions, which can be used for subsequent evaluation.

Note: When running inference scripts from the root directory, the TDC library will automatically download oracle files to the oracle/ directory.

Evaluation

Evaluate results on ChemCoTBench using the provided scripts. The evaluation uses the ChemCoTBench evaluation framework.

Recommended approach: Instead of passing parameters via command line, directly edit the script files to modify the default paths:

scripts/2 eval_mol_edit_chemcotbench.sh for molecular editing tasks
scripts/4 eval_mol_opt_chemcotbench.sh for molecular optimization tasks

The scripts accept three parameters:

BENCH_DIR: Path to ChemCoTBench baseline_and_eval directory (contains evaluation scripts)
PRED_DIR: Directory containing prediction JSON files generated from the inference step above
OUT_DIR: Output directory for evaluation results

# For molecular editing tasks
bash scripts/2\ eval_mol_edit_chemcotbench.sh \
    [BENCH_DIR] [PRED_DIR] [OUT_DIR]

# For molecular optimization tasks
bash scripts/4\ eval_mol_opt_chemcotbench.sh \
    [BENCH_DIR] [PRED_DIR] [OUT_DIR]

The scripts will automatically evaluate all subtasks (add/delete/sub for editing, logp/drd/jnk/gsk/qed/solubility for optimization) and save results in the output directory.

Example results: We provide example inference and evaluation results in the outputs/ directory for reference. You can check these files to understand the expected output format.

🛠️ Tools

The agents use a comprehensive set of chemistry tools for molecular manipulation and property calculation:

Molecular Validation: Validate SMILES strings
Property Calculation: Calculate molecular properties (logP, QED, solubility, etc.)
Molecular Editing: Functional group transformations (addition, deletion, substitution) implemented via SMARTS-based pattern matching and modification
Scaffold Analysis: Murcko scaffold extraction and similarity metrics
Oracle Scoring: Protein activation scoring for optimization tasks (DRD-2, JNK-3, GSK-3β)

📊 Benchmark

We evaluate on ChemCoTBench for both molecular editing and optimization tasks. ChemCoTBench is a comprehensive benchmark for step-wise reasoning on complex chemical problems.

Supported Tasks

Molecular Editing: Add, delete, and substitute functional groups
Molecular Optimization: Optimize for physicochemical properties (QED, LogP, Solubility) and protein activation (DRD-2, JNK-3, GSK-3β)

Results

🎬 Demo

The following shows an example of model training on molecular editing task using Qwen-2.5-7B as the base model on 4 H200 hardware.

demo.mp4

📝 Citation

If you use MolAct in your research, please cite:

@article{molact2025,
  title={MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization},
  author={Zhuo Yang and Yeyun Chen and Jiaqing Xie and Ben Gao and Shuaike Shen and Wanhao Liu and Liujia Yang and Beilun Wang and Tianfan Fu and Yuqiang Li},
  year={2025},
  eprint={2512.20135},
  archivePrefix={arXiv},
  primaryClass={cs.AI},
  url={https://arxiv.org/abs/2512.20135}
}

🙏 Acknowledgments

We would like to thank the following projects and frameworks that made this work possible:

SwanLab for experiment tracking and visualization
ChemCoTBench for providing comprehensive benchmarks for molecular editing and optimization tasks
AgentFly for the agentic RL framework infrastructure

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ChemCoTBench		ChemCoTBench
ChemCoTBench_benchmark		ChemCoTBench_benchmark
agentfly		agentfly
assets		assets
oracle		oracle
outputs		outputs
scripts		scripts
.gitignore		.gitignore
README.md		README.md
agentfly_install.sh		agentfly_install.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization(Work in progress)

📰 Latest News

🚀 Quick Start

Installation

Step 1: Clone the Repository

Step 2: Install AgentFly Framework

Step 3: Install ChemCoTBench Evaluation Dependencies

Inference

Molecular Editing (MolEdit)

Molecular Optimization (MolOpt)

Evaluation

🛠️ Tools

📊 Benchmark

Supported Tasks

Results

🎬 Demo

📝 Citation

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

little1d/MolAct

Folders and files

Latest commit

History

Repository files navigation

MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization(Work in progress)

📰 Latest News

🚀 Quick Start

Installation

Step 1: Clone the Repository

Step 2: Install AgentFly Framework

Step 3: Install ChemCoTBench Evaluation Dependencies

Inference

Molecular Editing (MolEdit)

Molecular Optimization (MolOpt)

Evaluation

🛠️ Tools

📊 Benchmark

Supported Tasks

Results

🎬 Demo

📝 Citation

🙏 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages