MMomentA

Fast, Transferable Multipole Moment-based Charge Assignment

A machine learning framework that learns multipole moment-based charge fitting (MPFIT) for polarizable force fields, combining quantum chemistry calculations (Psi4/GDMA) with graph neural network architectures.

Overview

MMomentA provides a complete pipeline for:

QM Charge Calculation (Phase 1): Clean, modular MPFIT/RESP/AM1-BCC computation
ML Data Preparation (Phase 2): Feature extraction and dataset creation
Model Training (Phase 3): Graph neural networks for charge prediction

Key Features

✅ Modular architecture separating QM, data processing, and ML
✅ Parallel processing with proper Psi4 process isolation
✅ Multipole moment integration for improved accuracy
✅ Scaffold-based splitting for transferability testing
✅ Comprehensive metrics and evaluation tools
✅ Simple, interpretable code with no try/except masking failures

Installation

cd MMomentA
pip install -e .

Dependencies

OpenFF Toolkit
OpenFF Recharge (Psi4, GDMA)
PyTorch
DGL (Deep Graph Library)
NumPy, Pandas, Joblib

Quick Start

1. Compute MPFIT Charges

from openff.toolkit.topology import Molecule
from MMomentA.qm import MPFITCalculator, GDMAConfig

molecule = Molecule.from_smiles("CCO")
molecule.generate_conformers(n_conformers=1)

calculator = MPFITCalculator(GDMAConfig(method="hf", basis="6-31G*"))
result = calculator.compute(molecule)

print(f"Charges: {result.charges}")
print(f"Multipole moments shape: {result.multipole_moments.shape}")

2. Prepare ML Dataset

python scripts/prepare_dataset.py \
    --input spice.oeb \
    --output spice_mpfit.h5 \
    --max-molecules 1000 \
    --method hf \
    --basis "6-31G*" \
    --n-jobs 8

3. Train Model

# Baseline model (no multipoles)
python scripts/train_spice.py \
    --dataset spice_mpfit.h5 \
    --output-dir runs/baseline \
    --feature-units 117 \
    --no-multipoles

# Model with multipole moments
python scripts/train_spice.py \
    --dataset spice_mpfit.h5 \
    --output-dir runs/multipoles \
    --feature-units 198 \
    --n-epochs 1000

Project Structure

MMomentA/
├── mmomenta/
│   ├── qm/              # QM calculators (MPFIT, RESP, AM1-BCC)
│   ├── data/            # Data processing and loaders
│   ├── comparison/      # Analysis and benchmarking
│   ├── models/          # Neural network architectures
│   ├── training/        # Training loops and evaluation
│   └── utils/           # Utilities (logging, I/O)
├── scripts/             # Command-line scripts
│   ├── compute_mpfit_charges.py
│   ├── compare_charge_methods.py
│   ├── prepare_dataset.py
│   ├── train_spice.py
│   └── train_zinc.py
├── examples/            # Usage examples
└── README_PHASE*.md     # Detailed documentation

Documentation

Detailed documentation for each phase:

Phase 1: MPFIT Data Generation - QM calculations and comparison framework
Phase 2: ML Data Preparation - Feature extraction and dataset creation
Phase 3: Model Training - Neural network training and evaluation

Examples

Batch Processing

from MMomentA.qm import MPFITCalculator, RESPCalculator
from MMomentA.data import batch_process_molecules

calculators = {
    "MPFIT": MPFITCalculator(),
    "RESP": RESPCalculator()
}

results = batch_process_molecules(
    molecules,
    calculators=calculators,
    n_jobs=4
)

Model Training

from MMomentA.data import load_dataset, create_split_datasets
from MMomentA.models import ChargeModel, ModelConfig
from MMomentA.training import Trainer, TrainingConfig

# Load dataset
molecule_data_list, metadata = load_dataset("spice_mpfit.h5")

# Create splits
split_data = create_split_datasets(
    molecule_data_list, train_idx, val_idx, test_idx,
    include_multipoles=True
)

# Train model
model = ChargeModel(ModelConfig(feature_units=198, depth=4, width=128))
trainer = Trainer(model, TrainingConfig(n_epochs=1000))

results = trainer.train(
    split_data.get_train_loader(128),
    split_data.get_val_loader(128),
    split_data.get_test_loader(128)
)

print(f"Test RMSE: {results['test_metrics']['val_rmse']:.5f}")

Design Principles

Following your requirements:

No try/except statements - code fails explicitly when behavior is incorrect
No hard-coded constants - all parameters configurable
Simple, interpretable - easy for users to understand
Modular architecture - clean separation of concerns
Comprehensive logging - structured logging instead of print statements

Citation

If you use MMomentA in your research, please cite:

@software{mmomenta2025,
  author = {Parmar, Shehan M.},
  title = {MMomentA: Multipole Moment-based Charge Assignment},
  year = {2025},
  url = {https://github.com/...}
}

References

OpenFF Recharge: https://github.com/openforcefield/openff-recharge
espaloma-charge: https://github.com/choderalab/espaloma_charge
DGL: https://www.dgl.ai/

Copyright

Acknowledgements

Project based on the Computational Molecular Science Python Cookiecutter version 1.11.

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
.github		.github
MMomentA		MMomentA
cluster-independent		cluster-independent
devtools		devtools
docs		docs
examples		examples
figures		figures
perlmutter		perlmutter
phoenix		phoenix
scripts		scripts
.codecov.yml		.codecov.yml
.gitattributes		.gitattributes
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README_PHASE1.md		README_PHASE1.md
README_PHASE2.md		README_PHASE2.md
README_PHASE3.md		README_PHASE3.md
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MMomentA

Overview

Key Features

Installation

Dependencies

Quick Start

1. Compute MPFIT Charges

2. Prepare ML Dataset

3. Train Model

Project Structure

Documentation

Examples

Batch Processing

Model Training

Design Principles

Citation

References

Copyright

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

shehan807/MMomentA

Folders and files

Latest commit

History

Repository files navigation

MMomentA

Overview

Key Features

Installation

Dependencies

Quick Start

1. Compute MPFIT Charges

2. Prepare ML Dataset

3. Train Model

Project Structure

Documentation

Examples

Batch Processing

Model Training

Design Principles

Citation

References

Copyright

Acknowledgements

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages