You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -44,6 +48,16 @@ The model is an autoregressive transformer for the space group conditioned cryst
44
48
45
49
We only consider symmetry inequivalent atoms. The remaining atoms are restored based on the space group and Wyckoff letter information. Note that there is a natural alphabetical ordering for the Wyckoff letters, starting with 'a' for a position with the site-symmetry group of maximal order and ending with the highest letter for the general position. The sampling procedure starts from higher symmetry sites (with smaller multiplicities) and then goes on to lower symmetry ones (with larger multiplicities). Only for the cases where discrete Wyckoff letters can not fully determine the structure, one needs to further consider factional coordinates in the loss or sampling.
46
50
51
+
## Status
52
+
53
+
Major milestones are summarized below.
54
+
- v0.4.2 : Add implementation of direct preference optimization.
55
+
- v0.4.1 : Replace the absolute positional embedding with the Rotary Positional Embedding (RoPE).
We release the weights of the model trained on the MP-20 dataset. More details can be seen in the [model](./model/README.md) folder.
105
+
We release the weights of the model trained on the MP-20 dataset and Alex-20 dataset. More details can be seen in the [model](./model/README.md) folder.
92
106
93
107
## How to run
94
108
@@ -163,10 +177,55 @@ Note that the training, test, and generated datasets should contain the structur
163
177
164
178
More details about the post-processing can be seen in the [scripts](./scripts/README.md) folder.
-`restore_path`: the path to the pre-trained model weights
197
+
-`valid_path`: the path to the validation dataset
198
+
-`test_path`: the path to the test dataset. The space group distribution will be loaded from this dataset and used for the sampling in the reinforcement learning fine-tuning
199
+
-`reward`: the reward function to use, `ehull` means the energy above the convex hull
200
+
-`convex_path`: the path to the convex hull data, which is used to calculate the $E_{hull}$. Only used when the reward is `ehull`
201
+
-`mlff_model`: the machine learning force field model to predict the total energy. We support [`orb`](https://github.com/orbital-materials/orb-models) and [`MACE`](https://github.com/ACEsuit/mace) models for the $E_{hull}$ reward
202
+
-`mlff_path`: the path to load the checkpoint of the machine learning force field model
203
+
204
+
### Dielectric FoM Reward
205
+
206
+
```bash
207
+
train_ppo --folder ./data/\
208
+
--restore_path YOUR_PATH\
209
+
--valid_path YOUR_PATH/alex_20/val.csv\
210
+
--test_path YOUR_PATH/alex_20/train.csv\
211
+
--reward dielectric\
212
+
--mlff_model matgl\
213
+
--mlff_path YOUR_PATH/model1,YOUR_PATH/model2
214
+
```
215
+
216
+
-`folder`: the folder to save the model and logs
217
+
-`restore_path`: the path to the pre-trained model weights
218
+
-`valid_path`: the path to the validation dataset
219
+
-`test_path`: the path to the test dataset. The space group distribution will be loaded from this dataset and used for the sampling in the reinforcement learning fine-tuning
220
+
-`reward`: the reward function to use, `dielectric` means the dielectric figure of merit (FoM), which is the product of the total dielectric constant and the band gap
221
+
-`mlff_model`: the machine learning force field model to predict the total energy. We only support models in [`matgl`](https://github.com/materialsvirtuallab/matgl) for the dielectric reward
222
+
-`mlff_path`: the path to load the checkpoint of the machine learning force field model. Note that you need to provide the model paths for the total dielectric constant and band gap, separated by the `,`
223
+
224
+
166
225
## How to cite
167
226
168
227
```bibtex
169
-
@misc{cao2024space,
228
+
@article{cao2024space,
170
229
title={Space Group Informed Transformer for Crystalline Materials Generation},
171
230
author={Zhendong Cao and Xiaoshan Luo and Jian Lv and Lei Wang},
172
231
year={2024},
@@ -176,4 +235,16 @@ More details about the post-processing can be seen in the [scripts](./scripts/RE
176
235
}
177
236
```
178
237
238
+
```bibtex
239
+
@article{cao2025crystalformerrl,
240
+
title={CrystalFormer-RL: Reinforcement Fine-Tuning for Materials Design},
241
+
author={Zhendong Cao and Lei Wang},
242
+
year={2025},
243
+
eprint={2504.02367},
244
+
archivePrefix={arXiv},
245
+
primaryClass={cond-mat.mtrl-sci},
246
+
url={https://arxiv.org/abs/2504.02367},
247
+
}
248
+
```
249
+
179
250
**Note**: This project is unrelated to https://github.com/omron-sinicx/crystalformer with the same name.
0 commit comments