Thank you for sharing your excellent research and providing the codebase.
I am currently attempting to reproduce the results from your paper, but I have encountered some difficulties.
Specifically, I am trying to reproduce the MS-COCO2017 FID-30k result of 28.02 reported in Table 1 for N=3 and R=60.
I have tested various options using the COCO_caption_prompts_30k.txt file you shared, but the resulting FID scores are not as good as expected.
Could you kindly share the exact command or options used to run the PixArt-alpha-ToCa/scripts/inference_ddp.py script to achieve this result?
I appreciate your time and assistance, and I look forward to your guidance.