Regarding Reproducing Table 1 (N=3, R=60) Results from Your Paper (Pixart-alpha, MS-COC02017 FID-30K)

Thank you for sharing your excellent research and providing the codebase.
I am currently attempting to reproduce the results from your paper, but I have encountered some difficulties.

Specifically, I am trying to reproduce the MS-COCO2017 FID-30k result of 28.02 reported in Table 1 for N=3 and R=60.
I have tested various options using the [COCO_caption_prompts_30k.txt](https://github.com/Shenyi-Z/ToCa/blob/main/COCO_caption_prompts_30k.txt) file you shared, but the resulting FID scores are not as good as expected.

Could you kindly share the exact command or options used to run the PixArt-alpha-ToCa/scripts/inference_ddp.py script to achieve this result?

I appreciate your time and assistance, and I look forward to your guidance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding Reproducing Table 1 (N=3, R=60) Results from Your Paper (Pixart-alpha, MS-COC02017 FID-30K) #8

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Regarding Reproducing Table 1 (N=3, R=60) Results from Your Paper (Pixart-alpha, MS-COC02017 FID-30K) #8

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions