Hi maintainers,
Thanks for the good work. In the paper, you wrote "previous SOTA method DM [63] needs 28,000 GPU hours to distill ImageNet-1K with 60% data processing." May I wonder how did you get this 28,000 hours (approx. 1166.7 days) result?
Thanks!