Skip to content

FAD的三通道输入和LFS的灰度图输入 #20

@leeguandong

Description

@leeguandong

我比较疑惑的是,在FAD中的输入是3通道图做DCT,
x_freq = self._DCT_all @ x @ self._DCT_all_T # [N, 3, 299, 299]

但是在LFS时,却要先转成灰度图:
x_gray = 0.299x[:,0,:,:] + 0.587x[:,1,:,:] + 0.114*x[:,2,:,:]
x = x_gray.unsqueeze(1)

x = (x + 1.) * 122.5

文章中,两块应该都是RGB输入,请问为什么LFS时要灰度图输入啊??

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions