你好，我发现代码实现中MCC部分并没有实现原文中描述的Straight-Through Estimator (STE)，而是直接用softmax去计算的损失的

class Reconstruct:
    def __init__(self):
        self.ce = nn.CrossEntropyLoss(label_smoothing=0.2)

    def compute(self, token_prediction_prob, tokens):
        hits = torch.sum(torch.argmax(token_prediction_prob, dim=-1) == tokens)
        NDCG10 = recalls_and_ndcgs_for_ks(token_prediction_prob.view(-1, token_prediction_prob.shape[-1]),
                                          tokens.reshape(-1, 1), 10)
        reconstruct_loss = self.ce(token_prediction_prob.view(-1, token_prediction_prob.shape[-1]), tokens.view(-1))
        return reconstruct_loss, hits, NDCG10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

你好，我发现代码实现中MCC部分并没有实现原文中描述的Straight-Through Estimator (STE)，而是直接用softmax去计算的损失的 #7

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

你好，我发现代码实现中MCC部分并没有实现原文中描述的Straight-Through Estimator (STE)，而是直接用softmax去计算的损失的 #7

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions