Skip to content

Train问题 #13

@charlesXu86

Description

@charlesXu86

您好,我在train的时候,在get_action方法里,
actions, visits = zip(*actions_visits)
print(actions, visits)
probs = softmax(1.0 / temperature * np.log(visits)) # + 1e-10
这一步的visits会产生很多0,导致np.log计算时发生错误,这个应该怎么解决呢?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions