Difference between train and test

Hi Anton,

I am not clear about what we are using as input for value network in train and test phases.

In train phase we are using both public beliefs as inputs. For example in poker we use ranges for each agent. This ranges are vectors with mostly non-zero numbers in most of cases.
But in test phase we know our exact infostate. And for example in poker our range contains all zeros except one hand with 1. At the same time our opponent's range is still a vector with mostly non-zero numbers.

And my question is:
Is it ok to train with input which filled with non-zeros, but test with input with half of zeros (our range)?

Or maybe we should sample hole cards for each train iteration and therefore use as input hero range as all zeros except one and opponent range as full distribution between all possible hands?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference between train and test #39

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Difference between train and test #39

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions