First NN implementation attempt #2
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary of the experiments with this implementation:
The problem is that the model does not learn - loss on the validation data remains high even with crazy number of epochs (like +20 000 which is abnormally high number and takes a long time to train). I tried with different learning rates and schedulers to somehow generalize to new data, but no results.
Also tried with different model complexities (1-5 hidden layers with varying neuron amounts) but the results did not improve. The story remains the same: training loss gets small but the validation does not.
The best outcome I achieved was 3 layers with 64 neurons passing through a few thousand epochs after which the validation stopped converging and overfitting became a problem (and early stopping was applied). The learning progression is attached as an image. The evaluation fails to beat the random pivot picker: