feat: Implement adaptive learning rate scheduling by Anroshka · Pull Request #9 · Anroshka/snake-ai

Anroshka · 2025-06-10T03:38:10Z

I've added an adaptive learning rate scheduler (ReduceLROnPlateau) to the QLearningAgent. This scheduler monitors the average score and reduces the learning rate if the score plateaus, potentially leading to improved training stability and performance.

Changes include:

Modified QLearningAgent in model.py to initialize and step the ReduceLROnPlateau scheduler.
Updated MultiAgentDQN in model.py to propagate the scheduler step to all individual agents.
Modified the training loop in train_multi.py to call the scheduler step at the end of each episode using the average score.
Updated save_model and load_model in QLearningAgent to save and load the scheduler's state, ensuring continuity in training and backward compatibility with older model files.

Description

Please include a summary of the changes and which issue is fixed. Please also include relevant motivation and context.

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce.

Test A
Test B

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published in downstream modules

I've added an adaptive learning rate scheduler (`ReduceLROnPlateau`) to the QLearningAgent. This scheduler monitors the average score and reduces the learning rate if the score plateaus, potentially leading to improved training stability and performance. Changes include: - Modified `QLearningAgent` in `model.py` to initialize and step the `ReduceLROnPlateau` scheduler. - Updated `MultiAgentDQN` in `model.py` to propagate the scheduler step to all individual agents. - Modified the training loop in `train_multi.py` to call the scheduler step at the end of each episode using the average score. - Updated `save_model` and `load_model` in `QLearningAgent` to save and load the scheduler's state, ensuring continuity in training and backward compatibility with older model files.

deepsource-io · 2025-06-10T03:38:35Z

Here's the code health analysis summary for commits 34c3480..07903b7. View details on DeepSource ↗.

Analysis Summary

Analyzer	Status	Summary	Link
Python	❌ Failure	❗ 4 occurences introduced	View Check ↗

💡 If you’re a repository administrator, you can configure the quality gates from the settings.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Implement adaptive learning rate scheduling#9

feat: Implement adaptive learning rate scheduling#9
Anroshka wants to merge 1 commit intomainfrom
feat/adaptive-lr

Anroshka commented Jun 10, 2025 •

edited

Loading

Uh oh!

deepsource-io bot commented Jun 10, 2025

Analysis Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Anroshka commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How Has This Been Tested?

Checklist:

Uh oh!

deepsource-io bot commented Jun 10, 2025

Analysis Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Anroshka commented Jun 10, 2025 •

edited

Loading