Question About Parameter Settings for Reproducing G2RL Results

Hello,Thank you for sharing your work and providing the G2RL implementation in the POGEMA environment. 
I tried increasing the number of iterations (num_episodes) and the replay buffer size, the results remain similar to the examples provided in the notebooks, and the value of `results['done'].append(scalars['done'])` is consistently low.
![QQ截图20241130103044](https://github.com/user-attachments/assets/fc25b7df-af73-42c6-a73d-1f9053a80778)
Are there specific parameters or settings that need to be adjusted to significantly improve the model's performance?
Any guidance you could provide on where adjustments might be most effective would be greatly appreciated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question About Parameter Settings for Reproducing G2RL Results #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question About Parameter Settings for Reproducing G2RL Results #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions