-
Notifications
You must be signed in to change notification settings - Fork 81
[VLM Multi-turn] delete fsdp #498
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Summary of ChangesHello @gxlvera, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request streamlines the Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request successfully removes the FSDP training backend from the geo3k VLM multi-turn example, leaving Megatron-LM as the sole backend. The changes are straightforward and mainly involve deleting FSDP-related code and configurations. I've provided a few suggestions to further simplify the code by removing now-redundant logic and to address a minor style issue.
I am having trouble creating individual review comments. Click here to see my feedback.
examples/geo3k_vlm_multi_turn/README.md (33)
Following the suggestion to hardcode the TRAIN_BACKEND in run_geo3k_vlm_multi_turn.py, this environment variable declaration becomes obsolete for this example and can be removed to simplify the setup instructions.
examples/geo3k_vlm_multi_turn/run_geo3k_vlm_multi_turn.py (18-19)
Since FSDP support is being removed and only the 'megatron' backend is supported, you can simplify this by making TRAIN_BACKEND a constant. This eliminates the need to read from an environment variable and perform an assertion, making the code cleaner and more direct.
TRAIN_BACKEND = "megatron"
examples/geo3k_vlm_multi_turn/run_geo3k_vlm_multi_turn.py (145-147)
With the removal of this else block, the if TRAIN_BACKEND == 'megatron': condition at line 135 is always true. Consider refactoring by removing the if statement and un-indenting its body to simplify the code.
examples/geo3k_vlm_multi_turn/run_geo3k_vlm_multi_turn.py (162)
The newline character at the end of the file has been removed. According to PEP 8, files should end with a single newline. Please add it back for consistency.
| if __name__ == "__main__": | ||
| prepare() | ||
| execute() | ||
| execute() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why change this?
This PR delete fsdp-related content for geo3k vlm multi-turn example, and only keeps Megatron-LM as training backend.