feat: deprecate Qwen1.5 and gemma; and replace them with Qwen2.5 and gemma2#23
feat: deprecate Qwen1.5 and gemma; and replace them with Qwen2.5 and gemma2#23vatsalkshah wants to merge 1 commit intoFLock-io:mainfrom
Conversation
WalkthroughThe pull request includes updates to the Changes
Possibly related PRs
Poem
📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (3)
🔥 Files not summarized due to errors (1)
🧰 Additional context used🔇 Additional comments (7)
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
| "hg_repo_id": "Qwen/Qwen1.5-1.8B-Chat", | ||
| "base_model": "qwen1.5", | ||
| "hg_repo_id": "Qwen/Qwen2.5-1.5B, | ||
| "base_model": "qwen2.5", |
There was a problem hiding this comment.
for base_model we are still using qwen1.5 since they stay the same
| "Qwen/Qwen1.5-7B": "qwen1.5", | ||
| "google/gemma-2b": "gemma", | ||
| "google/gemma-7b": "gemma", | ||
| "Qwen/Qwen2.5-0.5B": "qwen2.5", |
deprecate Qwen1.5 and gemma; and replace them with Qwen2.5 and gemma2
Summary by CodeRabbit
New Features
Documentation
README.mdto include updated model descriptions and identifiers.training_args.yamlto align with the new model configurations and parameters.