Skip to content

Conversation

@ved1beta
Copy link
Contributor

@ved1beta ved1beta commented Apr 3, 2025

No description provided.

@ved1beta ved1beta changed the title layer norm , multiquery , weight trying TODO Fix LayerNorm, Multi-Query Attention, and Weight-Tying in OLMo to HF Conversion Script Apr 3, 2025
@ved1beta ved1beta marked this pull request as draft April 3, 2025 19:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant