Skip to content

Request for MLX Compatibility & Conversion Guide #16

@yourappleintelligence

Description

@yourappleintelligence

I’m interested in running the NAMO-R1 model on Apple Silicon using MLX (Machine Learning for Apple Devices). Since MLX is optimized for Apple hardware and supports transformer-based models, I’d like to convert NAMO-R1 into an MLX-compatible format. However, I need clarification on a few things:

Questions & Challenges:

1. Model Format & Conversion:

  • The current model is in .safetensors, but MLX requires NumPy (.npz).
  • Could you provide guidance on extracting and converting weights into a format MLX can use?

2. Understanding namo.ve (Vision Encoder?)

  • I noticed namo.llm (the language model) and namo.ve (possibly a vision encoder?).
  • Could you clarify if NAMO-R1 is a multimodal model? If so, are there specific dependencies required for image inputs?

3. Defining Model Architecture in MLX

  • MLX requires an explicit model definition (nn.Module in mlx.nn).
  • Do you have details on the exact architecture (hidden sizes, number of layers, activation functions) to help reproduce it in MLX?

Request:
Would it be possible to provide a model architecture description, weight extraction guide, or any hints for making NAMO-R1 run on MLX? I’d love to contribute a conversion script once I get a better understanding!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions