Request for MLX Compatibility & Conversion Guide

I’m interested in running the NAMO-R1 model on Apple Silicon using MLX (Machine Learning for Apple Devices). Since MLX is optimized for Apple hardware and supports transformer-based models, I’d like to convert NAMO-R1 into an MLX-compatible format. However, I need clarification on a few things:

**Questions & Challenges:**

**1. Model Format & Conversion:**
- The current model is in .safetensors, but MLX requires NumPy (.npz).
- Could you provide guidance on extracting and converting weights into a format MLX can use?

**2. Understanding namo.ve (Vision Encoder?)**
- I noticed namo.llm (the language model) and namo.ve (possibly a vision encoder?).
- Could you clarify if NAMO-R1 is a multimodal model? If so, are there specific dependencies required for image inputs?

**3. Defining Model Architecture in MLX**
- MLX requires an explicit model definition (nn.Module in mlx.nn).
- Do you have details on the exact architecture (hidden sizes, number of layers, activation functions) to help reproduce it in MLX?

**Request:**
Would it be possible to provide a model architecture description, weight extraction guide, or any hints for making NAMO-R1 run on MLX? I’d love to contribute a conversion script once I get a better understanding!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for MLX Compatibility & Conversion Guide #16

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Request for MLX Compatibility & Conversion Guide #16

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions