Skip to content

zenlm/ml-fastvlm

 
 

Repository files navigation

FastVLM

Fast vision-language model architecture research. Part of the Zen LM ecosystem.

License

Overview

FastVLM explores efficient architectures for vision-language models, focusing on reducing computational overhead while maintaining strong multimodal understanding.

Features

  • Efficient vision-language model architecture
  • Reduced computational overhead vs standard VLMs
  • Strong multimodal understanding
  • Research reference implementation

Related

  • zen-vl — Zen vision-language models
  • jin — Multimodal understanding framework
  • Zen LM — Full model family

License

See LICENSE file.

Part of the Zen LM ecosystem by Hanzo AI

About

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 81.6%
  • Swift 17.1%
  • Shell 1.2%
  • Other 0.1%