FastVLM

Fast vision-language model architecture research. Part of the Zen LM ecosystem.

Overview

FastVLM explores efficient architectures for vision-language models, focusing on reducing computational overhead while maintaining strong multimodal understanding.

Features

Efficient vision-language model architecture
Reduced computational overhead vs standard VLMs
Strong multimodal understanding
Research reference implementation

License

See LICENSE file.

Part of the Zen LM ecosystem by Hanzo AI

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
app		app
docs		docs
llava		llava
model_export		model_export
.gitignore		.gitignore
ACKNOWLEDGEMENTS		ACKNOWLEDGEMENTS
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE_MODEL		LICENSE_MODEL
LLM.md		LLM.md
README.md		README.md
get_models.sh		get_models.sh
predict.py		predict.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastVLM

Overview

Features

Related

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FastVLM

Overview

Features

Related

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages