Open-source vision stack with stereo camera hardware, GPU processing, and AI agent for training video classifiers.
-
Updated
Oct 19, 2025 - Python
Open-source vision stack with stereo camera hardware, GPU processing, and AI agent for training video classifiers.
Masked Multi-Component Gated Decomposition Architecture
A physics-based video search engine using Meta's V-JEPA 2 world model to find videos with similar motion dynamics.
Locally-Hosted Media Gallery App with AI Similarity Search
Can the V-JEPA2 model be used as a world model?
🎥 Discover similar motion dynamics in videos with MotionMatch, a physics-based search engine leveraging Meta's V-JEPA 2 for efficient retrieval.
🎥 Enhance video–text alignment using V-DeClip's advanced MCGD architecture for precise, semantically decomposed video embeddings.
Add a description, image, and links to the vjepa topic page so that developers can more easily learn about it.
To associate your repository with the vjepa topic, visit your repo's landing page and select "manage topics."