video-llm
Here are 7 public repositories matching this topic...
PISCO: Precise Video Instance Insertion with Sparse Control
-
Updated
Feb 13, 2026 - Python
[NeurIPS 2025] 𝓡𝓣𝓥-𝓑𝓮𝓷𝓬𝓱: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.
-
Updated
Jan 15, 2026 - Python
(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
-
Updated
Sep 27, 2024 - Python
Paper list of Video LLM hallucination. Welcome to Star and Contribute!
-
Updated
Feb 18, 2026
[Arxiv 2509.14199] DENSE VIDEO UNDERSTANDING WITH GATED RESIDUAL TOKENIZATION
-
Updated
Sep 21, 2025 - Python
D-ORCA: Dialogue-Centric Optimization for Robust Audio-Visual Captioning
-
Updated
Feb 11, 2026 - Python
Improve this page
Add a description, image, and links to the video-llm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the video-llm topic, visit your repo's landing page and select "manage topics."