Skip to content
@CASIA-IVA-Lab

CASIA-IVA-Lab

Popular repositories Loading

  1. DANet DANet Public

    Dual Attention Network for Scene Segmentation (CVPR2019)

    Python 2.5k 484

  2. VALOR VALOR Public

    [TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

    Python 305 18

  3. VAST VAST Public

    [NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

    Jupyter Notebook 296 18

  4. MRES MRES Public

    This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.

    72

  5. ChatBridge ChatBridge Public

    ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.

    Python 54 1

  6. VideoNIAH VideoNIAH Public

    VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

    Python 53 1

Repositories

Showing 10 of 14 repositories
  • VRoPE Public

    [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.

    CASIA-IVA-Lab/VRoPE’s past year of commit activity
    Python 27 0 0 0 Updated Nov 18, 2025
  • PrefixGrouper Public

    An efficient GRPO training util.

    CASIA-IVA-Lab/PrefixGrouper’s past year of commit activity
    Python 50 MIT 2 0 0 Updated Jun 13, 2025
  • VideoNIAH Public

    VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

    CASIA-IVA-Lab/VideoNIAH’s past year of commit activity
    Python 53 1 4 0 Updated Mar 9, 2025
  • COSA Public

    [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

    CASIA-IVA-Lab/COSA’s past year of commit activity
    Python 43 MIT 3 3 0 Updated Dec 25, 2024
  • VALOR Public

    [TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

    CASIA-IVA-Lab/VALOR’s past year of commit activity
    Python 305 MIT 18 7 0 Updated Dec 25, 2024
  • DANet Public

    Dual Attention Network for Scene Segmentation (CVPR2019)

    CASIA-IVA-Lab/DANet’s past year of commit activity
    Python 2,451 MIT 484 61 1 Updated Dec 23, 2024
  • ChatSearch Public

    ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval

    CASIA-IVA-Lab/ChatSearch’s past year of commit activity
    6 0 0 0 Updated Oct 24, 2024
  • MRES Public

    This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.

    CASIA-IVA-Lab/MRES’s past year of commit activity
    72 Apache-2.0 0 5 0 Updated Jun 3, 2024
  • SC-Tune Public

    Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"

    CASIA-IVA-Lab/SC-Tune’s past year of commit activity
    Python 16 MIT 1 1 0 Updated Apr 22, 2024
  • VAST Public

    [NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

    CASIA-IVA-Lab/VAST’s past year of commit activity
    Jupyter Notebook 296 MIT 18 22 0 Updated Mar 14, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…