TensorRT SAM3 (C++ Inference)

This is a TensorRT-based SAM3 inference repository (C++ implementation). It currently implements image preprocessing, image encoding, text encoding, decoder decoding, and post-processing processes, supporting multi-text prompt inference for images.

Key Features:

Uses TensorRT engine
C++ + CUDA implementation of preprocessing/post-processing kernels, suitable for efficient GPU operation
Supports mask/box output based on text prompts and geometric bounding boxes
Utilizes batching and memory reuse to simultaneously recognize multiple text prompt categories
Draw boxes on image A, recognize on image B

ONNX Model and TensorRT Model Export

Refer to the repository below to export ONNX models
https://github.com/jamjamjon/usls.git
Address of already exported ONNX models
https://huggingface.co/tangliyang/onnx_model_store

Vision Encode Model Quantization

Refer to the repository below to perform int8 quantization on the SAM3 vision encode model https://github.com/NVIDIA/Model-Optimizer/tree/main/examples/windows/onnx_ptq/sam2

Environment

Server
ubuntu 24.04
GPU NVIDIA GeForce RTX 4090
Image
nvcr.io/nvidia/tensorrt:25.10-py3

Recognition Results

Multi-word Text Prompts Can simultaneously recognize multiple categories

Geometric Prompts

Mixed Prompts

Prompt boxes on image A, recognition on image B

Speed

Around 50ms

Build and Run

cmake .. -DCMAKE_PREFIX_PATH="$(python3 -m pybind11 --cmakedir)"
make -j$(nproc)

web UI

References

https://github.com/jamjamjon/usls.git

License and Contributions

This repository is an example for personal/research use, welcome issues.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
src		src
workspace		workspace
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
README.md		README.md
README_ZH.md		README_ZH.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorRT SAM3 (C++ Inference)

Key Features:

ONNX Model and TensorRT Model Export

Vision Encode Model Quantization

Environment

Recognition Results

Speed

Build and Run

web UI

References

License and Contributions

About

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

TensorRT SAM3 (C++ Inference)

Key Features:

ONNX Model and TensorRT Model Export

Vision Encode Model Quantization

Environment

Recognition Results

Speed

Build and Run

web UI

References

License and Contributions

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 1

Languages