Triton C++ Example

This is an example of an AI inference server that can scale out using Triton. The blockchain payment system is a bonus.

0. Introduction

Demo video: LINK

Nvidia Triton is a very attractive project.
It supports various AI models while boasting fast speeds.
However, it is not ideal from a scale-out perspective.

In this project, we implement a very simple example that addresses the scale-out issue using Nvidia Triton.
The configuration is as follows:

Service	Description
Frontend + Backend Server	Provides server code to verify actual operation.
Gateway	The entry point for user requests and responses.
Scheduler	Schedules which Triton node should handle the request. In this project, a simple round-robin method is used.
Triton Node	Consists of two parts: * Triton Server: Performs inference using AI models. * Manager: Manages the Triton server. It announces itself with health check messages and forwards requests to the Triton server.
Health Checker	Monitors Triton nodes. Continuously creates a list of nodes and provides it to the scheduler.
Blockchain-Based Payment System	Implements a payment system.

1. Getting Started

If there are permission issues, add the 'sudo' keyword.
The server address can be set in 'setting.json' or the shell script.

1-1 Backend

cd backend
bash quick_start.sh

1-2 Service - Gateway

cd service/gateway
bash quick_start.sh

1-3 Service - Scheduler

cd service/scheduler
bash quick_start.sh

1-4 Service - Health Checker

cd service/health-checker
bash quick_start.sh

1-5 Blockchain - Token Manager

cd ethereum/token-manager
bash quick_start.sh

1-6 Blockchain - Ethereum

cd ethereum
bash start_ethereum.sh

1-7 GPU Node - Manager

cd gpu-node/manager
bash quick_start.sh

1-8 GPU Node - Triton

cd gpu-node
bash start_triton.sh

If any issues arise, please post in 'Issues'.

2. Open Source Used

Repository	Description	URL
triton-inference-server	Connected to the Manager to serve AI models using Nvidia's Triton. This project uses version 23.12-py3.	LINK
go-ethereum	Used go-ethereum to build a private network for a blockchain-based payment system. This project uses version 1.13.15.	LINK

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
backend		backend
doc/images		doc/images
ethereum		ethereum
gpu-node		gpu-node
service		service
LICENSE		LICENSE
README.md		README.md
clear.sh		clear.sh
clear_purge.sh		clear_purge.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Triton C++ Example

0. Introduction

1. Getting Started

1-1 Backend

1-2 Service - Gateway

1-3 Service - Scheduler

1-4 Service - Health Checker

1-5 Blockchain - Token Manager

1-6 Blockchain - Ethereum

1-7 GPU Node - Manager

1-8 GPU Node - Triton

2. Open Source Used

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ahr-i/triton-cpp-example

Folders and files

Latest commit

History

Repository files navigation

Triton C++ Example

0. Introduction

1. Getting Started

1-1 Backend

1-2 Service - Gateway

1-3 Service - Scheduler

1-4 Service - Health Checker

1-5 Blockchain - Token Manager

1-6 Blockchain - Ethereum

1-7 GPU Node - Manager

1-8 GPU Node - Triton

2. Open Source Used

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages