Skip to content

dadagust/GenAI-Inno-Sama

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GenAI-Inno-Sama

This project focuses on creating an advanced neuro avatar using Large Language Model (LLM) technology. By interpreting vocal input and analyzing emotional cues, the system will deliver engaging, context-aware responses while exhibiting an- imated motions synchronized to the conversation’s mood.

Key Components and Workflow

1. Server side

  • Purpose: Runs the LLM and TTS voice model

    • Gets the text data from the client and then, returns the .wav file with the voice to it.

2. Client side

  • Purpose: Runs the vosk model to generate text, from the audio of a client
    • Connects to the server and send the text data to the server, then voices .wav, that the server returned

3. Ollama

  • Purpose: Creates and serves a model from the .guff file
  • Steps:
    • ollama serve to create an ollama server for the model.
    • ollama create innosama -f ModelFile to create an instance of an LLM model
    • The rest of the work is done by the server code

About

This project focuses on creating an advanced neuro avatar using Large Lan- guage Model (LLM) technology. By interpreting vocal input and analyzing emotional cues, the system will deliver engaging, context-aware responses while exhibiting an- imated motions synchronized to the conversation’s mood.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors