Skip to content

AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)

Notifications You must be signed in to change notification settings

OpenReplicant/ProtoReplicant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Alli - OpenReplicant Alpha v0.1

Image

THIS IS AN EARLY POC/PROTOTYPE - FIGURE IT OUT ON YOUR OWN, OR DON'T. EXPERIMENTAL!

The immediate goal is to provide a practical framework for multi-model AI agents. Using a web browser as the execution environment more than simply a thin client. Microservices are called for heavy-lifting, some models & functions run in-browser. To change avatars just drag and drop a VRM file onto the page. Ready for broadcast!

Initial code is a simple pipeline to animate a 3D character with voice. VAD -> STT -> LLM -> TTS -> VRM

You speak to it through your microphone, it transcribes you, sends that to Kobold Horde (pygmalion-6b default) and the returned text gets synthesized by coqui and played in the browser - which moves the mouth/face.

LLM prompt template for character & chatlogs stored as persistent object. This should be into an interface for import/export of the user/bot object.

Internal architecture and components, as well as embodiments will be expanded on.

RUN: (using CUDA. omit those options for CPU)

  • pip install -r requirements.txt
  • npm i -g http-server' (if you have nodeJS, or use any HTTP server on last step)
  • python stt/main.py -m tiny -c cuda
  • python tts/server.py --port 5002 --use_cuda True (--help for how to lookup model names and specify them using --model_name --vocoder_name)
  • http-server vrm-ui
  • (if you want to use an https://aihorde.net API key, insert it near the top of vrm-ui/llm-msg.js)

Based on projects: (thank you devs!)

About

AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages