DAVE - Your Digital Assistant with Voice Empowerment

DAVE is a digital voice assistant built using OpenAI's powerful technologies including Whisper ASR API, GPT-4 Chat models, and Text-To-Speech (TTS) APIs. With DAVE, you can smoothly interact with a virtual assistant that listens to you attentively, processes the information with a remarkable understanding, and articulates a reply, all made perceivable with intuitive transition states.

Getting Started

As DAVE is just a static html page, some javascript, and a couple images, running and hosting it is easy. If you're a python developer you can use something like python3 -m http.server 8080, otherwise there are a variety of options from VSCode extensions like Live Server to full NGINX deployments.

A demo deployment of DAVE is additionally hosted as a static site here here using DigitalOcean's App Platform.

OpenAI API

In order to use DAVE, an OpenAI account is required with the access to the following model endpoints.

Transcriptions /v1/audio/transcriptions
Chat /v1/chat/completions
Speech /v1/audio/speech

Basic Configuration

For basic usage of DAVE, the only configuration required is for you to provide your OpenAI API key within the page url apiKey parameter.

e.g. ?apiKey=sk-SsMzb...z07l

Advanced Configuration

For more advanced usage, DAVE allows the voice, chat and text-to-speech models to be configured additionally using url parameters. It is important to note that for the models, such as GPT-4, it will depend on your level of API access.

Below are the list of availible configuration options (as of 2023/11/08):

Voice Models (URL Param 'voiceModel'):
- alloy
- echo
- fable
- onyx (default)
- nova
- shimmer
Chat Models (URL Param 'chatModel'):
- gpt-3.5-turbo (default)
- gpt-3.5-turbo-0301
- gpt-3.5-turbo-0613
- gpt-3.5-turbo-1106
- gpt-3.5-turbo-16k
- gpt-3.5-turbo-16k-0613
- gpt-3.5-turbo-instruct
- gpt-3.5-turbo-instruct-0914
- gpt-4
- gpt-4-0314
- gpt-4-0613
- gpt-4-1106-preview
Text-to-Speech Models (URL Param 'textToSpeechModel'):
- tts-1 (default)
- tts-1-hd
- tts-1-1106
- tts-1-hd-1106
Text-to-Speech Models (URL Param 'textToSpeechModel'):
- You are a helpful voice assistant. (default)
- custom
Conversation Mode (URL Param 'conversationMode'):
- true (default)
- false

Note: These configuration options are additionally displayed within the console log for convenient reference.

Docker

If you wish to run DAVE using docker, you can use the following command to build and deploy as a docker image.

docker build -t dave .

docker run -d \
    --name dave \
    -p 8080:80 \
    dave

Future Enhancements

Automatically cut off recording after n seconds of silence.

Acknowledgments

The development of DAVE was inspired by the excellent work of Romain Huet, who demo'd a similar (albeit more complex) application at the 2023 OpenAI DevDay. His excellent presentation is availible to watch here on YouTube.

License

The code and documentation in this project are released under the GPLv3 License.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
fonts		fonts
icons		icons
js		js
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DAVE - Your Digital Assistant with Voice Empowerment

Getting Started

OpenAI API

Basic Configuration

Advanced Configuration

Docker

Future Enhancements

Acknowledgments

License

About

Languages

License

BadgerHobbs/Dave

Folders and files

Latest commit

History

Repository files navigation

DAVE - Your Digital Assistant with Voice Empowerment

Getting Started

OpenAI API

Basic Configuration

Advanced Configuration

Docker

Future Enhancements

Acknowledgments

License

About

Resources

License

Stars

Watchers

Forks

Languages