Start TTS generation as soon as 1 sentence worth of tokens are generated. #3343

chrishart0 · 2024-06-21T02:09:36Z

Is your feature request related to a problem? Please describe.
The new audio generation features are awesome, but audio could generation much closer to real time. As it stands, text does not get sent to the audio generation API until the full response has been received. Preventing anything like a real time chat experience.

Describe the solution you'd like
Since the TTS functionality already sends 1 sentence at a time to get generated, we should send the first sentence out to the audio API as soon as it has streamed in. That way, we can get audio back withing seconds of the message being sent instead of waiting for the whole response to come back.

Describe alternatives you've considered
None

Additional context

Technical Notes

There are two places audio is generated which will need to be touched. The regular chat interface and the call interface. We will need to figure out how to tap into stream and, I suppose use the same sentence chunker which is already written chunking text to send to the TTS API for chunking the stream. I've only glanced at the code but this doesn't sound too hard.

open-webui locked and limited conversation to collaborators Jun 21, 2024

tjbck converted this issue into discussion #3345 Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Start TTS generation as soon as 1 sentence worth of tokens are generated. #3343

Start TTS generation as soon as 1 sentence worth of tokens are generated. #3343

chrishart0 commented Jun 21, 2024 •

edited

Loading

This issue was moved to a discussion.

This issue was moved to a discussion.

Start TTS generation as soon as 1 sentence worth of tokens are generated. #3343

Start TTS generation as soon as 1 sentence worth of tokens are generated. #3343

Comments

chrishart0 commented Jun 21, 2024 • edited Loading

This issue was moved to a discussion.

chrishart0 commented Jun 21, 2024 •

edited

Loading