se_extractor.py crashes on line 28: "segments, info = model.transcribe(audio_path, beam_size=5, word_timestamps=True)" #323

PK268 · 2024-10-22T09:53:29Z

Platform: Windows 11
Python Version: 3.9.0

After following the tutorial here for installing OpenVoice on Windows. I cannot run demo_part3 successfully featuring OpenVoice V2. Narrowing down the issue, I found that the program crashes when trying to transcribe my audio clip in the se_extractor.py line 28. Everything works fine when using V1, granted, only after a couple workarounds. I'd be happy to transcribe the audio myself, is that planned to become a feature?

This is one of my first github issues so let me know if you need more info

dosubot · 2024-10-22T09:54:38Z

Hello @PK268! I'm here to help you with any bugs, questions, or contributions you have. Let's work together to solve the issue you're facing.

I wasn't able to find specific information about the 'transcribe' function in OpenVoice V2 or any known issues related to it. However, you might want to check if all dependencies are correctly installed and compatible with your setup. The dependencies for OpenVoice V2 are compatible with Python 3.9.0, so that shouldn't be an issue ^[1]. If you managed to get V1 working with some workarounds, it might be worth trying similar approaches for V2. Unfortunately, I don't have information on whether manual transcription is planned as a feature.

To continue talking to Dosu, mention @dosu.

Blaizzy · 2024-10-30T16:04:12Z

Try changing the whisper model from medium to tiny.

It worked for me :)

dosubot bot added the bug Something isn't working label Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

se_extractor.py crashes on line 28: "segments, info = model.transcribe(audio_path, beam_size=5, word_timestamps=True)" #323

se_extractor.py crashes on line 28: "segments, info = model.transcribe(audio_path, beam_size=5, word_timestamps=True)" #323

PK268 commented Oct 22, 2024

dosubot bot commented Oct 22, 2024

Blaizzy commented Oct 30, 2024 •

edited

Loading

se_extractor.py crashes on line 28: "segments, info = model.transcribe(audio_path, beam_size=5, word_timestamps=True)" #323

se_extractor.py crashes on line 28: "segments, info = model.transcribe(audio_path, beam_size=5, word_timestamps=True)" #323

Comments

PK268 commented Oct 22, 2024

dosubot bot commented Oct 22, 2024

Blaizzy commented Oct 30, 2024 • edited Loading

Blaizzy commented Oct 30, 2024 •

edited

Loading