Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

se_extractor.py crashes on line 28: "segments, info = model.transcribe(audio_path, beam_size=5, word_timestamps=True)" #323

Open
PK268 opened this issue Oct 22, 2024 · 2 comments
Labels
bug Something isn't working

Comments

@PK268
Copy link

PK268 commented Oct 22, 2024

Platform: Windows 11
Python Version: 3.9.0

After following the tutorial here for installing OpenVoice on Windows. I cannot run demo_part3 successfully featuring OpenVoice V2. Narrowing down the issue, I found that the program crashes when trying to transcribe my audio clip in the se_extractor.py line 28. Everything works fine when using V1, granted, only after a couple workarounds. I'd be happy to transcribe the audio myself, is that planned to become a feature?

This is one of my first github issues so let me know if you need more info

Copy link

dosubot bot commented Oct 22, 2024

Hello @PK268! I'm here to help you with any bugs, questions, or contributions you have. Let's work together to solve the issue you're facing.

I wasn't able to find specific information about the 'transcribe' function in OpenVoice V2 or any known issues related to it. However, you might want to check if all dependencies are correctly installed and compatible with your setup. The dependencies for OpenVoice V2 are compatible with Python 3.9.0, so that shouldn't be an issue [1]. If you managed to get V1 working with some workarounds, it might be worth trying similar approaches for V2. Unfortunately, I don't have information on whether manual transcription is planned as a feature.

To continue talking to Dosu, mention @dosu.

@dosubot dosubot bot added the bug Something isn't working label Oct 22, 2024
@Blaizzy
Copy link

Blaizzy commented Oct 30, 2024

Try changing the whisper model from medium to tiny.

It worked for me :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants