Subtitler

Automatically subtitle any video spoken in any language to a language of your choice using AI.

Models used:

OpenAI whisper C port - for audio-to-text
Facebook M2M10 - for translation

Tools used:

ffmpeg

Please don't forget to star the repository if you find it useful or educational!

Before:

soldier.mp4

After (in Romanian - model_type=medium, language_model_type=base):

soldier_subtitled.mp4

Setup

Install using pip.

pip install gptsubtitler

Install ffmpeg:

# Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# MacOS
brew install ffmpeg

# Windows using Chocolatey https://chocolatey.org/
choco install ffmpeg

Quick guide

Example usage for adding subtitles and translating them in Romanian:

Command line:

gptsubtitler soldier.mp4 --source_language en --target_language ro --captioning_model_type medium --language_model_type base

Or in Python

from gptsubtitler import Transcriber

# I strongly recommend using the "medium" model_type.
Transcriber.transcribe("soldier.mp4", source_language="en", target_language="ro", captioning_model_type="medium", language_model_type="base")

You can also use the Translator class from translator.py if you just want to translate some text.

Example usage for translating from English to Romanian:

from gptsubtitler import Translator

print(Translator.translate("Hi!", target_language="ro", source_language="en"))

If you have generated a .srt file and just want to add subtitles:

from gptsubtitler import create_video_with_subtitles
create_video_with_subtitles("video.mp4", "output.srt", "video_subtitled.mp4")

Options

Args:
    video_file (str): Path to video file.

    output_video_file (str, optional): Path to output video file. Defaults to video_file_subtitled.

    output_subtitle_file (str, optional): Path to output SRT file. Defaults to "output.srt".

    source_language (str, optional): Source language for translation. Defaults to en.

    target_language (str, optional): Target language for translation. Defaults to None.

    captioning_model_type (str, optional): Model type. Defaults to "base".

    language_model_type (str, optional): Language model type. Defaults to "base".

    model_dir (str, optional): Path to model directory. Defaults to None.

Available options for captioning_model_type (the audio to text model):

tiny
base - default
small
medium
large

Available options for language_model_type (the language translator model):

base - default
large

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
gptsubtitler		gptsubtitler
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Subtitler

Setup

Quick guide

Options

About

Releases

Packages

Languages

License

extremq/gptsubtitler

Folders and files

Latest commit

History

Repository files navigation

Subtitler

Setup

Quick guide

Options

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages