Feel free to create pull requests, but do not commit subtitles !
To create a visualization :
- Extracts the subtitles using FFMPEG to the VTT format, due to obvious copyright problems, they can't be on the repository.
- Preprocess the image using a graphical tool to create a mask.
- Black: Word cloud space
- White: Kept as is from the image
- Grey value: Discarded from the visualization
- From this mask and the words obtained from the subtitles, the script uses nltk to remove stop words, wordcloud to create a visualization and a bit of numpy image math's.
- Neon Genesis Evangelion
- Cowboy Bebop
- Darling in the Franxx
- Mirai Nikki
- Death Note
- Steins;Gate
- One-Punch Man
- Bocchi the Rock!
- Chainsaw Man
Data used:
- English subtitles from : Neon Genesis Evangelion (1995)
- Original image
Reddit posts : r/dataisbeautiful r/evangelion
Creation date: 20210122
Data used:
- English subtitles from : Cowboy Bebop (1998)
- Original image
Reddit posts: r/dataisbeautiful / r/cowboybebop
Creation date: 20210509
Data used:
- English subtitles from : Darling in the Franxx (2018)
- Original image
Creation date: 20211115
Data used:
- English subtitles from : Mirai Nikki (2011)
- Original image
Creation date: 20220304
Data used:
- English subtitles from : Death Note (2006)
- Original image
Creation date: 20220822
Data used:
- English subtitles from : Steins;Gate (2009)
- Original image
Creation date: 20230720
Data used:
- English subtitles from : One-Punch Man season 1 (2015)
- Original image
Creation date: 20230801
Data used:
- English subtitles from : Bocchi the Rock season 1 (2022)
- Original image
Creation date: 20241103
Data used:
- English subtitles from : Chainsaw Man season 1 (2022)
- Original image
Creation date: 20241103