OTAvatar_processing

This repository provides tools for preprocessing videos for HDTF dataset used in the paper

Environment Setup

We are still refining the environment setup at the moment. Please refer to the README file in each subdirectory for guidance. Once we finalize everything, we will gather all the necessary requirements into a requirements.txt file located in the root directory. You can create a conda enviroment first via

conda create --name otavatar_processing python=3.9

Step 0: Video Download

This section is adopted from HDTF

Step 1: Background Removal

This seciton is adopated from MODNet

Step 2: Video Cropping

This section is in the FaceCrop folder, adopted from FOMM. The primary objective is to zoom in on the facial regions and discard any irrelevant background.

Step 3: Landmark Detection

This section is adopted from PIRenderer

3DMM Extraction

Step 4: Coarse Stage

This section is adopted from PIRenderer

Step 5: Fine Stage

This section is adopted from ADNeRF

Q&A

Why the Background Removal is performed on the uncropped videos?

During our attempt to remove the background from uncropped videos, we observed that the matting results were unstable between frames. This was due to the fact that the model was trained on images where the head only occupied a small portion of the entire image, which is precisely what is captured in raw video. As a result, we decided to first remove the background and then crop the face, which led to more encouraging results.

Why the Video Cropping is different from FOMM?

Initially, we planned to implement [FOMM]((https://github.com/AliaksandrSiarohin/video-preprocessing), which has been widely utilized in numerous prior methods, as a means of talking face processing. However, we encountered difficulties in obtaining a stable and accurate head pose during the fine stage, as the head occupied too much space in the image. As a result, we opted to make some minor adjustments to our approach.

Acknowledge

We appreciate HDTF, FOMM, ADNeRF, PIRenderer, MODNet for providing their processing script.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
FaceCrop		FaceCrop
example_dataset		example_dataset
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OTAvatar_processing

Environment Setup

Step 0: Video Download

Step 1: Background Removal

Step 2: Video Cropping

Step 3: Landmark Detection

3DMM Extraction

Step 4: Coarse Stage

Step 5: Fine Stage

Q&A

Why the Background Removal is performed on the uncropped videos?

Why the Video Cropping is different from FOMM?

Acknowledge

About

Releases

Packages

Languages

doctorimage/OTAvatar_processing

Folders and files

Latest commit

History

Repository files navigation

OTAvatar_processing

Environment Setup

Step 0: Video Download

Step 1: Background Removal

Step 2: Video Cropping

Step 3: Landmark Detection

3DMM Extraction

Step 4: Coarse Stage

Step 5: Fine Stage

Q&A

Why the Background Removal is performed on the uncropped videos?

Why the Video Cropping is different from FOMM?

Acknowledge

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages