AD-NeRF-Research

Set up environment:

Install MiniConda

MINICONDA_INSTALLER_SCRIPT=Miniconda3-4.5.4-Linux-x86_64.sh

MINICONDA_PREFIX=/usr/local

wget https://repo.continuum.io/miniconda/$MINICONDA_INSTALLER_SCRIPT

chmod +x $MINICONDA_INSTALLER_SCRIPT

./$MINICONDA_INSTALLER_SCRIPT -b -f -p $MINICONDA_PREFIX

Update MiniConda

conda install --channel defaults conda python=3.6 --yes

conda update --channel defaults --all --yes

Install MiniConda featuretools from conda-forge

conda install --channel conda-forge featuretools --yes

Clone Original Research Paper Code

git clone https://github.com/YudongGuo/AD-NeRF.git

Create Environment

source <path_of_conda.sh>

conda env create -f ./drive/MyDrive/AD-NeRF/environment.yml

Install pytorch & pytorch3d

pip install torch

git clone https://github.com/facebookresearch/pytorch3d.git

source <path_of_conda.sh> && conda init bash

conda activate adnerf

cd pytorch3d && pip install -e .

Upload this file to ./AD-NeRF/data_util/face_tracking/3DMM

https://drive.google.com/file/d/1PL8vCIRVg1FtyXW5JKJ4hPibl0uawq26/view?usp=sharing

Execute following command to convert BFM

cd ./AD-NeRF/data_util/face_tracking/

python convert_BFM.py

Training model

################## Training video must be in 25fps, mp4 format with encoding accepted by NeRF ##################

Preprocess video with audio for Training

cd <Path_to_AD-NeRF_folder>

source <path_of_conda.sh> && conda init bash && conda activate adnerf

conda info --envs

bash process_data.sh <VideoFileName.mp4>

Training Head

cd <Path_to_AD-NeRF_folder>

source <path_of_conda.sh> && conda init bash && conda activate adnerf

python ./NeRFs/HeadNeRF/run_nerf.py --config ./dataset//HeadNeRF_config.txt

Copy latest Trained Head.tar file into ./dataset//logs/<Name_com>

cd <Path_to_AD-NeRF_folder>

source <path_of_conda.sh> && conda init bash && conda activate adnerf

python ./NeRFs/TorsoNeRF/run_nerf.py --config ./dataset//TorsoNeRF_config.txt

Generate audiofile.npy

source <path_of_conda.sh> && conda init bash && conda activate adnerf

python3 ./data_util/deepspeech_features/extract_ds_features.py --input=<Path_to_Audio_File.wav>

Render Video

cd <Path_to_AD-NeRF_folder>

source /usr/local/etc/profile.d/conda.sh && conda init bash && conda activate adnerf

python ./NeRFs/TorsoNeRF/run_nerf.py --config ./dataset/Obama/TorsoNeRFTest_config.txt --aud_file=<Path_to_AudioFile.npy> --test_size=-1

Pretrained Model

100k Head: https://drive.google.com/file/d/1-hUY2MfQRLYEtxse04NePn8B_z5lyl3b/view?usp=sharing

200k Head: https://drive.google.com/file/d/1VPPcJY3SJAPRW3Sc-oOABpBxI5gAvK3O/view?usp=sharing

60k Torso: https://drive.google.com/file/d/1VPPcJY3SJAPRW3Sc-oOABpBxI5gAvK3O/view?usp=sharing

120k Torso: https://drive.google.com/file/d/1Xlto3nTZPK4qtnXG_3FkeAhMxchgKmdv/view?usp=sharing

Pretrained Obama Model: https://drive.google.com/drive/folders/1-FnRx8jO8Plat821Z4oXtFhJ-C9ZXZSI?usp=sharing

TorsoNeRFTest_config: https://drive.google.com/file/d/1WpReGOzv-TX53EDSVP8sjbTJnwwk7_iA/view?usp=sharing

TorsoNeRF_config: https://drive.google.com/file/d/1Wsk3SMC0AO0Cyuh_xhFA3dgoqqeO0Svq/view?usp=sharing

HeadNeRF_config: https://drive.google.com/file/d/1WypqiN9c8WQnwnPY878bfw_2ZtGHkY6q/view?usp=sharing

transforms_val: https://drive.google.com/file/d/1X2H2Sa7YcY5IGmz30DDUIlL1zRkbgTl3/view?usp=sharing

transforms_train: https://drive.google.com/file/d/1X4Mcqq5f6PIwdcK2c4MVeLvwJJx7zB4Q/view?usp=sharing

Citation

@inproceedings{guo2021adnerf, title={AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis}, author={Yudong Guo and Keyu Chen and Sen Liang and Yongjin Liu and Hujun Bao and Juyong Zhang}, booktitle={IEEE/CVF International Conference on Computer Vision (ICCV)}, year={2021}

Acknowledgement

We use DeepSpeech for audio feature extraction. The NeRF model is implemented based on NeRF-pytorch.

geogyu / ad-nerf-research Goto Github PK

ad-nerf-research's Introduction

AD-NeRF-Research

Install MiniConda

Update MiniConda

Install MiniConda featuretools from conda-forge

Clone Original Research Paper Code

Create Environment

Install pytorch & pytorch3d

Upload this file to ./AD-NeRF/data_util/face_tracking/3DMM

Execute following command to convert BFM

Training model

Preprocess video with audio for Training

Training Head

Copy latest Trained Head.tar file into ./dataset//logs/<Name_com>

Generate audiofile.npy

Render Video

Pretrained Model

Citation

Acknowledgement

ad-nerf-research's People

Contributors

Stargazers

Watchers

Forkers

ad-nerf-research's Issues

Recommend Projects

Recommend Topics

Recommend Org