vinet's Introduction

Real-Time View Interpolation Network(VINet)

This is the official repository of VINet module in "A Multi-user Oriented Live Free-viewpoint Video Streaming System Based On View Interpolation" (ICME 2022)

[ArXiv] | [Demo]

Installation

git clone https://github.com/Eric-chuan/VINet
cd VINet
conda env create -f environments.yml
conda activate VINet

Usage

interpolate the intermidate view bettween two view

organize your multi-view image folder <img_dir> as follow

├──── IMG_DIR/
    ├──── 00.png
    ├──── 01.png
    ├──── ...
    ├──── 11.png

run the script

python inference.py --img_dir image_folder --out_dir output_folder --exp=1 --gpu_idx=0

output

├──── OUT_DIR/
    ├──── 00.png
        ├──── inter_view1.png
    ├──── 01.png
    ├──── ...
    ├──── 10.png
        ├──── inter_view10.png
    ├──── 11.png

you can increase the --exp to get a more intensive viewpoint

Custom dataset

prepare your own multi-view synchronized video

organize your own multi-view synchronized video as follow

├──── YOUR_DIR/
    ├──── raw_videos/
        ├──── 00.mp4
        ├──── ...
        ├──── 11.mp4

convert your video to frames
```
python extract_videos.py
```
organise your frames into triplets and compress them into npz format
It looks so cumbersome, but fortunately you can perform the above operation using the script provided
```
python process-vimeo90k.py
```

Train

Run main.py with the following options in parse_args:

python -m torch.distributed.launch --nproc_per_node=2 train.py --world_size=2  --epoch=100 --batch_size=32

Results

Citation

@article{hu2021multi,
  title={A Multi-user Oriented Live Free-viewpoint Video Streaming System Based On View Interpolation},
  author={Hu, Jingchuan and Guo, Shuai and Dong, Yu and Zhou, Kai and Xu, Jun and Song, Li},
  journal={arXiv preprint arXiv:2112.10603},
  year={2021}
}

vinet's People

Contributors

Stargazers

Watchers

vinet's Issues

关于数据集处理的问题

你好，根据论文描述的相机采集数据分辨率是1920x1080，那么送入网络训练的图像是从该分辨率图像下随机裁剪的224x224的图像吗？相邻相机的间隔为5度左右，那么模型训练的时候是左右图像相差10度插中间帧么？论文描述网络分为两次训练完成，那么第二次训练所使用的学习率是多少呢？谢谢

是否有开源代码或模型提供测试？

你好，之前看到你们用这个算法生成自由视点的demo，感觉效果很不错，主要是在时间上表现非常优秀！我现在也在做相关的研究，想在实际环境数据中与你们的模型进行一个比较，请问有开源代码或者模型提供出来吗？
还想咨询一下就是，我看你们的相机布置的是5°一个，这个是经过多次测试之后选取的视角吗？如果再增大视角的话，会影响最后出来的效果吗？

Recommend Projects