Files
ECCV2022-RIFE/README.md

163 lines
7.3 KiB
Markdown
Raw Normal View History

2021-05-17 14:51:02 +08:00
# RIFE - Real Time Video Interpolation
2021-12-10 13:27:43 +08:00
## [arXiv](https://arxiv.org/abs/2011.06294) | [YouTube](https://www.youtube.com/results?search_query=rife+interpolation&sp=CAM%253D) | [BiliBili](https://search.bilibili.com/all?keyword=SVFI&order=stow&duration=0&tids_1=0) | [Colab](https://colab.research.google.com/github/hzwer/arXiv2020-RIFE/blob/main/Colab_demo.ipynb) | [Tutorial](https://www.youtube.com/watch?v=gf_on-dbwyU&feature=emb_title)
2020-11-14 14:05:02 +08:00
2021-12-08 18:02:21 +08:00
16X interpolation results from two input images:
![Demo](./demo/I2_slomo_clipped.gif)
![Demo](./demo/D2_slomo_clipped.gif)
2021-03-04 17:14:22 +08:00
## Introduction
2021-12-16 12:33:14 +08:00
This project is the implement of [RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation](https://arxiv.org/abs/2011.06294). Currently, our model can run 30+FPS for 2X 720p interpolation on a 2080Ti GPU. It supports arbitrary-timestep interpolation between a pair of images.
2020-11-13 18:24:04 +08:00
2021-06-17 11:39:31 +08:00
## Software
2021-12-09 14:24:44 +08:00
[Flowframes](https://nmkd.itch.io/flowframes) | [Squirrel-RIFE(中文)](https://github.com/YiWeiHuang-stack/Squirrel-Video-Frame-Interpolation) | [Waifu2x-Extension-GUI](https://github.com/AaronFeng753/Waifu2x-Extension-GUI) | [Autodesk Flame](https://vimeo.com/505942142) | [SVP](https://www.svp-team.com/wiki/RIFE_AI_interpolation)
2021-03-10 15:11:31 +08:00
2021-12-09 14:01:21 +08:00
[RIFE-App(Paid)](https://grisk.itch.io/rife-app) | [Steam-VFI(Paid)](https://store.steampowered.com/app/1692080/SVFI/)
We are not responsible for and participating in the development of above software. According to the open source license, we respect the commercial behavior of other developers.
2021-12-09 14:24:44 +08:00
[VapourSynth-RIFE](https://github.com/HolyWu/vs-rife) | [RIFE-ncnn-vulkan](https://github.com/nihui/rife-ncnn-vulkan) | [VapourSynth-RIFE-ncnn-Vulkan](https://github.com/HomeOfVapourSynthEvolution/VapourSynth-RIFE-ncnn-Vulkan)
2021-12-16 12:28:41 +08:00
2021-12-16 12:33:14 +08:00
If you are a developer, welcome to follow [Practical-RIFE](https://github.com/hzwer/Practical-RIFE), which aims to make RIFE more practical for users by adding various features and design new models with faster speed.
2021-06-17 11:39:31 +08:00
## CLI Usage
2020-11-26 09:49:42 +08:00
### Installation
2020-11-12 20:27:43 +08:00
```
2020-11-26 09:49:42 +08:00
git clone git@github.com:hzwer/arXiv2020-RIFE.git
cd arXiv2020-RIFE
2020-12-02 19:05:01 +08:00
pip3 install -r requirements.txt
2020-11-12 20:27:43 +08:00
```
2021-06-17 18:02:06 +08:00
* Download the pretrained **HD** models from [here](https://drive.google.com/file/d/1APIzVeI-4ZZCEuIRE1m6WYfSCaOsi_7_/view?usp=sharing). (百度网盘链接:https://pan.baidu.com/share/init?surl=u6Q7-i4Hu4Vx9_5BJibPPA 密码:hfk3把压缩包解开后放在 train_log/\*)
2020-11-13 01:44:56 +08:00
2021-05-15 17:42:04 +08:00
* Unzip and move the pretrained parameters to train_log/\*
2020-11-12 23:39:28 +08:00
2021-05-15 22:32:20 +08:00
* This model is not reported by our paper, for our paper model please refer to [evaluation](https://github.com/hzwer/arXiv2020-RIFE#evaluation).
2020-11-26 09:49:42 +08:00
### Run
2020-11-26 11:34:11 +08:00
**Video Frame Interpolation**
2020-12-02 18:57:12 +08:00
You can use our [demo video](https://drive.google.com/file/d/1i3xlKb7ax7Y70khcTcuePi6E7crO_dFc/view?usp=sharing) or your own video.
2020-11-16 05:27:06 +08:00
```
2020-12-02 18:55:14 +08:00
python3 inference_video.py --exp=1 --video=video.mp4
2020-11-16 05:27:06 +08:00
```
2020-11-23 15:05:29 +08:00
(generate video_2X_xxfps.mp4)
2020-11-16 05:27:06 +08:00
```
2020-12-02 18:55:14 +08:00
python3 inference_video.py --exp=2 --video=video.mp4
2020-11-16 05:27:06 +08:00
```
2020-11-23 15:05:29 +08:00
(for 4X interpolation)
2020-11-16 05:27:06 +08:00
```
2021-02-27 17:00:53 +08:00
python3 inference_video.py --exp=1 --video=video.mp4 --scale=0.5
2020-12-13 23:55:04 +08:00
```
2021-02-27 17:48:21 +08:00
(If your video has very high resolution such as 4K, we recommend set --scale=0.5 (default 1.0). If you generate disordered pattern on your videos, try set --scale=2.0. This parameter control the process resolution for optical flow model.)
2020-12-13 23:55:04 +08:00
```
2020-12-03 18:04:13 +08:00
python3 inference_video.py --exp=2 --img=input/
```
(to read video from pngs, like input/0.png ... input/612.png, ensure that the png names are numbers)
```
2020-12-02 18:55:14 +08:00
python3 inference_video.py --exp=2 --video=video.mp4 --fps=60
2020-11-16 05:27:06 +08:00
```
2020-11-26 11:34:11 +08:00
(add slomo effect, the audio will be removed)
2020-11-18 11:29:08 +08:00
```
2020-12-02 18:55:14 +08:00
python3 inference_video.py --video=video.mp4 --montage --png
2020-11-17 19:19:21 +08:00
```
2021-11-10 12:15:51 +08:00
(if you want to montage the origin video and save the png format output)
2020-11-16 05:29:36 +08:00
2020-11-13 23:28:32 +08:00
**Image Interpolation**
2020-11-12 23:39:28 +08:00
```
2020-12-02 18:55:14 +08:00
python3 inference_img.py --img img0.png img1.png --exp=4
2020-11-16 21:37:23 +08:00
```
2020-11-14 02:54:24 +08:00
(2^4=16X interpolation results)
2020-11-16 21:37:23 +08:00
After that, you can use pngs to generate mp4:
```
2020-12-02 18:55:14 +08:00
ffmpeg -r 10 -f image2 -i output/img%d.png -s 448x256 -c:v libx264 -pix_fmt yuv420p output/slomo.mp4 -q:v 0 -q:a 0
2020-11-16 21:37:23 +08:00
```
You can also use pngs to generate gif:
```
2020-12-02 18:55:14 +08:00
ffmpeg -r 10 -f image2 -i output/img%d.png -s 448x256 -vf "split[s0][s1];[s0]palettegen=stats_mode=single[p];[s1][p]paletteuse=new=1" output/slomo.gif
2020-11-13 23:28:32 +08:00
```
2021-01-17 15:24:57 +01:00
### Run in docker
Place the pre-trained models in `train_log/\*.pkl` (as above)
2021-01-17 15:24:57 +01:00
Building the container:
```
docker build -t rife -f docker/Dockerfile .
```
Running the container:
```
docker run --rm -it -v $PWD:/host rife:latest inference_video --exp=1 --video=untitled.mp4 --output=untitled_rife.mp4
```
```
docker run --rm -it -v $PWD:/host rife:latest inference_img --img img0.png img1.png --exp=4
2021-01-17 15:24:57 +01:00
```
2021-01-17 16:11:24 +01:00
Using gpu acceleration (requires proper gpu drivers for docker):
```
docker run --rm -it --gpus all -v /dev/dri:/dev/dri -v $PWD:/host rife:latest inference_video --exp=1 --video=untitled.mp4 --output=untitled_rife.mp4
```
2020-11-16 05:27:06 +08:00
## Evaluation
2021-11-15 23:41:04 +08:00
Download [RIFE model](https://drive.google.com/file/d/1h42aGYPNJn2q8j_GVkS_yDu__G_UZ2GX/view?usp=sharing) or [RIFE_m model](https://drive.google.com/file/d/147XVsDXBfJPlyct2jfo9kpbL944mNeZr/view?usp=sharing) reported by our paper.
2021-03-04 12:33:56 +08:00
**UCF101**: Download [UCF101 dataset](https://liuziwei7.github.io/projects/VoxelFlow) at ./UCF101/ucf101_interp_ours/
2020-11-14 14:56:13 +08:00
2020-11-26 19:11:17 +08:00
**Vimeo90K**: Download [Vimeo90K dataset](http://toflow.csail.mit.edu/) at ./vimeo_interp_test
2020-11-26 19:12:36 +08:00
2020-11-26 19:11:17 +08:00
**MiddleBury**: Download [MiddleBury OTHER dataset](https://vision.middlebury.edu/flow/data/) at ./other-data and ./other-gt-interp
2021-03-04 12:33:56 +08:00
2021-04-13 16:36:35 +08:00
**HD**: Download [HD dataset](https://github.com/baowenbo/MEMC-Net) at ./HD_dataset. We also provide a [google drive download link](https://drive.google.com/file/d/1iHaLoR2g1-FLgr9MEv51NH_KQYMYz-FA/view?usp=sharing).
2020-11-14 17:02:45 +08:00
```
2021-05-07 11:27:46 +08:00
# RIFE
2021-03-04 12:33:56 +08:00
python3 benchmark/UCF101.py
2021-08-13 16:34:41 +08:00
# "PSNR: 35.282 SSIM: 0.9688"
2020-12-02 18:55:14 +08:00
python3 benchmark/Vimeo90K.py
2021-08-13 16:34:41 +08:00
# "PSNR: 35.615 SSIM: 0.9779"
2021-03-09 17:52:10 +08:00
python3 benchmark/MiddleBury_Other.py
2021-08-13 16:49:25 +08:00
# "IE: 1.956"
2021-03-04 12:33:56 +08:00
python3 benchmark/HD.py
2021-08-13 16:34:41 +08:00
# "PSNR: 32.14"
2021-11-15 23:41:04 +08:00
# RIFE_m
python3 benchmark/HD_multi_4X.py
2021-11-15 23:44:09 +08:00
# "PSNR: 22.96(544*1280), 31.87(720p), 34.25(1080p)"
2020-11-14 17:02:45 +08:00
```
2020-12-02 19:12:48 +08:00
## Training and Reproduction
2021-08-13 16:36:05 +08:00
Download [Vimeo90K dataset](http://toflow.csail.mit.edu/).
2020-12-02 18:40:39 +08:00
We use 16 CPUs, 4 GPUs and 20G memory for training:
```
2020-12-14 11:50:43 +08:00
python3 -m torch.distributed.launch --nproc_per_node=4 train.py --world_size=4
2020-12-02 18:40:39 +08:00
```
2020-11-13 11:16:57 +08:00
## Citation
2020-11-13 23:28:32 +08:00
2020-11-13 10:27:30 +08:00
```
@article{huang2020rife,
title={RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation},
author={Huang, Zhewei and Zhang, Tianyuan and Heng, Wen and Shi, Boxin and Zhou, Shuchang},
journal={arXiv preprint arXiv:2011.06294},
year={2020}
}
```
## Reference
2020-12-02 19:00:32 +08:00
2020-11-13 11:16:57 +08:00
Optical Flow:
2020-11-26 11:34:11 +08:00
[ARFlow](https://github.com/lliuz/ARFlow) [pytorch-liteflownet](https://github.com/sniklaus/pytorch-liteflownet) [RAFT](https://github.com/princeton-vl/RAFT) [pytorch-PWCNet](https://github.com/sniklaus/pytorch-pwc)
2020-11-13 11:16:57 +08:00
2020-12-02 19:02:47 +08:00
Video Interpolation:
2021-03-09 10:59:38 +08:00
[DVF](https://github.com/lxx1991/pytorch-voxel-flow) [TOflow](https://github.com/Coldog2333/pytoflow) [SepConv](https://github.com/sniklaus/sepconv-slomo) [DAIN](https://github.com/baowenbo/DAIN) [CAIN](https://github.com/myungsub/CAIN) [MEMC-Net](https://github.com/baowenbo/MEMC-Net) [SoftSplat](https://github.com/sniklaus/softmax-splatting) [BMBC](https://github.com/JunHeum/BMBC) [EDSC](https://github.com/Xianhang/EDSC-pytorch) [EQVI](https://github.com/lyh-18/EQVI)
2021-02-12 19:23:29 +08:00
2021-03-04 17:14:22 +08:00
## Sponsor
2021-12-16 12:33:14 +08:00
Many thanks to [Grisk](https://grisk.itch.io/rife-app).
2021-03-04 17:14:22 +08:00
2021-02-12 19:23:29 +08:00
感谢支持 Paypal Sponsor: https://www.paypal.com/paypalme/hzwer
<img width="160" alt="image" src="https://cdn.luogu.com.cn/upload/image_hosting/5h3609p1.png"><img width="160" alt="image" src="https://cdn.luogu.com.cn/upload/image_hosting/yi3kcwnw.png">