iVSR FFmpeg plugin - iVSR SDK based

The folder ivsr_ffmpeg_plugin enables model inference using FFmpeg with iVSR SDK as backend. It provides additional ivsr backend for the DNN interface called by the dnn_processing filter.
The patches included in patches folder are specifically for FFmpeg n7.1.

How to run inference with FFmpeg-plugin

To run inference with iVSR SDK, you need to specify ivsr as the backend for the dnn_processing filter. Here is an example of how to do it: dnn_processing=dnn_backend=ivsr.
Additionally, there are other parameters that you can use. These parameters are listed in the table below:

AVOption name	Description	Default value	Recommended value(s)
dnn_backend	DNN backend framework name	native	ivsr
model	path to model file	NULL	Available full path of the released model files
input	input name of the model	NULL	input
output	output name of the model	NULL	output
device	device for inference task	CPU	CPU or GPU
model_type	type for models	0	0 for Enhanced BasicVSR, 1 for SVP models, 2 for Enhanced EDSR, 3 for one CUSTOM VSR, 4 for TSENet
normalize_factor	factor for normalization	1.0	255.0 for Enhanced EDSR, 1.0 for other models supported in current version
num_streams	number of execution streams for the throughput mode (now valid only for GPU devices).	1	use `benchmark_app` (a tool provided by OpenVINO Toolkit), to get the appropriate value for the best throughput
extension	extension lib file full path, required for loading Enhanced BasicVSR model
op_xml	custom op xml file full path, required for loading Enhanced BasicVSR model
nif	number of input frames in batch sent to the DNN backend	1	3 for Enhanced BasicVSR, 1 for other models supported in current version
nireq	number of request	0	use the default setting or set it to match the number of cpu cores

Here are some examples of FFmpeg command lines to run inference with the supported models using the ivsr backend.

Command sample to run Enhanced BasicVSR inference, the input pixel format supported by the model is rgb24.

cd <iVSR project path>/ivsr_ffmpeg_plugin/ffmpeg
./ffmpeg -i <your test video> -vf format=rgb24,dnn_processing=dnn_backend=ivsr:model=<basic_vsr_model.xml>:input=input:output=output:nif=3:device=<CPU or GPU>:extension=<iVSR project path>/ivsr_ov/based_on_openvino_2022.3/openvino/bin/intel64/Release/libcustom_extension.so:op_xml=<iVSR project path>/ivsr_ov/based_on_openvino_2022.3/openvino/flow_warp_cl_kernel/flow_warp.xml test_out.mp4

Please note that for the Enhanced BasicVSR model, you need to set the extension and op_xml options (with backend_configs) in the command line. After applying OpenVINO's patches and building OpenVINO, the extension lib file is located in <OpenVINO folder>/openvino/bin/intel64/Release/libcustom_extension.so, and the op xml file is located in <OpenVINO folder>/openvino/flow_warp_cl_kernel/flow_warp.xml.

Command sample to run SVP models inference. If the supported input pixel format of the model variance is rgb24, set the preceeding format as is to avoid unnecessary layout conversion:

cd <iVSR project path>/ivsr_ffmpeg_plugin/ffmpeg
./ffmpeg -i <your test video> -vf format=rgb24,dnn_processing=dnn_backend=ivsr:model=<svp_model.xml>:input=input:output=output:nif=1:device=<CPU or GPU>:model_type=1 -pix_fmt yuv420p test_out.mp4

If the model variance supports Y-input, set the preceeding format as YUV:

./ffmpeg -i <your test video> -vf format=yuv420p,dnn_processing=dnn_backend=ivsr:model=<svp_model.xml>:input=input:output=output:nif=1:device=<CPU or GPU>:model_type=1 -pix_fmt yuv420p test_out.mp4

Command sample to run Enhanced EDSR inference, the input pixel format supported by the model is rgb24.

cd <iVSR project path>/ivsr_ffmpeg_plugin/ffmpeg
./ffmpeg -i <your test video> -vf format=rgb24,dnn_processing=dnn_backend=ivsr:model=<edsr_model.xml>:input=input:output=output:nif=1:device=<CPU or GPU>:model_type=2:normalize_factor=255.0 -pix_fmt yuv420p test_out.mp4

Command sample to run CUSTOM VSR inference. Note the input pixel format supported by this model is yuv420p, and its input shape is [1, (Y channel)1, H, W], output shape is [1, 1, 2xH, 2xW].

cd <iVSR project path>/ivsr_ffmpeg_plugin/ffmpeg
./ffmpeg -i <your test video> -vf format=yuv420p,dnn_processing=dnn_backend=ivsr:model=<customvsr_model.xml>:input=input:output=output:nif=1:nireq=1:device=CPU:model_type=3 -pix_fmt yuv420p test_out.mp4

Command sample to run TSENet model, the input pixel format supported by the model is rgb24.

cd <iVSR project path>/ivsr_ffmpeg_plugin/ffmpeg
./ffmpeg -i <your test video> -vf format=rgb24,dnn_processing=dnn_backend=ivsr:model=<tsenet_model.xml>:input=input:output=output:nif=1:device=<CPU or GPU>:model_type=4 -pix_fmt yuv420p test_out.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

iVSR FFmpeg plugin - iVSR SDK based

How to run inference with FFmpeg-plugin

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

iVSR FFmpeg plugin - iVSR SDK based

How to run inference with FFmpeg-plugin