Git Product home page Git Product logo

3dpsd-vqa's Introduction

Power of Tempospatially Unified Spectral Density for Perceptual Video Quality Assessment

This is the MATLAB implementation of our paper:

Mohammed A. Aabed, Gukyeong Kwon, and Ghassan AlRegib, "Power of tempospatially unified spectral density for perceptual video quality assessment," 2017 IEEE International Conference on Multimedia and Expo (ICME), Hong Kong, 2017, pp. 1476-1481.
[arXiv] [IEEE]

This paper is recognized as a Finalist of the World's FIRST 10K Best Paper Award (top 3% paper) @ ICME 2017.

Abstract

We propose a perceptual video quality assessment (PVQA) metric for distorted videos by analyzing the power spectral density (PSD) of a group of pictures. This is an estimation approach that relies on the changes in video dynamic calculated in the frequency domain and are primarily caused by distortion. We obtain a feature map by processing a 3D PSD tensor obtained from a set of distorted frames. This is a full-reference tempospatial approach that considers both temporal and spatial PSD characteristics. This makes it ubiquitously suitable for videos with varying motion patterns and spatial contents. Our technique does not make any assumptions on the coding conditions, streaming conditions or distortion. This approach is also computationally inexpensive which makes it feasible for real-time and practical implementations. We validate our proposed metric by testing it on a variety of distorted sequences from PVQA databases. The results show that our metric estimates the perceptual quality at the sequence level accurately. We report the correlation coefficients with the differential mean opinion scores (DMOS) reported in the databases. The results show high and competitive correlations compared with the state of the art techniques.

Temporally Unified Power Spectral Density

We develop a temporally unified power spectral density feature and show that the feature can capture different types, levels, and scene characteristics in a systematic way.

Computational Efficiency

The proposed method utilizes 3D PSD which can be simply calcuated using 3D DFT. We show the computatial efficiency of proposed method by comarpaing the computation time required to estimate the quality score of a video with 120 frames. Below are the information about PC that we used for the simulation and the comparision of required computation time from proposed method and other VQA metrics.

  • PC information: CORETM i7-6700K CPU @4.00 GHz, 32.0 GB RAM, MATLAB R2015 (b)
Metric VIF NQM Proposed
Time [sec] 255.729 59.490 15.030

Usage & Output Example

Run main.m and below is an example of output:

...
3D PSD calculation (tensor 13 / 15)... 
3D PSD calculation (tensor 14 / 15)... 
3D PSD calculation (tensor 15 / 15)... 
Video quality score: 0.101

Citation

If you find our paper and repository useful, please consider citing our paper:

@inproceedings{aabed2017power,
  title={Power of tempospatially unified spectral density for perceptual video quality assessment},
  author={Aabed, Mohammed A and Kwon, Gukyeong and AlRegib, Ghassan},
  booktitle={Multimedia and Expo (ICME), 2017 IEEE International Conference on},
  pages={1476--1481},
  year={2017},
  organization={IEEE}
}

3dpsd-vqa's People

Contributors

gukyeongkwon avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.