Abstract
Accurate estimation of visual quality as perceived by humans is crucial for modern multimedia systems and, given the evident ease for humans, a surprisingly difficult task for computers. Complexity considerations as imperative for real-time applications render this problem even more challenging. This paper studies the application of a neural network-based spatial model of distortion sensitivity to the quality prediction of spatio-temporal videos. We propose a simple yet effective adaptation of the loss function to cope with saturation effects in human quality ratings. This adaptation drastically decreases the number of iterations necessary for training networks to replicate psychophysical human responses. Our experimental results show significantly improved prediction performance of the spatio-temporal PSNR when compensated for spatial distortion sensitivity while maintaining the advantage of low complexity.
Original language | English |
---|---|
Title of host publication | 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing, MLSP 2019 |
Publisher | IEEE Computer Society |
ISBN (Electronic) | 9781728108247 |
DOIs | |
Publication status | Published - 2019 Oct |
Event | 29th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2019 - Pittsburgh, United States Duration: 2019 Oct 13 → 2019 Oct 16 |
Publication series
Name | IEEE International Workshop on Machine Learning for Signal Processing, MLSP |
---|---|
Volume | 2019-October |
ISSN (Print) | 2161-0363 |
ISSN (Electronic) | 2161-0371 |
Conference
Conference | 29th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2019 |
---|---|
Country/Territory | United States |
City | Pittsburgh |
Period | 19/10/13 → 19/10/16 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- Visual perception
- distortion sensitivity
- neural network
- video compression
- video quality
ASJC Scopus subject areas
- Human-Computer Interaction
- Signal Processing