A Technical Report for Visual Attention Estimation in HMD Challenge

Chun Tsao, Po Chyi Su

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The 360° video, also known as omnidirectional video (ODV), immersive video or spherical video, has become increasing popular and drawn great research attention. To achieve the effectiveness of 360° videos, it is quite important to understand how human perceive and interact with 360° videos so that efficient techniques for their encoding, transmission, and rendering can be developed. The Visual Attention Estimation in HMD (Head Mounted Display) is a competition to encourage contestants to design lightweight models for predicting human eye attention in 360°videos. The models need to not only achieve high accuracy but also have outstanding performance on HMD devices. We employ the approach of ATSal [1] and fine-tune the expert models with the dataset provided in this event to achieve 0.840 AUC-J, 0.476 CC, 3.206 KLC, 1.478 NSS, 0.412 SIM, currently ranked the 1st place in the qualification competition leaderboard.

Original languageEnglish
Title of host publicationProceedings - 2021 4th IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages143-144
Number of pages2
ISBN (Electronic)9781665432252
DOIs
StatePublished - 2021
Event4th IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2021 - Taichung, Taiwan
Duration: 15 Nov 202117 Nov 2021

Publication series

NameProceedings - 2021 4th IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2021

Conference

Conference4th IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2021
Country/TerritoryTaiwan
CityTaichung
Period15/11/2117/11/21

Keywords

  • 360°videos
  • HMD
  • Panoramic videos
  • Video attention estimation

Fingerprint

Dive into the research topics of 'A Technical Report for Visual Attention Estimation in HMD Challenge'. Together they form a unique fingerprint.

Cite this