SaltiNet: scan-path prediction on 360 degree images using saliency volumes

التفاصيل البيبلوغرافية
العنوان: SaltiNet: scan-path prediction on 360 degree images using saliency volumes
المؤلفون: Kevin McGuinness, Noel E. O'Connor, Xavier Giro-i-Nieto, Marc Assens
المساهمون: Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
المصدر: Recercat. Dipósit de la Recerca de Catalunya
instname
ICCV Workshops
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
بيانات النشر: IEEE Press, 2018.
سنة النشر: 2018
مصطلحات موضوعية: FOS: Computer and information sciences, Artificial intelligence, scanpath, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Image processing, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Convolutional neural network, computer vision, Neural networks (Computer science), Imatges -- Processament -- Tècniques digitals, Salience (neuroscience), Machine learning, Informàtica::Intel·ligència artificial::Representació del coneixement [Àrees temàtiques de la UPC], convolutional neural networks, 0202 electrical engineering, electronic engineering, information engineering, Xarxes neuronals (Informàtica), Artificial vision, 0105 earth and related environmental sciences, Image processing--Digital techniques, Artificial neural network, business.industry, Deep learning, Intel·ligència artificial, Visió per ordinador, saliency prediction, deep learning, Pattern recognition, Enginyeria de la telecomunicació::Processament del senyal::Processament de la imatge i del senyal vídeo [Àrees temàtiques de la UPC], Visió artificial (Robòtica), Multimedia (cs.MM), Visualization, Eye tracking, 020201 artificial intelligence & image processing, eye gaze, business, Computer Science - Multimedia
الوصف: We introduce SaltiNet, a deep neural network for scanpath prediction trained on 360-degree images. The model is based on a temporal-aware novel representation of saliency information named the saliency volume. The first part of the network consists of a model trained to generate saliency volumes, whose parameters are fit by back-propagation computed from a binary cross entropy (BCE) loss over downsampled versions of the saliency volumes. Sampling strategies over these volumes are used to generate scanpaths over the 360-degree images. Our experiments show the advantages of using saliency volumes, and how they can be used for related tasks. Our source code and trained models available at https://github.com/massens/saliency-360salient-2017Test.
Winner of the Best Scan-path Award at the Salient360!: Visual attention modeling for 360 degrees Images Grand Challenge of ICME 2017. Presented at the ICCV 2017 Workshop on Egocentric Perception, Interaction and Computing (EPIC)
وصف الملف: application/pdf
اللغة: English
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::05c25d1b7b2c5b685a7fbc68b3872656Test
https://hdl.handle.net/2117/114891Test
حقوق: OPEN
رقم الانضمام: edsair.doi.dedup.....05c25d1b7b2c5b685a7fbc68b3872656
قاعدة البيانات: OpenAIRE