We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its gaze. PathGAN is composed of two parts, the generator and the discriminator. Both parts extract features from images using off-the-shelf networks, and train recurrent layers to generate or discriminate scanpaths accordingly. In scanpath prediction, the stochastic nature of the data makes it very difficult to generate realistic predictions using supervised learning strategies, but we adopt adversarial training as a suitable alternative. Our experiments prove how PathGAN improves the state of the art of visual scanpath prediction on the iSUN and Salient360! datasets.
Ireland ->
Dublin City University ->
Publication Type = Conference or Workshop Item
Ireland ->
Dublin City University ->
Subject = Computer Science: Artificial intelligence
Ireland ->
Dublin City University ->
Subject = Computer Science: Image processing
Ireland ->
Dublin City University ->
Status = Published
Ireland ->
Dublin City University ->
DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing: School of Electronic Engineering
Ireland ->
Dublin City University ->
DCU Faculties and Centres = Research Initiatives and Centres: INSIGHT Centre for Data Analytics
Ireland ->
Dublin City University ->
Subject = Computer Science: Machine learning
Noel E. O'Connor,
Kevin McGuinness,
Xavier Giro-i-Nieto,
Marc Assens