Mandatory Fields

Authors

Mark Marsden, Eva Mohedano, Kevin McGuinness, Andrea Calafell, Xavier Giró-i-Nieto, Noel O’Connor, Jiang Zhou, Lucas Azevedo, Tobias Daudert, Brian Davis, Manuela Hürlimann, Haithem Afli, Jinhua Du, Debasis Ganguly, Wei Li, Andy Way, and Alan Smeaton

Conference Title

TRECVid 2016

Title of Paper

Dublin City University and Partners’ Participation in the INS and VTT Tracks at TRECVid 2016

Year

2016

Month

November

Status

Published

Peer Reviewed

Times Cited

()

Optional Fields

Search Keyword

Editors

Start Page

End Page

Location

Gaithersburg, MD., US

Start Date

End Date

Abstract

Dublin City University participated with a consortium of colleagues from NUI Galway and Universitat Polit`ecnica de Catalunya in two tasks in TRECVid 2016, Instance Search (INS) and Video to Text (VTT). For the INS task we developed a framework consisting of face detection and representation and place detection and representation, with a user annotation of top-ranked videos. For the VTT task we ran 1,000 concept detectors from the VGG-16 deep CNN on 10 keyframes per video and submitted 4 runs for caption re-ranking, based on BM25, Fusion, word2vec and a fusion of baseline BM25 and word2vec. With the same pre-processing for caption generation we used an open source image-to-caption CNN-RNN toolkit NeuralTalk2 to generate a caption for each keyframe and combine them.

Funded By

URL

http://doras.dcu.ie/21484/1/TRECVid2016(7).pdf

DOI Link

Grant Details

Funding Body

Science Foundation Ireland (SFI)

Grant Details

13/RC/2106