Dublin City University and Partners’ Participation in the INS and VTT Tracks at TRECVid 2016
Publication Type:
Refereed Conference Meeting Proceeding
Abstract:
Dublin City University participated with a consortium of colleagues from NUI Galway and Universitat Polit`ecnica de Catalunya in two tasks in TRECVid 2016, Instance Search (INS) and Video to Text (VTT). For the INS task we developed a framework consisting of face detection and representation and place detection and representation, with a user annotation of top-ranked videos. For the VTT task we ran 1,000 concept detectors from the VGG-16 deep CNN on 10 keyframes per video and submitted 4 runs for caption re-ranking, based on BM25, Fusion, word2vec and a fusion of baseline BM25 and word2vec. With the same pre-processing for caption generation we used an open source image-to-caption CNN-RNN toolkit NeuralTalk2 to generate a caption for each keyframe and combine them.
Conference Name:
TRECVid
Proceedings:
Proceedings of TRECVid
Digital Object Identifer (DOI):
10.na
Publication Date:
14/11/2016
Conference Location:
United States of America
Research Group:
Institution:
Dublin City University (DCU)
Open access repository:
Yes