Video Stream Retrieval of Unseen Queries using Semantic Memory

Recommend Documents

Video Stream Retrieval of Unseen Queries using ... - Google Sites

Retrieval of live, user-broadcast video streams is an under-addressed and increasingly relevant challenge. The on-line n

Dissociating Memory Retrieval Processes Using ... - Semantic Scholar

Sep 27, 2001 - veal a network of brain regions whose activity provides an index of episodic ...... dampened scanner noise and enabled communication. A power ... networks. Macintosh computer (Apple, Cupertino, CA) and Psyscope software.

VIDEO BACKGROUND RETRIEVAL USING ... - Semantic Scholar

puter Vision and Pattern Recognition, 2004. [4] M. Irani, P. Anandan, J. Bergen, R. Kumar, and S. Hsu. Mo- saic Representations of Video Sequences and Their ...

ENDOMICROSCOPIC VIDEO RETRIEVAL USING ... - Semantic Scholar

(1) Mauna Kea Technologies (MKT), Paris (2) Asclepios Project-Team, INRIA Sophia-Antipolis. (3) Hospital of the University of Pennsylvania, Philadelphia (4) ...

Optimizing Multiple Distributed Stream Queries Using Hierarchical ...

operators, resulting in an extremely large search space for optimal solutions. ... reduces the cost by more than 50% as it was able to exploit op- timization ...

Using Stream Semantics for Continuous Queries in ... - Semantic Scholar

Using Stream Semantics for Continuous Queries in Media Stream Processors. Amarnath Gupta. San Diego Supercomputer Center. University of California San ...

Contextual Media Retrieval Using Natural Language Queries

Feb 16, 2016 - ABSTRACT. The widespread integration of cameras in hand-held and .... parser with more accessible textual question-answer pairs [2,. 10, 1].

Using Medline Queries to Generate Image Retrieval

Using Medline Queries to Generate Image. Retrieval Tasks .... (image, video, media, xray, x-ray, CT, MRI, PET, tomography, ultrasound, ultrasound, endoscopy).

E cient Transport of Stored Video Using Stream ... - Semantic Scholar

Scheduling and Window-Based Tra c Envelops. Wei Zhao, Marwan Krunz y. , and Satish K. Tripathi. Institute for Advanced Computer Studies. Department of ...

Hippocampus: Answering Memory Queries using ... - eXascale Infolab

requested memory is reconstructed from a group of people by exchanging pieces of .... an initial list of persons having attended ISWC. The system then starts ...

Hippocampus: Answering Memory Queries using Transactive Search

memory queries that we call Transactive Search: The user- requested memory is .... Search, the Linked. Open Data movement has pushed the availability of large ... Research in this domain focused on extending database technologies with.

Competitive Semantic Memory Retrieval: Temporal

Feb 22, 2016 - retrieval trials (the impossible retrieval condition and retrieval failure trials ... conjunctions (e.g. item-source or item-context associations) when ...

Video Retrieval Based on Textual Queries - Google Sites

Audio and the textual content in videos can be of immense use ... An advanced video retrieval solution could identify th

Audio-based queries for video retrieval over Java

May 17, 2004 - Experimental results approve that such an audio based video ... mobile device, a user can record an audio/video clip and run an application to ... Interfaces (APIs) for handling (i.e. accessing, processing, editing, ... runs on a compu

Performance Augmentation of Video Retrieval ... - Semantic Scholar

frames from each video with variable frame frequency rates. Frequency rates are 5, 8, 12, 15, 20 and better performance is observed in case of 20th frame ...

SketchIt: Basketball Video Retrieval Using Ball ... - Semantic Scholar

A prototype basketball video retrieval system is presented in this report. ... content is based on a weighted similarity of color histogram, motion histogram, and Ga- ... of Mr. El-Saban's basketball skills shot at the UCSB Recreation Center.

Semantic Video Content Indexing and Retrieval using ... - MRIM

to design tools for content-based access to their documents. As this is the case for text documents, keyword based indexing and retrieval can be used (from.

Video Retrieval using Histogram and Sift ... - Semantic Scholar

retrieval techniques is shot segmentation. ... method based on rough-fuzzy set of (Han et al., 2005) ... Shot detection: As we mention above, the popular first.

Using Structure for Video Object Retrieval

is computed over a visual dictionary where region characteristics, either ... The most common representation of visual content in retrieval system relies on.

Video Retrieval using Search and Browsing - CiteSeerX

based search is complemented with NNk browsing functionality. ... out to test whether the use of the NNk network significantly improves performance above and ...

From a Stream of Relational Queries to Distributed Stream Processing

Sep 13, 2010 - a stream of queries into a distributed streaming application. Indeed, the ..... tion 5.1.3) that acts as a customized query engine for a spe- cific type of query. ...... benchmark on the stream processing core. In. Proceedings of the

RSAK: Random Stream AttacK for Phase Change Memory in Video ...

wear leveling, and error correction [9]. Limited write endurance of PCM not only affects lifetime reliability, but also incurs a potential security threat. From the.

Stream Processing of XPath Queries with Predicates

We consider the problem of evaluating large numbers of XPath filters, each with many predicates, on a stream of XML documents. The solution we propose is ...

Using Temporal Semantics for Live Media Stream Queries

Abstract. Querying live media streams is a challenging problem that becomes an essential requirement in a growing number of applications. We address the ...

Video Stream Retrieval of Unseen Queries using Semantic Memory

Download PDF

0 downloads 0 Views 81KB Size Report

Comment

word2vec semantic embedding to relate textual queries to ... where xt is the softmax scores of a deep neu- ... Zero-shot learning by convex combination of se-.

Video Stream Retrieval of Unseen Queries using Semantic Memory Spencer Cappallo [email protected]

Thomas Mensink [email protected]

Cees G. M. Snoek

Institute of Informatics University of Amsterdam Science Park 904 Amsterdam The Netherlands

[email protected]

Search among live, user-broadcast videos is an under addressed and increasingly relevant challenge. Every day, more content is shared via services like Meerkat, Periscope, and Twitch. As streaming video becomes more prevalent, it is necessary to develop retrieval systems that can address the unique consequences of live video. In contrast to pre-recorded videos, live streams frequently are transmitted without any accompanying textual description. The nature of streaming video means that even if text is available, there is no guarantee that it will adequately describe the content of a live broadcast. For this reason, the video content itself must be used. The full range of possible future search queries is unknowable, which motivates the framing of stream retrieval as a no-example retrieval problem, where visual examples of a query are assumed to be unavailable beforehand. We adapt existing approaches from the zero shot classification community, and rely on a word2vec semantic embedding to relate textual queries to pre-trained visual classifier confidence scores [2]. For a given query q, we score a stream with score(q, xt ) = s(q)| φ (xt ) where xt is the softmax scores of a deep neural network across some set of pre-trained classifiers C, s(q) denotes the semantic similarity between q and C in the semantic embedding, and φ (xt ) encodes the classifier scores in a sparse manner. Traditional video tasks assume the whole video is available for, a luxury that is not possible in a streaming setting. Also, especially in longer streams, content can change significantly and abruptly throughout the stream. A stream retrieval approach must provide up-to-date representations of the stream content. We explore three ways to emphasize only recent stream content. Two of these methods, Mean Memory Pooling and Max Memory Pooling, perform

pooling over a fixed window from the past into the present. We introduce Memory Welling, 1 m−1 w(xt−1 ) + xt − β , 0 w(xt ) = max m m where the current value of the well, w, is built on its previous state, diminished by a memory parameter m, and a constant leaking term β . Memory Wells emphasize recent, reliable content. We test our approach and competitive baselines on the ActivityNet data set and a motivated subset of the FCVID data set. We synthesize two additional data sets of longer videos through concatanation of random videos. Two tasks are identified and targeted: Instantaneous Retrieval of relevant video streams at one moment, and Continuous Retrieval of streams relevant to a query over a long viewing session. Scoring metrics for both tasks are developed, and videos are evaluated in a simulated streaming setting. To test responsiveness to unseen queries, the test set queries are disjoint from the validation set. In both target tasks, and on all data sets, either Memory Welling or an adaptation, Max Memory Welling, performs the strongest. We further validate our approach through comparison to state of the art on a traditional, nonstreaming video task. Max Memory Welling demonstrates improvement over [1] on zero-shot event retrieval within the TRECVID MED 13 data set, using the setting described in [1]. [1] M. Jain, J. van Gemert, T. Mensink, and C. G. M. Snoek. Objects2action: Classifying and localizing actions without any video example. In ICCV, 2015. [2] M. Norouzi, T. Mikolov, S. Bengio, Y. Singer, J. Shlens, A. Frome, G. S. Corrado, and J. Dean. Zero-shot learning by convex combination of semantic embeddings. ICLR, 2014.