CPPP/UFMS at ImageCLEF 2014: Robot Vision Task - Semantic Scholar

Recommend Documents

UFMS at ImageCLEF 2014: Robot Vision Task

place localization and object recognition becomes a fundamental part of image understanding for robot localization [2]. This paper presents the participation of ...

The ImageCLEF Medical Retrieval Task at ICPR ... - Semantic Scholar

retrieval has a long history of evaluation campaigns in which different groups use ... The best known such campaign is the Text REtrevial Conference (TREC1, ...

Overview of the WikipediaMM Task at ImageCLEF ... - Semantic Scholar

Sergio Navarro, Rafael Munoz, and Fernando Llopis. Evaluating fusion techniques at different domains at ImageCLEF subtasks. In CLEF 2009 working notes, ...

MedGIFT at ImageCLEF 2008 - Semantic Scholar

of the Viper project is the GNU Image Finding Tool, GIFT3. .... Table 3: Classification per axis with and without a âchoppingâ strategy with descending vote. run ID.

DAEDALUS at ImageCLEF Wikipedia Retrieval ... - Semantic Scholar

information gathered from the document where the image is referred. For the semantic annotation, DBpedia ontology and YAGO classification schema are used.

Image Retrieval and Robot Vision Research at ... - Semantic Scholar

2.1 Design Principles. Image queries are both generic and unpre- .... Adobe Illustrator, Aldus Freehand, X g, etc.). We start with a a collection of basic shapes.

ImageCLEF 2014: Overview and Analysis of the ... - Semantic Scholar

domain adaptation, scalable concept image annotation, liver CT image ... also to the introduction of new and cheap annotation tools such as Mechanical ... class, for a total of 600, manually collected by us using the class names as textual ... to rel

Robot Task Learning and Collaboration - Semantic Scholar

As a result, joint intention, attention and planning is naturally achieved. Throughout the collaboration, the human partner has a clear idea as to Leo's current.

MayoBMI at ImageCLEF 2016 Handwritten Document Retrieval Task

correction step is used to eliminate spelling errors from the concatenation ... dataset or transcribe the document before retrieving the query string. ... To evaluate the performance of these methods, precision, recall and ..... learning in Python.

Overview of the WikipediaMM Task at ImageCLEF ... - Theodora Tsikrika

Section 5 presents the approaches employed by the different par- ticipants .... necessarily want to employ visual evidence. .... thesaurus-based query expansion.

Plenoptic Models in Robot Vision - Semantic Scholar

Joachim Denzler, Benno Heigl, Matthias Zobel, Heinrich Niemann. Plenoptic ..... [6] S. Gortler, R. Grzeszczuk, R. Szeliski, and M. F. Cohen. The lumigraph.

Vision Based Robot Programming - Semantic Scholar

Abstract-Programming of industrial robots is normally carried out via so-called "Teach" or "Offline" programming methodologies. Both methodologies have ...

ImageCLEF 2014: Overview and Analysis of the ... - Semantic Scholar

class, for a total of 600, manually collected by us using the class names as textual ... Participation and Results While 19 groups registered to the domain adap- ... Table 1: Ranking and best score obtained by the three groups that submitted.

FabSpaces at ImageCLEF 2017 - Population Estimation (Remote) Task

Abstract. The paper summarizes the participation of the 6 FabSpaces to the population estimation (remote) pilot task at ImageCLEF 2017. Lab. FabSpace 2.0 is ...

MayoBMI at ImageCLEF 2016 Handwritten Document Retrieval Task

Keywords: handwriting recognition, hyphenation detection, text retrieval. 1 Introduction ... Ground Truth: instrument, instead of a fixed Book. Taken from Exche-.

The ImageCLEF 2012 Plant Identification Task - Hal

1 INRIA, IMEDIA & ZENITH teams, France, [email protected], ... 5 Laboratoire CReSTIC, UniversitÃ© de Reims, France, [email protected]. Abstract. .... growing stages with spring leaves, mature summer leaves, dry autumn leaves .... A to

The ImageCLEF benchmark on multimodal ... - Semantic Scholar

The ImageCLEF benchmark on multimodal multilingual image retrieval. Henning MÃ¼ller, Paul ... 10 http://muscle.prip.tuwien.ac.at/ws_proceedings_2005.php ...

BatmanLab in the ImageCLEF Tuberculosis Task 2017

the words of the different patients in a graph embedding. The procedure ... of the figure illustrates an example of the KNN graph in the feature space. Each node.

Robot Vision

The purpose of robot vision is to enable robots to perceive the external world in order to .... Remote Robot Vision Control of a Flexible Manufacturing Cell. 455.

Robot Task Learning based on Reinforcement ... - Semantic Scholar

many learning trials are needed to achieve the control rules; the robot itself may lose ... The robot tries to acquire optimal control rules through trial and error ...

Multi-task Gaussian Process Learning of Robot ... - Semantic Scholar

Christopher K. I. Williams. Stefan Klanke. Sethu Vijayakumar. School of Informatics ..... harder than interpolation. Since the training data are subsets selected ...

Underwater Robot Task Planning Using Multi ... - Semantic Scholar

Apr 4, 2017 - Multi-Objective Harmony Search Algorithm (MOHS). We start by delving into the first solver considered in this study, the Harmony Search (HS).

Multi-Robot Task Allocation Method for ... - Semantic Scholar

relates some of those researches that have inspired us. Dias and Stenz [7] have ..... CA:AAAI Press, Daytona Beach, FL (USA). 4. Bonabeau E., Sobkowski A., ...

Vision-based 2D and 3D Control of Robot ... - Semantic Scholar

and 3D. In both cases only one camera is mounted on the robot's arm, which supplies visual ..... A better illustration is shown by the root-locus diagram of the systems (25) or (26), given in Fig. ..... A tutorial on visual servo control, IEEE.

CPPP/UFMS at ImageCLEF 2014: Robot Vision Task - Semantic Scholar

Download PDF

2 downloads 0 Views 1MB Size Report

Comment

Vision Task. Rodrigo ... system used by our group in the robot vision challenge. .... For each object correctly detected as not present (True Negative): 0.0 points.

CPPP/UFMS at ImageCLEF 2014: Robot Vision Task Rodrigo de Carvalho Gomes, Lucas Correia Ribas, Amaury Antˆonio de Castro Junior, Wesley Nunes Gon¸calves Federal University of Mato Grosso do Sul - Ponta Por˜ a Campus Rua Itibir´e Vieira, s/n, CEP 79907-414 Ponta Por˜ a - MS, Brazil [email protected], [email protected], {amaury.junior,wesley.goncalves}@ufms.br

Abstract. This paper describes the participation of the CPPP/UFMS group in the robot vision task. We have applied the spatial pyramid matching proposed by Lazebnik et al. This method extends bag-of-visualwords to spatial pyramids by concatenating histograms of local features found in increasingly fine sub-regions. To form the visual vocabulary, kmeans clustering was applied in a random subset of images from training dataset. After that the images are classified using a pyramid match kernel and the k-nearest neighbors. The system has shown promising results, particularly for object recognition. Key words: Scene recognition, object recognition, spatial pyramid matching

1

Introduction

In recent years, robotics has achieved important advances, such as the intelligent industrial devices that are increasingly accurate and efficient. Despite the recent advances, most robots still represent the surrounding environment by means of a map with information about obstacles and free spaces. To increase the complexity of autonomous tasks, robots should be able to get a better understanding of images. In particular, the ability to identify scenes such as office, kitchen, as well as objects, is an important step to perform complex tasks [1]. Thus, place localization and object recognition becomes a fundamental part of image understanding for robot localization [2]. This paper presents the participation of our group in the 6th edition of the Robot Vision challenge1 [3, 4]. This challenge addresses the problem of semantic place classification and object recognition. For this task, the bag-of-visualwords approach (BOW) [5, 6] is one of the most promising approaches available. Although the approach have advantages, it also has one major drawback, the absence of spatial information. To overcome this drawback, a spatial pyramid 1

http://www.imageclef.org/2014/robot

348

framework combined with local features extractors, such as such as Scale Invariant Feature Transform (SIFT) [7] and Speeded Up Robust Features (SURF) [8], was proposed by Lazebnik et al. [9]. This method showed significantly improved performance on challenging scene categorization tasks. The image recognition system used in our participation is based on the improved BOW and multi-class classifiers. Experimental results have shown that the image recognition system provides promising results, in particular to the object recognition task. Among four systems, the proposed system ranked second using the number of cluster k = 400 and number of images M = 150 for training the vocabulary. This paper is described as follows. Section 2 presents the image recognition system used by our group in the robot vision challenge. The experiments and results of the proposed system are described in Section 3. Finally, conclusions and future works are discussed in Section 4.

2

Image Recognition System

In this section, we describe the image recognition system used in the challenge. This system can be described into 3 steps: i) feature extraction; ii) spatial pyramid matching; iii) classification. The following sections describe each step in details.

2.1

Feature Extraction

In the feature extraction step, the system extracts SIFT descriptors from 16 × 16 patches computed over a grid with spacing of 8 pixels. For each patch i, 128 descriptors are calculated, i.e., it is calculated a vector ϕi ∈