Graphical Methods for Classifier Performance Evaluation - ICMC - USP

Recommend Documents

for instance, with different misclassification costs, and to select the classifier .... In mass mailing, the cost of the mail campaign is greater than the profit obtained ...

Evaluation of Automatic Text Summarization Methods ... - ICMC - USP

simplest RST-based method is among the best ones, although all of them present .... 72 news and technical texts in Japanese language. In the best case, the ...

Algebra Linear - ICMC-USP

Neste cap tulo introduziremos o conceito de espa co vetorial que ser a usado em todo o decorrer do curso. Por em, antes de apresentarmos a de ni cão de espa ...

Cover Sheet - ICMC - USP

MultiWaveMed: A System for Medical Image Retrieval through ... because the specialist can be, at the moment, concerned about specific aspects ..... noisy data.

Systematic Review - ICMC - USP

Jan 24, 2012 - A Systematic Review on Service-Oriented .... systems, implementation technologies, and software engineering guidelines developed ...... external knowledge may bring a myriad of new possibilities in the robotics field , such.

Combining Classifiers - ICMC - USP

In this tutorial are presented the ... Afterwards, we describe the methods of the two main steps of the design of a multiple ...... ai,k = min [DTj(i, k), (1 â di,k(x))]. (10).

Technical Report - Icmc-Usp

Since SOA, reference architectures, reference models and systematic review ... such as HTTP, SOAP, and XML, Web services aim at maximizing service sharing ... literature review and it is used to summarizing, assessing and interpreting the ...

Evaluating Ranking Composition Methods for Multi ... - ICMC - USP

Page 1 ... all of the studied ranking composition methods pro- vide good balance of .... Multi-objective optimization techniques can be di- vided into two groups.

Enhanced Visual Evaluation of Feature Extractors for ... - ICMC - USP

queries over the data. Next, it is necessary to ask a human specialist on the domain to validate the answer given by the image retrieval technique supported by ...

15a23_ART02_Albuquerque e Christ.pmd - Icmc-Usp

scale aircraft maintenance company (AMC) in Germany. AMC has about ..... support of the Alexander von Humboldt Foundation and the travel grant given by the ...

classifier performance evaluation for offline signature ...

offline signature verification based upon local binary patterns feature set. ... 1. INTRODUCTION. The authorization of documents is an important aspect of.

system development using a - ICMC - USP

that application. A visual builder can aid this task, .... this system, citizens can log onto a Web site and ... information to be filled in the visual builder GUI forms.

Multi-view Semi-supervised Learning - ICMC - USP

Abstract. The supervised machine learning approach usually requires a large num- .... title, abstracts and references of 277 (70%) articles from Inductive Logic ...

RECIBO DE PAGAMENTO A AUTÔNOMO - ICMC - USP

Um Recibo de Pagamento a Autônomo (RPA) é um documento de ... contratante e é gerado no ato do pagamento pelos serviços prestados. A criação.

Enhancing Data Visualization Techniques - ICMC - USP

SingleEpithSize, BareNuclei, BlandChromatin, NormalNucleoli and Mitoses. ... manipulation in an analytical-based presentation schema - that is, selective ...

Pessoas | ICMC-USP - SÃ£o Carlos

UNIVERSIDADE DE SÃO PAULO INSTITUTO DE CIÃNCIAS MATEMÃTICAS E DE COMPUTAÃÃO - USP. Avenida Trabalhador SÃ£o-carlense, 400 - Centro

Rethinking formalization processes in computerized ... - ICMC - USP

[www.reciis.cict.fiocruz.br]. ISSN 1981-6286. Rethinking formalization processes in computerized systems: analyzing the co-evolution between software and.

an Empirical Study - LabES (ICMC/USP)

testing and Visual GUI Testing (VGT) in the tools GUITAR and a prototype tool we refer to as VGT GUITAR. First, GUI mutation operators are defined to create 18 ...

Multi-view Semi-supervised Learning - ICMC - USP

The supervised machine learning approach usually requires a large num- ... classification tasks involving on-line data sources, such as web pages, email and.

ICCE 2009 Publications Format - LabES (ICMC/USP)

Through source code analysis and test case execution, testers have to identify ... Mutants offers functionalities to receive and load test inputs, mark an equivalent .... Mutants, an open source and intuitive testing tool for applying mutation testin

PhD Thesis â ICMC/USP - Merlintec Computadores

dynamic languages, like Python or Smalltalk, to implement them. ..... dynamic a given application will be while other languages, like Java, mix ... Mozilla Foundation entered a performance race for that language backed by significant financial.

Graphical Perception and Graphical Methods for ... - Courses

Aspects of the ordering are partly conjectural in that we have no controlled .... The bottom panel is a graph of the yearly changes of the data in the top panel.

An Environment for Data Analysis in Biomedical Domain - ICMC - USP

IEDSS-Bio, an environment for Information Extraction and Decision Support. System in Biomedical domain. In a case study, experiments with machine learn-.

DiZer â an Automatic Discourse Analyzer for Brazilian ... - ICMC - USP

and was developed for free domain texts (based on news texts). To our .... the text always a punctuation signal (comma, dot, exclamation and interrogation points ...

Graphical Methods for Classifier Performance Evaluation - ICMC - USP

Download PDF

5 downloads 0 Views 212KB Size Report

Comment

1However, the methods discussed here can be adapted to multi-class problems ... For a multi-class problem with Ncl classes, the confusion matrix will have Ncl2.

Graphical Methods for Classifier Performance Evaluation Maria Carolina Monard and Gustavo E. A. P. A. Batista University of S˜ ao Paulo – USP Institute of Mathematics and Computer Science – ICMC Department of Computer Science and Statistics – SCE Laboratory of Computational Intelligence – LABIC P. O. Box 668, 13560-970, S˜ao Carlos, SP, Brazil {gbatista, mcmonard}@icmc.usp.br

Abstract. Evaluating the performance of classifiers is not as trivial as it would seem at a first glance. Even the most widely used methods such as measuring accuracy or error rate on a test set has severe limitations. Two of the most prominent limitations of these measures are that they do not consider misclassification costs and can be misleading when the classes have very different prior probabilities. On the last years, several researches have pointed out alternative methods to evaluate the performance of learning systems. Some of those methods are based on graphical evaluation of classifiers. Usually, a graphical evaluation lets the user analyze the performance of a classifier under different scenarios, for instance, with different misclassification costs, and to select the classifier parameters setting that provides the best result. The objective of this paper is to survey some of the most used graphical methods for performance evaluation, which do not rely on precise class and cost distribution information.

1

Introduction

In supervised learning, a set of n training examples is given to an inducer. Each example Ei is a tuple (~ xi , yi ), where x~i is a vector of m features values and yi is the class value. The main objective in supervised learning is to induce a general mapping of the vectors ~x to the class values y. Thus, the inducer should build a model, y = f (~x), of an unknown function, f , also known as concept function, which predicts y values for previously unseen examples. However, in most cases, the number of examples used to induce a model is not sufficient to completely characterize the function f . In fact, the inducers are usually able to induce a function h that approximates f , i.e., h(~x) ≈ f (~x), where h is known as the hypotheses of the concept function f . For classification problems, the y values are drawn from a discrete set of classes C = {C1 , C2 , . . . CN cl }, where N cl is the number of classes. Given a set of training examples, the learning algorithm outputs a classifier such that, given a new unlabelled example, it accurately predicts the label y. Assuming the vectors ~x correspond to points

2

Maria Carolina Monard and Gustavo E. A. P. A. Batista

in a m-dimensional space,