active learning for sound event classification by

Recommend Documents

Musical Sound Recognition by Active Learning PNN

Bülent Bolat and Ünal Küçük. Yildiz Technical University ..... Bolat, B., Yildirim, T.: Active Learning for Probabilistic Neural Networks. Lect. Notes in. Comp. Sci.

Active Learning Using Inexpensive Sound Cards For Circuits And

read the response signal of the external circuit. ... Readers who would like to see ... ASCII format, which has previously been described in [1]. .... Figure 1. Cool Edit Window: (a.) Output of an FM discriminator - simulated entirely in Cool Edit an

Robust sound event classification with bilinear multi-column ELM-AE ...

Y Bengio, A Courville, P Vincent, Representation learning: a review and new ... GB Huang, QY Zhu, CK Siew, Extreme learning machine: theory and applications ...

Continuous robust sound event classification using time ... - Plos

Sep 11, 2017 - and detection of continuous sound recordings prior to classification. ...... expected that careful adjustment of pTH using a development data set.

sound event classification based on feature integration, recursive

temporal shape, spectral shape, spectrogram, perceptual cepstral co- efficients, harmonic and ... Spectral skewness and kurtosis are calculated on the frame.

Active learning algorithms in seismic facies classification

In this paper we illustrate unsupervised and supervised learning algorithms that accurately classify the lithological variations in the 3D seismic data.

Active learning for ontological event extraction ... - Semantic Scholar

Han et al. Journal of Biomedical Semantics (2016) 7:22 ... We also apply our active learning method for the task of named entity recognition. Results and ...

Cost-Effective Active Learning for Deep Image Classification - arXiv

Jan 13, 2017 - This work was supported in part by Guangdong Natural Science Foundation under Grant ... and Liang Lin are with the. School of Data and Computer Science, Sun Yat-sen University, Guang Zhou. ...... Ph.D. degree in computer science and te

Multi-Class Batch-Mode Active Learning for Image Classification

Jul 1, 2010 - learning framework for multi-class image classification systems. In active .... Vector Machines (SVM) are used as the base binary classifier.

Active Learning for Automatic Classification of Software ... - CiteSeerX

We present a technique that models program executions as. Markov models, and ..... and outputs a Classifier C. P's test plan contains test cases that detail inputs ...

Scalable Active Learning for Multi-Class Image Classification

[14] K. Grauman and T. Darrell. The pyramid match kernel: ... [23] A. Kapoor, K. Grauman, R. Urtasun, and T. Darrell. ... Boston: McGraw-Hill, 1997. [31] A. Oliva ...

Active Learning: Any Value for Classification of Remotely Sensed Data?

Mar 27, 2013 - search for a small data set with high training utility, whereby both the ... specific challenges and opportunities in analysis of remote sensing ...

Ensemble Multiple Kernel Active Learning For Classification - mygeohub

Applications of Remote Sensing, within Civil Engineering, Purdue University, ... MS, an enhanced kernel space can be constructed by including multiple features or ..... adopted in this study since only discrete return LiDAR data are available ...

Active Learning for Dialogue Act Classification - DiVA portal

aligned word-by-word transcription. ... 2http://www.csie.ntu.edu.tw/~cjlin/liblinear for which a ... were extracted from the transcription to encode the presence of.

Active learning for object classification: from exploration to exploitation

Jul 27, 2008 - Machine learning, proceedings of the twenty-first international conference (ICML 2004), Banff,. Alberta, Canada, July 4â8, 2004. ACM. 123 ...

Active learning for clinical text classification: is it ... - Semantic Scholar

between the dataset diversity and the DIV performance, as well as the dataset ...... edu/wmlearn/MLRepository.html (accessed 20 Sep 2011). 22. Michie D ...

Multi-Class Active Learning for Image Classification - Semantic Scholar

based uncertainty to the multi-class case and is easy to compute, so that active learning can ..... Illustration of one-vs-one classification (classes that each.

Vuvuzelas & Active Learning for Online Classification - Ralf Herbrich

platforms such as Wikipedia, Twitter, Facebook, and many more. Often, the ... trending topic and a very indicative feature for the category âsportâ. Since tweets ...

Scalable Active Learning for Multi-Class Image Classification

tion algorithm, since it provides state-of-the-art performance on the datasets used for evaluation. For the multi-class case, one-vs-one SVM (classifiers trained for ...

Multimetric Active Learning for Classification of Remote Sensing Data

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS. 1. Multimetric Active ... remote sensing data is affected by two key problems: the high dimensionality of ...

Ensemble Multiple Kernel Active Learning For Classification - mygeohub

of Houston, and in part by the Laboratory for Applications of Remote Sensing ... Electrical and Computer Engineering Department, University of Houston,.

Supervised machine learning and active learning in classification of ...

May 22, 2014 - Sydney, Sydney, New South. Wales, Australia ... fields and support vector machines, active learning (AL) ... site at Westmead Hospital (Sydney).

Supervised machine learning and active learning in classification of ...

May 22, 2014 - Sydney, Sydney, New South. Wales, Australia ... fields and support vector machines, active learning (AL) ... site at Westmead Hospital (Sydney).

SOUND CLASSIFICATION - Community

SOUND CLASSIFICATION. VOWEL/VOCOID VERSUS CONSONANT/CONTOID. In phonetic terms – i.e. from an articulatory view -, the difference between.

active learning for sound event classification by

Download PDF

0 downloads 0 Views 281KB Size Report

Comment

ACTIVE LEARNING FOR SOUND EVENT CLASSIFICATION ... Training sound event classifier requires annotated recordings: ... Dataset: UrbanSound 8K.

ACTIVE LEARNING FOR SOUND EVENT CLASSIFICATION BY CLUSTERING UNLABELED DATA Zhao Shuyang, Toni Heittola, Tuomas Virtanen Department of Signal Processing, Tampere University of Technology, Finland Background

Evaluation Dataset: UrbanSound 8K

Training sound event classifier requires annotated recordings: I I

Audio data is easy to collect. Annotation is time-consuming. Idea: Utilizing abundant unlabeled data to optimize the effectiveness of the annotation effort.

I

8732 labeled sound segments

I

10 sound event classes in open urban space.

I

Cross-validation: 10-fold. Setup

Proposed method Medoid-based active learning (MAL): parition the data and annotate only the medoid segments, centroids of clusters. I I

I

Labels are produced by simulating limited number of labeling responses (labeling budget), according to the ground truth.

I

Number of clusters is set to 1/4 of the number of unlabeled data points.

I

Supervised learning setup (feature and model) follows UrbanSound SVM baseline. Features are various MFCCs statistics within segments: mean, median, variance, minimum, maximum, skewness, etc.

I

Compared with reference methods, including random sample (baseline), certainty-based active learning(CRTAL) and semi-supervised learning (SSL).

I

Experiments are repeated five times and the average performance is reported.

Medoids are assured to span different local distributions. A labeled medoid can be used to derive predicted labels for other cluster members.

Result

Figure 1: Overview of the proposed method. Medoid segments are marked with red border. Annotated labels are filled with black and predicted labels are filled with grey.

Noticeable technical details I

I

In the clustering stage, sound segments are respresented with a single Gaussian, based on the MFCCs statistics. Segment-segment dissimilarity is measured by the symmetric KL divergence.

Figure 2: Classification accuracy as a function of labeling budget, simulated using an oracle annotator. I

The classification accuracy is improved by 8%, when labeling budget is lower than 10% of unlabeled data.

I

The proposed method saves 50% to 60% budget to achieve the same accuracy, with respect to the best reference method.

Conclusions I

The proposed method effectively saves labeling budget for sound event classification.

Initialization of mediods is based on farthest-first traversal, starting from a random point.

I

Future: Study with different datasets, especially larger scale datasets.

I

An annotated label overrules predicted labels on the same segments.

I

Future: Study weak annotator case, simulated using real human labeling responses.

I

Produced labels are used to construct training examples for supervised learning.

I I

K-medoids clustering is based on the dissimilarity matrix.

[email protected]