From concrete to virtual annotation mark-up language: the ... - CiteSeerX

Recommend Documents

AbstractâMethod for Geography Markup Language: GML based representation of time series of assimilation data and its application to animation content ...

The Virtual Annotation System - CiteSeerX

We illustrate the annotation. system with two diverse examples: a general data. visualization/ analysis application and an architectural. walkthrough. Overview.

Mathematics Education Markup Language - CiteSeerX

Oct 17, 2002 - Abstract: On-going design of Mathematics Education Markup Language (MeML) is presented. MeML is an XML-based language and the key ...

Conceptual Knowledge Markup Language: The Central ... - CiteSeerX

(any legal XML name symbol). [30] string. ::= (any XML text, with "", and "&" escaped). As indicated in the XML specification document an attribute name ...

Principles of Video Annotation Markup Language - CRPIT Series

project - the Video Annotation Markup Language (VAML) .... Director's Lingo programming language. .... in the figure indicates the nest relationship between.

WandaML a markup language for digital document annotation

attribute of my tag, having value my value. ... A body of XML text may be enclosed between tags with an opening tag ... .... notation was generated (author, date created, contact email, author's affiliation, etc.) and.

The Behavior Markup Language: Recent

behaviors (see Fig. 1). Mediating between the first two stages is the Functional. Markup Language (FML) that describes intent without reference to surface form; ...

Extensible Markup Language (XML) Standard Generalized Markup ...

Jun 2, 2006 ... 2 Classical HTML is not an XML Language . ... 5 Example Languages under XML and SGML. ..... The XML Handbook, Prentice Hall. A second ...

Extensible Markup Language: An Enterprise Integration ... - CiteSeerX

following sections describe the history of XML, and position the technology with ... based General Motors Corporation has claimed that they will be providing ...

TML: A Thesaural Markup Language - CiteSeerX

In this paper, we describe this ontology, a markup language used to express it, and ..... Navigation, URL: http://www.n-four.demon.co.uk/mirk/zthes-. 02.html.

Extensible Markup Language and Qualitative Data ... - CiteSeerX

to-business (B2B) document exchange both within enterprises and over the Internet .... CHRISTIANSEN and TORKINGTON (1998, pp.218-220) set out a list of useful ... expression (1997) for matching all valid email addresses is over 6000 ...

Feedback from Real & Virtual Language Teachers - CiteSeerX

feedback from a virtual tutor, human language teacher feedback has been explored ..... "Learning and development of Contextual Action" European Union NEST ...

EPC Markup Language (EPML)

Mar 10, 2005 - outlined why edge element lists are used to describe EPC process graphs ... ciples proposed for the ASC X12 Reference Model for XML Design ... This is important because EPML documents will be used not only by appli-.

XML: eXtensible Markup Language

Page 13 .... Web: How to Program, Ed. Pearson International. Edition ! http://www. w3.org/ ! www.deitel.com/books/iw3htp4 (per il codice di esempio degli esercizi)

NetML Network Markup Language

Ivan Santarelli [email protected]. Alexandra Bellogini ... XML editor (with syntax checking and tag auto-completion) ..... Netkit web site: http://www.netkit. org/.

markup language(s)

WEB TECHNOLOGIES. A COMPUTER SCIENCE PERSPECTIVE. JEFFREY C. JACKSON. Jackson, Web ... Web Consortium (W3C) recommendations.

Deriving a new Markup Language to enhance

Deriving a new Markup Language from the Extensible Markup Language (XML) to enhance Electrical Engineering Applications. Ø§ Ø§ Ø© Ù Ø¥. Øª. Ø§ Ø§. By. Areej Yousef ...

Transformation of ARIS Markup Language to EPML

We take ARIS Markup Language (AML)1, the proprietary interchange format of ARIS Toolset, as a starting point and discuss its ad- vantages and shortcomings.

DERIVE A NEW MARKUP LANGUAGE FROM THE EXTENSIBLE

www.arpapress.com/Volumes/Vol23Issue3/IJRRAS_23_3_01.pdf. 174. DERIVE A NEW ... supporting this language with a web application for drawing electric circuits via a web page without the need for drawing ... INTRODUCTION. With the ...

R2ML - The REWERSE I1 Rule Markup Language - CiteSeerX

Rental. 3. FUNCTIONAL TERMS. A data term is either a data variable, a data value, or a data function term .... Integrating rules in server side programming.

JavaML 2.0: Enriching the Markup Language for Java ... - CiteSeerX

converters: one for producing an HTML view of the source code and another for ... create-link that creates and formats the anchor name element. 217.

The Next Step Towards a Functional Markup Language - CiteSeerX

One of these is the Functional Markup Language (FML). This language specifies the communicative intent behind an agent's be- haviour. Currently, several ...

Browsing Proof Markup Language Provenance

Probe-It! is a browser that allows users to navigate through Proof Markup Lan- guage (PML) based provenance traces by interacting with a number of different.

The roles of geography markup language (GML) - CiteSeerX

Key words: Geographic Markup Language (GML), Scalable Vector Graph- ics (SVG) .... of these Internet GIS provide basic GIS functions and can handle client ..... GeoServerLite is a simple Web Feature Server written in the PHP scripting.

From concrete to virtual annotation mark-up language: the ... - CiteSeerX

Download PDF

0 downloads 0 Views 52KB Size Report

Comment

ample in Figure 4 for the sentences O jogador pode deixar o time. Ele recebeu uma proposta excelente. (The player may leave the club. He received an ex-.

From concrete to virtual annotation mark-up language: the case of COMMOn-REFs Renata Vieira , Caroline Gasperin , Rodrigo Goulart PIPCA - Unisinos São Leopoldo, Brazil

Susanne Salmon-Alt ATILF-CNRS Nancy, France

{renata,caroline,rodrigo}@exatas.unisinos.br

[email protected]

Abstract This work presents the data model we adopted for annotating coreference. Our data model includes different levels of annotation, such as part-of-speech, syntax and discourse. We compare our encoding schemes to the abstract XML encoding being proposed as standard. We also present our tool for coreference resolution that handles our data model.

1 Introduction We have been dealing with corpus based studies since 1997 (Renata Vieira and Simone Teufel, 1997; Poesio et al., 1997). Our focus has been the study of coreference. In the study of coreference we have dealt with annotation experiments (manual and automatic) and their respective annotation schemes. To work on coreference we used information from syntactic annotated corpus, the Penn Treebank. Our results (annotated corpus with coreference links and classification of coreference status) were Prolog encoded. When we first adapted our tool for Portuguese (Rossi et al., 2001) we dealt with other tools and annotation formats. The resources built on these previous works were difficult to share due to their particular information encoding. Our current work in the COMMOn-REFs project (A computational model for processing referring expressions)1 , involves Portuguese and French. We 1 http://www.inf.unisinos.br/ renata/documents/commonrefsproposal.pdf. Research Grant CNPq- Brazil.

are using MMAX, a tool for multimodal annotation in XML (Müller and Strube, 2001), for manual annotation of coreference, and we are developing a tool for automatic coreference resolution. Our tool deals with XML encoding provided by MMAX and syntactic information for Portuguese and French encoded in XML. In order to be able to share the resources being built, we are relating our model with proposed standards. In Section 2 we present previous annotation formats that we dealt with. In Section 3 we give an overview of the work in COMMOn-REFs. Section 4 relates our current model with the standards recently proposed (Ide and Romary, 2002; Ide and Romary, 2003; Ide and Romary, 2001). Section 5 describes our tool for coreference resolution. A discussion on the problems we face with our annotation model is presented in Section 6.

2 Previous work Our first annotation schemes were Prolog lists of treebank sentences and their noun phrases (NPs), as shown in Figure 1. The lists were extracted from Lisp lists of the Penn Treebank. These lists were manipulated in our experiments on coreference annotation and resolution. The results of coreference annotation were lists of Prolog facts dcc(Index1,Index2,Code) as shown in Figure 2. Index1 refers to the sequential numbering of definite descriptions; Index2 refers to the sequential numbering of noun phrases; and Code refers to their classification, according to discourse status (Poesio and Vieira, 1998). For some of them there were also facts

Proceedings of the ACL 2003 Workshop on Linguistic Annotation: Getting the Model Right, pp. 6-13.

[S,[NP,the,squabbling,[PP,within,[NP,the, Organization,[PP,of,[NP,Petroleum,Exporting, Countries]]]]],[VP,seems,[PP,under, [NP,control]],[PP,for,now]].].

STA:fcl P:v-fin(’ser’ PR 3P IND) São SC:adj(’remoto’ F P) remotas SUBJ:np =>N:art(’o’ F P) as [NP,Petroleum,Exporting,Countries]. =H:n(’chance’ F P) chances =N