Proceedings of the... - Association for Computational Linguistics
Recommend Documents
Dec 8, 2015 - interfaces, online services, training and training ...... This is quite a common tech- nique, and it has b
Dec 8, 2015 - ing testing is that we can use the same input with different added noise in ...... For example, the FreeBa
Selecting relevant text subsets from web-data for building topic specific language models. Abhinav Sethy .... domain sentences for this task available. We down-.
moval). 1 Introduction. Improving information retrieval (IR) through natu- ... cuss the query optimization algorithm followed by a section on ... This step is necessary to reduce the search space for the QA ... This was the IR engine of our choice in
turned (e.g., âLaura Bushâ) along with an icon indicating the type of the entity and the number of documents in which the entity appears. Clicking on the entity in ...
Jun 25, 2007 - USA. Tel: +1-570-476-8006. Fax: +1-570-476-0860 [email protected] ii ...... tailed query, might be âAsus laptop + POSITIVE. OPINIONSâ. Another possible ...... eration jukeboxes ( ipod , archos , dell , samsung ), it is not ...
Jun 28, 2007 - A multi-word expression (henceforth MWE) is usu- ally taken to be ...... Anthony P. Cowie. 1981. .... of the decisions made in order to come to a standard ..... from correct usage; (2) the extraction is in part based ..... for an even
Jun 23, 2007 - features describing the amount of parallelism be- tween conjuncts ..... test the quality of the model, we check for each given sentence which of ... these examples, why does it need selection restric- tions? ..... figure. After that, t
The natural language modalityâas opposed to directed dia- logâwas initially used mostly for call-routing, i.e. to route calls to the appropriate call center based.
Jun 28, 2007 - A Semantic Approach To Textual Entailment: System Evaluation ...... systems using some form of logical inferences (at ...... inventory for WordNet 2.1 released for Task #7 in ...... TF1,...,TFn that are designed to modify depen-.
Automated Quality Monitoring for Call Centers using Speech and NLP Technologies. G. Zweig, O. ...... cided upon, and automatically recording them could.
Jun 28, 2007 - Solomon Teferra Abate and Wolfgang Menzel . ..... singular of book. This seems fine until ... The 'magic' is that the bound variables n, c, and s ..... ac.uk/Ëspena/papers/GPFM.pdf, October 6. ...... [The Jordanian king Abdullah II].
need to carefully study how linguistic facts are rep- resented in the ... conversion process relating to the handling of bro- ken plurals at the ... it marks the value that is chosen during subsequent ... thographic tokens from which punctuation has
It is based on the observation that termino- logical multi-word groups reveal a con- ... the constant stream of new terms is increasingly get- .... Table 1: Frequency distribution for noun phrase term candi- ... bigrams), length 3 (word trigrams) and
Names of people, places, and organizations have unique linguistic properties, and ... studied domain, name matching is a promising area for innovation and for ...
biomedical domain (Blaschke et al., 2004; Kim et ... the limited availability of annotated material in most .... all possible names and moreover, in most domains,.
Jun 25, 2007 - about how the work could be presented best, and does not indicate a quality difference ...... tailed query, might be âAsus laptop + POSITIVE.
stantiation of this framework for email data, where ... email message, for instance, is connected via header .... its probability mass into small fractions over many.
multimillion pound six year research project funded by the EPSRC in the UK. AKT .... In order to flesh the concept relations, we need to identify relations such as ...
Patrick Paroubek**, Yves Schabes and Aravind K. Joshi ..... Patrick Martin automated the acquisition of some of syntactic lexicons .... A.R. , S. Seitz, and B. Smith.
Jun 24, 2011 - Jonathan K Kummerfeld, Mohit Bansal, David Burkett and Dan Klein . . . . . . . . . . . . . . . . . . . . . 102 ..... story from his days as a pioneering heart sur- geon back in ..... evolved over a period of almost five years, with dif
semrels) from a definition or example sentence for ... extraction of paradigmatic relations, in particular .... strategy is based on identifying syntagmatic relations.
MedSLT (MedSLT, 2005; Bouillon et al., 2005) is a unidirectional medical speech translation system for use in doctor-patient diagnosis dialogues, which.
Proceedings of the... - Association for Computational Linguistics
the available text efficiently, systems such as, (Veale and Way, 1997) and (Brown, 1999), convert the examples in the corpus into templates against which the new text can be matched. Thus, source-target sentence pairs are converted to source-target generalized template pairs. An example of such a pair is shown below:
Prior work has shown that generalization of data in an Example Based Machine Translation (EBMT) system, reduces the amount of pre-translated text required to achieve a certain level of accuracy (Brown, 2000). Several word clustering algorithms have been suggested to perform these generalizations, such as kMeans clustering or Group Average Clustering. The hypothesis is that better contextual clustering can lead to better translation accuracy with limited training data. In this paper, we use a form of spectral clustering to cluster words, and this is shown to result in as much as 29.08% improvement over the baseline EBMT system.