A Framework for Computational Processing of Spelling ... - LTRC

Recommend Documents

Address correspondence to Emilios Cambouropoulos, Department of Music Studies, ...... Unpublished doctoral dissertation, University of Edinburgh, Scotland.

A computational framework for microstructural

coefficients matrix and B the N Ã 1 vector representing the right-hand side. Eventually ...... Clebsch-Gordon coefficients, that is câ1â2â m1m2m â¡ â¨â1 â2 m1 ...

A Computational Framework for Influenza

Oct 7, 2010 - Copyright: Â© 2010 Cai et al. This is an ... The peak influenza season in the northern hemisphere is from. January to April every year. ... three influenza A pandemics occurred in 1918, 1957, and 1968, respectively [4,5]. ... temporal b

A computational model for processing of semi

The basic equations of continuum mechanics are the balance equations. .... the Avrami equation is a special case of the Schneider rate equations as shown.

A computational model for processing of semi

The basic equations of continuum mechanics are the balance equations. In their .... The very often used Avrami equation (22), which describes space filling in.

Spelling space: A computational testbed for phonological and ...

The situation can be likened to a tug-of-war of approximately equally strong parties, where the spelling is the rope, and morphology and phonology are pulling it, ...

OpenStructure: a flexible software framework for computational ...

Motivation: Developers of new methods in computational structural biology are often hampered ... workflow automation tools, e.g. Biskit (Grunberg et al., 2007) or.

Geometric algebra: a computational framework for ... - CiteSeerX

do this effectively, the basic concept of a vector as an element of a linear ... use geometric algebra to develop new algorithms for graphics; but we hope ...... is referred to the tutorial of [3], which introduces these differentiations using exampl

A Computational Framework For Academic Accreditation ... - wseas.us

Emphasis is put on technologies spanning (crude) data, information, refined ... Academic accreditation, Database management system (DBMS), Information ...

A Computational Framework for Context-aware ... - CiteSeerX

application framework that assists developers in implementing applications to run ...... A tablet PC running Android was chosen as the platform, and 2 use cases ...

A Computational Framework for Multiscale Structural ...

Sep 14, 2007 - localized 3-D thermo-mechanical fields at structural scale ... Belytschko T., Gracie R. and Ventura G. A review of extended/generalized.

A Computational Framework for Data Analysis using

scribed number of critical points by computing. Vk = arg max. MâM, |M|=k Ï(M). ..... Mathematics, PUC-Rio (2005), advised by HÃ©lio Lopes and Geovan Tavares.

Towards a Computational Framework for Modeling

May 28, 2018 - Heilsbronn, Germany) was used to amplify, acquire, and analyze pressure ..... complies with the assumptions of an incompressible, isothermal,. Newtonian and ..... integrated EM models was verified by standard PV loop analysis as shown

A Computational Framework for Total Variation

Mathematics Subject Classification (2000) 65J22 Â· 65K10 Â· 65F22. .... Our focus in this paper is on the use of the total variation (TV) regularization function ... The projection of a vector x â RN onto the feasible set Î© := {x â RN | x â¥ 0}

A Computational Framework for Interacting with ...

Mar 26, 2014 - Peppytide workflow and UI mockup-design in Cyborg . . . . . . 73. 4.4 ..... We have designed a web API (Application Programming. Interface) ...

A computational framework for the near

Spreadsheet risk is the danger that errors in a common business tool such as ..... Microsoft Excel) either by their own business analysts (whose programming ...

A multiphysics multiscale computational framework for ...

Jean-Marie Aerts and Prof. Hans Van Oosterwyck from KU Leuven, Prof. Davide Ruffoni from ULg and Prof. Eirini Velliou from University of Surrey for being part ...

A Computational Framework for HumanAgent Communication.pdf

(7) User: No, no, I will contact Adam myself ... the utterance âit's necessary to contact Adamâ is ..... Typologyâ, Special Issue 216 Searle with his Replies of.

A Transparency and Scaffolding Framework for Computational ...

experimental data and hypotheses, iv) translate theoretical variables to observable ones, ... scaffolding framework embedded within ABACUS. INQUIRY LEARNING ... [12] defined transparency as a property of some systems where the inner ...

A Computational Framework for Uncertainty Quantification ... - CiteSeerX

ing a state-of-the-art numerical weather prediction (NWP) model ... Preprint ANL/MCS-P1681-1009 .... forecasting capabilities of state-of-the-art NWP models?

A computational framework for modeling targets as

paradigm is to move away from the top-down information sharing which is prevalent ...... To emphasize the importance of information sharing, ... anime patrol in.

A Computational Argumentation Framework for Agent Societies

Jun 17, 2011 - In computing, large systems can be viewed or designed in terms of the services they offer and the entities that interact to provide or consume ...

(Rad-TRaP): A computational framework for

brief description of these features is provided in Table 2 in the âMethodsâ ..... EBRT were generated using the Varian Eclipseâ¢treatment planning software ..... RE, Madabhushi A. Novel pca-vip scheme for ranking mri protocols and identifying ..

BioMog: A Computational Framework for the De

Dec 5, 2013 - to correct false positives it is done by manual inspection). .... the network (.100 blocked metabolites) are considered uninformative and filtered out to improve ..... due to various attached cofactors (e.g., ACP and trna) for which ...

A Framework for Computational Processing of Spelling ... - LTRC

Download PDF

2 downloads 20 Views 7KB Size Report

Comment

Many languages like Hindi, especially those which have been used as ... of two different 'dialects of Hindi' and are considered major 'regional' novels in Hindi.

A Framework for Computational Processing of Spelling Variation Anil Kumar Singh Language Technologies Research Centre International Institute of Information Technology, Hyderabad, India

Abstract Many languages like Hindi, especially those which have been used as (spoken) link languages and don’t have a long history of standardization, allow variant spellings of the same words. One of the reasons for this is the influence of the writer’s first language or dialect. In this paper we present a computational framework for predicting spelling variations based on the speaker’s dialect. This method is mainly based on using a phonetic model of scripts. Such a method is especially suited for Brahmi origin scripts which are used for most major Indian languages. The same method can also be used for studying regional variation given some written corpora in the concerned dialects as well as for determining the standard or the normalized form given a variant. The phonetic model used here differs from others such models in that it has been defined in terms of mainly phonetic features (vowel, voiced, etc.) augmented with some orthographic features. Other models mostly try to map graphemes to phonemes directly. Our focus in the current work is on the framework for computational processing of phonological and spelling variation rather than on specific (statistical or rule based) techniques like hidden markov modelling (HMM). The idea was to come up with a framework under which we can experiment with various such techniques. Such a framework is needed for languages like Hindi because a lot of errors while trying to solve natural language processing (NLP) problems occur because of the non-standard spellings and dialectal or regional variations. The framework that we propose would consists of the following components: • A phonetic model of Indian language scripts, which includes a function for measuring the phonetic distance (or similarity) between two letters or words • A ‘trie’ based compilation tool for lexicon to enable fast and easy search • A syllable or akshar forming component, as the Indian language scripts we are considering are syllabic in nature, in addition to having phonetic properties • A decoder which uses the lexicon ‘trie’ and the distance function to determine the standard or normalized form corresponding to the spelling variations • A generator to generate possible spelling variations given a standard or normalized form • A tool for studying spelling (and indirectly, phonological) variation given annotated corpora The decoder can use techniques like Gaussian mixture based HMM combined with dynamic time warping algorithm, as is done in speech recognition systems. The generator can be rule based or could use some statistical approach. As far as the corpus is concerned, we have manually marked up two Hindi novels written by native speakers of two different ‘dialects of Hindi’ and are considered major ‘regional’ novels in Hindi. These novels contain a lot of ‘Hindi words’ with spellings which correspond to their phonological variants in those dialects. We have also marked up two major English novels by Indians which contain many Hindi words. These will be used for studying the spelling variations of Hindi words or Indian names in written English.