Phonetic Algorithms in R - Open Journals

Recommend Documents

r esearch r esearch - Loughborough University Library Open Journals

the bathroom in the Hotel Louis XIV and other object lessons Caplan (2005) writes, 'design is a process for making things right, for shaping what people need'.

Open Access Journals in Educational Technology - patrick r. lowenthal

Preprint: âOpen Access Journals in Educational Technology: Results of a Survey of Experienced. Usersâ to appear in the Australasian Journal of Educational ...

The phonetic variation of German /r

It has already been acknowledged in the literature that German uvular /r/ shows much ... Phonetic data are reported which imply that the reduction of uvular /r/ in ...

Researcher degrees of freedom in phonetic research - Open Science

researcher degrees of freedom and how they can lead to misinterpretations ... ing research on an isolated undocumented language and investigate the ... However, these tests have a natural false positive rate, i.e. given a p-value of 0.05, there.

biomedical journals in r

Oct 16, 2015 - MJMS â Macedonian Journal of Medical Sciences, International Medical Journal ... butions of the Macedonian Scientific Society of Bitola, Vox Medici, Social Medicine: Professional ... Key words: biomedical journals, publishing, Republ

R. Yarahmadi, et al. - About Open Academic Journals Index

Key words: Acidic sludge- Bitumen- SBS polymer- Used motor oil. INTRODUCTION. According to the database of Iranian industry organization, there are more ...

R. Yarahmadi, et al. - About Open Academic Journals Index

industries to bitumen using bentonite and SBS. Ahmad Jonidi ... Key words: Acidic sludge- Bitumen- SBS polymer- Used motor oil ...... waterproofing material.

eixport: An R package to export emissions to ... - Open Journals

Apr 25, 2018 - Also, eixport import functions from the package ncdf4 (Pierce 2017), .... Snyder, Michelle G, Akula Venkatram, David K Heist, Steven G Perry, ...

Generating CodeMeta Metadata for R Packages - Open Journals

The CodeMeta project defines a JSON-LD format (âJSON-Ld 1.1â 2017) for describing software metadata, based largely on schema.org terms. This metadata ...

highlightHTML: CSS Formatting of R Markdown ... - Open Journals

Jan 12, 2018 - DOI: 10.21105/joss.00185. Software. â¢ Review. â¢ Repository. â¢ Archive ... Authors of JOSS papers retain copyright and release the work un-.

TDAstats: R pipeline for computing persistent ... - Open Journals

Aug 8, 2018 - tus in medicine, and computer vision in deep learning. Dimension reduction, using meth ... The Ripser C++ library (Bauer, 2015), benchmarked.

qqman: an R package for visualizing GWAS results ... - Open Journals

May 19, 2018 - Turner, (2018). qqman: an R package for visualizing GWAS results ... Pruim, Randall J, Ryan P Welch, Serena Sanna, Tanya M Teslovich, ...

qicharts2: Quality Improvement Charts for R - Open Journals

May 17, 2018 - qicharts2 is an R package for making statistical process control (SPC) charts for contin- uous quality improvement. SPC charts are ...

The vtreat R package: a statistically sound data ... - Open Journals

Mar 12, 2018 - (Mount and Zumel (2018)) is an R data frame processor that prepares ... Explicit coding of categorical variable levels as indicator variables.

geneXplainR: An R interface for the geneXplain ... - Open Journals

Licence. Authors of JOSS papers retain copyright and release the work un- der a Creative Commons Attri- bution 4.0 International License. (CC-BY). Summary.

humanleague: a C++ microsynthesis package with R ... - Open Journals

May 3, 2018 - humanleague: a C++ microsynthesis package with R and python interfaces. Andrew P Smith1. 1 School of Geography and Leeds Institute for ...

quanteda: An R package for the quantitative analysis ... - Open Journals

Oct 6, 2018 - nipulating these or using them in further quantitative analysis. Using C++ and multi- ... Corpus management quanteda makes it easy to manage ...

Pymer4: Connecting R and Python for Linear Mixed ... - Open Journals

Nov 26, 2018 - To accomplish this pymer4 leverages the existing prowess of lme4, through the use of the rpy2 (Gautier, 2008) cross-langauge compatibility ...

hydroscoper: R interface to the Greek National Data ... - Open Journals

Mar 14, 2018 - and global change studies (Vafiadis, Tolikas, and Koutsoyiannis 1994). Also, these data can be used by researchers to forward their studies ...

In/formalization - UniCA Open Journals

based company Grab followed suit and released its own motorbike taxi ser- ... The expansion of ridesharing companies into this sec-. 2017 A .... Company man-.

In/formalization - UniCA Open Journals

Grassroots meanings of informality: Resistance, subsistence and survival in the Greek crisis context. 2017 A. â¸ NUAC. VOL. 6, NÂ° 2, DICEMBRE 2017: 51-55.

Institute of Phonetic Sciences - Phonetic Sciences, Amsterdam

explain poor performance on tasks of this study but are not necessarily related to reading achievement. ..... p

r eviews - Journals

the correct research tools is as important to achieving the research aims for technology education as researching the right topics'. This important point is made in ...

r esearch - Journals

schools. This made the foundation very weak because students need to be exposed to different technological concepts and careers at an early age. Despite a.

Phonetic Algorithms in R - Open Journals

Download PDF

3 downloads 0 Views 118KB Size Report

Comment

Feb 14, 2018 - For instance, âRobertâ and âRupertâ both have the Soundex value of. âR163,â suggesting they are pronounced almost identically. Because of ...

Phonetic Algorithms in R James P. Howard, II1 DOI: 10.21105/joss.00480

1 The Johns Hopkins University Applied Physics Laboratory

Software • Review • Repository • Archive

Summary

The phonics package provides implementations of several phonetic algorithms. Phonetic algorithms are used to encode a string based on how it is pronounced (Zobel and Dart 1996). The resultant code should provide functional matching between similarly pronounced names. For instance, “Robert” and “Rupert” both have the Soundex value of Licence “R163,” suggesting they are pronounced almost identically. Because of pronunciation difAuthors of JOSS papers retain ferences around the world, even across English, many different algorithms exist and serve copyright and release the work undifferent needs and populations. der a Creative Commons Attribution 4.0 International License The algorithms are typically used for name encoding and indexing, record linking between (CC-BY). unrelated databases, and spellchecking. In addition, they can be used as a proxy for string distance measurements. Included in this package are Soundex, Metaphone, and many others developed over the years, including published variants. Submitted: 15 November 2017 Published: 14 February 2018

Acknowledgements The author thanks Oliver Keyes for his contributions and improvements to the C++ implementations within this package. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562 (Towns et al. 2014). In particular, it used the Comet system at the San Diego Supercomputing Center (SDSC) (Moore et al. 2014; S. M. Strande et al. 2017) through allocations TGDBS170012 and TG-ASC150024.

References Moore, Richard L., Chaitan Baru, Diane Baxter, Geoffrey C. Fox, Amit Majumdar, Phillip Papadopoulos, Wayne Pfeiffer, et al. 2014. “Gateways to Discovery: Cyberinfrastructure for the Long Tail of Science.” In Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment, 39:1–39:8. XSEDE ’14. New York, NY, USA: ACM. https://doi.org/10.1145/2616498.2616540. Strande, Shawn M., Haisong Cai, Trevor Cooper, Karen Flammer, Christopher Irving, Gregor von Laszewski, Amit Majumdar, et al. 2017. “Comet: Tales from the Long Tail: Two Years in and 10,000 Users Later.” In Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact, 38:1–38:7. PEARC17. New York, NY, USA: ACM. https://doi.org/10.1145/3093338.3093383. Towns, John, Timothy Cockerill, Maytal Dahan, Ian Foster, Kelly Gaither, Andrew Grimshaw, Victor Hazlewood, et al. 2014. “XSEDE: Accelerating Scientific Discovery.” Computing in Science & Engineering 16 (5). IEEE:62–74.

Howard, (2018). Phonetic Algorithms in R. Journal of Open Source Software, 3(22), 480. https://doi.org/10.21105/joss.00480

1

Zobel, Justin, and Philip Dart. 1996. “Phonetic String Matching: Lessons from Information Retrieval.” In Proceedings of the 19th Annual International Acm Sigir Conference on Research and Development in Information Retrieval, 166–72. ACM.

Howard, (2018). Phonetic Algorithms in R. Journal of Open Source Software, 3(22), 480. https://doi.org/10.21105/joss.00480

2