Identification of confusable drug names: a new approach and evaluation methodology

Authors:
Grzegorz Kondrak;Bonnie Dorr
Affiliations:
University of Alberta Edmonton, Alberta, Canada;University of Maryland, College Park
Venue:
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Year:
2004

Citing 8
Cited 10

Approximate string-matching with q-grams and maximal matches

Theoretical Computer Science - Selected papers of the Combinatorial Pattern Matching School
Finding approximate matches in large lexicons

Software—Practice & Experience
Phonetic string matching: lessons from information retrieval

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
The String-to-String Correction Problem

Journal of the ACM (JACM)
Approximate String Matching

ACM Computing Surveys (CSUR)
Bitext maps and alignment via pattern recognition

Computational Linguistics
A new algorithm for the alignment of phonetic sequences

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
The SMART Retrieval System—Experiments in Automatic Document Processing

The SMART Retrieval System—Experiments in Automatic Document Processing

Methods for extracting and classifying pairs of cognates and false friends

Machine Translation
Automatic extraction of translations from web-based bilingual materials

Machine Translation
Automatic prediction of cognate orthography using support vector machines

ACL '07 Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop
Finding `Lucy in Disguise': The Misheard Lyric Matching Problem

AIRS '09 Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology
Name matching between Chinese and Roman scripts: machine complements human

NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Computing word similarity and identifying cognates with pair hidden Markov models

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
A knowledge-rich approach to measuring the similarity between Bulgarian and Russian words

MRTECEEL '09 Proceedings of the Workshop on Multilingual Resources, Technologies and Evaluation for Central and Eastern European Languages
Bootstrapping bilingual lexicons from comparable corpora for closely related languages

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Aligning the un-alignable -- a pilot study using a noisy corpus of nonstandardized, semi-parallel texts

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
Similarity patterns in words

EACL 2012 Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the mitigation of medical errors due to the confusion of sound-alike and look-alike drug names. Our approach involves application of two new methods---one based on orthographic similarity ("look-alike") and the other based on phonetic similarity ("sound-alike"). We present a new recall-based evaluation methodology for determining the effectiveness of different similarity measures on drug names. We show that the new orthographic measure (BI-SIM) outperforms other commonly used measures of similarity on a set containing both look-alike and sound-alike pairs, and that the feature-based phonetic approach (ALINE) outperforms orthographic approaches on a test set containing solely sound-alike confusion pairs. However, an approach that combines several different measures achieves the best results on both test sets.