Using the normalization for typographic errors in numerals

  • Authors:
  • Sachin N. Deshmukh;Suresh C. Mehrotra;Hardeep Singh

  • Affiliations:
  • Department of CS and IT, Dr B. A. M. University, Aurangabad, M.S., India;Department of CS and IT, Dr B. A. M. University, Aurangabad, M.S., India;Department of CSE, Guru Nanak Dev University, Amritsar, Punjab, India

  • Venue:
  • ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

For numerical record fields such as date and age, many types of error are likely to yield small numerical differences between observed and true values. If, for example, two different sources provide separate case reports related to the same incident, the dates of onset may not match perfectly but are more likely to differ by a few days than by several years. In order to tackle the variations in numbers a few methods are available. The paper proposes a new normalization technique useful for the numerical record. A Comparison of Distance with the Smith Waterman Distance shows significant increase in the weight by the present technique.