Applying natural language processing (NLP) based metadata extraction to automatically acquire user preferences

  • Authors:
  • Woojin Paik;Sibel Yilmazel;Eric Brown;Maryjane Poulin;Stephane Dubon;Christophe Amice

  • Affiliations:
  • solutions-united, inc., Syracuse, NY;solutions-united, inc., Syracuse, NY;solutions-united, inc., Syracuse, NY;solutions-united, inc., Syracuse, NY;solutions-united, inc., Syracuse, NY;solutions-united, inc., Syracuse, NY

  • Venue:
  • Proceedings of the 1st international conference on Knowledge capture
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a metadata extraction technique based on natural language processing (NLP) which extracts personalized information from email communications between financial analysts and their clients. Personalized means connecting users with content in a personally meaningful way to create, grow, and retain online relationships. Personalization often results in the creation of user profiles that store individuals' preferences regarding goods or services offered by various e-commerce merchants. With the introduction of e-commerce, it has become more difficult to develop and maintain personalized information due to larger transaction volumes. is an NLP and Machine Learning (ML)-based automatic metadata extraction system designed to process textual data such as emails, discussion group postings, or chat group transcriptions. extracts both explicit and implicit metadata elements including proper names, numeric concepts, and topic/subject information. In addition, Speech Act Theory inspired metadata elements, which represent the message creators' intention, mood, and urgency are also extracted. In a typical dialogue between financial analysts and their clients, clients often discuss the items that they liked or have an interest. By extracting this information, constructs user profiles automatically. This system has been designed, implemented, and tested with real-world data. The overall accuracy and coverage of extracting explicit and implicit metadata is about 90%. In summary, the paper shows that an NLP-based metadata extraction system enables automatic user profiling with high effectiveness.