Mining uncertain sentences with multiple instance learning

  • Authors:
  • Feng Ji;Xipeng Qiu;Xuanjing Huang

  • Affiliations:
  • School of Computer Science and Technology, Fudan University, Shanghai, China;School of Computer Science and Technology, Fudan University, Shanghai, China;School of Computer Science and Technology, Fudan University, Shanghai, China

  • Venue:
  • ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Distinguishing uncertain information from factual ones in online texts is of essential importance in information extraction, because uncertain information would mislead systems to find useless even fault information. In this paper, we propose a method for detecting uncertain sentences with multiple instance learning (MIL). Based on the basic assumption, we derive two new constraints for estimating the weight vector by defining a probability margin, which is used in an online learning algorithm known as Passive-Aggressive algorithm. To demonstrate the effectiveness of our method, we experiment on the biomedical corpus. Compared with an intuitive method with conventional single instance learning (SIL), our method provide higher performance by raising the performance from 79.07% up to 82.55%, over 3% improvement.