Bisociative literature mining by ensemble heuristics

  • Authors:
  • Matjaž Juršič;Bojan Cestnik;Tanja Urbančič;Nada Lavrač

  • Affiliations:
  • Jožef Stefan Institute, Ljubljana, Slovenia;Jožef Stefan Institute, Ljubljana, Slovenia, Temida d.o.o., Ljubljana, Slovenia;Jožef Stefan Institute, Ljubljana, Slovenia, University of Nova Gorica, Nova Gorica, Slovenia;Jožef Stefan Institute, Ljubljana, Slovenia, University of Nova Gorica, Nova Gorica, Slovenia

  • Venue:
  • Bisociative Knowledge Discovery
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In literature mining, the identification of bridging concepts that link two diverse domains has been shown to be a promising approach for finding bisociations as distinct, yet unexplored cross-domain connections which could lead to new scientific discoveries. This chapter introduces the system CrossBee (on line Cross-Context Bisociation Explorer) which implements a methodology that supports the search for hidden links connecting two different domains. The methodology is based on an ensemble of specially tailored text mining heuristics which assign the candidate bridging concepts a bisociation score. Using this score, the user of the system can primarily explore only the most promising concepts with high bisociation scores. Besides improved bridging concept identification and ranking, CrossBee also provides various content presentations which further speed up the process of bisociation hypotheses examination. These presentations include side-by-side document inspection, emphasizing of interesting text fragments, and uncovering similar documents. The methodology is evaluated on two problems: the standard migraine-magnesium problem well-known in literature mining, and a more recent autism-calcineurin literature mining problem.