Contextually-mediated semantic similarity graphs for topic segmentation

  • Authors:
  • Geetu Ambwani;Anthony R. Davis

  • Affiliations:
  • StreamSage/Comcast, Washington, DC;StreamSage/Comcast, Washington, DC

  • Venue:
  • TextGraphs-5 Proceedings of the 2010 Workshop on Graph-based Methods for Natural Language Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a representation of documents as directed, weighted graphs, modeling the range of influence of terms within the document as well as contextually determined semantic relatedness among terms. We then show the usefulness of this kind of representation in topic segmentation. Our boundary detection algorithm uses this graph to determine topical coherence and potential topic shifts, and does not require labeled data or training of parameters. We show that this method yields improved results on both concatenated pseudo-documents and on closed-captions for television programs.