Discourse segmentation by human and automated means

Authors:
Rebecca J. Passonneau;Diane J. Litman
Affiliations:
Bellcore and Columbia University;AT&T Labs-Research
Venue:
Computational Linguistics
Year:
1997

Citing 32
Cited 50

Getting computers to talk like you and me

Getting computers to talk like you and me
Attention, intentions, and the structure of discourse

Computational Linguistics
Tense as discourse anaphor

Computational Linguistics - Special issue on tense and aspect
Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems

Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
C4.5: programs for machine learning

C4.5: programs for machine learning
A problem for RST: the need for multi-level discourse analysis

Computational Linguistics
Classifying cue phrases in text and speech using machine learning

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Centering: a framework for modeling the local coherence of discourse

Computational Linguistics
Assessing agreement on classification tasks: the kappa statistic

Computational Linguistics
Limited attention and discourse structure

Computational Linguistics
Plan Recognition in Natural Language Dialogue

Plan Recognition in Natural Language Dialogue
TextTiling: A Quantitative Approach to Discourse

TextTiling: A Quantitative Approach to Discourse
The representation and use of focus in dialogue understanding.

The representation and use of focus in dialogue understanding.
Lexical cohesion computed by thesaural relations as an indicator of the structure of text

Computational Linguistics
Empirical studies on the disambiguation of cue phrases

Computational Linguistics
Planning text for advisory dialogues: capturing intentional and rhetorical information

Computational Linguistics
A computational theory of the function of clue words in argument understanding

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
The intonational structuring of discourse

ACL '86 Proceedings of the 24th annual meeting on Association for Computational Linguistics
Deterministic parsing of syntactic non-fluencies

ACL '83 Proceedings of the 21st annual meeting on Association for Computational Linguistics
Intention-based segmentation: human reliability and correlation with linguistic cues

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Text segmentation based on similarity between words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Combining multiple knowledge sources for discourse segmentation

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Evaluating automated and manual acquisition of anaphora resolution strategies

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Investigating cue selection and placement in tutorial discourse

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Multi-paragraph segmentation of expository text

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
An automatic method of finding topic boundaries

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Mixed initiative in dialogue: an investigation into discourse segmentation

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
A prosodic analysis of discourse segments in direction-giving monologues

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Tense trees as the "fine structure" of discourse

ACL '92 Proceedings of the 30th annual meeting on Association for Computational Linguistics
Estimating upper and lower bounds on the performance of word-sense disambiguation programs

ACL '92 Proceedings of the 30th annual meeting on Association for Computational Linguistics
Cues and control in expert-client dialogues

ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
Cue phrase classification using machine learning

Journal of Artificial Intelligence Research

Statistical Models for Text Segmentation

Machine Learning - Special issue on natural language learning
Learning-based Intrasentence Segmentation for Efficient Translation of Long Sentences

Machine Translation
Mixed-Initiative Interaction

IEEE Intelligent Systems
An evaluation method of words tendency depending on time-series variation and its improvements

Information Processing and Management: an International Journal
Automatic summarization of open-domain multiparty dialogues in diverse genres

Computational Linguistics - Summarization
The disambiguation of nominalizations

Computational Linguistics
Segmenting Conversations by Topic, Initiative, and Style

Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
The Role of Pause Occurrence and Pause Duration in the Signaling of Narrative Structure

PorTAL '02 Proceedings of the Third International Conference on Advances in Natural Language Processing
Integrating prosodic and lexical cues for automatic topic segmentation

Computational Linguistics
Evaluation of text coherence for electronic essay scoring systems

Natural Language Engineering
A bootstrapping approach for robust topic analysis

Natural Language Engineering
Resolving discourse deictic anaphora in dialogues

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
PARADISE: a framework for evaluating spoken dialogue agents

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Using collocations for topic segmentation and link detection

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Discourse segmentation of multi-party conversation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
An evaluation method of words tendency using decision tree

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
The role of centering theory's rough-shift in the teaching and evaluation of writing skills

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Building a discourse-tagged corpus in the framework of Rhetorical Structure Theory

SIGDIAL '01 Proceedings of the Second SIGdial Workshop on Discourse and Dialogue - Volume 16
Reconciling initiative and discourse structure

SIGDIAL '01 Proceedings of the Second SIGdial Workshop on Discourse and Dialogue - Volume 16
Bridging the gap between dialogue management and dialogue models

SIGDIAL '02 Proceedings of the 3rd SIGdial workshop on Discourse and dialogue - Volume 2
An orthonormal basis for topic segmentation in tutorial dialogue

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Arkose: reusing informal information from online discussions

Proceedings of the 2007 international ACM conference on Supporting group work
Inter-coder agreement for computational linguistics

Computational Linguistics
A Model of Discourse Segmentation and Segment Title Assignment for Lecture Speech Indexing

IEICE - Transactions on Information and Systems
Split and Merge Based Story Segmentation in News Videos

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Unsupervised methods of topical text segmentation for Polish

ACL '07 Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies
Switching to real-time tasks in multi-tasking dialogue

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Using a probabilistic model of discourse relations to investigate word order variation

DiscAnnotation '04 Proceedings of the 2004 ACL Workshop on Discourse Annotation
Computer-based support for patients with limited English

EAMT '03 Proceedings of the 7th International EAMT workshop on MT and other Language Technology Tools, Improving MT through other Language Technology Tools: Resources and Tools for Building MT
Story segmentation of brodcast news in English, Mandarin and Arabic

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Cue phrase classification using machine learning

Journal of Artificial Intelligence Research
Evaluating interactive dialogue systems: extending component evaluation to integrated system evaluation

ISDS '97 Interactive Spoken Dialog Systems on Bringing Speech and NLP Together in Real Applications
An analysis of quantitative aspects in the evaluation of thematic segmentation algorithms

SigDIAL '06 Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue
A syntactic and lexical-based discourse segmenter

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Participant subjectivity and involvement as a basis for discourse segmentation

SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Contrasting the interaction structure of an email and a telephone corpus: a machine learning approach to annotation of dialogue function units

SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
A study on the reliability of two discourse segmentation models

PROPOR'03 Proceedings of the 6th international conference on Computational processing of the Portuguese language
Retrospective analysis of cross-culture communication

Proceedings of the 3rd international conference on Intercultural collaboration
Evaluating hierarchical discourse segmentation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Automatic identification of discourse markers in dialogues: An in-depth study of like and well

Computer Speech and Language
An investigation of interruptions and resumptions in multi-tasking dialogues

Computational Linguistics
Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval

ACM Transactions on Speech and Language Processing (TSLP)
TV news story segmentation based on semantic coherence and content similarity

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Contextual correlation based thread detection in short text message streams

Journal of Intelligent Information Systems
Automated detection of local coherence in short argumentative essays based on centering theory

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Topical segmentation: a study of human performance and a new measure of quality

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Two-part segmentation of text documents

Proceedings of the 21st ACM international conference on Information and knowledge management
A semi-automatic text-based semantic video annotation system for Turkish facilitating multilingual retrieval

Expert Systems with Applications: An International Journal
Unsupervised text segmentation using LDA and MCMC

AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
Topic segmentation and labeling in asynchronous conversations

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

The need to model the relation between discourse structure and linguistic features of utterances is almost universally acknowledged in the literature on discourse. However, there is only weak consensus on what the units of discourse structure are, or the criteria for recognizing and generating them. We present quantitative results of a two-part study using a corpus of spontaneous, narrative monologues. The first part of our paper presents a method for empirically validating multitutterance units referred to as discourse segments. We report highly significant results of segmentations performed by naive subjects, where a commonsense notion of speaker intention is the segmentation criterion. In the second part of our study, data abstracted from the subjects' segmentations serve as a target for evaluating two sets of algorithms that use utterance features to perform segmentation. On the first algorithm set, we evaluate and compare the correlation of discourse segmentation with three types of linguistic cues (referential noun phrases, cue words, and pauses). We then develop a second set using two methods: error analysis and machine learning. Testing the new algorithms on a new data set shows that when multiple sources of linguistic knowledge are used concurrently, algorithm performance improves.