Edit detection and parsing for transcribed speech

Authors:
Eugene Charniak;Mark Johnson
Affiliations:
Brown University, Providence, RI;Brown University, Providence, RI
Venue:
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Year:
2001

Citing 14
Cited 27

Improved boosting algorithms using confidence-rated predictions

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Learning to Parse Natural Language with Maximum Entropy Models

Machine Learning - Special issue on natural language learning
Discriminative Reranking for Natural Language Parsing

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Head-driven statistical models for natural language parsing

Head-driven statistical models for natural language parsing
Speech repairs, intonational phrases, and discourse markers: modeling speakers' utterances in spoken dialogue

Computational Linguistics
A maximum-entropy-inspired parser

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Three generative, lexicalised models for statistical parsing

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Intonational boundaries, speech repairs and discourse markers: modeling spoken dialog

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Deterministic parsing of syntactic non-fluencies

ACL '83 Proceedings of the 21st annual meeting on Association for Computational Linguistics
Statistical decision-tree models for parsing

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
A new statistical parser based on bigram lexical dependencies

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Integrating multiple knowledge sources for detection and correction of repairs in human-computer dialog

ACL '92 Proceedings of the 30th annual meeting on Association for Computational Linguistics
A syntactic framework for speech repairs and other disruptions

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Statistical parsing with a context-free grammar and word statistics

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence

Robust garden path parsing

Natural Language Engineering
Parsing and disfluency placement

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Extracting clauses for spoken language understanding in conversational systems

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A TAG-based noisy channel model of speech repairs

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
PCFGs with syntactic and prosodic indicators of speech repairs

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Learning the structure of task-driven human-human dialogs

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A progressive feature selection algorithm for ultra large feature spaces

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Effective use of prosody in parsing conversational speech

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Where Do Parsing Errors Come From

TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Syntactic complexity measures for detecting mild cognitive impairment

BioNLP '07 Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing
A look at parsing and its applications

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Parsing conversational speech using enhanced segmentation

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
A lexically-driven algorithm for disfluency detection

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Early deletion of fillers in processing conversational speech

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Domain adaptation with artificial data for semantic parsing of speech

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Improved features and models for detecting edit disfluencies in transcribing spontaneous Mandarin speech

IEEE Transactions on Audio, Speech, and Language Processing
Exploring features for identifying edited regions in disfluent sentences

Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
From extractive to abstractive meeting summaries: can it be done by sentence compression?

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Wide-coverage parsing of speech transcripts

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Word buffering models for improved speech repair parsing

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Symbolic-to-statistical hybridization: extending generation-heavy machine translation

Machine Translation
Self-training with products of latent variable grammars

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
The impact of language models and loss functions on repair disfluency detection

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Book review: parsing schemata for practical text analysis carlos gómez rodríguez (university of a coruña) london: Imperial college press (mathematics, computing, language, and life series, edited by carlos martin-vide, volume 1), 2010, xiv+275 pp; hardbound, isbn 978-1-84816-560-1, $89.00

Computational Linguistics
Affirmative cue words in task-oriented dialogue

Computational Linguistics
Contextual maximum entropy model for edit disfluency detection of spontaneous speech

ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
CoNLL-2011 shared task: modeling unrestricted coreference in OntoNotes

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a simple architecture for parsing transcribed speech in which an edited-word detector first removes such words from the sentence string, and then a standard statistical parser trained on transcribed speech parses the remaining words. The edit detector achieves a misclassification rate on edited words of 2.2%. (The NULL-model, which marks everything as not edited, has an error rate of 5.9%.) To evaluate our parsing results we introduce a new evaluation metric, the purpose of which is to make evaluation of a parse tree relatively indifferent to the exact tree position of EDITED nodes. By this metric the parser achieves 85.3% precision and 86.5% recall.