Inferring social nature of conversations from words: Experiments on a corpus of everyday telephone conversations

Authors:
Anthony Stark;Izhak Shafran;Jeffrey Kaye
Affiliations:
-;-;-
Venue:
Computer Speech and Language
Year:
2014

Citing 10
Cited 0

Latent dirichlet allocation

The Journal of Machine Learning Research
Enriching the knowledge sources used in a maximum entropy part-of-speech tagger

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Extracting social meaning: identifying interactional style in spoken conversation

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Detection and interpretation of opinion expressions in spoken surveys

IEEE Transactions on Audio, Speech, and Language Processing
Extractive speech summarization using shallow rhetorical structure modeling

IEEE Transactions on Audio, Speech, and Language Processing
SWITCHBOARD: telephone speech corpus for research and development

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
Towards the detection of social dominance in dialogue

Speech Communication
A Framework for Automatic Human Emotion Classification Using Emotion Profiles

IEEE Transactions on Audio, Speech, and Language Processing
Stance classification using dialogic properties of persuasion

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Language is being increasingly harnessed to not only create natural human-machine interfaces but also to infer social behaviors and interactions. In the same vein, we investigate a novel spoken language task, of inferring social relationships in two-party conversations: whether the two parties are related as family, strangers or are involved in business transactions. For our study, we created a corpus of all incoming and outgoing calls from a few homes over the span of a year. On this unique naturalistic corpus of everyday telephone conversations, which is unlike Switchboard or any other public domain corpora, we demonstrate that standard natural language processing techniques can achieve accuracies of about 88%, 82%, 74% and 80% in differentiating business from personal calls, family from non-family calls, familiar from unfamiliar calls and family from other personal calls respectively. Through a series of experiments with our classifiers, we characterize the properties of telephone conversations and find: (a) that 30 words of openings (beginnings) are sufficient to predict business from personal calls, which could potentially be exploited in designing context sensitive interfaces in smart phones; (b) our corpus-based analysis does not support Schegloff and Sack's manual analysis of exemplars in which they conclude that pre-closings differ significantly between business and personal calls - closing fared no better than a random segment; and (c) the distribution of different types of calls are stable over durations as short as 1-2 months. In summary, our results show that social relationships can be inferred automatically in two-party conversations with sufficient accuracy to support practical applications.