SpeakerSense: energy efficient unobtrusive speaker identification on mobile phones

Authors:
Hong Lu;A. J. Bernheim Brush;Bodhi Priyantha;Amy K. Karlson;Jie Liu
Affiliations:
Dept. of Computer Science, Dartmouth College, Hanover, NH and Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA
Venue:
Pervasive'11 Proceedings of the 9th international conference on Pervasive computing
Year:
2011

Citing 13
Cited 22

Cepstral domain segmental feature vector normalization for noise robust speech recognition

Speech Communication - Special issue on robust speech recognition
Comparison of different implementations of MFCC

Journal of Computer Science and Technology
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
iRemember: a personal, long-term memory prosthesis

Proceedings of the 3rd ACM workshop on Continuous archival and retrival of personal experences
Real-time discrimination of broadcast speech/music

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Lifelogging memory appliance for people with episodic memory impairment

UbiComp '08 Proceedings of the 10th international conference on Ubiquitous computing
Sensing meets mobile social networks: the design, implementation and evaluation of the CenceMe application

Proceedings of the 6th ACM conference on Embedded network sensor systems
SoundSense: scalable sound sensing for people-centric applications on mobile phones

Proceedings of the 7th international conference on Mobile systems, applications, and services
A framework of energy efficient mobile sensing for automatic user state recognition

Proceedings of the 7th international conference on Mobile systems, applications, and services
Darwin phones: the evolution of sensing and inference on mobile phones

Proceedings of the 8th international conference on Mobile systems, applications, and services
EmotionSense: a mobile phones based adaptive platform for experimental social psychology research

Proceedings of the 12th ACM international conference on Ubiquitous computing
LittleRock: Enabling Energy-Efficient Continuous Sensing on Mobile Phones

IEEE Pervasive Computing
SenseCam: a retrospective memory aid

UbiComp'06 Proceedings of the 8th international conference on Ubiquitous Computing

Context-based security: state of the art, open research topics and a case study

CASEMANS '11 Proceedings of the 5th ACM International Workshop on Context-Awareness for Self-Managing Systems
mConverse: inferring conversation episodes from respiratory measurements collected in the field

Proceedings of the 2nd Conference on Wireless Health
Progressive authentication: deciding when to authenticate on mobile phones

Security'12 Proceedings of the 21st USENIX conference on Security symposium
Improving energy efficiency of personal sensing applications with heterogeneous multi-processors

Proceedings of the 2012 ACM Conference on Ubiquitous Computing
StressSense: detecting stress in unconstrained acoustic environments using smartphones

Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Low cost crowd counting using audio tones

Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems
sMFCC: exploiting sparseness in speech for fast acoustic feature extraction on mobile devices -- a feasibility study

Proceedings of the 14th Workshop on Mobile Computing Systems and Applications
Auditeur: a mobile-cloud service platform for acoustic event detection on smartphones

Proceeding of the 11th annual international conference on Mobile systems, applications, and services
MoodScope: building a mood sensor from smartphone usage patterns

Proceeding of the 11th annual international conference on Mobile systems, applications, and services
SocioPhone: everyday face-to-face interaction monitoring platform using multi-phone sensor fusion

Proceeding of the 11th annual international conference on Mobile systems, applications, and services
Crowd++: unsupervised speaker count with smartphones

Proceedings of the 2013 ACM international joint conference on Pervasive and ubiquitous computing
Robust voice activity detection for social sensing

Proceedings of the 2013 ACM conference on Pervasive and ubiquitous computing adjunct publication
The latency, accuracy, and battery (LAB) abstraction: programmer productivity and energy efficiency for continuous mobile context sensing

Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications
SocialWeaver: collaborative inference of human conversation networks using smartphones

Proceedings of the 11th ACM Conference on Embedded Networked Sensor Systems
The sound of silence

Proceedings of the 11th ACM Conference on Embedded Networked Sensor Systems
Reduce the Number of Sensors: Sensing Acoustic Emissions to Estimate Appliance Energy Usage

Proceedings of the 5th ACM Workshop on Embedded Systems For Energy-Efficient Buildings
TalkBetter: family-driven mobile intervention care for children with language delay

Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
TIPS: context-aware implicit user identification using touch screen in uncontrolled environments

Proceedings of the 15th Workshop on Mobile Computing Systems and Applications
Investigating automatic measurements of prosodic accommodation and its dynamics in social interaction

Speech Communication
Local business ambience characterization through mobile audio sensing

Proceedings of the 23rd international conference on World wide web
Smartphone sensing offloading for efficiently supporting social sensing applications

Pervasive and Mobile Computing
Activity recognition for creatures of habit

Personal and Ubiquitous Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations.