Cepstral domain segmental feature vector normalization for noise robust speech recognition
Speech Communication - Special issue on robust speech recognition
Comparison of different implementations of MFCC
Journal of Computer Science and Technology
Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
iRemember: a personal, long-term memory prosthesis
Proceedings of the 3rd ACM workshop on Continuous archival and retrival of personal experences
Real-time discrimination of broadcast speech/music
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Lifelogging memory appliance for people with episodic memory impairment
UbiComp '08 Proceedings of the 10th international conference on Ubiquitous computing
Proceedings of the 6th ACM conference on Embedded network sensor systems
SoundSense: scalable sound sensing for people-centric applications on mobile phones
Proceedings of the 7th international conference on Mobile systems, applications, and services
A framework of energy efficient mobile sensing for automatic user state recognition
Proceedings of the 7th international conference on Mobile systems, applications, and services
Darwin phones: the evolution of sensing and inference on mobile phones
Proceedings of the 8th international conference on Mobile systems, applications, and services
EmotionSense: a mobile phones based adaptive platform for experimental social psychology research
Proceedings of the 12th ACM international conference on Ubiquitous computing
LittleRock: Enabling Energy-Efficient Continuous Sensing on Mobile Phones
IEEE Pervasive Computing
SenseCam: a retrospective memory aid
UbiComp'06 Proceedings of the 8th international conference on Ubiquitous Computing
Context-based security: state of the art, open research topics and a case study
CASEMANS '11 Proceedings of the 5th ACM International Workshop on Context-Awareness for Self-Managing Systems
mConverse: inferring conversation episodes from respiratory measurements collected in the field
Proceedings of the 2nd Conference on Wireless Health
Progressive authentication: deciding when to authenticate on mobile phones
Security'12 Proceedings of the 21st USENIX conference on Security symposium
Improving energy efficiency of personal sensing applications with heterogeneous multi-processors
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
StressSense: detecting stress in unconstrained acoustic environments using smartphones
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Low cost crowd counting using audio tones
Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems
Proceedings of the 14th Workshop on Mobile Computing Systems and Applications
Auditeur: a mobile-cloud service platform for acoustic event detection on smartphones
Proceeding of the 11th annual international conference on Mobile systems, applications, and services
MoodScope: building a mood sensor from smartphone usage patterns
Proceeding of the 11th annual international conference on Mobile systems, applications, and services
SocioPhone: everyday face-to-face interaction monitoring platform using multi-phone sensor fusion
Proceeding of the 11th annual international conference on Mobile systems, applications, and services
Crowd++: unsupervised speaker count with smartphones
Proceedings of the 2013 ACM international joint conference on Pervasive and ubiquitous computing
Robust voice activity detection for social sensing
Proceedings of the 2013 ACM conference on Pervasive and ubiquitous computing adjunct publication
Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications
SocialWeaver: collaborative inference of human conversation networks using smartphones
Proceedings of the 11th ACM Conference on Embedded Networked Sensor Systems
Proceedings of the 11th ACM Conference on Embedded Networked Sensor Systems
Reduce the Number of Sensors: Sensing Acoustic Emissions to Estimate Appliance Energy Usage
Proceedings of the 5th ACM Workshop on Embedded Systems For Energy-Efficient Buildings
TalkBetter: family-driven mobile intervention care for children with language delay
Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
TIPS: context-aware implicit user identification using touch screen in uncontrolled environments
Proceedings of the 15th Workshop on Mobile Computing Systems and Applications
Local business ambience characterization through mobile audio sensing
Proceedings of the 23rd international conference on World wide web
Smartphone sensing offloading for efficiently supporting social sensing applications
Pervasive and Mobile Computing
Activity recognition for creatures of habit
Personal and Ubiquitous Computing
Hi-index | 0.00 |
Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations.