Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Computer Vision for Music Identification
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A Performance Evaluation of Local Descriptors
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Review of Audio Fingerprinting
Journal of VLSI Signal Processing Systems
Waveprint: Efficient wavelet-based audio fingerprinting
Pattern Recognition
Fingerprints for machines: characterization and optical identification of grinding imprints
DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
A local fingerprinting approach for audio copy detection
Signal Processing
Hi-index | 0.00 |
A novel audio fingerprinting method that is highly robust to Time Scale Modification (TSM) and pitch shifting is proposed. Instead of simply employing spectral or tempo-related features, our system is based on computer-vision techniques. We transform each 1-D audio signal into a 2-D image and treat TSM and pitch shifting of the audio signal as stretch and translation of the corresponding image. Robust local descriptors are extracted from the image and matched against those of the reference audio signals. Experimental results show that our system is highly robust to various audio distortions, including the challenging TSM and pitch shifting.