Audio-Based copy detection in the large-scale internet videos
PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Content-based copy detection through multimodal feature representation and temporal pyramid matching
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Hi-index | 0.00 |
In this paper, we present a new feature extraction algorithm which is referred to as weighted ASF (WASF) in a fingerprint system. The feature in our algorithm is extracted based on a MPEG-7 descriptor-Audio Spectrum Flatness (ASF) and Human Auditory System (HAS). It also applies several effective filters and another MPEG-7 descriptor: Audio Signature (AS). This algorithm is tested under several audio distortions: sampling rate change, noise addition, and speed-change and so on. For these distortions, the WASF algorithm can get discrimination more than 90%. The MFCC feature and another MPEG-7 descriptor-Audio spectrum Centroid (ASC) are also considered.