MM '09 Proceedings of the 17th ACM international conference on Multimedia
Multiple fundamental frequency estimation and polyphony inference of polyphonic music signals
IEEE Transactions on Audio, Speech, and Language Processing
Speech spectrum modeling for joint estimation of spectral envelope and fundamental frequency
IEEE Transactions on Audio, Speech, and Language Processing
Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle
IEEE Transactions on Audio, Speech, and Language Processing
Flexible harmonic temporal structure for modeling musical instrument
ICEC'10 Proceedings of the 9th international conference on Entertainment computing
Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions
IEEE Transactions on Audio, Speech, and Language Processing
Auxiliary-function-based independent component analysis for super-Gaussian sources
LVA/ICA'10 Proceedings of the 9th international conference on Latent variable analysis and signal separation
Single-channel speech separation based on long-short frame associated harmonic model
Digital Signal Processing
EvoCOMNET'10 Proceedings of the 2010 international conference on Applications of Evolutionary Computation - Volume Part II
Musical instrument identification based on new boosting algorithm with probabilistic decisions
CMMR'11 Proceedings of the 8th international conference on Speech, Sound and Music Processing: embracing research in India
Implement real-time polyphonic pitch detection and feedback system for the melodic instrument player
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
Journal of Intelligent Information Systems
Multi-pitch Streaming of Harmonic Sound Mixtures
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Automatic music transcription: challenges and future directions
Journal of Intelligent Information Systems
Hi-index | 0.00 |
This paper proposes a multipitch analyzer called the harmonic temporal structured clustering (HTC) method, that jointly estimates pitch, intensity, onset, duration, etc., of each underlying source in a multipitch audio signal. HTC decomposes the energy patterns diffused in time-frequency space, i.e., the power spectrum time series, into distinct clusters such that each has originated from a single source. The problem is equivalent to approximating the observed power spectrum time series by superimposed HTC source models, whose parameters are associated with the acoustic features that we wish to extract. The update equations of the HTC are explicitly derived by formulating the HTC source model with a Gaussian kernel representation. We verified through experiments the potential of the HTC method