The pyramid-technique: towards breaking the curse of dimensionality
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
When Is ''Nearest Neighbor'' Meaningful?
ICDT '99 Proceedings of the 7th International Conference on Database Theory
The Haar Wavelet Transform in the Time Series Similarity Paradigm
PKDD '99 Proceedings of the Third European Conference on Principles of Data Mining and Knowledge Discovery
Similarity Search in High Dimensions via Hashing
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Locality-sensitive hashing scheme based on p-stable distributions
SCG '04 Proceedings of the twentieth annual symposium on Computational geometry
Automatic music video generation based on temporal pattern analysis
Proceedings of the 12th annual ACM international conference on Multimedia
Foundations of Multidimensional and Metric Data Structures (The Morgan Kaufmann Series in Computer Graphics and Geometric Modeling)
Automatic generation of personalized music sports video
Proceedings of the 13th annual ACM international conference on Multimedia
SIAM Journal on Discrete Mathematics
Emotion-based impressionism slideshow with automatic music accompaniment
Proceedings of the 15th international conference on Multimedia
Automatic Generation of Music Slide Show Using Personal Photos
ISM '08 Proceedings of the 2008 Tenth IEEE International Symposium on Multimedia
Music analysis, retrieval and synthesis of audio signals MARSYAS
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Proceedings of the international conference on Multimedia
Overview of the MPEG-7 standard
IEEE Transactions on Circuits and Systems for Video Technology
IEEE Transactions on Circuits and Systems for Video Technology
Optimization-based automated home video editing system
IEEE Transactions on Circuits and Systems for Video Technology
PICASSO: automated soundtrack suggestion for multi-modal data
Proceedings of the 20th ACM international conference on Information and knowledge management
Knowledge-based music retrieval for places of interest
Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies
MuseSync: standing on the shoulders of Hollywood
Proceedings of the 20th ACM international conference on Multimedia
Being picky: processing top-k queries with set-defined selections
Proceedings of the 21st ACM international conference on Information and knowledge management
SRbench--a benchmark for soundtrack recommendation systems
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
We study the problem of automatically assigning appropriate music pieces to a picture or, in general, series of pictures. This task, commonly referred to as soundtrack suggestion, is non-trivial as it requires a lot of human attention and a good deal of experience, with master pieces distinguished, e.g., with the Academy Award for Best Original Score. We put forward PICASSO to solve this task in a fully automated way. PICASSO makes use of genuine samples obtained from first-class contemporary movies. Hence, the training set can be arbitrarily large and is also inexpensive to obtain but still provides an excellent source of information. At query time, PICASSO employs a three-level algorithm. First, it selects for a given query image a ranking of the most similar screenshots taken, and subsequently, selects for each screenshot the most similar songs to the music played in the movie when the screenshot was taken. Last, it issues a top-K aggregation algorithm to find the overall best suitable songs available. We have created a large training set consisting of over 40,000 image/soundtrack samples obtained from 28 movies and evaluated the suitability of PICASSO by means of a user study.