Learning to extract symbolic knowledge from the World Wide Web
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Learning Curved Multinomial Subfamilies for Natural Language Processing and Information Retrieval
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Learning the Kernel Matrix with Semi-Definite Programming
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Text classification with kernels on the multinomial manifold
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Estimating and computing density based distance metrics
ICML '05 Proceedings of the 22nd international conference on Machine learning
Dirichlet component analysis: feature extraction for compositional data
Proceedings of the 25th international conference on Machine learning
Adaptive p-posterior mixture-model kernels for multiple instance learning
Proceedings of the 25th international conference on Machine learning
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
We consider the problem of learning a Riemannian metric associated with a given differentiable manifold and a set of points. Our approach to the problem involves choosing a metric from a parametric family that is based on maximizing the inverse volume of a given dataset of points. From a statistical perspective, it is related to maximum likelihood under a model that assigns probabilities inversely proportional to the Riemannian volume element. We discuss in detail learninga metric on the multinomial simplex where the metric candidates are pull-back metrics of the Fisher information under a continuous group of transformations. When applied to documents, the resulting geodesics resemble, but outperform, the TFIDF cosine similarity measure in classification.