Semi-supervised metrics for textual data visualization

Authors:
Ángela Blanco;Manuel Martín-Merino
Affiliations:
Universidad Pontificia de Salamanca, Salamanca, Spain;Universidad Pontificia de Salamanca, Salamanca, Spain
Venue:
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Year:
2007

Citing 12
Cited 0

Latent semantic indexing is an optimal special case of multidimensional scaling

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Matrix computations (3rd ed.)

Matrix computations (3rd ed.)
Internet browsing and searching: user evaluations of category map and concept space techniques

Journal of the American Society for Information Science - Special topic issue: artificial intelligence techniques for emerging information systems applications
Matrices, Vector Spaces, and Information Retrieval

SIAM Review
A corpus-based approach to comparative evaluation of statistical term association measures

Journal of the American Society for Information Science and Technology
Modern Information Retrieval

Modern Information Retrieval
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms

Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
On Using Partial Supervision for Text Categorization

IEEE Transactions on Knowledge and Data Engineering
A New Sammon Algorithm for Sparse Data Visualization

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 1 - Volume 01
Learning from labeled and unlabeled data using a minimal number of queries

IEEE Transactions on Neural Networks
Artificial neural networks for feature extraction and multivariate data projection

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

Multidimensional Scaling algorithms (MDS) are useful tools that help to discover high dimensional object relationships. They have been applied to a wide range of practical problems and particularly to the visualization of the semantic relations among documents or terms in textual databases. The MDS algorithms proposed in the literature often suffer from a low discriminant power due to its unsupervised nature and to the 'curse of dimensionality'. Fortunately, textual databases provide frequently a manually created classification for a subset of documents that may help to overcome this problem. In this paper we propose a semi-supervised version of the Torgerson MDS algorithm that takes advantage of this document classification to improve the discriminant power of the word maps generated. The algorithm has been applied to the visualization of term relationships. The experimental results show that the model proposed outperforms well known unsupervised alternatives.