Fast computation of low rank matrix approximations
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Bridging the semanitic gap in image retrieval
Distributed multimedia databases
A Fuzzy Rule-Based Agent for Web Retrieval-Filtering
WI '01 Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development
Message Analysis for the Recommendation of Contact Persons within Defined Subject Fields
IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence
Negotiating the Semantic Gap: From Feature Maps to Semantic Landscapes
SOFSEM '01 Proceedings of the 28th Conference on Current Trends in Theory and Practice of Informatics Piestany: Theory and Practice of Informatics
Topic Discovery from Text Using Aggregation of Different Clustering Methods
AI '02 Proceedings of the 15th Conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Visualization and Analysis of Web Graphs
Progress in Discovery Science, Final Report of the Japanese Discovery Science Project
Extraction Positive and Negative Keywords for Web Communities
DS '00 Proceedings of the Third International Conference on Discovery Science
Lower dimensional representation of text data in vector space based information retrieval
Computational information retrieval
Symbolic preprocessing techniques for information retrieval using vector space models
Computational information retrieval
Clustering large unstructured document sets
Computational information retrieval
Taking a new look at the latent semantic analysis approach to information retrieval
Computational information retrieval
Experiments with LSA scoring: optimal rank and basis
Computational information retrieval
A comparative analysis of LSI strategies
Computational information retrieval
pSearch: information retrieval in structured overlays
ACM SIGCOMM Computer Communication Review
Parallel Monte Carlo algorithms for information retrieval
Mathematics and Computers in Simulation - Special issue: 3rd IMACS seminar on Monte Carlo methods - MCM 2001
Clustering in massive data sets
Handbook of massive data sets
Peer-to-peer information retrieval using self-organizing semantic overlay networks
Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications
A new differential LSI space-based probabilistic document classifier
Information Processing Letters
On scaling latent semantic indexing for large peer-to-peer systems
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Intelligent agent for automated manufacturing rule generation
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Dimension Reduction in Text Classification with Support Vector Machines
The Journal of Machine Learning Research
Enabling Flexible Queries with Guarantees in P2P Systems
IEEE Internet Computing
Making Search Efficient on Gnutella-Like P2P Systems
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Algorithm 844: Computing sparse reduced-rank approximations to sparse matrices
ACM Transactions on Mathematical Software (TOMS)
Subproblem optimization by gene correlation with singular value decomposition
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Selforganizing classification on the Reuters news corpus
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Clustered SVD strategies in latent semantic indexing
Information Processing and Management: an International Journal
Dimension-reduced estimation of word co-occurrence probability
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
PRISM: indexing multi-dimensional data in P2P networks using reference vectors
Proceedings of the 13th annual ACM international conference on Multimedia
Email Surveillance Using Non-negative Matrix Factorization
Computational & Mathematical Organization Theory
Features for unsupervised document classification
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
A differential LSI method for document classification
AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
Very low-dimensional latent semantic indexing for local query regions
AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
A patent document retrieval system addressing both semantic and syntactic properties
PATENT '03 Proceedings of the ACL-2003 workshop on Patent corpus processing - Volume 20
Document clustering using nonnegative matrix factorization
Information Processing and Management: an International Journal
Probabilistic topic decomposition of an eighteenth-century American newspaper
Journal of the American Society for Information Science and Technology
A framework for understanding latent semantic indexing (LSI) performance
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
SDQE: towards automatic semantic query optimization in P2P systems
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Efficient query routing for information retrieval in semantic overlays
Proceedings of the 2006 ACM symposium on Applied computing
Spectral techniques for graph bisection in genetic algorithms
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Isolating and relating concerns in requirements using latent semantic analysis
Proceedings of the 21st annual ACM SIGPLAN conference on Object-oriented programming systems, languages, and applications
Access Structures for Angular Similarity Queries
IEEE Transactions on Knowledge and Data Engineering
TMBIO '06 Proceedings of the 1st international workshop on Text mining in bioinformatics
Enhancing Search Performance on Gnutella-Like P2P Systems
IEEE Transactions on Parallel and Distributed Systems
Fast computation of low-rank matrix approximations
Journal of the ACM (JACM)
A hierarchical semantic overlay approach to P2P similarity search
ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Towards a semantic-aware file store
HOTOS'03 Proceedings of the 9th conference on Hot Topics in Operating Systems - Volume 9
A measure theoretic approach to information retrieval
Journal of the American Society for Information Science and Technology
Sampling from large matrices: An approach through geometric functional analysis
Journal of the ACM (JACM)
A similarity-based method for retrieving documents from the SCI/SSCI database
Journal of Information Science
Enhancing semi-supervised clustering: a feature projection perspective
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Using position, fonts and cited references to retrieve scientific documents
Journal of Information Science
Out-of-core SVD performance for document indexing
Applied Numerical Mathematics
Synobins: an intermediate level towards annotation and semantic retrieval
EURASIP Journal on Applied Signal Processing
Information retrieval in schema-based P2P systems using one-dimensional semantic space
Computer Networks: The International Journal of Computer and Telecommunications Networking
A comparative analysis on the bisecting K-means and the PDDP clustering algorithms
Intelligent Data Analysis
Semantic indexing in structured peer-to-peer networks
Journal of Parallel and Distributed Computing
Scalable blind search and broadcasting over Distributed Hash Tables
Computer Communications
RSS: A framework enabling ranked search on the semantic web
Information Processing and Management: an International Journal
Augmenting the power of LSI in text retrieval: Singular value rescaling
Data & Knowledge Engineering
Designing evolving user profile in e-CRM with dynamic clustering of Web documents
Data & Knowledge Engineering
International Journal of Computational Science and Engineering
A novel data distortion approach via selective SSVD for privacy protection
International Journal of Information and Computer Security
Informax principle-based query expansion using Hopfield neural networks
International Journal of Intelligent Systems Technologies and Applications
SemreX: Efficient search in a semantic overlay for literature retrieval
Future Generation Computer Systems
Squid: Enabling search in DHT-based systems
Journal of Parallel and Distributed Computing
pRoute: Peer selection using shared term similarity matrices
Web Intelligence and Agent Systems
Dealing with P2P semantic heterogeneity through query expansion and interpretation
DaMaP '08 Proceedings of the 2008 international workshop on Data management in peer-to-peer systems
Journal of Systems and Software
Combining weights with fuzziness for intelligent semantic web search
Knowledge-Based Systems
Content-based search using self-organizing peer-to-peer network
SEPADS'08 Proceedings of the 7th WSEAS International Conference on Software Engineering, Parallel and Distributed Systems
Crawling Bug Tracker for Semantic Bug Search
DSOM '08 Proceedings of the 19th IFIP/IEEE international workshop on Distributed Systems: Operations and Management: Managing Large-Scale Service Deployment
A protocol for self-organizing peer-to-peer network supporting content-based search
WSEAS Transactions on Information Science and Applications
Dynamical low-rank approximation: applications and numerical experiments
Mathematics and Computers in Simulation
A P2P-based intelligent resource discovery mechanism in Internet-based distributed systems
Journal of Parallel and Distributed Computing
Fault Resolution in Case-Based Reasoning
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Latent Semantic Analysis --- The Dynamics of Semantics Web Services Discovery
Advances in Web Semantics I
P2P Networking and Applications
P2P Networking and Applications
Wavelet and Eigen-Space Feature Extraction for Classification of Metallography Images
Proceedings of the 2008 conference on Information Modelling and Knowledge Bases XIX
Adaptive Web SitesA Knowledge Extraction from Web Data Approach
Proceedings of the 2008 conference on Adaptive Web Sites: A Knowledge Extraction from Web Data Approach
Human Expert Modelling Using Numerical Linear Algebra: a Heavy Industry Case Study
Proceedings of the 2007 conference on Information Modelling and Knowledge Bases XVIII
Proceedings of the 2009 conference on Information Modelling and Knowledge Bases XX
Proceedings of the 2009 conference on Information Modelling and Knowledge Bases XX
Adaptive Context-based term (re)weightingAn experiment on Single-Word Question Answering
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Applying Semantic Techniques to Search and Analyze Bug Tracking Data
Journal of Network and Systems Management
Sequential latent semantic indexing
Proceedings of the 2nd Workshop on Data Mining using Matrices and Tensors
Automatic identification of non-compositional multi-word expressions using latent semantic analysis
MWE '06 Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties
In Search of Semantic Compositionality in Vector Spaces
ICCS '09 Proceedings of the 17th International Conference on Conceptual Structures: Conceptual Structures: Leveraging Semantic Technologies
Potential collaboration discovery using document clustering and community structure detection
Proceedings of the 1st ACM international workshop on Complex networks meet information & knowledge management
Unified linear subspace approach to semantic analysis
Journal of the American Society for Information Science and Technology
Matrix completion from a few entries
ISIT'09 Proceedings of the 2009 IEEE international conference on Symposium on Information Theory - Volume 1
Clustered SVD strategies in latent semantic indexing
Information Processing and Management: an International Journal
Document clustering using nonnegative matrix factorization
Information Processing and Management: an International Journal
A framework for understanding Latent Semantic Indexing (LSI) performance
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
SDQE: towards automatic semantic query optimization in P2P systems
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Using LPP and SFL for document query optimization
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Document categorization algorithm based on kernel NPE
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Predicting Novel Human Gene Ontology Annotations Using Semantic Analysis
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Affective content analysis by mid-level representation in multiple modalities
Proceedings of the First International Conference on Internet Multimedia Computing and Service
Clustering graphs for visualization via node similarities
Journal of Visual Languages and Computing
Understanding latent semantic indexing: A topological structure analysis using Q-analysis
Journal of the American Society for Information Science and Technology
RSM-based gossip on P2P network
ICA3PP'07 Proceedings of the 7th international conference on Algorithms and architectures for parallel processing
A partially supervised metric multidimensional scaling algorithm for textual data visualization
IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
Measurement and analysis of an online content voting network: a case study of Digg
Proceedings of the 19th international conference on World wide web
Semi-supervised metrics for textual data visualization
ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Thematic clustering of geographic resource metadata collections
W2GIS'07 Proceedings of the 7th international conference on Web and wireless geographical information systems
Fault representation in case-based reasoning
DSOM'07 Proceedings of the Distributed systems: operations and management 18th IFIP/IEEE international conference on Managing virtualization of networks and services
Improving interoperability using query interpretation in semantic vector spaces
ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
Emergent semantics from users' browsing paths
ISI'03 Proceedings of the 1st NSF/NIJ conference on Intelligence and security informatics
A vector space approach to tag cloud similarity ranking
Information Processing Letters
A conceptual graph approach to semantic similarity computation method for e-service discovery
International Journal of Knowledge Engineering and Data Mining
Matrix completion from a few entries
IEEE Transactions on Information Theory
Enhanced vector space models for content-based recommender systems
Proceedings of the fourth ACM conference on Recommender systems
A hybrid recommendation method with double SVD reduction
DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
Using scripts for affective content retrieval
PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
A smarter process for sensing the information space
IBM Journal of Research and Development
CONQUIRO: A cluster-based meta-search engine
Computers in Human Behavior
International Journal of Data Mining and Bioinformatics
Measuring and enhancing the social connectivity of UGC video systems: a case study of YouKu
Proceedings of the Nineteenth International Workshop on Quality of Service
Using population based algorithms for initializing nonnegative matrix factorization
ICSI'11 Proceedings of the Second international conference on Advances in swarm intelligence - Volume Part II
Self-learning predictor aggregation for the evolution of people-driven ad-hoc processes
BPM'11 Proceedings of the 9th international conference on Business process management
STAIRS: Towards efficient full-text filtering and dissemination in DHT environments
The VLDB Journal — The International Journal on Very Large Data Bases
Composition Vector Method Based on Maximum Entropy Principle for Sequence Comparison
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
SIAM Journal on Scientific Computing
Text retrieval using sparsified concept decomposition matrix
CIS'04 Proceedings of the First international conference on Computational and Information Science
IglooG: a distributed web crawler based on grid service
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
An immune network for contextual text data clustering
ICARIS'06 Proceedings of the 5th international conference on Artificial Immune Systems
Clustering peers based on contents for efficient similarity search
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
On building and updating distributed LSI for p2p systems
ISPA'05 Proceedings of the 2005 international conference on Parallel and Distributed Processing and Applications
Semantic video indexing and summarization using subtitles
PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part I
Interactive video search using multilevel indexing
CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Text mining with application to engineering diagnostics
IEA/AIE'06 Proceedings of the 19th international conference on Advances in Applied Artificial Intelligence: industrial, Engineering and Other Applications of Applied Intelligent Systems
BEATCA: map-based intelligent navigation in WWW
AIMSA'06 Proceedings of the 12th international conference on Artificial Intelligence: methodology, Systems, and Applications
Personalized news categorization through scalable text classification
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
DPTree: a distributed pattern tree index for partial-match queries in peer-to-peer networks
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Some experiments of face annotation based on latent semantic indexing in FIARS
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Content-based similarity search over peer-to-peer systems
DBISP2P'04 Proceedings of the Second international conference on Databases, Information Systems, and Peer-to-Peer Computing
SemreX: a semantic peer-to-peer system for literature documents retrieval
ASWC'06 Proceedings of the First Asian conference on The Semantic Web
Understanding and enhancing the folding-in method in latent semantic indexing
DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
A new semi-supervised dimension reduction technique for textual data analysis
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
A holistic semantic similarity measure for viewports in interactive maps
W2GIS'12 Proceedings of the 11th international conference on Web and Wireless Geographical Information Systems
Concepts and architectures for next-generation information search engines
International Journal of Information Management: The Journal for Information Professionals
Memory-restricted latent semantic analysis to accumulate term-document co-occurrence events
Pattern Recognition Letters
A semantic searching scheme in heterogeneous unstructured P2P networks
Journal of Computer Science and Technology - Special issue on Natural Language Processing
Distributed top-k full-text content dissemination
Distributed and Parallel Databases
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Enhanced semantic TV-show representation for personalized electronic program guides
UMAP'12 Proceedings of the 20th international conference on User Modeling, Adaptation, and Personalization
Affiliation dynamics with an application to movie-actor biographies
EUROVIS'06 Proceedings of the Eighth Joint Eurographics / IEEE VGTC conference on Visualization
Trust-aware query routing in P2P social networks
International Journal of Communication Systems
TRES-CORE: content-based retrieval based on the balanced tree in peer to peer systems
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
Parallel rare term vector replacement: Fast and effective dimensionality reduction for text
Journal of Parallel and Distributed Computing
SMBSRP: a search mechanism based on interest similarity, query relevance and distance prediction
IWANN'13 Proceedings of the 12th international conference on Artificial Neural Networks: advances in computational intelligence - Volume Part I
Partial-update dimensionality reduction for accumulating co-occurrence events
Pattern Recognition Letters
Hi-index | 0.06 |
The evolution of digital libraries and the Internet has dramatically transformed the processing, storage, and retrieval of information. Efforts to digitize text, images, video, and audio now consume a substantial portion of both academic and industrial activity. Even when there is no shortage of textual materials on a particular topic, procedures for indexing or extracting the knowledge or conceptual information contained in them can be lacking. Recently developed information retrieval technologies are based on the concept of a vector space. Data are modeled as a matrix, and a user's query of the database is represented as a vector. Relevant documents in the database are then identified via simple vector operations. Orthogonal factorizations of the matrix provide mechanisms for handling uncertainty in the database itself. The purpose of this paper is to show how such fundamental mathematical concepts from linear algebra can be used to manage and index large text collections.