Recent trends in hierarchic document clustering: a critical review
Information Processing and Management: an International Journal
Comparison of hierarchic agglomerative clustering methods for document retrieval
The Computer Journal
Searching distributed collections with inference networks
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Reexamining the cluster hypothesis: scatter/gather on retrieval results
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
The cluster hypothesis revisited
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating document retrieval in patent database: a preliminary report
CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
Effective retrieval with distributed collections
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A hidden Markov model information retrieval system
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A general language model for information retrieval (poster abstract)
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A patent search and classification system
Proceedings of the fourth ACM conference on Digital libraries
Re-ranking model based on document clusters
Information Processing and Management: an International Journal
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Novelty and redundancy detection in adaptive filtering
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
The effectiveness of query-specific hierarchic clustering in information retrieval
Information Processing and Management: an International Journal
An empirical study on retrieval models for different document genres: patents and newspaper articles
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Embedding web-based statistical translation models in cross-language information retrieval
Computational Linguistics - Special issue on web as corpus
Cluster-based retrieval using language models
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Overview of patent retrieval task at NTCIR-3
PATENT '03 Proceedings of the ACL-2003 workshop on Patent corpus processing - Volume 20
Collaborative search and sensemaking of patents
CHI '08 Extended Abstracts on Human Factors in Computing Systems
A design rationale representation model using patent documents
Proceedings of the 2nd international workshop on Patent information retrieval
Development of a multilingual text mining approach for knowledge discovery in patents
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Expert Systems with Applications: An International Journal
An IPC-based vector space model for patent retrieval
Information Processing and Management: an International Journal
Applying key phrase extraction to aid invalidity search
Proceedings of the 13th International Conference on Artificial Intelligence and Law
Developing a comprehensive patent related information retrieval tool
Journal of Theoretical and Applied Electronic Commerce Research
Developing an ontology for the U.S. patent system
Proceedings of the 12th Annual International Digital Government Research Conference: Digital Government Innovation in Challenging Times
Patent search using IPC classification vectors
Proceedings of the 4th workshop on Patent information retrieval
Cluster-based patent retrieval using international patent classification system
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Expert Systems with Applications: An International Journal
A proposed IPC-Based clustering and applied to technology strategy formulation
ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part II
Vector space model for patent documents with hierarchical class labels
Journal of Information Science
Enhancing technology clustering through heuristics by using patent counts
Expert Systems with Applications: An International Journal
Emerging technology exploration using rare information retrieval and link analysis
ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part II
A document is known by the company it keeps: neighborhood consensus for short text categorization
Language Resources and Evaluation
Cross-language patent matching via an international patent classification-based concept bridge
Journal of Information Science
Journal of Information Science
Hi-index | 0.00 |
Through the recent NTCIR workshops, patent retrieval casts many challenging issues to information retrieval community. Unlike newspaper articles, patent documents are very long and well structured. These characteristics raise the necessity to reassess existing retrieval techniques that have been mainly developed for structure-less and short documents such as newspapers. This study investigates cluster-based retrieval in the context of invalidity search task of patent retrieval. Cluster-based retrieval assumes that clusters would provide additional evidence to match user's information need. Thus far, cluster-based retrieval approaches have relied on automatically-created clusters. Fortunately, all patents have manually-assigned cluster information, international patent classification codes. International patent classification is a standard taxonomy for classifying patents, and has currently about 69,000 nodes which are organized into a five-level hierarchical system. Thus, patent documents could provide the best test bed to develop and evaluate cluster-based retrieval techniques. Experiments using the NTCIR-4 patent collection showed that the cluster-based language model could be helpful to improving the cluster-less baseline language model.