Cluster-based patent retrieval

Authors:
In-Su Kang;Seung-Hoon Na;Jungi Kim;Jong-Hyeok Lee
Affiliations:
Korea Institute of Science and Technology Information, Pohang University of Science and Technology (POSTECH), Advanced Information Technology Research Center (AITrc), Republic of Korea;Division of Electrical and Computer Engineering, Pohang University of Science and Technology (POSTECH), Advanced Information Technology Research Center (AITrc), Republic of Korea;Division of Electrical and Computer Engineering, Pohang University of Science and Technology (POSTECH), Advanced Information Technology Research Center (AITrc), Republic of Korea;Division of Electrical and Computer Engineering, Pohang University of Science and Technology (POSTECH), Advanced Information Technology Research Center (AITrc), Republic of Korea
Venue:
Information Processing and Management: an International Journal
Year:
2007

Citing 19
Cited 18

Recent trends in hierarchic document clustering: a critical review

Information Processing and Management: an International Journal
Comparison of hierarchic agglomerative clustering methods for document retrieval

The Computer Journal
Searching distributed collections with inference networks

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Reexamining the cluster hypothesis: scatter/gather on retrieval results

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
The cluster hypothesis revisited

SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating document retrieval in patent database: a preliminary report

CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
Effective retrieval with distributed collections

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A hidden Markov model information retrieval system

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A general language model for information retrieval (poster abstract)

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A patent search and classification system

Proceedings of the fourth ACM conference on Digital libraries
Re-ranking model based on document clusters

Information Processing and Management: an International Journal
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Novelty and redundancy detection in adaptive filtering

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
The effectiveness of query-specific hierarchic clustering in information retrieval

Information Processing and Management: an International Journal
An empirical study on retrieval models for different document genres: patents and newspaper articles

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Embedding web-based statistical translation models in cross-language information retrieval

Computational Linguistics - Special issue on web as corpus
Cluster-based retrieval using language models

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Overview of patent retrieval task at NTCIR-3

PATENT '03 Proceedings of the ACL-2003 workshop on Patent corpus processing - Volume 20

Collaborative search and sensemaking of patents

CHI '08 Extended Abstracts on Human Factors in Computing Systems
A design rationale representation model using patent documents

Proceedings of the 2nd international workshop on Patent information retrieval
Development of a multilingual text mining approach for knowledge discovery in patents

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
A multi-faceted and automatic knowledge elicitation system (MAKES) for managing unstructured information

Expert Systems with Applications: An International Journal
An IPC-based vector space model for patent retrieval

Information Processing and Management: an International Journal
Applying key phrase extraction to aid invalidity search

Proceedings of the 13th International Conference on Artificial Intelligence and Law
Developing a comprehensive patent related information retrieval tool

Journal of Theoretical and Applied Electronic Commerce Research
Developing an ontology for the U.S. patent system

Proceedings of the 12th Annual International Digital Government Research Conference: Digital Government Innovation in Challenging Times
Patent search using IPC classification vectors

Proceedings of the 4th workshop on Patent information retrieval
Cluster-based patent retrieval using international patent classification system

ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
A hybrid case-GA-based decision support model for warehouse operation in fulfilling cross-border orders

Expert Systems with Applications: An International Journal
A proposed IPC-Based clustering and applied to technology strategy formulation

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part II
Vector space model for patent documents with hierarchical class labels

Journal of Information Science
Enhancing technology clustering through heuristics by using patent counts

Expert Systems with Applications: An International Journal
Emerging technology exploration using rare information retrieval and link analysis

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part II
A document is known by the company it keeps: neighborhood consensus for short text categorization

Language Resources and Evaluation
Cross-language patent matching via an international patent classification-based concept bridge

Journal of Information Science
A proposed IPC-based clustering method for exploiting expert knowledge and its application to strategic planning

Journal of Information Science

Quantified Score

Hi-index	0.00

Visualization

Abstract

Through the recent NTCIR workshops, patent retrieval casts many challenging issues to information retrieval community. Unlike newspaper articles, patent documents are very long and well structured. These characteristics raise the necessity to reassess existing retrieval techniques that have been mainly developed for structure-less and short documents such as newspapers. This study investigates cluster-based retrieval in the context of invalidity search task of patent retrieval. Cluster-based retrieval assumes that clusters would provide additional evidence to match user's information need. Thus far, cluster-based retrieval approaches have relied on automatically-created clusters. Fortunately, all patents have manually-assigned cluster information, international patent classification codes. International patent classification is a standard taxonomy for classifying patents, and has currently about 69,000 nodes which are organized into a five-level hierarchical system. Thus, patent documents could provide the best test bed to develop and evaluate cluster-based retrieval techniques. Experiments using the NTCIR-4 patent collection showed that the cluster-based language model could be helpful to improving the cluster-less baseline language model.