Subtopic structuring for full-length document access
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Building a question answering test collection
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
An empirical study on retrieval models for different document genres: patents and newspaper articles
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Multi-paragraph segmentation of expository text
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
NTT data: description of the Erie system used for MUC-6
TIPSTER '96 Proceedings of a workshop on held at Vienna, Virginia: May 6-8, 1996
Patent claim processing for readability: structure analysis and term explanation
PATENT '03 Proceedings of the ACL-2003 workshop on Patent corpus processing - Volume 20
Introduction to the special issue on patent processing
Information Processing and Management: an International Journal
Automatic discovery of technology trends from patent text
Proceedings of the 2009 ACM symposium on Applied Computing
Transforming patents into prior-art queries
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Automatic query generation for patent search
Proceedings of the 18th ACM conference on Information and knowledge management
Relevant document retrieval using a spoken document
ISCIT'09 Proceedings of the 9th international conference on Communications and information technologies
Proceedings of the 4th International Conference on Theory and Practice of Electronic Governance
Building queries for prior-art search
IRFC'11 Proceedings of the Second international conference on Multidisciplinary information retrieval facility
Applying key phrase extraction to aid invalidity search
Proceedings of the 13th International Conference on Artificial Intelligence and Law
United we fall, divided we stand: a study of query segmentation and prf for patent prior art search
Proceedings of the 4th workshop on Patent information retrieval
Effective query generation and postprocessing strategies for prior art patent search
Journal of the American Society for Information Science and Technology
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
A patent system ontology for facilitating retrieval of patent related information
Proceedings of the 6th International Conference on Theory and Practice of Electronic Governance
An LDA-smoothed relevance model for document expansion: a case study for spoken document retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Hi-index | 0.00 |
We propose an associative document retrieval method, in which a document is used as a query to search for other similar documents. Because a long document usually includes more than one topic, we first analyze a query document to extract multiple subtopics. For each subtopic element, a sub-query is produced and similar documents are retrieved with a relevance score. The relevance scores are weighted by the importance of each subtopic element and are integrated to determine the final relevant documents. In the calculation of the subtopic importance, the specificity of a query term is evaluated using entropy, which is the deviation degree of the appearances of the term in each subtopic element. We apply this method to an invalidity patent search. By exploiting certain unique features of Japanese patent claims, we use features distinguishing the preamble and the essential portion in a query patent claim. To demonstrate the effectiveness of our method, we experimentally evaluated our associative document retrieval method on five years of patent documents.