Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
A patent search and classification system
Proceedings of the fourth ACM conference on Digital libraries
Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
The VLDB Journal — The International Journal on Very Large Data Bases
SNoW User Guide
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
Entry vocabulary: a technology to enhance digital search
HLT '01 Proceedings of the first international conference on Human language technology research
Self organization of a massive document collection
IEEE Transactions on Neural Networks
Recognition Algorithms for Structured Documents with Variable Content
Programming and Computing Software
Introduction to the special issue on patent processing
Information Processing and Management: an International Journal
Patent document categorization based on semantic structural information
Information Processing and Management: an International Journal
Text mining techniques for patent analysis
Information Processing and Management: an International Journal
Grouping of TRIZ Inventive Principles to facilitate automatic patent classification
Expert Systems with Applications: An International Journal
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Patent surrogate extraction and evaluation in the context of patent mapping
Journal of Information Science
Towards a patent taxonomy integration and interaction framework
Proceedings of the 1st ACM workshop on Patent information retrieval
The patent mining task in the seventh NTCIR workshop
Proceedings of the 1st ACM workshop on Patent information retrieval
Incorporating Prior Knowledge into Task Decomposition for Large-Scale Patent Classification
ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Preferential text classification: learning algorithms and evaluation measures
Information Retrieval
Improving retrievability of patents with cluster-based pseudo-relevance feedback documents selection
Proceedings of the 18th ACM conference on Information and knowledge management
Identification of low/high retrievable patents using content-based features
Proceedings of the 2nd international workshop on Patent information retrieval
On the role of classification in patent invalidity searches
Proceedings of the 2nd international workshop on Patent information retrieval
Journal of Management Information Systems
Journal of the American Society for Information Science and Technology
Patent classification system using a new hybrid genetic algorithm support vector machine
Applied Soft Computing
A vector space analysis of swedish patent claims with different linguistic indices
PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval
Hybrid-patent classification based on patent-network analysis
Journal of the American Society for Information Science and Technology
Identifying candidates for design-by-analogy
Computers in Industry
An IPC-based vector space model for patent retrieval
Information Processing and Management: an International Journal
A Semantic-based Intellectual Property Management System (SIPMS) for supporting patent analysis
Engineering Applications of Artificial Intelligence
Cluster-based patent retrieval using international patent classification system
ICCPOL'06 Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges ahead
Improving retrievability of patents in prior-art search
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Improving retrievability with improved cluster-based pseudo-relevance feedback selection
Expert Systems with Applications: An International Journal
Automatic categorization of patent applications using classifier combinations
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Extract conceptual graphs from plain texts in patent claims
Engineering Applications of Artificial Intelligence
Vector space model for patent documents with hierarchical class labels
Journal of Information Science
A three-phase method for patent classification
Information Processing and Management: an International Journal
Hi-index | 0.00 |
A new reference collection of patent documents for training and testing automated categorization systems is established and described in detail. This collection is tailored for automating the attribution of international patent classification codes to patent applications and is made publicly available for future research work. We report the results of applying a variety of machine learning algorithms to the automated categorization of English-language patent documents. This procedure involves a complex hierarchical taxonomy, within which we classify documents into 114 classes and 451 subclasses. Several measures of categorization success are described and evaluated. We investigate how best to resolve the training problems related to the attribution of multiple classification codes to each patent document.