Automated Classification and Categorization of Mathematical Knowledge

Authors:
Radim &#/344ehů/ř/ek;Petr Sojka
Affiliations:
Faculty of Informatics, Masaryk University, Brno, Czech Republic;Faculty of Informatics, Masaryk University, Brno, Czech Republic
Venue:
Proceedings of the 9th AISC international conference, the 15th Calculemas symposium, and the 7th international MKM conference on Intelligent Computer Mathematics
Year:
2008

Citing 11
Cited 2

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Viewing morphology as an inference process

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Analyses of multiple evidence combination

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A study of thresholding strategies for text categorization

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Experiments on the Use of Feature Selection and Negative Evidence in Automated Text Categorization

ECDL '00 Proceedings of the 4th European Conference on Research and Advanced Technology for Digital Libraries
An extensive empirical study of feature selection metrics for text classification

The Journal of Machine Learning Research
Confidence estimation for NLP applications

ACM Transactions on Speech and Language Processing (TSLP)
Boosting multi-label hierarchical text categorization

Information Retrieval
Introduction to Information Retrieval

Introduction to Information Retrieval

Document engineering for a digital library: PDF recompression using JBIG2 and other optimizations of PDF documents

Proceedings of the 10th ACM symposium on Document engineering
Evaluation of normalization techniques in text classification for portuguese

ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part III

Quantified Score

Hi-index	0.00

Visualization

Abstract

There is a commonMathematics SubjectClassification(MSC) System used for categorizing mathematical papers and knowledge. We present results of machine learning of the MSC on full texts of papers in the mathematical digital libraries DML-CZ and NUMDAM. The F1- measure achieved on classification task of top-level MSC categories exceeds 89%. We describe and evaluate our methods for measuring the similarity of papers in the digital library based on paper full texts.