Numerical recipes in C (2nd ed.): the art of scientific computing
Numerical recipes in C (2nd ed.): the art of scientific computing
The nature of statistical learning theory
The nature of statistical learning theory
A trainable document summarizer
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Machine Learning
Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Bringing order to the Web: automatically categorizing search results
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
OCELOT: a system for summarizing Web pages
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Extracting sentence segments for text summarization: a machine learning approach
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Function-based object model towards website adaptation
Proceedings of the 10th international conference on World Wide Web
Seeing the whole in parts: text summarization for web browsing on handheld devices
Proceedings of the 10th international conference on World Wide Web
Generic text summarization using relevance measure and latent semantic analysis
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Summarization as feature selection for text categorization
Proceedings of the tenth international conference on Information and knowledge management
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Using web structure for classifying and describing web pages
Proceedings of the 11th international conference on World Wide Web
Information Retrieval
Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Automatic Textual Document Categorization Based on Generalized Instance Sets and a Metamodel
IEEE Transactions on Pattern Analysis and Machine Intelligence
Building a web thesaurus from web link structure
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Eliminating noisy information in Web pages for data mining
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic text categorization using the importance of sentences
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A text categorization based on summarization technique
RANLPIR '00 Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11
Web-page summarization using clickthrough data
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Knowing a web page by the company it keeps
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A Voting Method for the Classification of Web Pages
WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
A Novel Partitioning-Based Clustering Method and Generic Document Summarization
WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
Interest-based personalized search
ACM Transactions on Information Systems (TOIS)
Cross-document event clustering using knowledge mining from co-reference chains
Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Automatic classification of web pages into bookmark categories
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Noise reduction through summarization for Web-page classification
Information Processing and Management: an International Journal
Just-in-time contextual advertising
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
PeRSSonal's core functionality evaluation: Enhancing text labeling through personalized summaries
Data & Knowledge Engineering
Improving relevance judgment of web search results with image excerpts
Proceedings of the 17th international conference on World Wide Web
Generating succinct titles for web URLs
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning from multi-topic web documents for contextual advertisement
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A bottom-up approach for XML documents classification
IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Web content summarization using social bookmarks: a new approach for social summarization
Proceedings of the 10th ACM workshop on Web information and data management
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
Entity-Based Classification of Web Page in Search Engine
ICADL 08 Proceedings of the 11th International Conference on Asian Digital Libraries: Universal and Ubiquitous Access to Information
Subjectively Related Association Term Discovery towards Personalized Web Information Retrieval
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Enhancing diversity, coverage and balance for summarization through structure learning
Proceedings of the 18th international conference on World wide web
Tree-Based Method for Classifying Websites Using Extended Hidden Markov Models
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Document summarization using conditional random fields
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Serving Comparative Shopping Links Non-invasively
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Towards a graph-based user profile modeling for a session-based personalized search
Knowledge and Information Systems
Rules revisited: web page classification
CI '07 Proceedings of the Third IASTED International Conference on Computational Intelligence
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction
ACM Transactions on Information Systems (TOIS)
Detecting visually similar Web pages: Application to phishing detection
ACM Transactions on Internet Technology (TOIT)
Query-topic focused web pages summarization
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
A schema for ontology-based concept definition and identification
International Journal of Computer Applications in Technology
A Comprehensive Study of Features and Algorithms for URL-Based Topic Classification
ACM Transactions on the Web (TWEB)
Normal distribution re-weighting for personalized web search
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Web Page Summarization for Just-in-Time Contextual Advertising
ACM Transactions on Intelligent Systems and Technology (TIST)
A Luhn-Inspired Vector Re-weighting Approach for Improving Personalized Web Search
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Cross document event clustering using knowledge mining from co-reference chains
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Image description mining and hierarchical clustering on data records using HR-Tree
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Text summarisation in progress: a literature review
Artificial Intelligence Review
Mobile web profiling: a study of off-portal surfing habits of mobile users
UMAP'10 Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
Cleaning web pages for effective web content mining
DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
Internet public opinion hotspot detection research based on k-means algorithm
ICSI'10 Proceedings of the First international conference on Advances in Swarm Intelligence - Volume Part II
Information Processing and Management: an International Journal
PostRank: a new algorithm for incremental finding of persian blog representative words
Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics
Proceedings of the CUBE International Information Technology Conference
Annotation and Auto-Scrolling for Web Page Overview in Mobile Web Browsing
International Journal of Handheld Computing Research
Rhetorics-based multi-document summarization
Expert Systems with Applications: An International Journal
Browse with a social web directory
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
What's the deal?: identifying online bargains
AWC '13 Proceedings of the First Australasian Web Conference - Volume 144
Web Intelligence and Agent Systems
Hi-index | 0.00 |
Web-page classification is much more difficult than pure-text classification due to a large variety of noisy information embedded in Web pages. In this paper, we propose a new Web-page classification algorithm based on Web summarization for improving the accuracy. We first give empirical evidence that ideal Web-page summaries generated by human editors can indeed improve the performance of Web-page classification algorithms. We then propose a new Web summarization-based classification algorithm and evaluate it along with several other state-of-the-art text summarization algorithms on the LookSmart Web directory. Experimental results show that our proposed summarization-based classification algorithm achieves an approximately 8.8% improvement as compared to pure-text-based classification algorithm. We further introduce an ensemble classifier using the improved summarization algorithm and show that it achieves about 12.9% improvement over pure-text based methods.