Web Communities Defined by Web Page Content
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Learning to recognize webpage genres
Information Processing and Management: an International Journal
Web Site Description Based on Genres and Web Design Patterns
SOCINFO '09 Proceedings of the 2009 International Workshop on Social Informatics
Enhance web pages genre identification using neighboring pages
WISE'11 Proceedings of the 12th international conference on Web information system engineering
Testing a genre-enabled application: a preliminary assessment
FDIA'08 Proceedings of the 2nd BCS IRSG conference on Future Directions in Information Access
Genre analysis of structured e-mails for corpus profiling
IRSG'08 Proceedings of the 2008 BCS-IRSG conference on Corpus Profiling
Hi-index | 0.00 |
In this paper, we describe a set of experiments to examine the effect of various attributes of web genre on the automatic identification of the genre of web pages. Four different genres are used in the data set, namely, FAQ, News, E-Shopping and Personal Home Pages. The effects of the number of features used to represent the web pages (5, 20, or 100) as well as the types of attributes, content, form, functionality, singly and in various combinations are examined. The results indicate that fewer features produce better precision but more features produce better recall, and that attributes in combinations will always perform better than single attributes.