Characterizing browsing strategies in the World-Wide Web
Proceedings of the Third International World-Wide Web conference on Technology, tools and applications
Subject categorization of query terms for exploring Web users' search interests
Journal of the American Society for Information Science and Technology
Multitasking information seeking and searching processes
Journal of the American Society for Information Science and Technology
Analysis of large data logs: an application of Poisson sampling on excite web queries
Information Processing and Management: an International Journal
Combining evidence for automatic web session identification
Information Processing and Management: an International Journal - Issues of context in information retrieval
Mining Access Patterns Efficiently from Web Logs
PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
Analysing Web Search Logs to Determine Session Boundaries for User-Oriented Learning
AH '00 Proceedings of the International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems
Data Mining of User Navigation Patterns
WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Web usage mining: discovery and applications of usage patterns from Web data
ACM SIGKDD Explorations Newsletter
Web Usage Mining as a Tool for Personalization: A Survey
User Modeling and User-Adapted Interaction
A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis
INFORMS Journal on Computing
Dynamic web log session identification with statistical language models
Journal of the American Society for Information Science and Technology - Special issue: Webometrics
Application of automatic topic identification on excite web search engine data logs
Information Processing and Management: an International Journal
Utilizing a user's context to improve search results
Journal of the American Society for Information Science and Technology
Automatic new topic identification using multiple linear regression
Information Processing and Management: an International Journal
Crawler Detection: A Bayesian Approach
ICISP '06 Proceedings of the International Conference on Internet Surveillance and Protection
Applying language modeling to session identification from database trace logs
Knowledge and Information Systems
Defining a session on Web search engines: Research Articles
Journal of the American Society for Information Science and Technology
Cross-validation of neural network applications for automatic new topic identification
Journal of the American Society for Information Science and Technology
Hi-index | 12.05 |
Users are interested in multiple topics during a search session, and identifying the boundaries of search sessions is an important task. This study proposes to use neural networks for defining the topic boundaries in search engine transaction logs, and is a part of ongoing research on automatic new topic identification. The objective of the study is to determine the best set of parameters for neural networks that are designed to perform automatic new topic identification. Sample data logs from FAST (currently owned by Yahoo) and Excite (currently owned by IAC Search & Media) search engines were analyzed. The findings show that neural networks are fairly successful in identifying topic continuations and shifts in search engine transaction logs. The choice of the neural network structure depends on which performance measure is more important to the user. For a certain performance measure, there is a set of parameters of neural networks that will increase the performance of new topic identification in search engine transaction logs. In addition, the threshold value of the output level of neural networks is the most influential parameter on the performance of new topic identification.