SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
MySQL Database Design and Tuning (Developer's Library)
MySQL Database Design and Tuning (Developer's Library)
Blog search and mining in the business domain
Proceedings of the 2007 international workshop on Domain driven data mining
Optimize databases for health monitoring systems
Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments
High performance mysql, 2nd edition
High performance mysql, 2nd edition
Combining named entities and tags for novel sentence detection
Proceedings of the WSDM '09 Workshop on Exploiting Semantic Annotations in Information Retrieval
Sentence-Level Novelty Detection in English and Malay
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Design and development of a mobile peer-to-peer social networking application
Expert Systems with Applications: An International Journal
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Detecting novel business blogs
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Evaluation of novelty metrics for sentence-level novelty mining
Information Sciences: an International Journal
Detecting novel business blogs
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
An intelligent system for sentence retrieval and novelty mining
International Journal of Knowledge Engineering and Data Mining
Design of an intelligent novelty detection application
International Journal of Innovative Computing and Applications
Chinese categorization and novelty mining
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
International Journal of Advanced Pervasive and Ubiquitous Computing
Probabilistic Models for Social Media Mining
International Journal of Information Technology and Web Engineering
Adaptable Services for Novelty Mining
International Journal of Systems and Service-Oriented Engineering
Hi-index | 0.00 |
Research in the area of optimizing databases in any Database Management System (DBMS) has been evolving constantly. Today, programming languages are being integrated into database systems to help professional programmers develop software quickly to meet deadlines. Therefore, the design of a database must cater to both the needs of customers and the efficiency of database processes. In this paper, a database application, novelty detection, is used to detect new documents for readers who do not want redundant documents to be read again. This application needs a database to store history and current documents. The objective of this research is to optimize the database tables for up to 10 million records. The experiments are done on both sentence level and document level. In both levels, the investigation of data optimization and the use of proper indexing are conducted. In MYSQL, the MYSQL B-Tree index is used to speed up data selection. In addition, the use of EXPLAIN enables us to properly index the correct data column and to avoid redundant indexing. Optimizing data types are also investigated to ensure no extra work is done by MYSQL in selecting data. A technique known as batching is also introduced to speed up results insertion after novelty detection has been done. Overall, the combined optimization improved the speed by up to 90%. Therefore, we have successfully optimized the database for novelty detection, and the techniques have been integrated into a real-time novelty detection application.