Efficient extraction of schemas for XML documents
Information Processing Letters
Unordered Tree Mining with Applications to Phylogeny
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Knowledge and Information Systems
Warehousing complex data from the web
International Journal of Web Engineering and Technology
Process of applying data mining techniques to XML data
Proceedings of the 2006 conference on Advances in Intelligent IT: Active Media Technology 2006
Data mining using links in open hypermedia
MIS'02 Proceedings of the 2002 international conference on Metainformatics
Similarity computation for XML documents by XML element sequence patterns
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Recovering data semantics from XML documents into DTD graph with SAX
ACOS'06 Proceedings of the 5th WSEAS international conference on Applied computer science
Hi-index | 0.00 |
XML documents are semistructured and the structure of the documents is embedded in the tags. Although XML documents can be accompanied by a DTD that defines the structure of the documents, the presence of a DTD is not mandatory. The difficulty in deriving the DTD for XML documents lies in the fact that DTDs are of different syntax as XML and that prior knowledge of the structure of the documents is required. In this paper, we introduce DTD-Miner, an automatic structure-mining tool for XML documents. Using a Web-based interface, the user will be able to submit a set of similarly structured XML documents and the system will automatically suggest a DTD. The user is also able to further refine the DTD generated to reduce the complexity by relaxing some the rules used in the system.