Concepts and models for structured documents
Structured documents
Managing a digital library of legislation
DL '97 Proceedings of the second ACM international conference on Digital libraries
Open hypermedia as user controlled meta data for the Web
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Extended path expressions of XML
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Using XML as a means to access legislative documents: Italian and foreign experiences
ACM SIGAPP Applied Computing Review
Towards a semantics for XML markup
Proceedings of the 2002 ACM symposium on Document engineering
Proposal for a Dutch Legal XML Standard
EGOV '02 Proceedings of the First International Conference on Electronic Government
Kernel methods for relation extraction
The Journal of Machine Learning Research
XIRQL: An XML query language based on information retrieval concepts
ACM Transactions on Information Systems (TOIS)
Explorer's Guide to the Semantic Web
Explorer's Guide to the Semantic Web
Automated extraction of normative references in legal texts
ICAIL '03 Proceedings of the 9th international conference on Artificial intelligence and law
Creation of an expert witness database through text mining
ICAIL '03 Proceedings of the 9th international conference on Artificial intelligence and law
Generic technologies for single- and multi-document summarization
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Proceedings of the 3rd international conference on Knowledge capture
Mining soft-matching rules from textual data
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Legislative digital library: online and off-line database of laws
ECDL'05 Proceedings of the 9th European conference on Research and Advanced Technology for Digital Libraries
A generic construct based workload model for web search
Information Processing and Management: an International Journal
Hi-index | 0.00 |
References to parts of structured documents use their structure to locate the piece of document which is the reference target. On the other hand, XML has become an increasingly important language for structured documents. One of its most important related languages is XPath, the language that permits fragments of XML documents to be selected. In this article we present a methodology, and an application case, to automatically extract and solve references to fragments of structured documents. This approach combines structure manipulation and information extraction, to enhance reference extraction tools by improving the precision of the references extracted. We take advantage of XML markup to locate the position within the structure in which the references are found. The use of XPath, one of the most important XML related languages, for reference resolution is original: the resolution tool automatically builds XPath expressions. This proposal is inspired (and implemented) from our work with legislative documents.