An approach for document fragment retrieval and its formatting issue in engineering information management

  • Authors:
  • Shaofeng Liu;Chris A. McMahon;Mansur J. Darlington;Steve J. Culley;Peter J. Wild

  • Affiliations:
  • Innovative Manufacturing Research Centre (IMRC), Department of Mechanical Engineering, University of Bath, Bath, UK;Innovative Manufacturing Research Centre (IMRC), Department of Mechanical Engineering, University of Bath, Bath, UK;Innovative Manufacturing Research Centre (IMRC), Department of Mechanical Engineering, University of Bath, Bath, UK;Innovative Manufacturing Research Centre (IMRC), Department of Mechanical Engineering, University of Bath, Bath, UK;Innovative Manufacturing Research Centre (IMRC), Department of Mechanical Engineering, University of Bath, Bath, UK

  • Venue:
  • ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part II
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper discusses engineering document fragment mark-up supported by the use of the eXstensible Stylesheet Language – Formatting Objects (XLS-FO). XLS-FO can be used to convert the native format repre-sentation of such documents as Word, Excel and PDF into XML. Once in XML, documents fragments can be retrieved at will in response to a search query. In the paper the process of a document fragment retrieval – based on the authors’ decomposition scheme approach – has been modelled and the issue of converting documents into XML addressed. Additionally, the use of document templates is discussed as a means of ensuring that the transformed XML documents are compliant with the decomposition schemes. Automating the reformatting of documents into XML and the use of templates helps make implementation of a document-fragment approach to retrieval more resource efficient, so making its adoption in industry more practicable.