Data integration: a theoretical perspective
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Querying Heterogeneous Information Sources Using Source Descriptions
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Semantic characterizations of navigational XPath
ACM SIGMOD Record
Data integration: the teenage years
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Articulating information needs in XML query languages
ACM Transactions on Information Systems (TOIS)
Learning deterministic regular expressions for the inference of schemas from XML data
Proceedings of the 17th international conference on World Wide Web
Non-interactive OCR post-correction for giga-scale digitization projects
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Digital weight watching: reconstruction of scanned documents
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
Hi-index | 0.00 |
Meeting notes are documents which contain lots of structure. This structure is often implicit in layout and reserved words. On the other hand, since meetings tend to occur regularly and are repeated for long periods of time, this structure is often (semi-)formalized. This makes these documents suitable for automatic semantic annotation efforts. We describe the annotation we performed on the notes of more than 20 years of Dutch parliamentary debates. We annotated every word spoken in parliament with 1) the speaker, 2) her party at the time of speaking, 3) her role/function in parliament and 4) the iso-date. These annotations yield numerous new ways of searching, browsing, mining and summarizing these documents. Meetings are always too long, whence so are their verbatim notes. But of course they contain valuable information and notes have to be consulted from time to time. In this paper we show that semantic annotation can make finding things easier, and more fun.