On the use of spreading activation methods in automatic information
SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
An example-based mapping method for text categorization and retrieval
ACM Transactions on Information Systems (TOIS)
Automatic text structuring and summarization
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Threading electronic mail: a preliminary study
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Improved Boosting Algorithms Using Confidence-rated Predictions
Machine Learning - The Eleventh Annual Conference on computational Learning Theory
Data integration using similarity joins and a word-based information representation language
ACM Transactions on Information Systems (TOIS)
SimRank: a measure of structural-context similarity
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Language Modeling for Information Retrieval
Language Modeling for Information Retrieval
TextTiling: segmenting text into multi-paragraph subtopic passages
Computational Linguistics
An adaptive information retrieval system based on associative networks
APCCM '04 Proceedings of the first Asian-Pacific conference on Conceptual modelling - Volume 31
Learning random walk models for inducing word dependency distributions
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Object-level ranking: bringing order to Web objects
WWW '05 Proceedings of the 14th international conference on World Wide Web
Ranking algorithms for named-entity extraction: boosting and the voted perceptron
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
SimFusion: measuring similarity using unified relationship matrix
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
PageRank without hyperlinks: structural re-ranking using links induced by language models
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
On the collective classification of email "speech acts"
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Query expansion using random walk models
Proceedings of the 14th ACM international conference on Information and knowledge management
A Network Analysis Model for Disambiguation of Names in Lists
Computational & Mathematical Organization Theory
Multi-way distributional clustering via pairwise interactions
ICML '05 Proceedings of the 22nd international conference on Machine learning
eMailSift: Email Classification Based on Structure and Content
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Discriminative Reranking for Natural Language Parsing
Computational Linguistics
Extracting personal names from email: applying named entity recognition to informal text
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Objectrank: authority-based keyword search in databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Journal of Artificial Intelligence Research
Learning web page scores by error back-propagation
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Web projections: learning from contextual subgraphs of the web
Proceedings of the 16th international conference on World Wide Web
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Adaptive graphical approach to entity resolution
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Extending query translation to cross-language query expansion with markov chain models
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A constraint-based probabilistic framework for name disambiguation
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Learning to rank typed graph walks: local and global approaches
Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Adaptive community-based multimedia data retrieval in a distributed environment
Proceedings of the 2nd international conference on Ubiquitous information management and communication
Effective latent space graph-based re-ranking model with global consistency
Proceedings of the Second ACM International Conference on Web Search and Data Mining
idMesh: graph-based disambiguation of linked data
Proceedings of the 18th international conference on World wide web
Using Contextual Information to Improve Search in Email Archives
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
A generalized Co-HITS algorithm and its application to bipartite graphs
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Social search and discovery using a unified approach
Proceedings of the 20th ACM conference on Hypertext and hypermedia
A graph-search framework for geneId ranking
BioNLP '06 Proceedings of the Workshop on Linking Natural Language Processing and Biology: Towards Deeper Biological Literature Analysis
Intelligent hybrid approach to false identity detection
Proceedings of the 12th International Conference on Artificial Intelligence and Law
Learning graph walk based similarity measures for parsed text
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
WIT: web people search disambiguation using random walks
SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Named entity disambiguation by leveraging wikipedia semantic knowledge
Proceedings of the 18th ACM conference on Information and knowledge management
Web personal name disambiguation based on reference entity tables mined from the web
Proceedings of the eleventh international workshop on Web information and data management
A graph-search framework for GeneId ranking
LNLBioNLP '06 Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology
Semi-supervised OWA aggregation for link-based similarity evaluation and alias detection
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Making sense of archived e-mail: Exploring the Enron collection with NetLens
Journal of the American Society for Information Science and Technology
Self-tuning in graph-based reference disambiguation
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Automatically incorporating new sources in keyword search-based data integration
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Fast query execution for retrieval models based on path-constrained random walks
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning to link entities with knowledge base
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Structural semantic relatedness: a knowledge-based method to named entity disambiguation
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
ACM Transactions on Information Systems (TOIS)
Disclosing false identity through hybrid link analysis
Artificial Intelligence and Law
On Graph-Based Name Disambiguation
Journal of Data and Information Quality (JDIQ)
Using Markov chains to exploit word relationships in information retrieval
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Potential role based entity matching for dataspaces search
WISE'10 Proceedings of the 11th international conference on Web information systems engineering
An effective web document clustering algorithm based on bisection and merge
Artificial Intelligence Review
Index design and query processing for graph conductance search
The VLDB Journal — The International Journal on Very Large Data Bases
LINDEN: linking named entities with knowledge base via semantic knowledge
Proceedings of the 21st international conference on World Wide Web
Named entity disambiguation in streaming data
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Threading machine generated email
Proceedings of the sixth ACM international conference on Web search and data mining
Domain-Independent Entity Coreference for Linking Ontology Instances
Journal of Data and Information Quality (JDIQ) - Special Issue on Entity Resolution
Query expansion using path-constrained random walks
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
ASCOS: an asymmetric network structure COntext similarity measure
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Predicting relevant documents for enterprise communication contexts
Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
Name disambiguation in scientific cooperation network by exploiting user feedback
Artificial Intelligence Review
Hi-index | 0.00 |
Similarity measures for text have historically been an important tool for solving information retrieval problems. In many interesting settings, however, documents are often closely connected to other documents, as well as other non-textual objects: for instance, email messages are connected to other messages via header information. In this paper we consider extended similarity metrics for documents and other objects embedded in graphs, facilitated via a lazy graph walk. We provide a detailed instantiation of this framework for email data, where content, social networks and a timeline are integrated in a structural graph. The suggested framework is evaluated for two email-related problems: disambiguating names in email documents, and threading. We show that reranking schemes based on the graph-walk similarity measures often outperform baseline methods, and that further improvements can be obtained by use of appropriate learning methods.