Placing search in context: the concept revisited
ACM Transactions on Information Systems (TOIS)
Evaluating WordNet-based Measures of Lexical Semantic Relatedness
Computational Linguistics
Combining image captions and visual analysis for image concept classification
Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
WikiRelate! computing semantic relatedness using wikipedia
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Refining the most frequent sense baseline
DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Computing semantic relatedness using Wikipedia-based explicit semantic analysis
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
PIKM 2010: ACM workshop for ph.d. students in information and knowledge management
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Emerging multidisciplinary research across database management systems
ACM SIGMOD Record
Mining interests for user profiling in electronic conversations
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
The input for a Bag-of-Articles (BOA) classifier is a set of unlabeled entities - noun chunks and a set of target labeled entities - Wikipedia articles. The classifier locates Wikipedia articles that might define the unlabeled entity and performs disambiguation selecting one. Both unlabeled and labeled entity is represented with the proposed BOA term weight vector, which is created by aggregating term weight vectors of articles related to the Wikipedia article defining it. The label is assigned by choosing the closest labeled entity, also a BOA term weight vector, with cosine similarity. The paper formally defines the BOA entity representation and BOA-based entity classification and presents a partial software implementation. A BOA-based disambiguation algorithm is presented as a planned extension.