Word association norms, mutual information, and lexicography
Computational Linguistics
Formal ontology, common sense and cognitive science
International Journal of Human-Computer Studies - Special issue: the role of formal ontology in the information technology
CYC: a large-scale investment in knowledge infrastructure
Communications of the ACM
Communications of the ACM
Corpus-based Learning of Analogies and Semantic Relations
Machine Learning
Extracting and evaluating general world knowledge from the Brown corpus
HLT-NAACL-TEXTMEANING '03 Proceedings of the HLT-NAACL 2003 workshop on Text meaning - Volume 9
Verbosity: a game for collecting common-sense facts
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Computer
An introduction to ROC analysis
Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Similarity of Semantic Relations
Computational Linguistics
Expressing implicit semantic relations without supervision
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Games with a purpose for social networking platforms
Proceedings of the 20th ACM conference on Hypertext and hypermedia
Digital Intuition: Applying Common Sense Using Dimensionality Reduction
IEEE Intelligent Systems
Studying databases of intentions: do search query logs capture knowledge about common human goals?
Proceedings of the fifth international conference on Knowledge capture
A uniform approach to analogies, synonyms, antonyms, and associations
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Searching for common sense: populating Cyc™ from the web
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Open information extraction from the web
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
BagPack: a general framework to represent semantic relations
GEMS '09 Proceedings of the Workshop on Geometrical Models of Natural Language Semantics
Evaluation of commonsense knowledge with Mechanical Turk
CSLDAMT '10 Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
Computational Statistics & Data Analysis
Distributional memory: A general framework for corpus-based semantics
Computational Linguistics
WebChild: harvesting and organizing commonsense knowledge from the web
Proceedings of the 7th ACM international conference on Web search and data mining
Hi-index | 0.00 |
Text mining has been very successful in extracting huge amounts of commonsense knowledge from data, but the extracted knowledge tends to be extremely noisy. Manual construction of knowledge repositories, on the other hand, tends to produce high-quality data in very small amounts. We propose an architecture to combine the best of both worlds: A game with a purpose that induces humans to clean up data automatically extracted by text mining. First, a text miner trained on a set of known commonsense facts harvests many more candidate facts from corpora. Then, a simple slot-machine-with-a-purpose game presents these candidate facts to the players for verification by playing. As a result, a new dataset of high precision commonsense knowledge is created. This combined architecture is able to produce significantly better commonsense facts than the state-of-the-art text miner alone. Furthermore, we report that bootstrapping (i.e., training the text miner on the output of the game) improves the subsequent performance of the text miner.