Machine Learning
Inducing Features of Random Fields
IEEE Transactions on Pattern Analysis and Machine Intelligence
Snowball: extracting relations from large plain-text collections
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Extracting Patterns and Relations from the World Wide Web
WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Kernel methods for relation extraction
The Journal of Machine Learning Research
Unsupervised named-entity extraction from the web: an experimental study
Artificial Intelligence
Learning the structure of Markov logic networks
ICML '05 Proceedings of the 22nd international conference on Machine learning
Machine Learning
Simultaneous record detection and attribute labeling in web data extraction
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Preemptive information extraction using unrestricted relation discovery
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
On Bayesian classification with Laplace priors
Pattern Recognition Letters
Scalable training of L1-regularized log-linear models
Proceedings of the 24th international conference on Machine learning
Statistical predicate invention
Proceedings of the 24th international conference on Machine learning
A scalable modular convex solver for regularized risk minimization
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Discriminative structure and parameter learning for Markov logic networks
Proceedings of the 25th international conference on Machine learning
Discriminative training of Markov logic networks
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Joint inference in information extraction
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Open information extraction from the web
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Shallow semantics for relation extraction
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Efficiently inducing features of conditional random fields
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Query by analogical example: relational search using web search engine indices
Proceedings of the 18th ACM conference on Information and knowledge management
Relational duality: unsupervised extraction of semantic relations between entities on the web
Proceedings of the 19th international conference on World wide web
From information to knowledge: harvesting entities and relationships from web sources
Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
BioSnowball: automated population of Wikis
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Open information extraction using Wikipedia
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Find your advisor: robust knowledge gathering from the web
Procceedings of the 13th International Workshop on the Web and Databases
Function-based question classification for general QA
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Multi-modal multi-correlation person-centric news retrieval
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Extracting 5W1H event semantic elements from Chinese online news
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Mining and explaining relationships in wikipedia
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part II
Scalable knowledge harvesting with high precision and high recall
Proceedings of the fourth ACM international conference on Web search and data mining
EagleEye: entity-centric business intelligence for smarter decisions
IBM Journal of Research and Development
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Database researchers: plumbers or thinkers?
Proceedings of the 14th International Conference on Extending Database Technology
Using graph based method to improve bootstrapping relation extraction
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Extracting XML data from the web
Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
Enishi: searching knowledge about relations by complementarily utilizing wikipedia and the web
WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Entity set expansion in opinion documents
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
RDR-based open IE for the web document
Proceedings of the sixth international conference on Knowledge capture
Event discovery in social media feeds
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
In-domain relation discovery with meta-constraints via posterior regularization
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Exploiting syntactico-semantic structures for relation extraction
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Query relaxation for entity-relationship search
ESWC'11 Proceedings of the 8th extended semantic web conference on The semanic web: research and applications - Volume Part II
Combining the Best of Two Worlds: NLP and IR for Intranet Search
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Harvesting facts from textual web sources by constrained label propagation
Proceedings of the 20th ACM international conference on Information and knowledge management
Answering label-constraint reachability in large graphs
Proceedings of the 20th ACM international conference on Information and knowledge management
Analysis of implicit relations on wikipedia: measuring strength through mining elucidatory objects
DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part I
Robust disambiguation of named entities in text
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Identifying relations for open information extraction
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Chapter 3: search for knowledge
Search Computing
Leveraging different meronym discovery methods for bridging resolution in french
DAARC'11 Proceedings of the 8th international conference on Anaphora Processing and Applications
Clustering techniques for open relation extraction
PhD '12 Proceedings of the on SIGMOD/PODS 2012 PhD Symposium
Open information extraction: the second generation
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Extracting information networks from the blogosphere
ACM Transactions on the Web (TWEB)
A weighting scheme for open information extraction
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
Open language learning for information extraction
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Identifying constant and unique relations by using time-series text
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
PATTY: a taxonomy of relational patterns with semantic types
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Automatic evaluation of relation extraction systems on large-scale
AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Improving open information extraction for informal web documents with ripple-down rules
PKAW'12 Proceedings of the 12th Pacific Rim conference on Knowledge Management and Acquisition for Intelligent Systems
YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia
Artificial Intelligence
Improving the performance of a named entity recognition system with knowledge acquisition
EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
A model for information extraction in portuguese based on text patterns
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Towards high-throughput gibbs sampling at scale: a study across storage managers
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Knowledge harvesting in the big-data era
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Building, maintaining, and using knowledge bases: a report from the trenches
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Assessing sparse information extraction using semantic contexts
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Aggregated search: A new information retrieval paradigm
ACM Computing Surveys (CSUR)
SocialSearch+: enriching social network with web evidences
World Wide Web
Efficient processing of label-constraint reachability queries in large graphs
Information Systems
Acquisition of open-domain classes via intersective semantics
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
Traditional relation extraction methods require pre-specified relations and relation-specific human-tagged examples. Bootstrapping systems significantly reduce the number of training examples, but they usually apply heuristic-based methods to combine a set of strict hard rules, which limit the ability to generalize and thus generate a low recall. Furthermore, existing bootstrapping methods do not perform open information extraction (Open IE), which can identify various types of relations without requiring pre-specifications. In this paper, we propose a statistical extraction framework called Statistical Snowball (StatSnowball), which is a bootstrapping system and can perform both traditional relation extraction and Open IE. StatSnowball uses the discriminative Markov logic networks (MLNs) and softens hard rules by learning their weights in a maximum likelihood estimate sense. MLN is a general model, and can be configured to perform different levels of relation extraction. In StatSnwoball, pattern selection is performed by solving an l1-norm penalized maximum likelihood estimation, which enjoys well-founded theories and efficient solvers. We extensively evaluate the performance of StatSnowball in different configurations on both a small but fully labeled data set and large-scale Web data. Empirical results show that StatSnowball can achieve a significantly higher recall without sacrificing the high precision during iterations with a small number of seeds, and the joint inference of MLN can improve the performance. Finally, StatSnowball is efficient and we have developed a working entity relation search engine called Renlifang based on it.