Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
A scalable comparison-shopping agent for the World-Wide Web
AGENTS '97 Proceedings of the first international conference on Autonomous agents
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Learning to classify text from labeled and unlabeled documents
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Learning Information Extraction Rules for Semi-Structured and Free Text
Machine Learning - Special issue on natural language learning
Relational learning of pattern-match rules for information extraction
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Learning dictionaries for information extraction by multi-level bootstrapping
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Snowball: extracting relations from large plain-text collections
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Web-collaborative filtering: recommending music by crawling the Web
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Text Classification from Labeled and Unlabeled Documents using EM
Machine Learning - Special issue on information retrieval
Learning to construct knowledge bases from the World Wide Web
Artificial Intelligence - Special issue on Intelligent internet systems
Snowball: a prototype system for extracting relations from large text collections
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Scaling question answering to the web
ACM Transactions on Information Systems (TOIS)
A flexible learning system for wrapping tables and lists in HTML documents
Proceedings of the 11th international conference on World Wide Web
Hierarchical Wrapper Induction for Semistructured Information Sources
Autonomous Agents and Multi-Agent Systems
Extracting Patterns and Relations from the World Wide Web
WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
SemTag and seeker: bootstrapping the semantic web via automated semantic annotation
WWW '03 Proceedings of the 12th international conference on World Wide Web
Measuring praise and criticism: Inference of semantic orientation from association
ACM Transactions on Information Systems (TOIS)
Web-scale information extraction in knowitall: (preliminary results)
Proceedings of the 13th international conference on World Wide Web
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Learning surface text patterns for a Question Answering system
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Is it the right answer?: exploiting web redundancy for Answer Validation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Exploiting strong syntactic heuristics and co-training to learn semantic lexicons
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A bootstrapping method for learning semantic lexicons using extraction pattern contexts
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Semi-supervised learning of geographical gazetteers from the internet
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Can we derive general world knowledge from texts?
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Adaptive information extraction from text by rule induction and generalisation
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Moving up the information food chain: deploying softbots on the world wide web
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
Efficiently inducing features of conditional random fields
UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Query-Based Summarization of Customer Reviews
CAI '07 Proceedings of the 20th conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Harvesting Relational and Structured Knowledge for Ontology Building in the WPro Architecture
AI*IA '07 Proceedings of the 10th Congress of the Italian Association for Artificial Intelligence on AI*IA 2007: Artificial Intelligence and Human-Oriented Computing
Formal Grammar for Hispanic Named Entities Analysis
CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Building a Graph of Names and Contextual Patterns for Named Entity Classification
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Incremental Ontology-Based Extraction and Alignment in Semi-structured Documents
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Cerno: Light-weight tool support for semantic annotation of textual documents
Data & Knowledge Engineering
Automatic event-level textual emotion sensing using mutual action histogram between entities
Expert Systems with Applications: An International Journal
Seeking Acronym Definitions: a Web-based Approach
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Seeking Acronym Definitions: a Web-based Approach
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
A metric-based framework for automatic taxonomy induction
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Automatic set instance extraction using the web
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Distant supervision for relation extraction without labeled data
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Automatic Construction of a Semantic, Domain-Independent Knowledge Base
OTM '09 Proceedings of the Confederated International Workshops and Posters on On the Move to Meaningful Internet Systems: ADI, CAMS, EI2N, ISDE, IWSSA, MONET, OnToContent, ODIS, ORM, OTM Academy, SWWS, SEMELS, Beyond SAWSDL, and COMBEK 2009
Generalized expectation criteria for bootstrapping extractors using record-text alignment
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Enhancement of lexical concepts using cross-lingual web mining
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Hypernym discovery based on distributional similarity and hierarchical structures
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Web-scale distributional similarity and entity set expansion
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Toward completeness in concept extraction and classification
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Character-level analysis of semi-structured documents for set expansion
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Tag confidence measure for semi-automatically updating named entity recognition
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Researcher affiliation extraction from homepages
NLPIR4DL '09 Proceedings of the 2009 Workshop on Text and Citation Analysis for Scholarly Digital Libraries
A context-aware middleware for real-time semantic enrichment of distributed multimedia metadata
Multimedia Tools and Applications
Semantic annotation for knowledge management: Requirements and a survey of the state of the art
Web Semantics: Science, Services and Agents on the World Wide Web
Improving a state-of-the-art named entity recognition system using the world wide web
ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
Towards rich query interpretation: walking back and forth for mining query templates
Proceedings of the 19th international conference on World wide web
Relational duality: unsupervised extraction of semantic relations between entities on the web
Proceedings of the 19th international conference on World wide web
A scalable machine-learning approach for semi-structured named entity recognition
Proceedings of the 19th international conference on World wide web
A methodology to learn ontological attributes from the Web
Data & Knowledge Engineering
Combining relations for information extraction from free text
ACM Transactions on Information Systems (TOIS)
Harvesting and organizing knowledge from the web
ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Labeling data extracted from the web
OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part I
Creating a dead poets society: extracting a social network of historical persons from the web
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
ALLRIGHT: automatic ontology instantiation from tabular web documents
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Graph mutual reinforcement based bootstrapping
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
An alignment-based approach to semi-supervised relation extraction including multiple arguments
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Multi-class named entity recognition via bootstrapping with dependency tree-based patterns
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Chinese named entity recognition with inducted context patterns
IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
I4E: interactive investigation of iterative information extraction
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Analysis of a probabilistic model of redundancy in unsupervised information extraction
Artificial Intelligence
BioSnowball: automated population of Wikis
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Not all seeds are equal: measuring the quality of text mining seeds
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Inducing domain-specific semantic class taggers from (almost) nothing
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A latent dirichlet allocation method for selectional preferences
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Experiments in graph-based semi-supervised learning methods for class-instance acquisition
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Learning arguments and supertypes of semantic relations using recursive patterns
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Automatically generating term-frequency-induced taxonomies
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Extracting sequences from the web
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Distributional similarity vs. PU learning for entity set expansion
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Machine reading at the University of Washington
FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Assessing the challenge of fine-grained named entity recognition and classification
NEWS '10 Proceedings of the 2010 Named Entities Workshop
Domain adaptation of rule-based annotators for named-entity recognition tasks
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A semi-supervised method to learn and construct taxonomies using the web
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Online annotation of text streams with structured entities
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
FactRank: random walks on a web of facts
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Supporting semantic search on heterogeneous semi-structured documents
CAiSE'10 Proceedings of the 22nd international conference on Advanced information systems engineering
Ontology-driven web-based semantic similarity
Journal of Intelligent Information Systems
Scalable knowledge harvesting with high precision and high recall
Proceedings of the fourth ACM international conference on Web search and data mining
EagleEye: entity-centric business intelligence for smarter decisions
IBM Journal of Research and Development
The role of queries in ranking labeled instances extracted from text
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
SEISA: set expansion by iterative similarity aggregation
Proceedings of the 20th international conference on World wide web
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Using graph based method to improve bootstrapping relation extraction
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
ACM Transactions on Asian Language Information Processing (TALIP)
Extracting XML data from the web
Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
Automatic extraction of acronym definitions from the Web
Applied Intelligence
Special semi-supervised techniques for natural language processing tasks
CIMMACS'07 Proceedings of the 6th WSEAS international conference on Computational intelligence, man-machine systems and cybernetics
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Entity set expansion in opinion documents
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Ontology population and enrichment: state of the art
Knowledge-driven multimedia information extraction and ontology evolution
Recognizing named entities in tweets
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Unsupervised discovery of domain-specific knowledge from text
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Aspect ranking: identifying important product aspects from online consumer reviews
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Ranking class labels using query sessions
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Insights from network structure for text mining
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
A supervised method of feature weighting for measuring semantic relatedness
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
User Behaviors in Related Word Retrieval and New Word Detection: A Collaborative Perspective
ACM Transactions on Asian Language Information Processing (TALIP)
When recommendation meets mobile: contextual and personalized recommendation on the go
Proceedings of the 13th international conference on Ubiquitous computing
Introduction to linked data and its lifecycle on the web
RW'11 Proceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data
Buy, sell, or hold? information extraction from stock analyst reports
CONTEXT'11 Proceedings of the 7th international and interdisciplinary conference on Modeling and using context
Acquiring knowledge about human goals from Search Query Logs
Information Processing and Management: an International Journal
Using the web to validate lexico-semantic relations
EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
Ontology-Based Feature Extraction
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
SCMS: semantifying content management systems
ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part II
Towards a framework for attribute retrieval
Proceedings of the 20th ACM international conference on Information and knowledge management
Facilitating pattern discovery for relation extraction with semantic-signature-based clustering
Proceedings of the 20th ACM international conference on Information and knowledge management
Extract knowledge from semi-structured websites for search task simplification
Proceedings of the 20th ACM international conference on Information and knowledge management
Applying software analysis technology to lightweight semantic markup of document text
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Self-supervised relation extraction from the web
ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Extracting and summarizing hot item features across different auction web sites
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
An ontology-based retrieval system using semantic indexing
Information Systems
Using verbs to characterize noun-noun relations
AIMSA'06 Proceedings of the 12th international conference on Artificial Intelligence: methodology, Systems, and Applications
WebSets: extracting sets of entities from the web using unsupervised information extraction
Proceedings of the fifth ACM international conference on Web search and data mining
Ontology-driven information extraction with ontosyphon
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Stalking online: on user privacy in social networks
Proceedings of the second ACM conference on Data and Application Security and Privacy
gProt: annotating protein interactions using google and gene ontology
KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part III
Class label enhancement via related instances
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Random walk inference and learning in a large scale knowledge base
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Relation extraction with relation topics
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Discovering relations between noun categories
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Named entity recognition in tweets: an experimental study
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Unsupervised named-entity recognition: generating gazetteers and resolving ambiguity
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Visibility analysis on the web using co-visibilities and semantic networks
EWMF'05/KDO'05 Proceedings of the 2005 joint international conference on Semantics, Web and Mining
Discovering a term taxonomy from term similarities using principal component analysis
EWMF'05/KDO'05 Proceedings of the 2005 joint international conference on Semantics, Web and Mining
Text mining through semi automatic semantic annotation
PAKM'06 Proceedings of the 6th international conference on Practical Aspects of Knowledge Management
Turning the web into a database: extracting data and structure
NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
Creating topic hierarchies for large medical libraries
KR4HC'09 Proceedings of the 2009 AIME international conference on Knowledge Representation for Health-Care: data, Processes and Guidelines
Theoretical foundations for enabling a web of knowledge
FoIKS'10 Proceedings of the 6th international conference on Foundations of Information and Knowledge Systems
The recognition and interpretation of motion in language
CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Discovering non-taxonomic relations from the web
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Leveraging different meronym discovery methods for bridging resolution in french
DAARC'11 Proceedings of the 8th international conference on Anaphora Processing and Applications
Data extraction from web pages based on structural-semantic entropy
Proceedings of the 21st international conference companion on World Wide Web
Clustering techniques for open relation extraction
PhD '12 Proceedings of the on SIGMOD/PODS 2012 PhD Symposium
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Artificial Intelligence in Medicine
Citation-based bootstrapping for large-scale author disambiguation
Journal of the American Society for Information Science and Technology
Corpus-Driven hyponym acquisition for turkish language
CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics
Event-Level textual emotion sensing based on common action distributions between event participants
IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
A semi-supervised approach to extracting multiword entity names from user reviews
Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
Taxonomy induction using hierarchical random graphs
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Ensemble-based semantic lexicon induction for semantic tagging
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Joint inference of named entity recognition and normalization for tweets
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
No noun phrase left behind: detecting and typing unlinkable entities
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Ensemble semantics for large-scale unsupervised relation extraction
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Constructing task-specific taxonomies for document collection browsing
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Automatic evaluation of relation extraction systems on large-scale
AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Resolving task specification and path inconsistency in taxonomy construction
Proceedings of the 3rd Workshop on the People's Web Meets NLP: Collaboratively Constructed Semantic Resources and their Applications to NLP
Cause-effect relation learning
TextGraphs-7 '12 Workshop Proceedings of TextGraphs-7 on Graph-based Methods for Natural Language Processing
Automated extraction of security policies from natural-language software documents
Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
A graph-based approach for ontology population with named entities
Proceedings of the 21st ACM international conference on Information and knowledge management
Two-stage NER for tweets with clustering
Information Processing and Management: an International Journal
Social relation extraction from texts using a support-vector-machine-based dependency trigram kernel
Information Processing and Management: an International Journal
Identifying references to datasets in publications
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Coupled bayesian sets algorithm for semi-supervised learning and information extraction
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Collaboratively built semi-structured content and Artificial Intelligence: The story so far
Artificial Intelligence
YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia
Artificial Intelligence
Learning multilingual named entity recognition from Wikipedia
Artificial Intelligence
Improving the performance of a named entity recognition system with knowledge acquisition
EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
Named entity recognition for tweets
ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context
Large-Scale learning of relation-extraction rules with distant supervision from the web
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
DEQA: deep web extraction for question answering
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
OXPath: A language for scalable data extraction, automation, and crawling on the deep web
The VLDB Journal — The International Journal on Very Large Data Bases
Detecting illicit drugs on social media using automated social media intelligence analysis (ASMIA)
CSS'12 Proceedings of the 4th international conference on Cyberspace Safety and Security
An automatic approach for ontology-based feature extraction from heterogeneous textualresources
Engineering Applications of Artificial Intelligence
A model for information extraction in portuguese based on text patterns
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Automatic gazette creation for named entity recognition and application to resume processing
Proceedings of the 5th ACM COMPUTE Conference: Intelligent & scalable system technologies
Knowledge harvesting in the big-data era
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Journal of Web Engineering
Autonomously reviewing and validating the knowledge base of a never-ending learning system
Proceedings of the 22nd international conference on World Wide Web companion
Journal of Biomedical Informatics
Using natural language to integrate, evaluate, and optimize extracted knowledge bases
Proceedings of the 2013 workshop on Automated knowledge base construction
Person attribute extraction from the textual parts of web pages
Acta Cybernetica
Aggregated search: A new information retrieval paradigm
ACM Computing Surveys (CSUR)
Introduction to linked data and its lifecycle on the web
RW'13 Proceedings of the 9th international conference on Reasoning Web: semantic technologies for intelligent data access
Tailoring the automated construction of large-scale taxonomies using the web
Language Resources and Evaluation
Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts
Journal of Biomedical Informatics
Scalable and noise tolerant web knowledge extraction for search task simplification
Decision Support Systems
Acquisition of open-domain classes via intersective semantics
Proceedings of the 23rd international conference on World wide web
Discovering and Characterizing Places of Interest Using Flickr and Twitter
International Journal on Semantic Web & Information Systems
Hi-index | 0.00 |
The KnowItAll system aims to automate the tedious process of extracting large collections of facts (e.g., names of scientists or politicians) from the Web in an unsupervised, domain-independent, and scalable manner. The paper presents an overview of KnowItAll's novel architecture and design principles, emphasizing its distinctive ability to extract information without any hand-labeled training examples. In its first major run, KnowItAll extracted over 50,000 class instances, but suggested a challenge: How can we improve KnowItAll's recall and extraction rate without sacrificing precision? This paper presents three distinct ways to address this challenge and evaluates their performance. Pattern Learning learns domain-specific extraction rules, which enable additional extractions. Subclass Extraction automatically identifies sub-classes in order to boost recall (e.g., ''chemist'' and ''biologist'' are identified as sub-classes of ''scientist''). List Extraction locates lists of class instances, learns a ''wrapper'' for each list, and extracts elements of each list. Since each method bootstraps from KnowItAll's domain-independent methods, the methods also obviate hand-labeled training examples. The paper reports on experiments, focused on building lists of named entities, that measure the relative efficacy of each method and demonstrate their synergy. In concert, our methods gave KnowItAll a 4-fold to 8-fold increase in recall at precision of 0.90, and discovered over 10,000 cities missing from the Tipster Gazetteer.