Web-scale information extraction in knowitall: (preliminary results)

Authors:
Oren Etzioni;Michael Cafarella;Doug Downey;Stanley Kok;Ana-Maria Popescu;Tal Shaked;Stephen Soderland;Daniel S. Weld;Alexander Yates
Affiliations:
University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
Proceedings of the 13th international conference on World Wide Web
Year:
2004

Citing 22
Cited 183

Why AM an EUISKO appear to work.

Artificial Intelligence
Some advances in transformation-based part of speech tagging

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
Learning to extract symbolic knowledge from the World Wide Web

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Learning Information Extraction Rules for Semi-Structured and Free Text

Machine Learning - Special issue on natural language learning
Learning dictionaries for information extraction by multi-level bootstrapping

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Snowball: extracting relations from large plain-text collections

DL '00 Proceedings of the fifth ACM conference on Digital libraries
Learning to construct knowledge bases from the World Wide Web

Artificial Intelligence - Special issue on Intelligent internet systems
Scaling question answering to the Web

Proceedings of the 10th international conference on World Wide Web
Mining the web for answers to natural language questions

Proceedings of the tenth international conference on Information and knowledge management
Extracting Patterns and Relations from the World Wide Web

WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Information Extraction with HMM Structures Learned by Stochastic Optimization

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
An XML query engine for network-bound data

The VLDB Journal — The International Journal on Very Large Data Bases
SemTag and seeker: bootstrapping the semantic web via automated semantic annotation

WWW '03 Proceedings of the 12th international conference on World Wide Web
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Scaling to very very large corpora for natural language disambiguation

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Learning surface text patterns for a Question Answering system

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Is it the right answer?: exploiting web redundancy for Answer Validation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Hierarchical hidden Markov models for information extraction

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
CRYSTAL inducing a conceptual dictionary

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Moving up the information food chain: deploying softbots on the world wide web

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
Efficiently inducing features of conditional random fields

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence

Is question answering an acquired skill?

Proceedings of the 13th international conference on World Wide Web
Acquisition of categorized named entities for web search

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Editorial: special issue on web content mining

ACM SIGKDD Explorations Newsletter
Learning by googling

ACM SIGKDD Explorations Newsletter
Gimme' the context: context-driven automatic semantic annotation with C-PANKOW

WWW '05 Proceedings of the 14th international conference on World Wide Web
Opinion observer: analyzing and comparing opinions on the Web

WWW '05 Proceedings of the 14th international conference on World Wide Web
A search engine for natural language applications

WWW '05 Proceedings of the 14th international conference on World Wide Web
MYSTIQ: a system for finding more answers by using probabilities

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Analysis of factoid questions for effective relation extraction

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Spotlight browsing of resource archives

Proceedings of the sixteenth ACM conference on Hypertext and hypermedia
The SphereSearch engine for unified ranked retrieval of heterogeneous XML and web documents

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Mapping maintenance for data integration systems

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Unsupervised named-entity extraction from the web: an experimental study

Artificial Intelligence
Retrieving answers from frequently asked questions pages on the web

Proceedings of the 14th ACM international conference on Information and knowledge management
Predicting accuracy of extracting information from unstructured text collections

Proceedings of the 14th ACM international conference on Information and knowledge management
Deriving quantitative overviews of free text assessments on the web

Proceedings of the 11th international conference on Intelligent user interfaces
Exploring social annotations for the semantic web

Proceedings of the 15th international conference on World Wide Web
Optimizing scoring functions and indexes for proximity search in type-annotated corpora

Proceedings of the 15th international conference on World Wide Web
To search or to crawl?: towards a query optimizer for text-centric tasks

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
DeWild: a tool for searching the web using wild cards

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Combining linguistic and statistical analysis to extract relations from web documents

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Knowledge acquisition from simplified text

Proceedings of the 12th international conference on Intelligent user interfaces
Towards terascale knowledge acquisition

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Enhanced answer type inference from questions using sequential models

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Effectively using syntax for recognizing false entailment

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Soft pattern matching models for definitional question answering

ACM Transactions on Information Systems (TOIS)
Ontologies are us: A unified model of social networks and semantics

Web Semantics: Science, Services and Agents on the World Wide Web
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
A bootstrapping approach for identifying stakeholders in public-comment corpora

dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
Instantiation of Relations for Semantic Annotation

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Information Extraction from Web Pages Using Presentation Regularities and Domain Knowledge

World Wide Web
Management of probabilistic data: foundations and challenges

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Integrating pattern-based and distributional similarity methods for lexical entailment acquisition

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Corroborate and learn facts from the web

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovery of event entailment knowledge from text corpora

Computer Speech and Language
A Sketch Algorithm for Estimating Two-Way and Multi-Way Associations

Computational Linguistics
Towards a query optimizer for text-centric tasks

ACM Transactions on Database Systems (TODS)
Lightweight web-based fact repositories for textual question answering

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Materialized views in probabilistic databases: for information exchange and query optimization

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
EntityRank: searching entities directly and holistically

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
OntoMiner: automated metadata and instance mining from news websites

International Journal of Web and Grid Services
A Hybrid Technique for English-Chinese Cross Language Information Retrieval

ACM Transactions on Asian Language Information Processing (TALIP)
Towards a global schema for web entities

Proceedings of the 17th international conference on World Wide Web
Probabilistic databases

ACM SIGACT News
A unified approach for schema matching, coreference and canonicalization

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Enriching the class diagram concepts to capture natural language semantics for database access

Data & Knowledge Engineering
AEON - An approach to the automatic evaluation of ontologies

Applied Ontology - Ontological Foundations of Conceptual Modelling
YAGO: A Large Ontology from Wikipedia and WordNet

Web Semantics: Science, Services and Agents on the World Wide Web
WebTables: exploring the power of tables on the web

Proceedings of the VLDB Endowment
Scalable ad-hoc entity extraction from text collections

Proceedings of the VLDB Endowment
Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
Towards a System for Ontology-Based Information Extraction from PDF Documents

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Information Extraction

Foundations and Trends in Databases
Discovering Groups of Sibling Terms from Web Documents with XTREEM-SG

Journal on Data Semantics XI
A quality-aware optimizer for information extraction

ACM Transactions on Database Systems (TODS)
Real time extraction of related terms by bi-directional lexico-syntactic patterns from the web

Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Building query optimizers for information extraction: the SQoUT project

ACM SIGMOD Record
Web-scale extraction of structured data

ACM SIGMOD Record
SOFIE: a self-organizing framework for information extraction

Proceedings of the 18th international conference on World wide web
A survey on sentiment detection of reviews

Expert Systems with Applications: An International Journal
A web of concepts

Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
AIDE: ad-hoc intents detection engine over query logs

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Learning Expressive Ontologies

Proceedings of the 2008 conference on Ontology Learning and Population: Bridging the Gap between Text and Knowledge
Methods for domain-independent information extraction from the web: an experimental comparison

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Class-driven attribute extraction

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Growing finely-discriminating taxonomies from seeds of varying quality and size

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Unsupervised information extraction approach using graph mutual reinforcement

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Semantic labeling of compound nominalization in Chinese

MWE '07 Proceedings of the Workshop on a Broader Perspective on Multiword Expressions
Searching for common sense: populating Cyc™ from the web

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Improving author coreference by resource-bounded information gathering from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Web mining for event-based commonsense knowledge using lexico-syntactic pattern matching and semantic role labeling

Expert Systems with Applications: An International Journal
Automatically learning qualia structures from the web

DeepLA '05 Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition
A probabilistic model of redundancy in information extraction

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Unsupervised named-entity extraction from the Web: An experimental study

Artificial Intelligence
Query by analogical example: relational search using web search engine indices

Proceedings of the 18th ACM conference on Information and knowledge management
ExSearch: a novel vertical search engine for online barter business

Proceedings of the 18th ACM conference on Information and knowledge management
Identifying comparable entities on the web

Proceedings of the 18th ACM conference on Information and knowledge management
MagicCube: choosing the best snippet for each aspect of an entity

Proceedings of the 18th ACM conference on Information and knowledge management
Data extraction from the web using wild card queries

Proceedings of the 18th ACM conference on Information and knowledge management
CRCTOL: A semantic-based domain ontology learning system

Journal of the American Society for Information Science and Technology
Data integration for the relational web

Proceedings of the VLDB Endowment
Context and Domain Knowledge Enhanced Entity Spotting in Informal Text

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Modeling Common Real-Word Relations Using Triples Extracted from n-Grams

ASWC '09 Proceedings of the 4th Asian Conference on The Semantic Web
Geo-mining: discovery of road and transport networks using directional patterns

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Data-oriented content query system: searching for data into text on the web

Proceedings of the third ACM international conference on Web search and data mining
Beyond pages: supporting efficient, scalable entity search with dual-inversion index

Proceedings of the 13th International Conference on Extending Database Technology
Timely YAGO: harvesting, querying, and visualizing temporal knowledge from Wikipedia

Proceedings of the 13th International Conference on Extending Database Technology
Relation instantiation for ontology population using the web

KI'06 Proceedings of the 29th annual German conference on Artificial intelligence
A method for web information extraction

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Pattern-based extraction of addresses from web page content

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Cost-effective web search in bootstrapping for named entity recognition

DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
From information to knowledge: harvesting entities and relationships from web sources

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
I4E: interactive investigation of iterative information extraction

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
DoCQS: a prototype system for supporting data-oriented content query

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Analysis of a probabilistic model of redundancy in unsupervised information extraction

Artificial Intelligence
Refining non-taxonomic relation labels with external structured data to support ontology learning

Data & Knowledge Engineering
The architecture and implementation of an extensible web crawler

NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
Hunting for the black swan: risk mining from text

ACLDemos '10 Proceedings of the ACL 2010 System Demonstrations
Find your advisor: robust knowledge gathering from the web

Procceedings of the 13th International Workshop on the Web and Databases
Multi-modal multi-correlation person-centric news retrieval

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Index structures for efficiently searching natural language text

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
FactRank: random walks on a web of facts

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Corpus-based semantic class mining: distributional vs. pattern-based approaches

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Extracting 5W1H event semantic elements from Chinese online news

WAIM'10 Proceedings of the 11th international conference on Web-age information management
Knowledge-based sense disambiguation (almost) for all structures

Information Systems
A framework for corroborating answers from multiple web sources

Information Systems
Pattern-based synonym and antonym extraction

Proceedings of the 48th Annual Southeast Regional Conference
ROXXI: Reviving witness dOcuments to eXplore eXtracted Information

Proceedings of the VLDB Endowment
Dynamic relationship and event discovery

Proceedings of the fourth ACM international conference on Web search and data mining
Searching patterns for relation extraction over the web: rediscovering the pattern-relation duality

Proceedings of the fourth ACM international conference on Web search and data mining
Automatic wrappers for large scale web extraction

Proceedings of the VLDB Endowment
Enhancing the open-domain classification of named entity using linked open data

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Information extraction from Wikipedia using pattern learning

Acta Cybernetica
Methodological Review: Natural Language Processing methods and systems for biomedical ontology learning

Journal of Biomedical Informatics
SEISA: set expansion by iterative similarity aggregation

Proceedings of the 20th international conference on World wide web
SCAD: collective discovery of attribute values

Proceedings of the 20th international conference on World wide web
FACTO: a fact lookup engine based on web tables

Proceedings of the 20th international conference on World wide web
Automatic extraction of acronym definitions from the Web

Applied Intelligence
Ontology development for health care in India

Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Taxonomy induction based on a collaboratively built knowledge repository

Artificial Intelligence
Towards web search by sentence queries: asking the web for query substitutions

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications: Part II
Ontology population and enrichment: state of the art

Knowledge-driven multimedia information extraction and ontology evolution
Finding dimensions for queries

Proceedings of the 20th ACM international conference on Information and knowledge management
Interactive reasoning in uncertain RDF knowledge bases

Proceedings of the 20th ACM international conference on Information and knowledge management
PARIS: probabilistic alignment of relations, instances, and schema

Proceedings of the VLDB Endowment
Ontologies are us: a unified model of social networks and semantics

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Automatic evaluation of ontologies (AEON)

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Empirical merging of ontologies: a proposal of universal uncertainty representation framework

ESWC'06 Proceedings of the 3rd European conference on The Semantic Web: research and applications
Extracting instances of relations from web documents using redundancy

ESWC'06 Proceedings of the 3rd European conference on The Semantic Web: research and applications
Integrating semi-structured data into business applications: a web intelligence example

WM'05 Proceedings of the Third Biennial conference on Professional Knowledge Management
Discovering links between lexical and surface features in questions and answers

WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
An ontology-based retrieval system using semantic indexing

Information Systems
An up-to-date knowledge-based literature search and exploration framework for focused bioscience domains

Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Improving web data annotations with spreading activation

WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Semantic partitioning of web pages

WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Ontology acquisition for automatic building of scientific portals

SOFSEM'06 Proceedings of the 32nd conference on Current Trends in Theory and Practice of Computer Science
Discovering semantic sibling associations from web documents with XTREEM-SP

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Web scale competitor discovery using mutual information

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Coupled temporal scoping of relational facts

Proceedings of the fifth ACM international conference on Web search and data mining
WebSets: extracting sets of entities from the web using unsupervised information extraction

Proceedings of the fifth ACM international conference on Web search and data mining
Selecting actions for resource-bounded information extraction using reinforcement learning

Proceedings of the fifth ACM international conference on Web search and data mining
Extracting relations in social networks from the web using similarity between collective contexts

ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Relation acquisition using word classes and partial patterns

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Web mining techniques for automatic discovery of medical knowledge

AIME'05 Proceedings of the 10th conference on Artificial Intelligence in Medicine
Information extraction for the semantic web

Proceedings of the First international conference on Reasoning Web
Domains and context: First steps towards managing diversity in knowledge

Web Semantics: Science, Services and Agents on the World Wide Web
Discovering semantic sibling groups from web documents with XTREEM-SG

EKAW'06 Proceedings of the 15th international conference on Managing Knowledge in a World of Networks
Emergent semantics from folksonomies: a quantitative study

Journal on Data Semantics VI
Extraction of temporal facts and events from Wikipedia

Proceedings of the 2nd Temporal Web Analytics Workshop
Resource-Bounded information extraction: acquiring missing feature values on demand

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
An analysis of structured data on the web

Proceedings of the VLDB Endowment
Discovering multi terms and co-hyponymy from XHTML documents with XTREEM

KDXD'06 Proceedings of the First international conference on Knowledge Discovery from XML Documents
The HiLeX system for semantic information extraction

Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Probase: a probabilistic taxonomy for text understanding

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Automatic web-scale information extraction

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Corpus-Driven hyponym acquisition for turkish language

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
An academic search and analysis prototype for specific domain

APWeb'12 Proceedings of the 14th international conference on Web Technologies and Applications
Metabrain: web information extraction and visualization

Proceedings of the International Working Conference on Advanced Visual Interfaces
Phrase pair classification for identifying subtopics

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Short text conceptualization using a probabilistic knowledgebase

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
LIEGE:: link entities in web lists with knowledge base

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Extracting information networks from the blogosphere

ACM Transactions on the Web (TWEB)
Extracting data records from web using suffix tree

Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics
AEON - An approach to the automatic evaluation of ontologies

Applied Ontology - Ontological Foundations of Conceptual Modelling
Decision making aid in mobile environment by behavioral characteristic

Proceedings of the 13th International Conference on Electronic Commerce
Aspectual type and temporal relation classification

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Towards automatic construction of knowledge bases from Chinese online resources

ACL '12 Proceedings of ACL 2012 Student Research Workshop
Reading the web with learned syntactic-semantic inference rules

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Ensemble semantics for large-scale unsupervised relation extraction

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Supporting factual statements with evidence from the web

Proceedings of the 21st ACM international conference on Information and knowledge management
Identifying users' topical tasks in web search

Proceedings of the sixth ACM international conference on Web search and data mining
Wikipedia entity expansion and attribute extraction from the web using semi-supervised learning

Proceedings of the sixth ACM international conference on Web search and data mining
An automatic approach for ontology-based feature extraction from heterogeneous textualresources

Engineering Applications of Artificial Intelligence
Extraction, evaluation and integration of lexical-semantic relations for the automated construction of a lexical ontology

AOW '07 Proceedings of the Third Australasian Workshop on Advances in Ontologies - Volume 85
Travel with Words: An Innovative Vision on Travelling

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
A new term ranking method based on relation extraction and graph model for text classification

ACSC '11 Proceedings of the Thirty-Fourth Australasian Computer Science Conference - Volume 113
Automatic organization of human task goals for web-scale problem solving knowledge

Proceedings of the seventh international conference on Knowledge capture
Discovering unexpected information on the basis of popularity/unpopularity analysis of coordinate objects and their relationships

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Methods for exploring and mining tables on Wikipedia

Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
Learning open-domain comparable entity graphs from user search queries

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Mining rules to align knowledge bases

Proceedings of the 2013 workshop on Automated knowledge base construction
Aggregating semantic annotators

Proceedings of the VLDB Endowment
Editorial: Minimally-supervised learning of domain-specific causal relations using an open-domain corpus as knowledge base

Data & Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Manually querying search engines in order to accumulate a large bodyof factual information is a tedious, error-prone process of piecemealsearch. Search engines retrieve and rank potentially relevantdocuments for human perusal, but do not extract facts, assessconfidence, or fuse information from multiple documents. This paperintroduces KnowItAll, a system that aims to automate the tedious process ofextracting large collections of facts from the web in an autonomous,domain-independent, and scalable manner.The paper describes preliminary experiments in which an instance of KnowItAll, running for four days on a single machine, was able to automatically extract 54,753 facts. KnowItAll associates a probability with each fact enabling it to trade off precision and recall. The paper analyzes KnowItAll's architecture and reports on lessons learned for the design of large-scale information extraction systems.