Incorporating quality metrics in centralized/distributed information retrieval on the World Wide Web
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Document quality models for web ad hoc retrieval
Proceedings of the 14th ACM international conference on Information and knowledge management
Beyond accuracy: what data quality means to data consumers
Journal of Management Information Systems
He says, she says: conflict and coordination in Wikipedia
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Cooperation and quality in wikipedia
Proceedings of the 2007 international symposium on Wikis
Measuring article quality in wikipedia: models and evaluation
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Finding high-quality content in social media
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Information quality work organization in wikipedia
Journal of the American Society for Information Science and Technology
Size matters: word count as a measure of quality on wikipedia
Proceedings of the 17th international conference on World Wide Web
Building semantic kernels for text classification using wikipedia
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning to link with wikipedia
Proceedings of the 17th ACM conference on Information and knowledge management
Overview and Framework for Data and Information Quality Research
Journal of Data and Information Quality (JDIQ)
User generated content: how good is it?
Proceedings of the 3rd workshop on Information credibility on the web
Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Enhancing cluster labeling using wikipedia
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Wikibugs: using template messages in open content collections
Proceedings of the 5th International Symposium on Wikis and Open Collaboration
Evidence of quality of textual features on the web 2.0
Proceedings of the 18th ACM conference on Information and knowledge management
HICSS '10 Proceedings of the 2010 43rd Hawaii International Conference on System Sciences
Exploiting social context for review quality prediction
Proceedings of the 19th international conference on World wide web
Identifying featured articles in wikipedia: writing style matters
Proceedings of the 19th international conference on World wide web
A Wikipedia-based multilingual retrieval model
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
A topic-specific web search system focusing on quality pages
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Quality-biased ranking of web documents
Proceedings of the fourth ACM international conference on Web search and data mining
Towards automatic quality assurance in Wikipedia
Proceedings of the 20th international conference companion on World wide web
Insights into explicit semantic analysis
Proceedings of the 20th ACM international conference on Information and knowledge management
Detection of text quality flaws as a one-class classification problem
Proceedings of the 20th ACM international conference on Information and knowledge management
Predicting quality flaws in user-generated content: the case of wikipedia
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Automatic Mapping of Wikipedia Templates for Fast Deployment of Localised DBpedia Datasets
Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies
Proceedings of the 19th Brazilian symposium on Multimedia and the web
Hi-index | 0.00 |
The online encyclopedia Wikipedia is a successful example of the increasing popularity of user generated content on the Web. Despite its success, Wikipedia is often criticized for containing low-quality information, which is mainly attributed to its core policy of being open for editing by everyone. The identification of low-quality information is an important task since Wikipedia has become the primary source of knowledge for a huge number of people around the world. Previous research on quality assessment in Wikipedia either investigates only small samples of articles, or else focuses on single quality aspects, like accuracy or formality. This paper targets the investigation of quality flaws, and presents the first complete breakdown of Wikipedia's quality flaw structure. We conduct an extensive exploratory analysis, which reveals (1) the quality flaws that actually exist, (2) the distribution of flaws in Wikipedia, and (3) the extent of flawed content. An important finding is that more than one in four English Wikipedia articles contains at least one quality flaw, 70% of which concern article verifiability.