Video search reranking via information bottleneck principle
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Exploring social dynamics in online media sharing
Proceedings of the 16th international conference on World Wide Web
Practical elimination of near-duplicates from web video search
Proceedings of the 15th international conference on Multimedia
I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system
Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
UQLIPS: a real-time near-duplicate video clip detection system
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Content based video matching using spatiotemporal volumes
Computer Vision and Image Understanding
Identifying user behavior in online social networks
Proceedings of the 1st Workshop on Social Network Systems
Challenges and techniques for effective and efficient similarity search in large video databases
Proceedings of the VLDB Endowment
Understanding video interactions in youtube
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Broadcast yourself on YouTube: really?
HCC '08 Proceedings of the 3rd ACM international workshop on Human-centered computing
Near-duplicate detection for images and videos
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Interactive near-duplicate video retrieval and detection
MM '09 Proceedings of the 17th ACM international conference on Multimedia
An efficient near-duplicate video shot detection method using shot-based interest points
IEEE Transactions on Multimedia
Near-Duplicate Video Detection Using Temporal Patterns of Semantic Concepts
ISM '09 Proceedings of the 2009 11th IEEE International Symposium on Multimedia
Stratification-based keyframe cliques for removal of near-duplicates in video search results
Proceedings of the international conference on Multimedia information retrieval
BASIL: effective near-duplicate image detection using gene sequence alignment
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
The tube over time: characterizing popularity growth of youtube videos
Proceedings of the fourth ACM international conference on Web search and data mining
Barriers and bridges in the adoption of today's mobile phone contextual services
Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services
Hi-index | 0.00 |
Popular content in video sharing websites (e.g., YouTube) is usually replicated via identical copies or near-duplicates. These duplicates are usually studied because they pose a threat to site owners in terms of wasted disk space, or privacy infringements. Furthermore, this content might potentially hinder the users' experience in these websites. The research presented in this article focuses around the central argument that there is no agreement on the technical definition of what these near-duplicates are, and, more importantly, there is no strong evidence that users of video sharing websites would like this content to be removed. Most scholars define near-duplicate video clips (NDVC) by means of non-semantic features (e.g., different image/audio quality), while a few also include semantic features (i.e., different videos of similar content). However, it is unclear what features contribute to the human perception of near-duplicate videos. The findings of four large scale online surveys that were carried out in the context of our research confirm the relevance of both types of features. Some of our findings confirm the adopted definitions of NDVC whereas other findings are surprising: Near-duplicate videos with different image quality, audio quality, or with/without overlays were perceived as NDVC. However, the same could not be verified when videos differed by more than one of these features at the same time. With respect to semantics, it is yet unclear the exact role that it plays in relation to the features that make videos alike. From a user's perspective, participants preferred in most cases to see only one of the NDVC in the search results of a video search query and they were more tolerant to changes in the audio than in the video tracks. Based on all these findings, we propose a new user-centric NDVC definition and present implications for how duplicate content should be dealt with by video sharing Web sites.