Fast Text Caption Localization on Video Using Visual Rhythm
VISUAL '02 Proceedings of the 5th International Conference on Recent Advances in Visual Information Systems
Robust region-of-interest determination based on user attention model through visual rhythm analysis
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.02 |
Recent publications regarding video shot change detection algorithms show that abrupt shot changes are detected fairly well. However, gradual shot changes including wipes and dissolves are often missed or falsely detected. A robust shot change detection system, therefore, must include a shot verification step to further enhance the overall system performance. In this paper, we introduce the concept of the visual rhythm, an abstraction of the video. The visual rhythm is a single image, a sub-sampled version of a full video in which the sampling is performed in a pre-determined and in a systematic fashion. It is basically a representation of the video, which includes the overall content of the video. But most importantly, the visual rhythm contains patterns or visual features that allow the viewer/operator to distinguish and classify many different types of video effects (edits and otherwise): cuts, wipes, dissolves, fades, camera motions, object motions, flashlights, zooms, etc. The different video effects manifest themselves as different patterns on the visual rhythm. Using the visual rhythm, it becomes possible, without sequentially playing the entire video, to find false positive shots as well as undetected shots. Thus, inclusion of the visual rhythm in the shot verification process will aid the operator to verify detected shots as well as to find undetected shots fast and efficiently. For this purpose we have developed a new tool, a shot verifier including the visual rhythm. The usefulness of the visual rhythm during the shot verification process will be presented.