TY - JOUR
T1 - Text-video completion using structure repair and texture propagation
AU - Tsai, Tsung Han
AU - Fang, Chih Lun
N1 - Funding Information:
Manuscript received March 01, 2010; revised October 29, 2010; accepted October 31, 2010. Date of publication November 11, 2010; date of current version January 19, 2011. This work was supported by the CIC and the National Science Council, Taiwan, R.O.C., under Grant NSC99-2220-E-008-003. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Nadia Magnenat-Thalmann.
PY - 2011/2
Y1 - 2011/2
N2 - Today, more superimposed text is embedded within videos. Usually some text is unnecessary. Thus, one requires an approach to remove the text and complete the video. However, few conventional approaches complete the video well due to the large-sized text, structure regions, and various types of videos. In response, this study designed a text-video completion algorithm that poses text-video completion as structure repair and texture propagation. To repair the structure regions, the structure interpolation uses the new model's rotated block matching to estimate the initial location of completed regions and later refine the coordinates of completed regions. The information in the neighboring frames then fills the structure regions. To complete the structure regions without tedious manual interaction, the structure extension utilizes the spline curve estimation. Afterwards, derivative propagation realizes the texture region completion. The experiment results are based on several real TV programs, where all of the text regions were completed with spatio-temporal consistency. Additionally, comparisons present that the performance of the proposed algorithm is superior to those of conventional approaches. Its advantages include the reduction of design complexity by only integrating the structure information in multi-frame and the demonstration of structure consistency for realistic videos.
AB - Today, more superimposed text is embedded within videos. Usually some text is unnecessary. Thus, one requires an approach to remove the text and complete the video. However, few conventional approaches complete the video well due to the large-sized text, structure regions, and various types of videos. In response, this study designed a text-video completion algorithm that poses text-video completion as structure repair and texture propagation. To repair the structure regions, the structure interpolation uses the new model's rotated block matching to estimate the initial location of completed regions and later refine the coordinates of completed regions. The information in the neighboring frames then fills the structure regions. To complete the structure regions without tedious manual interaction, the structure extension utilizes the spline curve estimation. Afterwards, derivative propagation realizes the texture region completion. The experiment results are based on several real TV programs, where all of the text regions were completed with spatio-temporal consistency. Additionally, comparisons present that the performance of the proposed algorithm is superior to those of conventional approaches. Its advantages include the reduction of design complexity by only integrating the structure information in multi-frame and the demonstration of structure consistency for realistic videos.
KW - Spatio-temporal consistency
KW - structure-texture completion
KW - text removal
KW - video completion
UR - http://www.scopus.com/inward/record.url?scp=78951476743&partnerID=8YFLogxK
U2 - 10.1109/TMM.2010.2091497
DO - 10.1109/TMM.2010.2091497
M3 - 期刊論文
AN - SCOPUS:78951476743
SN - 1520-9210
VL - 13
SP - 29
EP - 39
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 1
M1 - 5629374
ER -