Fig. 3From: Chinese text-line detection from web videos with fully convolutional networksCaptions and their corresponding label maps, where (a) and (c) are the English and Chinese caption area extracted from one frame, and (b) and (d) are their corresponding labelsBack to article page