Fig. 2
From: Chinese text-line detection from web videos with fully convolutional networks

Details of the fully convolutional network, where the first five convolutional units represent the 5 convolutional stages of PVANET or ResNet-50, and the batch normalization layers and the pooling layers are omitted in the figure for simplicity