Water surface object detection using panoramic vision based on improved single-shot multibox detector

EURASIP Journal on Advances in Signal Processing

Table 2 Backbone network parameters after modification

Name of convolutional layer	Input size	Kernel	Stride	Output size
Conv1_x	300*300	7*7, 64	2	15015064
Conv2_x	15015064	3*3 max pool	2	7575256
Conv2_x	15015064	\(\left[ {\begin{array}{{20}c} {11} & {64} \\ {33} & {64} \\ {11} & {256} \\ \end{array} } \right]\)*3	1	7575256
Conv3_x	7575256	\(\left[ {\begin{array}{{20}c} {11} & {128} \\ {33} & {128} \\ {11} & {512} \\ \end{array} } \right]\)*4	2	3838512
Conv4_x	3838512	\(\left[ {\begin{array}{{20}c} {11} & {256} \\ {33} & {256} \\ {11} & {1024} \\ \end{array} } \right]\)*6	1	38381024