Understading AlexNet

文章目录

1.
1. 1.1. Preprocessing
2. 1.2. Network Structure

Reference:

First in CS231n, we know the following (K is the kernal size, P is the padding size, S is the stride)
For convolutonal layer:

$$W_{new} = \frac {W_{old} - K + 2P}{S} + 1$$

For the pooling layer (we don’t use padding):

$$W_{new} = \frac {W_{old} - K}{S} + 1$$

Preprocessing

Resize the image, subtract the mean , and do the crop

1 2	Image----resize---->256 x 256----crop---->227 x 227 (it seems 224 in the original paper is wrong) Here for crops: 4 conner patches and one center pathch

Network Structure

          input(size, channel)         para                  output(size, channel)
Conv1     227 x 227, 3	       K=11, S=4, P=0, group=1         55 x 55, 96
ReLU1
LRN1
Pool1     55 x 55, 96          K=3, S=2                        27 x 27, 96	 


Conv2     27 x 27, 256         K=5, S = 1, P=2  group=2        27 x 27, 256 (since group=2, means48->128, 48->128 see the above image,two parts) 
ReLU2
LRN2
Pool2    27 x 27, 256          K=3, S = 2, P = 0               13 x 13, 256


Conv3    13 x 13, 256          K=3, S=1, P=1, group=1          13 x 13, 384 (since group=1, in the above image, you can see the lines crossed in that layer)
ReLU3


Conv4    13 x 13, 384          K=3, S=1, P=1, group=2          13 x 13, 384 (group=2, split again, 192->192, 192->192)
ReLU4

Conv5    13 x 13, 384          K=3, S=1, P=1, group=2          13 x 13, 256 (group=2, two parts, 192->128, 192->128)
ReLU5
Pool5    13 x 13, 284             K=3, S=2                     6 x 6, 256


FC6      6 x 6, 256            Fully connected                 4096 x 1
ReLU6
Dropout6 (p=0.5)(Only in training phase, see 231n if forget)


FC7      4096, 1               Fully connected                 4096 x 1
ReLU7
Dropout7 (p=0.5)(Only in training phase, see 231n if forget)


FC8      4096, 1               Fully connected                   1000 x 1


LOSS    1000 x 1               SoftMAX loss function             scalar

Be a geek

梦想一定要有的，万一见鬼了呢

Understading AlexNet

Preprocessing

Network Structure