当前位置：网站首页>Convolution neural network -- convolution layer

Convolution neural network -- convolution layer

2022-07-05 05:33:00 【Li Junfeng】

Convolution layer

In convolutional neural networks , The most important thing is convolution layer , It is responsible for extracting feature information from pictures .
First consider the simplest picture , A two-dimensional picture , Then the convolution kernel is also two-dimensional .

Convolution operation

It sounds tall , In fact, it is the sum of products .
for instance ：
Suppose the picture is like this

1	2	4
3	5	4

The convolution kernel is

6	1
4	4

Then the result of convolution operation is ：
$6\times 1+1\times 2 + 4\times 3 + 4\times 5 =40 \newline 6\times 2 + 1\times 4 + 4\times 5+4\times 4 =52$

fill

Through the simple example above , It's not hard to find out , After convolution , The size of the matrix changes . This will bring about a serious problem , After continuous convolution , The picture is getting smaller and smaller , So that the convolution operation cannot be carried out in the end .
To solve this problem , We hope that after convolution , The size of the picture does not change , At this time, you need to fill the corresponding value to the edge of the image .

Matrix size

How much to fill the edge of the picture , This is closely related to the size of convolution kernel . So is there a relationship between this size change . The answer is yes .
Might as well set $H, W$ Represents the length and height of the original matrix , $O H, O W$ Represents the length of the convolution result 、 high , $F H, F W$ Represents the length of convolution kernel 、 high , $P$ Indicates filling , $S$ Indicates the stride .
$=\frac{H + 2\cdot P - FH}{S}+1 \newline OW = \frac{W + 2\cdot P - FW}{S}+ 1$

Code implementation

The principle of convolution operation is very simple , And the code implementation is also very simple .
Through a few for Loop can easily solve , But there are more efficient ways .
We can change the numbers corresponding to the same convolution kernel in the picture into one line , Change the convolution kernel into a column . Then use Matrix multiplication , Convolution operation can be perfectly realized , And one matrix multiplication can calculate the result of the whole picture .
How to change it into one line , The code is given below .

def im2col(input_data, filter_h, filter_w, stride=1, pad=0):
    """ Parameters ---------- input_data :  from ( Data volume ,  passageway ,  high ,  Long ) Of 4 Input data made up of dimension array  filter_h :  The height of the filter  filter_w :  The length of the filter  stride :  Stride  pad :  fill  Returns ------- col : 2 Dimension group  """
    N, C, H, W = input_data.shape
    out_h = (H + 2*pad - filter_h)//stride + 1
    out_w = (W + 2*pad - filter_w)//stride + 1

    img = np.pad(input_data, [(0,0), (0,0), (pad, pad), (pad, pad)], 'constant')
    col = np.zeros((N, C, filter_h, filter_w, out_h, out_w))

    for y in range(filter_h):
        y_max = y + stride*out_h
        for x in range(filter_w):
            x_max = x + stride*out_w
            col[:, :, y, x, :, :] = img[:, :, y:y_max:stride, x:x_max:stride]

    col = col.transpose(0, 4, 5, 1, 2, 3).reshape(N*out_h*out_w, -1)
    return col


def col2im(col, input_shape, filter_h, filter_w, stride=1, pad=0):
    """ Parameters ---------- col : input_shape :  The shape of the input data （ example ：(10, 1, 28, 28)） filter_h : filter_w stride pad Returns ------- """
    N, C, H, W = input_shape
    out_h = (H + 2*pad - filter_h)//stride + 1
    out_w = (W + 2*pad - filter_w)//stride + 1
    col = col.reshape(N, out_h, out_w, C, filter_h, filter_w).transpose(0, 3, 4, 5, 1, 2)

    img = np.zeros((N, C, H + 2*pad + stride - 1, W + 2*pad + stride - 1))
    for y in range(filter_h):
        y_max = y + stride*out_h
        for x in range(filter_w):
            x_max = x + stride*out_w
            img[:, :, y:y_max:stride, x:x_max:stride] += col[:, :, y, x, :, :]

    return img[:, :, pad:H + pad, pad:W + pad]

原网站

版权声明
本文为[Li Junfeng]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/186/202207050527288108.html