当前位置：网站首页>[pytorch pre training model modification, addition and deletion of specific layers]

[pytorch pre training model modification, addition and deletion of specific layers]

2022-07-05 11:48:00 【Network starry sky (LUOC)】

List of articles

One 、 The introduction
Two 、 Official model library
3、 ... and 、 Modify a specific layer
Four 、 Add or delete specific layers

One 、 The introduction

In the process of building a deep learning network , It is often necessary to modify the pre training model and add or delete specific layers .

torchvision.models It provides a variety of models to meet the choice of different tasks , So when building the network structure , There is no need to reproduce a network structure from scratch , Just modify it on the basis of the official library .

Two 、 Official model library

pytorch The provided model can be queried through the following links ：https://pytorch.org/vision/stable/models.html, Classification 、 Division 、 Object detection instance segmentation, key point detection and video classification 4 A classification , You can find the model you need .

Take the classification task as an example , What we use is resnet.torchvision.models Provides resnet18,resnet34,resnet50,resnet101,resnet152. The two columns on the right are where they are ImageNet Upper top1 Accuracy and top5 Accuracy.

Insert picture description here

Here we use resnet50 For example . The function is described as follows ：

Insert picture description here

import torchvision.models as models

def Net(nn.Module):
	def __init__(self, input_ch, num_class,pretrained=True):
		super(Net,self).__init__()
		self.model = models.resnet50(pretrained=pretrained)
	def forward(self,x):
		x = self.model(x)
		return x

such , We define a Net, This Net Is a pre training weight used resnet50.

3、 ... and 、 Modify a specific layer

In use , One of the problems we may often encounter is , The number of input channels is inconsistent with that of the first layer of the network . Here, the first floor needs to be checked conv Make changes . If we initialize a conv layer , And I want to use the weight of pre training , What do you do then ？ We can do this in the following ways .
resnet50 Of conv1 Weight dimension is [64,3,7,7], It means that the input image needs to be 3 passageway . Suppose the image we want to input is a grayscale image , that conv1 The number of input channels should be changed to 1.

Put the original nn.Conv2d(3, 64, kernel_size=(7,7), stride=(2,2), padding=(3,3), bias=False), Replace with nn.Conv2d(1, 64, kernel_size=(7,7), stride=(2,2), padding=(3,3), bias=False).

def Net(nn.Module):
	def __init__(self, input_ch, num_class,pretrained=True):
		super(Net,self).__init__()
		self.model = models.resnet50(pretrained=pretrained)
		conv1 = nn.Conv2d(1, 64, kernel_size=(7,7), stride=(2,2), padding=(3,3), bias=False) # new conv1 layer 
		self.model.conv1 = conv1 # Replace the original conv1
	def forward(self,x):
		x = self.model(x)
		return x

Follow the operation above , be conv1 The pre training weight of cannot be utilized . In order to make use of conv1 Pre training weight of , We walked along dim=1 Draw , Expand the average weight to new conv1 The weight dimension is consistent .

def Net(nn.Module):
	def __init__(self, input_ch, num_class,pretrained=True):
		super(Net,self).__init__()
		self.model = models.resnet50(pretrained=pretrained)
		conv1_weight = torch.mean(self.model.conv1.weight,dim=1,keepdim=True).repeat(1,input_ch,1,1)# Remove from conv1 Weight and average and expand 
		conv1 = nn.Conv2d(input_ch, 64, kernel_size=(7,7), stride=(2,2), padding=(3,3), bias=False) # new conv1 layer 
		model_dict = self.model.state_dict()# Get the pre training weight of the whole network 
		self.model.conv1 = conv1 # Replace the original conv1
		model_dict['conv1.weight'] = conv1_weight # take conv1 Replace the weight with a new one conv1 The weight 
		model_dict.update(model_dict)# Update the pre training weight of the whole network 
		self.model.load_state_dict(model_dict)# Load new pre training weights 
		
	def forward(self,x):
		x = self.model(x)
		return x

Four 、 Add or delete specific layers

We also often encounter the need to delete the last few layers of the network structure . Or to resnet50 For example . Suppose you want to complete a multi label classification task , To increase classifier.

import torchvision.models as models

class classifer(nn.Module):
	def __init__(self,in_ch,num_classes):
		super(classification_head,self).__init__()
		self.avgpool = nn.AdaptiveAvgPool2d(output_size=(1, 1))
		self.fc = nn.Linear(in_ch,num_classes)

	def forward(self, x):
		x = self.avgpool(x)
		x = torch.flatten(x, 1)
		x = self.fc(x)
		# import pdb;pdb.set_trace()
		return x

class Net(nn.Module):
	def __init__(self, input_ch, num_class,pretrained=True):
		super(Net,self).__init__()
		model = models.resnet50(pretrained=pretrained)
		self.backbone =  nn.Sequential(*list(model.children())[:-3])# Put the last layer4,Avgpool and Fully Connected Layer Remove 
		self.classification_head1 = nn.Sequential(*list(model.children())[-3],
										classifier(2048,3))
		self.classification_head2 = nn.Sequential(*list(model.children())[-3],
										classifier(2048,5))
										
	def forward(self,x):
		x = self.backbone(x)
		output1 = self.classification_head1(x)
		output2 = self.classification_head2(x)
		return [output1,putput2]

take layer4 Also from the backbone It is separated and belongs to two classifer In order to avoid the mutual interference between the two classification tasks , Keep only the lower levels 、 Feature extraction for the network part with high commonality , The higher-level network carries out .

原网站

版权声明
本文为[Network starry sky (LUOC)]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/186/202207051134377604.html