当前位置：网站首页>The model defined (modified) in pytoch loads some required pre training model parameters and freezes them

The model defined (modified) in pytoch loads some required pre training model parameters and freezes them

2022-06-26 05:34:00 【Little beaver flower made by Rua】

Part of this article refers to https://zhuanlan.zhihu.com/p/34147880

One . This method is more versatile , Load the parameters of the pre training model according to the parameters of your own model , Assignment with the same name . If you add some layers to the original model, it will not be loaded

dict_trained=torch.load(self.args.load_path, map_location=torch.device('cpu'))
dict_new=model.state_dict()
# 1. filter out unnecessary keys
dict_trained = {
    k: v for k, v in dict_trained.items() if k in dict_new}
# 2. overwrite entries in the existing state dict
model_dict.update(dict_trained)
model.load_state_dict(dict_new)

Two . This is a lot more complicated , Make the changes you want , Such as my , This model adds four layers ’dense’, ‘unary_affine’, ‘binary_affine’, ‘classifier’, adopt j+=8, Skip their weight and bias, This can be referred to as weight attenuation . At the same time, the original model parameters are ’crf’ Partially not loaded .

dict_trained = torch.load(self.args.load_path, map_location=torch.device('cpu'))
dict_new = self.model.state_dict().copy()
trained_list = list(dict_trained.keys())
new_list = list(dict_new.keys())
j = 0
no_loda = {'dense', 'unary_affine', 'binary_affine', 'classifier'}
for i in range(len(trained_list)):
     flag = False
     if 'crf' in trained_list[i]:
         continue
     for nd in no_loda:
         if nd in new_list[j] and 'bert' not in new_list[j]:
             flag = True
     if flag:
         j += 8  # no_loda Of dense and bias Pass by 
     else:
         dict_new[new_list[j]] = dict_trained[trained_list[i]]
         if new_list[j] != trained_list[i]:
             print("i:{},new_state_dict: {}  trained state_dict: {} atypism ".format(i, new_list[j], trained_list[i]))
     j += 1 #keys Not aligned 
model.load_state_dict(dict_new)

Later, I learned that there is a kind of It's simpler Methods ：

When you set up your own model , If you only want to use the parameters at the same structure of the pre training model , That is to say, when loading, set the parameter strict Set to False that will do . The default value of this parameter is True, The layer representing the pre training model is strictly equivalent to the network structure layer defined by itself （ Such as layer name and dimension ）, Otherwise, we can't load , The implementation is as follows ：

model.load_state_dict(torch.load(self.args.load_path, strict=False))

PS: Encountered a mistake , You may wish to modify the model parameters keys And loading model parameters keys Print it out , An antidote against the disease

3、 ... and . Freeze these layers of parameters

In a nutshell

for k in model.paramers:
	k.requires_grad=False

There are many ways , The freezing method corresponding to the above method is used here

I suggest you take a look at
https://discuss.pytorch.org/t/how-the-pytorch-freeze-network-in-some-layers-only-the-rest-of-the-training/7088
perhaps
https://discuss.pytorch.org/t/correct-way-to-freeze-layers/26714
perhaps
Corresponding , In training ,optimizer It can only be updated requires_grad = True Parameters of , therefore

optimizer = torch.optim.Adam( filter(lambda p: p.requires_grad, net.parameters(),lr) )

原网站

版权声明
本文为[Little beaver flower made by Rua]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/177/202206260526389551.html

当前位置：网站首页>The model defined (modified) in pytoch loads some required pre training model parameters and freezes them

The model defined (modified) in pytoch loads some required pre training model parameters and freezes them

One . This method is more versatile , Load the parameters of the pre training model according to the parameters of your own model , Assignment with the same name . If you add some layers to the original model, it will not be loaded

3、 ... and . Freeze these layers of parameters

边栏推荐

猜你喜欢

随机推荐