当前位置：网站首页>[paper notes] transunet: transformers make strongencoders for medical image segmentation

[paper notes] transunet: transformers make strongencoders for medical image segmentation

2022-07-06 18:52:00 【come from γ Saiya of stars】

Statement
Update your papers from time to time , Easy to understand , Junior Xiaobai can also understand
Scope of coverage ： In depth learning direction , Include CV、NLP、Data Fusion、Digital Twin

Paper title ：

TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

TransUNet: Transformer Provide a powerful encoder for medical image segmentation

Thesis link ：https://arxiv.org/abs/2102.04306

Paper code ：https://github.com/Beckschen/TransUNet

Time of publication ：2021 year 2 month

Innovation points

1、 introduce Transformer and U-Net Combining the Internet , structure TransUNet The Internet

Abstract

Medical image segmentation is to develop medical care system , In particular, the necessary prerequisites for disease diagnosis and treatment planning . In various medical image segmentation tasks ,U Shape architecture （ also called U-NET） Has become the de facto standard , And it was a great success . However , Due to the inherent locality of convolution ,U-NET It usually shows limitations in explicitly modeling remote dependencies . The converter designed for sequence to sequence prediction has become an alternative architecture with innate global self-attention mechanism , But due to the lack of underlying details , It may lead to limited positioning capability .

In this paper, TransUNet As a powerful alternative to medical image segmentation , It has both Transformers and U-net The advantages of . One side , The converter will convolute Neural Networks (CNN) The marked image block in the feature map is encoded as an input sequence , Used to extract global context . On the other hand , The decoder upsamples the encoded features , Then the coding features are combined with high resolution CNN Feature mapping , For precise positioning .

We think , Transformer can be used as a strong encoder for medical image segmentation tasks , And combine U-NET Enhance finer details by restoring local spatial information .Transunet It has achieved better performance than various competitive methods in medical applications such as multi organ segmentation and heart segmentation . Codes and models can be found in https://github.com/beckschen/transunet get .

Method

First , The input image is down sampled and 3 Iteration of layer convolution , The characteristics of generation , Conduct Flatten operation ;

then ,Flatten After the feature enters 12 layer Transformer, there Transformer Inside the structure is MSA （ The long attention mechanism ）,MLP （ Fully connected layer ） Then the output ;

Here's an explanation , Why convolute first and then Transformer .

Because ,Transformer The drawback is that , Large amount of computation , And there is no spatial information . Advantage is , Have global information .

The disadvantage of convolution is , Unable to synthesize global information , And the advantage is , After convolution , Fewer parameters , And it has local spatial information , Different convolution kernels have different receptive fields .

therefore , The author puts convolution in Transformer In front of the structure , Combine their advantages and disadvantages , Reduced parameters , And has spatial and global information .

Last , Decoder part and U-Net identical ,reshape After that, conduct four times of upper sampling , Then it is compared with the characteristics of the encoder's three down sampling Concatenation operation , Finally, output the segmentation graph .

Experiments

The goal of the experiment ： Different data sets , Comparison results of different codec structures

experimental result ：TransUNet The result is the best

The goal of the experiment ： Split the result graph

The goal of the experiment ： Comparison of different frameworks

experimental result ： TransUNet Have distinct advantages

At the end

Transofrmer Large data sets are required , But medical data sets are not easy to collect , This may be a limitation Transformer One of the problems of development in the medical field ！

原网站

版权声明
本文为[come from γ Saiya of stars]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/187/202207061059222804.html