当前位置：网站首页>Efficient Video Instance Segmentation via Tracklet Query and Proposal

Efficient Video Instance Segmentation via Tracklet Query and Proposal

2022-07-26 03:19:00 【Early lunar month in Pingqiu】

Abstract

VIS Our goal is to classify at the same time , Division , Track multiple target instances in the video . The present clip-level Of VIS Input a short video , Because the timing context information of multiple frames is used . The effect is obviously better than frame-level VIS. But at present, most clip-level Methods are neither end-to-end learnable , Nor can it be real-time .VIS transformer It solves the above two problems , But because of it frame-wise Dense attention calculation , Training time is too long ; and VisTR Cannot learn from end to end for multiple video segments , Manual data association is required , Before and after clips Examples of weeks tracklet link . In this paper, the EfficientVIS Training reasoning is very efficient , And end-to-end learning . The core idea is “tracklet query and tracklet proposal that associate and segment RoIs across space and time by an interative query-video interaction". And further proposed correspondence Study , Make adjacent clips Of tracklets Links can be learned .

Tracklet Query and Proposal

use tracklet queries ${q_i\}_{i=1}^N$ and tracklet proposals ${b_i\}_{i=1}^N$ To jointly represent each object instance in a video .tracklet query $q_i\in R^{T\times C}$ Is the number of channels C Of embedding vector ,tracklet proposal $b_i\in R^{T\times 4}$ It's a space-time Rectangle box .

原网站

版权声明
本文为[Early lunar month in Pingqiu]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/207/202207260312421025.html

当前位置：网站首页>Efficient Video Instance Segmentation via Tracklet Query and Proposal

Efficient Video Instance Segmentation via Tracklet Query and Proposal

Abstract

Tracklet Query and Proposal

边栏推荐

猜你喜欢

随机推荐