当前位置:网站首页>Efficient Video Instance Segmentation via Tracklet Query and Proposal
Efficient Video Instance Segmentation via Tracklet Query and Proposal
2022-07-26 03:19:00 【Early lunar month in Pingqiu】
Abstract
VIS Our goal is to classify at the same time , Division , Track multiple target instances in the video . The present clip-level Of VIS Input a short video , Because the timing context information of multiple frames is used . The effect is obviously better than frame-level VIS. But at present, most clip-level Methods are neither end-to-end learnable , Nor can it be real-time .VIS transformer It solves the above two problems , But because of it frame-wise Dense attention calculation , Training time is too long ; and VisTR Cannot learn from end to end for multiple video segments , Manual data association is required , Before and after clips Examples of weeks tracklet link . In this paper, the EfficientVIS Training reasoning is very efficient , And end-to-end learning . The core idea is “tracklet query and tracklet proposal that associate and segment RoIs across space and time by an interative query-video interaction". And further proposed correspondence Study , Make adjacent clips Of tracklets Links can be learned .
Tracklet Query and Proposal
use tracklet queries { q i } i = 1 N \{q_i\}_{i=1}^N { qi}i=1N and tracklet proposals { b i } i = 1 N \{b_i\}_{i=1}^N { bi}i=1N To jointly represent each object instance in a video .tracklet query q i ∈ R T × C q_i\in R^{T\times C} qi∈RT×C Is the number of channels C Of embedding vector ,tracklet proposal b i ∈ R T × 4 b_i\in R^{T\times 4} bi∈RT×4 It's a space-time Rectangle box .
边栏推荐
- Unity quickly builds urban scenes
- ByteDance (Tiktok) software test monthly salary 23K post, technical two-sided interview questions are newly released
- Classic interview questions -- three characteristics of OOP language
- STM32 - DMA notes
- STM——EXTI外部中断学习笔记
- ELS callback function, exit message
- tensorflow中tf.Variable()函数的用法
- 78. 子集
- 【尤里复裂人】带你轻松理解——深拷贝和浅拷贝
- tf.truncated_normal()用法
猜你喜欢

Unknown-Aware Object Detection:Learning What You Don’t Know from Videos in the Wild(CVPR 2022)

Quick check of OGC WebGIS common service standards (wms/wmts/tms/wfs)

Leetcode · daily question · 919. complete binary tree inserter · hierarchy traversal · BFS

How to reinstall win7 system?

YOLOv3: An Incremental Improvement

canvas——绘制文本——饼图的修改

实现一个方法,找出数组中的第k大和第m大的数字相加之和

爆肝出了4W字的Redis面试教程
![[sql] usage of self connection](/img/92/92474343b4b4e6ea60453b4799cb55.jpg)
[sql] usage of self connection

LeetCode·
随机推荐
万维网、因特网和互联网的区别
[STL]优先级队列priority_queue
Leetcode · 83 biweekly match · 6128. best poker hand · simulation
The difference between the world wide web, the Internet and the Internet
爆肝出了4W字的Redis面试教程
Opening method of win11 microphone permission
LeetCode·
Course notes of single chip microcomputer principle and interface technology for migrant workers majoring in electronic information engineering
hello world驱动(二)-初级版
tf.truncated_normal()用法
Golang log programming system
els 修改光标、修改图标
ue4如何进行静态渲染?5个步骤生成静态渲染
了解预加载和懒加载、学会缓动动画
记一次SQL优化
[noip2001 popularization group] the problem of maximum common divisor and minimum common multiple
canvas——心电图的设计,以及如何清理画布
Summary of Huawei virtualization fusioncompute knowledge points
VR panoramic shooting and production of business center helps businesses effectively attract people
2022-07-21 study notes of group 4 self-cultivation class (every day)