当前位置:网站首页>Resunet tensorrt8.2 speed and video memory record table on Jetson Xavier NX (continuously supplemented later)
Resunet tensorrt8.2 speed and video memory record table on Jetson Xavier NX (continuously supplemented later)
2022-07-02 20:00:00 【Few people take the road_ Mental journey】
ResUnet-TensorRT8.2 Speed and video memory record table
| precision | Pattern | Image size | Number of categories | batch | Number of threads | Reasoning time | Complete processing time | memory |
|---|---|---|---|---|---|---|---|---|
| FP32 | 20W 2CORE | 512*640 | 2 | 1 | 1 | 168ms | 179ms | 2.2G |
| FP32 | 20W 2CORE | 512*640 | 2 | 1 | 2 | 172ms | 184ms | 3.0G |
| FP16 | 20W 2CORE | 512*640 | 2 | 1 | 1 | 58ms | 68ms | 1.6G |
| FP16 | 20W 2CORE | 512*640 | 2 | 1 | 2 | 58ms | 68ms | 1.9G |
| FP32 | 20W 2CORE | 512*612 | 6 | 1 | 1 | 167ms | 209ms | 2.2G |
| FP32 | 20W 2CORE | 512*612 | 6 | 1 | 2 | 170ms | 234ms | 3.0G |
| FP16 | 20W 2CORE | 512*612 | 6 | 1 | 1 | 57ms | 97ms | 1.6G |
| FP16 | 20W 2CORE | 512*612 | 6 | 1 | 2 | 58ms | 106ms | 1.9G |
explain :
1. Pattern means Jetson Power consumption mode of the device , For my Jetson XAVIER NX Come on , All in all 8 Patterns , If you want to achieve maximum reasoning speed , choice 20W 2CORE Pattern . There is a... In the upper right corner of the main interface MODE The choice of , choice 20W 2CORE Mode is enough .
( My choice 20W 6CORE Test and follow 20W 2CORE almost , Only soon 1ms, So choose 20W 2CORE that will do )

2. Reasoning time refers to the average time of each graph doInference( The perform cudaMemcpyAsync) The reasoning time required .
Complete processing time reasoning time plus pre-processing and post-processing time .
3. about Jetson For equipment ,CPU and GPU share , So video memory is memory . about Jetson XAVIER NX Total memory 8G.
The viewing method cannot be used directly nvidia-smi The command line , Must install jetson-stats.
For specific operation methods, please refer to the following blog .
Jetson View video memory on device ( Memory )——jetson-stats
4. Why do I have 8 A model , And under this system TensorRT yes 8.2.1.8 edition , No 7 edition , Guess the reason is that the image used in the initial burning system is relatively new .
And compared to TensorRT7 edition , The speed is fast 20ms, See my previous blog for details .
Jetson XAVIER NX On ResUnet-TensorRT7 Speed and video memory record table ( Continuous follow-up )
边栏推荐
- 攻防世界pwn题:Recho
- 自动生成VGG图像注释文件
- Solution: vs2017 cannot open the source file stdio h main. H header document [easy to understand]
- Esp32c3 crash analysis
- How can testers do without missing tests? Seven o'clock is enough
- In depth understanding of modern web browsers (I)
- 职场四象限法则:时间管理四象限与职场沟通四象限「建议收藏」
- Correspondence between pytoch version, CUDA version and graphics card driver version
- Py's interpret: a detailed introduction to interpret, installation, and case application
- 从20s优化到500ms,我用了这三招
猜你喜欢

GCC: Graph Contrastive Coding for Graph Neural NetworkPre-Training

Basic concept of database, installation and configuration of database, basic use of MySQL, operation of database in the project

高并发下如何避免产生重复数据?

SBT tutorial

B-end e-commerce - reverse order process

有时候只查询一行语句,执行也慢

Génération automatique de fichiers d'annotation d'images vgg

After eight years of test experience and interview with 28K company, hematemesis sorted out high-frequency interview questions and answers

After 65 days of closure and control of the epidemic, my home office experience sharing | community essay solicitation

Data Lake (XII): integration of spark3.1.2 and iceberg0.12.1
随机推荐
HDL design peripheral tools to reduce errors and help you take off!
AcWing 181. Turnaround game solution (search ida* search)
API documentation tool knife4j usage details
KT148A语音芯片ic的用户端自己更换语音的方法,上位机
台湾SSS鑫创SSS1700替代Cmedia CM6533 24bit 96KHZ USB音频编解码芯片
Overview of browser caching mechanism
Spark source code compilation, cluster deployment and SBT development environment integration in idea
从20s优化到500ms,我用了这三招
功能、作用、效能、功用、效用、功效
Implementation of online shopping mall system based on SSM
【Hot100】23. 合并K个升序链表
Use IDM to download Baidu online disk files (useful for personal testing) [easy to understand]
Common problems and description of kt148a voice chip IC development
[JS] get the search parameters of URL in hash mode
八年测开经验,面试28K公司后,吐血整理出高频面试题和答案
NMF-matlab
Sometimes only one line of statements are queried, and the execution is slow
编写完10万行代码,我发了篇长文吐槽Rust
[译]深入了解现代web浏览器(一)
职场四象限法则:时间管理四象限与职场沟通四象限「建议收藏」