当前位置:网站首页>Jetson XAVIER NX上ResUnet-TensorRT8.2速度與顯存記錄錶(後續不斷補充)
Jetson XAVIER NX上ResUnet-TensorRT8.2速度與顯存記錄錶(後續不斷補充)
2022-07-02 19:59:00 【少有人走的路_心智旅程】
ResUnet-TensorRT8.2速度與顯存記錄錶
| 精度 | 模式 | 圖像尺寸 | 類別數 | 批次 | 線程數 | 推理時間 | 完整處理時間 | 顯存 |
|---|---|---|---|---|---|---|---|---|
| FP32 | 20W 2CORE | 512*640 | 2 | 1 | 1 | 168ms | 179ms | 2.2G |
| FP32 | 20W 2CORE | 512*640 | 2 | 1 | 2 | 172ms | 184ms | 3.0G |
| FP16 | 20W 2CORE | 512*640 | 2 | 1 | 1 | 58ms | 68ms | 1.6G |
| FP16 | 20W 2CORE | 512*640 | 2 | 1 | 2 | 58ms | 68ms | 1.9G |
| FP32 | 20W 2CORE | 512*612 | 6 | 1 | 1 | 167ms | 209ms | 2.2G |
| FP32 | 20W 2CORE | 512*612 | 6 | 1 | 2 | 170ms | 234ms | 3.0G |
| FP16 | 20W 2CORE | 512*612 | 6 | 1 | 1 | 57ms | 97ms | 1.6G |
| FP16 | 20W 2CORE | 512*612 | 6 | 1 | 2 | 58ms | 106ms | 1.9G |
說明:
1.模式是指Jetson設備的功耗模式,對於本人的Jetson XAVIER NX來說,總共有8種模式,如果想達到最大推理速度的話,選擇20W 2CORE模式。在主界面的右上角有個MODE的選擇,選擇20W 2CORE模式即可。
(本人選擇20W 6CORE測試下來跟20W 2CORE差不多,只快了1ms,所以選擇20W 2CORE即可)

2.推理時間是指平均每張圖進行doInference(即執行cudaMemcpyAsync)所需要的推理時間。
完整處理時間推理時間加上前處理與後處理時間。
3.對於Jetson設備來說,CPU和GPU共用,所以顯存就是內存。對於Jetson XAVIER NX來說內存總共8G。
而查看的方式不能直接使用nvidia-smi的命令行,必須安裝jetson-stats。
具體操作方式可參考以下博客。
Jetson設備上查看顯存(內存)——jetson-stats
4.為什麼本人會有8個模式,而且這個系統下的TensorRT是8.2.1.8版本,不是7版本,猜測原因是在最初燒錄系統的時候使用的鏡像是比較新的。
而且相比TensorRT7版本,速度快了近20ms,具體可以看本人之前的博客。
Jetson XAVIER NX上ResUnet-TensorRT7速度與顯存記錄錶(後續不斷補充)
边栏推荐
- In depth understanding of modern web browsers (I)
- pytorch 模型保存的完整例子+pytorch 模型保存只保存可训练参数吗?是(+解决方案)
- After 65 days of closure and control of the epidemic, my home office experience sharing | community essay solicitation
- esp32c3 crash分析
- Dictionaries
- Summary of interview experience, escort your offer, full of knowledge points
- rxjs Observable 自定义 Operator 的开发技巧
- API documentation tool knife4j usage details
- NMF-matlab
- Istio部署:快速上手微服务,
猜你喜欢

One side is volume, the other side is layoff. There are a lot of layoffs in byte commercialization department. What do you think of this wave?

HDL design peripheral tools to reduce errors and help you take off!

Introduction to program ape (XII) -- data storage

Why do I have a passion for process?

Zabbix5 client installation and configuration

CRM Customer Relationship Management System

SQLite 3.39.0 release supports right external connection and all external connection

Summary of interview experience, escort your offer, full of knowledge points

AcWing 1126. Minimum cost solution (shortest path Dijkstra)

How to avoid duplicate data in gaobingfa?
随机推荐
Self-Improvement! Daliangshan boys all award Zhibo! Thank you for your paper
Sometimes only one line of statements are queried, and the execution is slow
RPD出品:Superpower Squad 保姆级攻略
After writing 100000 lines of code, I sent a long article roast rust
At compilation environment setup -win
Zabbix5 client installation and configuration
通信人的经典语录,第一条就扎心了……
基于SSM实现网上购物商城系统
Burp install license key not recognized
分享几个图床网址,便于大家分享图片
Development skills of rxjs observable custom operator
简书自动阅读
What are the benefits of multi terminal applet development? Covering Baidu applet, Tiktok applet, wechat applet development, and seizing the multi platform traffic dividend
for(auto a : b)和for(auto &a : b)用法
职场四象限法则:时间管理四象限与职场沟通四象限「建议收藏」
[JS] get the search parameters of URL in hash mode
Cuckoo filter
Jetson XAVIER NX上ResUnet-TensorRT8.2速度与显存记录表(后续不断补充)
Use IDM to download Baidu online disk files (useful for personal testing) [easy to understand]
【Hot100】21. 合并两个有序链表