当前位置:网站首页>Hardware factors such as CPU, memory, and graphics card also affect the performance of your deep learning model
Hardware factors such as CPU, memory, and graphics card also affect the performance of your deep learning model
2022-08-04 21:41:00 【The mountain of ignorance, the valley of despair, the slope of 】
Reference: https://posts.careerengine.us/p/5dcb5f53bb8d8844ea19cd0f
Data flow path
Assuming that we now have a batch of image sets on the hard disk, which are to be read into the memory and sent to the GPU for computing, the following process will generally be experienced:
The cpu issues a read command, finds the image data from the hard disk, and stores it inIn memory;
The cpu takes out a batch of data from memory, converts it into numpy array, and performs data preprocessing/enhancement operations, such as flipping, translation, color transformation, etc.Return to memory after processing.
Each of the cpu memory (hereinafter referred to as memory) and the gpu memory (hereinafter referred to as video memory) open up a buffer, and a batch of data in the memory is transferred to the video memory through the PCIe channel.
The GPU core obtains data from the video memory for parallel computing, and returns the calculation results to the video memory.
The calculated result will be returned from the video memory to the memory through the PCIe channel.
These 5 steps involve several links that affect data transfer speed:
HDD read speed;
PCIe transfer speed;
Memory read/write speed;
CPU frequency.
Let's talk about hard drives first.
After the hardware bottleneck is multi-threading at the software level
边栏推荐
- UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xd6 in position 120: invalid continuation byte
- 中大型商业银行堡垒机升级改造方案!必看!
- 【ubuntu20.04安装MySQL以及MySQL-workbench可视化工具】
- 【QT】回调函数的实现
- 2022年江苏省大学生电子设计竞赛(TI杯)B题 飞机 省级一等奖记录 “一个摆烂人的独白”
- 七夕特制:《牛郎会织女》
- Unknown point cloud structure file conversion requirements
- 模拟对抗之红队免杀开发实践
- 打卡第 1 天:正则表达式学习总结
- buu web
猜你喜欢

【uiautomation】微信好友列表获取(存储到txt中)

开发deepstram的自定义插件,使用gst-dseaxmple插件进行扩充,实现deepstream图像输出前的预处理,实现图像自定义绘制图(精四)

Is the International Project Manager PMP certificate worth taking?

DSPE-PEG-Aldehyde, DSPE-PEG-CHO, Phospholipid-Polyethylene Glycol-Aldehyde A hydrophobic 18-carbon phospholipid

Cocoa Application-基础

搬走地下空间开发利用“绊脚石” 中地数码取得地下空间透明化技术突破

Qiangwang Cup 2022 - WEB

PMP证书在哪些行业有用?

JdbcTemplate概述和测试

【SQL之降龙十八掌】01——亢龙有悔:入门10题
随机推荐
未知点云结构文件转换需求
Exploration and Practice of Database Governance
《剑指offer》刷题分类
模拟对抗之红队免杀开发实践
SPSS-System Clustering Software Practice
[Linear Algebra 03] Elimination method display and 4 solutions of AX=b
webmine网页挖矿木马分析与处置
Yolov7:Trainable bag-of-freebies sets new state-of-the-art for real-time objectdetectors
dotnet enables JIT multi-core compilation to improve startup performance
【ubuntu20.04安装MySQL以及MySQL-workbench可视化工具】
OD-Model [6]: YOLOv2
LayaBox---knowledge point
AXI interface application of Zynq Fpga image processing - the use of axi_lite interface
如何一键重装Win11系统 一键重装系统方法
PowerCLi import license to vCenter 7
【uiautomation】微信好友列表获取(存储到txt中)
ES6高级-async的用法
LeetCode 199: 二叉树的右视图
Milvus configuration related
国内的PMP证书含金量到底如何