当前位置:网站首页>Hardware factors such as CPU, memory, and graphics card also affect the performance of your deep learning model
Hardware factors such as CPU, memory, and graphics card also affect the performance of your deep learning model
2022-08-04 21:41:00 【The mountain of ignorance, the valley of despair, the slope of 】
Reference: https://posts.careerengine.us/p/5dcb5f53bb8d8844ea19cd0f
Data flow path
Assuming that we now have a batch of image sets on the hard disk, which are to be read into the memory and sent to the GPU for computing, the following process will generally be experienced:
The cpu issues a read command, finds the image data from the hard disk, and stores it inIn memory;
The cpu takes out a batch of data from memory, converts it into numpy array, and performs data preprocessing/enhancement operations, such as flipping, translation, color transformation, etc.Return to memory after processing.
Each of the cpu memory (hereinafter referred to as memory) and the gpu memory (hereinafter referred to as video memory) open up a buffer, and a batch of data in the memory is transferred to the video memory through the PCIe channel.
The GPU core obtains data from the video memory for parallel computing, and returns the calculation results to the video memory.
The calculated result will be returned from the video memory to the memory through the PCIe channel.
These 5 steps involve several links that affect data transfer speed:
HDD read speed;
PCIe transfer speed;
Memory read/write speed;
CPU frequency.
Let's talk about hard drives first.
After the hardware bottleneck is multi-threading at the software level
边栏推荐
- 【社媒营销】WhatsApp Business API:您需要知道的一切
- 【ubuntu20.04安装MySQL以及MySQL-workbench可视化工具】
- ini怎么使用? C#教程
- Red team kill-free development practice of simulated confrontation
- docker 搭建mysql 主从复制
- 如何一键重装win7系统?重装win7系统详细教程
- 数电快速入门(三)(卡诺图化简法的介绍)
- ES6高级-Promise的用法
- js data type, throttling/anti-shake, click event delegation optimization, transition animation
- webmine网页挖矿木马分析与处置
猜你喜欢
随机推荐
Spss-一元回归实操
Qiangwang Cup 2022 - WEB
【分布式】分布式ID生成策略
大势所趋之下的nft拍卖,未来艺术品的新赋能
Win11如何开启Telnet客户端?
如何为Web3.0世界启动完美的DAO
Codeforces Round #811 (Div. 3)
UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xd6 in position 120: invalid continuation byte
MySQL查询为啥慢了?
Hands-on Deep Learning_NiN
OC-协议
ini怎么使用? C#教程
Yolov7:Trainable bag-of-freebies sets new state-of-the-art for real-time objectdetectors
js data type, throttling/anti-shake, click event delegation optimization, transition animation
Is the International Project Manager PMP certificate worth taking?
模拟对抗之红队免杀开发实践
[Linear Algebra 03] Elimination method display and 4 solutions of AX=b
LayaBox---TypeScript---结构
matlab drawing
七夕特制:《牛郎会织女》








