当前位置:网站首页>CUDA details GPU architecture
CUDA details GPU architecture
2022-07-29 02:36:00 【Autumn ink】
Each thread has its own private local memory (Local Memory), Each thread block contains shared memory (Shared Memory), Can be shared by all threads in the thread block , Its life cycle is consistent with the thread block . Besides , All threads have access to global memory (Global Memory). You can also access some read-only memory blocks : Constant memory (Constant Memory) And texture memory (Texture Memory). Memory structure involves program optimization , They are not discussed in depth here .

GPU One of the core components of hardware is SM, I've said that before ,SM The English name is Streaming Multiprocessor, Stream multiprocessor .SM Its core components include CUDA The core , Shared memory , Register, etc ,SM Hundreds of threads can be executed concurrently , Concurrency depends on SM Number of resources owned . When one kernel When executed , its gird Thread blocks in are allocated to SM On , A thread block can only be in one SM Was dispatched to .SM In general, multiple thread blocks can be scheduled , It depends SM Own ability . So it's possible that one kernel Each thread block of is allocated multiple SM, therefore grid It's just logic , and SM It's the physical layer of execution .SM It's using SIMT (Single-Instruction, Multiple-Thread, Single instruction multithreading ) framework , The basic execution unit is
边栏推荐
- Waiting queue wait_ queue
- When I look at the source code, what am I thinking?
- What should I do if excel opens a CSV file containing Chinese characters and there is garbled code?
- How to migrate thinkphp5 projects to Alibaba cloud function computing to cope with traffic peaks?
- 主从复制及其原理
- Explain asynchronous tasks in detail: task status and lifecycle management
- 物联网组件
- 网络安全漏洞管理的探索与实践
- 矿山开采虚拟现实vr安全培训提升员工警惕性和防护意识
- Cuda-npp image and video processing
猜你喜欢

What should I do if excel opens a CSV file containing Chinese characters and there is garbled code?

Split, an avalanche caused by connection pool parameters

The outsourcing company "mixed" for two years, and I only did five things seriously. Now I get byte offer smoothly.

多边形点测试

Read the recent trends of okaleido tiger and tap the value and potential behind it

Production scheme and advantages of online 3D digital exhibition hall

当Synchronized遇到这玩意儿,有个大坑,要注意

Remember error scheduler once Asynceventqueue: dropping event from queue shared causes OOM

Shell 脚本 快速入门 -01

一文读懂Okaleido Tiger近期动态,挖掘背后价值与潜力
随机推荐
The financing demand of 129 million yuan was released, and the roadshow of the Dake city project continued to irrigate the "good seedlings" of scientific innovation
0728~面试题梳理
Rust 列表(Vec)复制
Split, an avalanche caused by connection pool parameters
KBxxxxxx不一定是补丁,也可能是某个问题解决方案的描述
Three expiration strategies
线上3d数字展厅制作方案及优点
“两个披萨”团队的分支管理实践
TCP重传机制有哪些?
After 4 years of testing experience, I finally entered Alibaba. Two months later, I chose to resign naked
MySQL驱动中关于时间的坑
Work queue_ queue
Altium designer outputs Gerber and other production documents
Explanation of engineering economics terms
Thermistor temperature calculation formula program
如何快速设计一套支持渲染富文本内容的跨端组件
NPM install reports an error: eperm: operation not permitted, rename
Cuda-npp image and video processing
How does the Devops team defend against API attacks?
Rust list (VEC) replication