当前位置:网站首页>[CUDA study notes] First acquaintance with CUDA
[CUDA study notes] First acquaintance with CUDA
2022-07-31 14:56:00 【Pastry chef learns AI】
Article table of contents
Foreword
This article is my study notes for the CUDA course, mainly for myself to review the past and learn new things in the future.It's an honor to be of any help to you.If you are a beginner in CUDA, please correct me if I am wrong.If there is any infringement, please contact the author to delete it.
I. What is GPU?
Graphics Processing Unit (English: Graphics Processing Unit, abbreviation: GPU; also known as display core, graphics card, visual processor, display chip or graphics chip) is a specialized computer, workstation, game console and some mobile devices.Microprocessors (such as tablets, smartphones, etc.) that perform drawing operations.
Second, what is CUDA?
CUDA (Compute Unified Device Architecture, unified computing architecture) is an integrated technology launched by NVIDIA, which is the company's solution to GPGPU (English: General-purpose computing on graphics processing units, referred to as GPGPU or GP²U))official name.Through this technology, users can use NVIDIA's GPU to perform operations other than image processing. It is also the first time that a GPU can be used as a development environment for a C-compiler.The CUDA development kit (CUDA Toolkit) can only compile its own CUDA C-language (only the function of linking to OpenCL), that is, the part executed on the GPU is compiled into the PTX intermediate language or the machine code of a specific NVIDIA GPU architecture (NVIDIA official nameIt is "device code"); and the C/C++ code (officially called "host code" by NVIDIA) that executes on the central processing unit still depends on an external compiler, such as Microsoft Visual Studio under Microsoft Windows; under Linux, it is mainlyDepends on GCC.
Three. Graphics card structure diagram
Below is a schematic of a newer graphics card (TESLA V1000).The green ones are all cores, with a total of more than 5,000 cores.
Fourth, CPU vs GPU
- GPU can be dozens of times faster than CPU: cpu: general computer has 4-8 cpu cores, desktop computers can reach 24; GPU: can reach 5, 6 thousand.runs very fast,
- GPUs have great potential for scientific computing.

V. Cooperation between CPU and GPU:
- The GPU is connected to the CPU through a high-speed interface: PCIe
- GPU as acceleration component.

Six, GPU application:

边栏推荐
猜你喜欢

Analysis of the startup source code of hyperf (2) - how the request reaches the controller

I summed up the bad MySQL interview questions

OAuth2:单点登陆客户端

自适应控制——仿真实验三 用超稳定性理论设计模型参考自适应系统
![Recommendation System - Recall Phase - 2013: DSSM (Twin Towers Model) [Embedding (Semantic Vector) Recall] [Microsoft]](/img/40/b567780ed2cf04f1f1336922816f86.png)
Recommendation System - Recall Phase - 2013: DSSM (Twin Towers Model) [Embedding (Semantic Vector) Recall] [Microsoft]

Sentinel安装与部署

leetcode303场周赛复盘

乡村基冲刺港交所:5个月期内亏2224万 SIG与红杉中国是股东

2021 OWASP TOP 10 漏洞指南

消息队列消息数据存储MySQL表设计
随机推荐
The use of thread pool two
DeepLab Series Learning
redhat/openssl generates a self-signed ca certificate and uses it
Motion capture system for end-positioning control of flexible manipulators
大健云仓冲刺美股:增营收反减利润 京东与DCM是股东
Redis与分布式:主从复制
In the future, the interviewer asks you why it is not recommended to use Select *, please answer him out loud!
MANIFEST.MF文件(PDB文件)
49. The copy constructor and overloaded 】
Ubantu专题4:xshell、xftp连接接虚拟机以及设置xshell复制粘贴快捷键
NC | 斯坦福申小涛等开发数据可重复分析计算框架TidyMass
Description of Hikvision camera streaming RTSP address rules
LeetCode二叉树系列——222.完全二叉树的节点个数
[Pytorch] F.softmax() method description
如何进行需求分析评审
svn安装及使用(身体功能手册)
QGIS 加载WMS数据,重新投影
安装Xshell并使用其进行Ymodem协议的串口传输
AVH部署实践 (一) | 在Arm虚拟硬件上部署飞桨模型
Sentinel安装与部署