当前位置:网站首页>[CUDA study notes] First acquaintance with CUDA
[CUDA study notes] First acquaintance with CUDA
2022-07-31 14:56:00 【Pastry chef learns AI】
Article table of contents
Foreword
This article is my study notes for the CUDA course, mainly for myself to review the past and learn new things in the future.It's an honor to be of any help to you.If you are a beginner in CUDA, please correct me if I am wrong.If there is any infringement, please contact the author to delete it.
I. What is GPU?
Graphics Processing Unit (English: Graphics Processing Unit, abbreviation: GPU; also known as display core, graphics card, visual processor, display chip or graphics chip) is a specialized computer, workstation, game console and some mobile devices.Microprocessors (such as tablets, smartphones, etc.) that perform drawing operations.
Second, what is CUDA?
CUDA (Compute Unified Device Architecture, unified computing architecture) is an integrated technology launched by NVIDIA, which is the company's solution to GPGPU (English: General-purpose computing on graphics processing units, referred to as GPGPU or GP²U))official name.Through this technology, users can use NVIDIA's GPU to perform operations other than image processing. It is also the first time that a GPU can be used as a development environment for a C-compiler.The CUDA development kit (CUDA Toolkit) can only compile its own CUDA C-language (only the function of linking to OpenCL), that is, the part executed on the GPU is compiled into the PTX intermediate language or the machine code of a specific NVIDIA GPU architecture (NVIDIA official nameIt is "device code"); and the C/C++ code (officially called "host code" by NVIDIA) that executes on the central processing unit still depends on an external compiler, such as Microsoft Visual Studio under Microsoft Windows; under Linux, it is mainlyDepends on GCC.
Three. Graphics card structure diagram
Below is a schematic of a newer graphics card (TESLA V1000).The green ones are all cores, with a total of more than 5,000 cores.
Fourth, CPU vs GPU
- GPU can be dozens of times faster than CPU: cpu: general computer has 4-8 cpu cores, desktop computers can reach 24; GPU: can reach 5, 6 thousand.runs very fast,
- GPUs have great potential for scientific computing.

V. Cooperation between CPU and GPU:
- The GPU is connected to the CPU through a high-speed interface: PCIe
- GPU as acceleration component.

Six, GPU application:

边栏推荐
- sentinel与nacos持久化
- ML, DL, CV common problems sorting
- Architecture actual combat battalion module 8 message queue table structure design
- c语言hello world代码(代码编程入门)
- R语言ggstatsplot包ggbarstats函数可视化条形图、并添加假设检验结果(包含样本数、统计量、效应大小及其置信区间、显著性、组间两两比较、贝叶斯假设)、检验结果报告符合APA标准
- 安装Xshell并使用其进行Ymodem协议的串口传输
- The 232-layer 3D flash memory chip is here: the single-chip capacity is 2TB, and the transmission speed is increased by 50%
- Nuget打包并上传教程
- thread_local 变量的析构顺序
- 高等数学——常用不定积分公式
猜你喜欢

深入浅出边缘云 | 4. 生命周期管理

Spark学习(2)-Spark环境搭建-Local

Synchronized和volatile 面试简单汇总

The paper manual becomes 3D animation in seconds, the latest research of Wu Jiajun of Stanford University, selected for ECCV 2022

2021 OWASP TOP 10 漏洞指南

Ubantu专题5:设置静态ip地址

OAuth2:四种授权方式

TCP详解

Spark学习(3)-Spark环境搭建-Standalone

消息队列消息数据存储MySQL表设计
随机推荐
安装Xshell并使用其进行Ymodem协议的串口传输
[Pytorch] F.softmax() method description
Node实现数据加密
LeetCode二叉树系列——226.翻转二叉树
Why do we need to sub-library and sub-table?
435. 无重叠区间
OpenShift 4 - 用 Operator 部署 Redis 集群
高等数学——常用不定积分公式
搭建私有的的Nuget包服务器教程
I summed up the bad MySQL interview questions
2021 OWASP TOP 10 Vulnerability Guide
abaqus find contact pairs报错:surface name is already in use
Motion capture system for end-positioning control of flexible manipulators
BigDecimal 简介,常用方法
TRACE32——常用操作
学习笔记12--路径-速度分解法之局部路径搜索
Spark学习(3)-Spark环境搭建-Standalone
C language basic practice (nine-nine multiplication table) and printing different asterisk patterns
ASP.NET Core 产生连续 Guid
常用工具命令速查表