当前位置:网站首页>[CUDA study notes] First acquaintance with CUDA
[CUDA study notes] First acquaintance with CUDA
2022-07-31 14:56:00 【Pastry chef learns AI】
Article table of contents
Foreword
This article is my study notes for the CUDA course, mainly for myself to review the past and learn new things in the future.It's an honor to be of any help to you.If you are a beginner in CUDA, please correct me if I am wrong.If there is any infringement, please contact the author to delete it.
I. What is GPU?
Graphics Processing Unit (English: Graphics Processing Unit, abbreviation: GPU; also known as display core, graphics card, visual processor, display chip or graphics chip) is a specialized computer, workstation, game console and some mobile devices.Microprocessors (such as tablets, smartphones, etc.) that perform drawing operations.
Second, what is CUDA?
CUDA (Compute Unified Device Architecture, unified computing architecture) is an integrated technology launched by NVIDIA, which is the company's solution to GPGPU (English: General-purpose computing on graphics processing units, referred to as GPGPU or GP²U))official name.Through this technology, users can use NVIDIA's GPU to perform operations other than image processing. It is also the first time that a GPU can be used as a development environment for a C-compiler.The CUDA development kit (CUDA Toolkit) can only compile its own CUDA C-language (only the function of linking to OpenCL), that is, the part executed on the GPU is compiled into the PTX intermediate language or the machine code of a specific NVIDIA GPU architecture (NVIDIA official nameIt is "device code"); and the C/C++ code (officially called "host code" by NVIDIA) that executes on the central processing unit still depends on an external compiler, such as Microsoft Visual Studio under Microsoft Windows; under Linux, it is mainlyDepends on GCC.
Three. Graphics card structure diagram
Below is a schematic of a newer graphics card (TESLA V1000).The green ones are all cores, with a total of more than 5,000 cores.
Fourth, CPU vs GPU
- GPU can be dozens of times faster than CPU: cpu: general computer has 4-8 cpu cores, desktop computers can reach 24; GPU: can reach 5, 6 thousand.runs very fast,
- GPUs have great potential for scientific computing.
V. Cooperation between CPU and GPU:
- The GPU is connected to the CPU through a high-speed interface: PCIe
- GPU as acceleration component.
Six, GPU application:
边栏推荐
- R语言ggplot2可视化:使用ggpubr包的ggboxplot函数可视化分组箱图、使用ggpar函数改变图形化参数(caption、添加、修改可视化图像的题注、脚注内容)
- Getting started with UnityShader (3) - Unity's Shader
- 微信聊天记录中搜索红包
- 看交互设计如何集成到Scrum敏捷流程中
- Asynchronous processing business using CompletableFuture
- 架构实战营模块8消息队列表结构设计
- LeetCode二叉树系列——222.完全二叉树的节点个数
- OpenShift 4 - Deploy Redis Cluster with Operator
- Why do we need to sub-library and sub-table?
- Jmeter常用的十大组件
猜你喜欢
随机推荐
Redis与分布式:集群搭建
OpenShift 4 - 用 Operator 部署 Redis 集群
力扣:738.单调递增的数字
Spark学习(3)-Spark环境搭建-Standalone
R语言ggplot2可视化:使用ggpubr包的ggboxplot函数可视化分组箱图、使用ggpar函数改变图形化参数(caption、添加、修改可视化图像的题注、脚注内容)
女性服务社群产品设计
763.划分字母区间——之打开新世界
Getting started with UnityShader (3) - Unity's Shader
abaqus find contact pairs报错:surface name is already in use
Redis与分布式:主从复制
TCP详解
Sentinel流量控制
element-plus虚拟表格virtual-list组件中是怎么实现清理lodash.memoize缓存的?
R语言检验样本是否符合正态性(检验样本是否来自一个正态分布总体):shapiro.test函数检验样本是否符合正态分布(normality test)
Web自动化实战——Selenium4(自动化测试环境的搭建)
sentinel与nacos持久化
分成两栏后文字顺序混乱的问题解决【写期刊论文时】
Synchronized和volatile 面试简单汇总
LeetCode二叉树系列——110.平衡二叉树
Small test knife: Go reflection helped me convert Excel to Struct