当前位置:网站首页>Network equipment hard core technology insider router Chapter 16 dpdk and its prequel (I)
Network equipment hard core technology insider router Chapter 16 dpdk and its prequel (I)
2022-07-27 15:28:00 【User 8289326】
In the last topic , We mentioned that ,NFV As SDN One of the genres of , With vBRAS And other forms are promoting the reconstruction of operators' Networks . in fact , Not only in the field of operators , meanwhile , stay IDC in , Based on network element virtualization vSwitch as well as LB/ Firewall and other value-added services , It's also popular . these NFV The platform on which the software runs , Is based on x86 Multi core processor platform .
because x86 The tradition of running on Linux The architecture limitations of ,x86 Multi core has been unable to achieve high-performance data forwarding , therefore , Led by Intel DPDK(Data Plane Development Kit) Has become NFV Standard configuration of , Major Internet enterprises recruit network business development 、 Architecture Engineer , Will also be familiar with DPDK Written into the recruitment requirements .DPDK With the container 、SD-WAN、 Concepts such as edge computing have become the hottest ICT technology .
but , What you don't know is , As early as 2006 year , Similar projects have been carried out on multi-core processors in China DPDK Research and development .
2006 year , Multi core processors are the hottest in the Jianghu Spicy chicken Fried chicken Yes MIPS Multicore processor .RMI( Has been Broadcom Merger and acquisition (m&a) ) and Cavium( Has been Marvell Merger and acquisition (m&a) ) A fierce multi-core processor arms race broke out . And China is at the forefront of science and Technology H3C The company has a keen insight into the trend that multi-core general-purpose and semi general-purpose processors will become the mainstream processors of network devices in the future , Using multi-core processors, we have developed a series of medium and high-end routers and security products .
SR6602 Namely H3C The representative work of router based on multi-core technology . It is based on the most advanced 8 nucleus 32 Hardware thread processor , The internal block diagram of the processor is shown in the figure below :
It can be seen that , The processor has the highest 24Gbps Network of IO processing capacity , It can be expanded TCAM Interface , There is a fast message ring network inside (Fast Messaging Network) From the network interface 、 The data packet of the security encryption algorithm engine quickly notifies the corresponding CPU/ Threads , And each CPU/ Threads can also pass FMN Communicate with each other .
H3C During the development process, the engineers found , Simply put the soft forwarding module in the network operating system in multiple CPU Up operation , There is no linear improvement in performance . As before 1 Get on threads 200kbps Forwarding capability of ,28 The forwarding performance of threads is only 1500kbps, It is much lower than the expectation of product planning . Why is that ?
original , For Parallel Computing , There is an important law ——Amdahl Law of acceleration ratio .
Amdahl The content of the law of acceleration ratio is , The speedup ratio available to complete complex work is limited , Limited by the part of this work that must be executed serially . in other words , Even if there is a small amount of computing work that requires multiple processors to execute in series , The overall performance will also be greatly reduced .
actually , When multi-core processors process packets in parallel , In most cases , Each packet can be based on hash Algorithm , The load is shared to different processors for processing .
however , Notice the part framed by the red line :
All processor cores share L2 cache , such , Once two or more processor cores share a cache area , This will cause multiple cores to queue up to access this cache area .
The default cache organization of the processor , So cache lines (cache line) Organized in a way . Such as MIPS The caching behavior of 32 byte , And packet caching (MBuf) The size is 4KB, yes 32 Bytes of 128 times . such , It is very easy to cause continuous MBuf Use conflict , When accepting the next packet , Just kick out the previous packet cacheline.
The solution is simple . Distribute MBuf when , With 4KB+32 Bytes are units . such , Can guarantee every two MBuf Between , The header will not map to the same one cacheline.
in addition , When implementing a router with multiple cores , There is another place that is prone to cache conflicts , Do you know where it is ?
边栏推荐
- Four kinds of relay schemes driven by single chip microcomputer
- LeetCode 456. 132模式 单调栈/medium
- How to edit a framework resource file separately
- The mobile terminal uses the list component of vantui. When multiple tab items are switched back and forth, the list is loaded many times, resulting in the failure of normal display of data
- Leetcode 191. number of 1 bits bit operation /easy
- Principle of MOS tube to prevent reverse connection of power supply
- The design method of integral operation circuit is introduced in detail
- 资本频频加码,急于上市的和府捞面有多“疯狂”?
- 网络设备硬核技术内幕 路由器篇 3 贾宝玉梦游太虚幻境 (中)
- ad7606与stm32连接电路介绍
猜你喜欢

Do you really understand CMS garbage collector?

光电隔离电路设计方案(六款基于光耦、AD210AN的光电隔离电路图)

generic paradigm

Unity性能优化------渲染优化(GPU)之Occlusion culling(遮挡剔除)

What is the breakthrough point of digital transformation in the electronic manufacturing industry? Lean manufacturing is the key

The mobile terminal uses the list component of vantui. When multiple tab items are switched back and forth, the list is loaded many times, resulting in the failure of normal display of data

Adaptation verification new occupation is coming! Huayun data participated in the preparation of the national vocational skill standard for information system adaptation verifiers

华为鸿蒙模拟器去除顶部导航栏方法

The design method of integral operation circuit is introduced in detail

Tools - common methods of markdown editor
随机推荐
Introduction to STM32 learning can controller
Code coverage statistical artifact -jacobo tool practice
网络设备硬核技术内幕 路由器篇 3 贾宝玉梦游太虚幻境 (中)
基于stm32的数字示波器设计方案
Leetcode-1737-满足三条件之一需改变的最少字符数
网络设备硬核技术内幕 路由器篇 21 可重构的路由器
《剑指Offer》两个链表的第一个公共结点
Problem solving in magic tower project
Unity performance optimization ----- LOD (level of detail) of rendering optimization (GPU)
Sword finger offer cut rope
DIY制作示波器的超详细教程:(一)我不是为了做一个示波器
3.3-5v转换
事务_基本演示和事务_默认自动提交&手动提交
LeetCode 191. Number of 1 Bits(位1的个数) 位运算/easy
LeetCode 1143. 最长公共子序列 动态规划/medium
The reverse order pairs in the "sword finger offer" array
反射
网络设备硬核技术内幕 路由器篇 (10) CISCO ASR9900拆解 (四)
仅做两项修改,苹果就让StyleGANv2获得了3D生成能力
Network equipment hard core technology insider router Chapter 11 Cisco asr9900 disassembly (V)