当前位置:网站首页>Analysis of usage scenarios of mutex, read-write lock, spin lock, and atomic operation instructions xaddl and cmpxchg
Analysis of usage scenarios of mutex, read-write lock, spin lock, and atomic operation instructions xaddl and cmpxchg
2022-08-04 00:47:00 【cheers~】
互斥锁、读写锁、自旋锁,and atomic operation instructionsxaddl、cmpxchgAnalysis of usage scenarios
- 前言
- 临界资源
- 什么是临界资源
- Multithreaded operation of critical resources without locking
- i++不是原子操作,i++Corresponding to the three assembly instruction
- Multi-threaded operation of critical resources and mutual exclusion lock
- Multithreaded operations critical resources and add read-write lock
- Multi-threaded operation of critical resources and spin locks
- 原子操作
- Three kinds of locksapi介绍
- Three lock usage scenarios
- Interface for atomic operations
前言
This article introduces the usage scenarios of locks, critical resources and atomic operations.
本专栏知识点是通过零声教育的线上课学习,进行梳理总结写下文章,对c/c++linux课程感兴趣的读者,可以点击链接 C/C++后台高级服务器课程介绍 详细查看课程的服务.
临界资源
什么是临界资源
Critical resources are used by multiple threads/进程共享,But only one thread can be used at a time/进程所使用的资源.
The following is a classic case(多线程同时进行i++)Introduce three kinds of locks,以及cpuSupport atomic operation and instruction setCAS.
Create ten threads after the main thread starts,And from the main threadcountVariables as parameters within the child thread,That is to say, ten threads operate on a shared resource at the same time.count,子线程执行10w次count++
操作,The main thread prints every two secondscount的值.Let's take a look at the difference between locking and not locking.
Multithreaded operation of critical resources without locking
//
// Created by 68725 on 2022/8/3.
//
#include <stdio.h>
#include <pthread.h>
#include <unistd.h>
#include <sys/mman.h>
#define THREAD_SIZE 10
//callback
void *func(void *arg) {
int *pcount = (int *) arg;
int i = 0;
while (i++ < 100000) {
(*pcount)++;
usleep(1);
}
}
int main() {
pthread_t th_id[THREAD_SIZE] = {
0};
int i = 0;
int count = 0;
for (i = 0; i < THREAD_SIZE; i++) {
int ret = pthread_create(&th_id[i], NULL, func, &count);
if (ret) {
break;
}
}
for (i = 0; i < 100; i++) {
printf("count --> %d\n", count);
sleep(2);
}
}
我们预期countfinally reach100w,Why did not achieve the expected effect when unlocking?很明显,count++不是原子操作
i++不是原子操作,i++Corresponding to the three assembly instruction
如果i++是原子操作,Then it will definitely add up to100w,那么i++What are the steps corresponding to??
下面以idx++举例,idxThe value is stored in memory,First of all from memoryMOV到eax寄存器里面,Then auto-increment through the register,最后再从eax写回到内存中.without the compiler doing any optimizations,idx++corresponds to these three steps.
在大多数情况下,The program is executed like this
But there are also the following two situations.线程1首先将idxThe value is read into the register,然后cpu执行线程2,线程2After performing the three steps,又回到线程1,线程1Then perform the remaining two steps.有人可能会想,Don't both threads finish executing??有什么不同?
首先,在线程1让出后,线程1的上下文(比如这里的eax),is stored in the thread1里面的,线程1恢复后,And the contextload回去.这里就涉及到yield和resume了,see in detail纯c协程框架NtyCo实现与原理In the second quarter and the third quarter. After understanding context switching,就容易理解了,有没有发现,两次++操作,Eventually leak.
所以在多线程中,When working with critical resources,Then the critical resources is the atom,So don't lock,Or must be locked,If the above problem occurs!
So what does locking mean??is to turn these three assembly instructions into an atomic operation,只要有一个线程lock加锁了,Other threads cannot execute,until the locked thread is unlocked,Other threads can lock.Then these three assembly instructions are atomic.下面将介绍3中锁以及2个原子操作.
Multi-threaded operation of critical resources and mutual exclusion lock
Here are two ways to lock,Both can run100w,But the granularity of these two locking is different,在这个程序中,谁是临界资源?是pcount,而不是while,So the second kind of lock can run through,But its locking granularity is too large,就本程序而言,The second mode of locking it and what is the difference between single thread to run?所以We need to lock critical resources,Not lock-free for critical resources,Control the granularity of locks
//正确加锁
while (i++ < 100000) {
pthread_mutex_lock(&mutex);
(*pcount)++;
pthread_mutex_unlock(&mutex);
}
//错误加锁
pthread_mutex_lock(&mutex);
while (i++ < 100000) {
(*pcount)++;
}
pthread_mutex_unlock(&mutex);
//
// Created by 68725 on 2022/8/3.
//
#include <stdio.h>
#include <pthread.h>
#include <unistd.h>
#include <sys/mman.h>
#define THREAD_SIZE 10
pthread_mutex_t mutex;
//callback
void *func(void *arg) {
int *pcount = (int *) arg;
int i = 0;
while (i++ < 100000) {
pthread_mutex_lock(&mutex);
(*pcount)++;
pthread_mutex_unlock(&mutex);
usleep(1);
}
}
int main() {
pthread_t th_id[THREAD_SIZE] = {
0};
pthread_mutex_init(&mutex, NULL);
int i = 0;
int count = 0;
for (i = 0; i < THREAD_SIZE; i++) {
int ret = pthread_create(&th_id[i], NULL, func, &count);
if (ret) {
break;
}
}
for (i = 0; i < 100; i++) {
printf("count --> %d\n", count);
sleep(2);
}
}
After the lock can be seen,Successfully meet our expectations
Multithreaded operations critical resources and add read-write lock
读写锁,顾名思义,Add read locks when reading critical resources,Writing the critical resources and write locks.适用于读多写少的场景.
- AThread added read lock,BThreads can continue to add read locks,但是不能加写锁.
- AThread added write locks,BThread cannot add read lock,Can't add write lock.
//
// Created by 68725 on 2022/8/3.
//
#include <stdio.h>
#include <pthread.h>
#include <unistd.h>
#include <sys/mman.h>
#define THREAD_SIZE 10
pthread_rwlock_t rwlock;
//callback
void *func(void *arg) {
int *pcount = (int *) arg;
int i = 0;
while (i++ < 100000) {
pthread_rwlock_wrlock(&rwlock);
(*pcount)++;
pthread_rwlock_unlock(&rwlock);
usleep(1);
}
}
int main() {
pthread_t th_id[THREAD_SIZE] = {
0};
pthread_rwlock_init(&rwlock, NULL);
int i = 0;
int count = 0;
for (i = 0; i < THREAD_SIZE; i++) {
int ret = pthread_create(&th_id[i], NULL, func, &count);
if (ret) {
break;
}
}
for (i = 0; i < 100; i++) {
pthread_rwlock_rdlock(&rwlock);
printf("count --> %d\n", count);
pthread_rwlock_unlock(&rwlock);
sleep(2);
}
}
Multi-threaded operation of critical resources and spin locks
spinlock与mutex一样,mutex在哪里加锁,spinlocklock right there,使用方法是一样的,But its internal behavior is different.那么mutex和spinlock的区别在哪呢?
- When the mutex cannot acquire the lock,会进入休眠,Wake up while waiting for release.会让出CPU.
- Spinlocks when does not lock,一直等待,There will be no process in the waiting process,线程切换.just keep waiting,死等.
The usage scenarios of mutex and spin lock are described below.
//
// Created by 68725 on 2022/8/3.
//
#include <stdio.h>
#include <pthread.h>
#include <unistd.h>
#include <sys/mman.h>
#define THREAD_SIZE 10
pthread_spinlock_t spinlock;
//callback
void *func(void *arg) {
int *pcount = (int *) arg;
int i = 0;
while (i++ < 100000) {
pthread_spin_lock(&spinlock);
(*pcount)++;
pthread_spin_unlock(&spinlock);
usleep(1);
}
}
int main() {
pthread_t th_id[THREAD_SIZE] = {
0};
pthread_spin_init(&spinlock, PTHREAD_PROCESS_SHARED);
int i = 0;
int count = 0;
for (i = 0; i < THREAD_SIZE; i++) {
int ret = pthread_create(&th_id[i], NULL, func, &count);
if (ret) {
break;
}
}
for (i = 0; i < 100; i++) {
printf("count --> %d\n", count);
sleep(2);
}
}
原子操作
we found the lock,都是将i++Corresponding to the three steps of assembly,become atomic.So is there a way we can directlyi++对应的汇编指令,变成一条指令?可以,我们使用xaddl
这条指令.
Intel X86指令集提供了指令前缀lock⽤to lock the front string⾏总线FSB,Guarantee the execution of the command⾏Will not receive other processors⼲扰.
所谓原子操作,it is not a specific instruction,它是CPU支持的指令集,都是原子操作.比如说CAS,CAS是原子操作的一种,It cannot be said that atomic operations areCAS.
xaddl -----> Inc
- xaddl:The second parameter plus the first parameter,and store the value in the first parameter
//
// Created by 68725 on 2022/8/3.
//
#include <stdio.h>
#include <pthread.h>
#include <unistd.h>
#include <sys/mman.h>
#define THREAD_SIZE 10
int inc(int *value, int add) {
int old;
__asm__ volatile (
"lock; xaddl %2, %1;"
: "=a" (old)
: "m" (*value), "a" (add)
: "cc", "memory"
);
return old;
}
//callback
void *func(void *arg) {
int *pcount = (int *) arg;
int i = 0;
while (i++ < 100000) {
inc(pcount, 1);
usleep(1);
}
}
int main() {
pthread_t th_id[THREAD_SIZE] = {
0};
int i = 0;
int count = 0;
for (i = 0; i < THREAD_SIZE; i++) {
int ret = pthread_create(&th_id[i], NULL, func, &count);
if (ret) {
break;
}
}
for (i = 0; i < 100; i++) {
printf("count --> %d\n", count);
sleep(2);
}
}
cmpxchg-----> CAS
- CAS:Compare And Swap,先比较,再赋值,Translated into code is the following
if(a==b){
//Compare
a=c;//Swap
}
CPUThe instruction set supports compare-before-assign instructions,叫cmpxchg.正因为CPUexecuted this command,It's an atomic operation.
// Perform atomic 'compare and swap' operation on the pointer.
// The pointer is compared to 'cmp' argument and if they are
// equal, its value is set to 'val'. Old value of the pointer is returned.
inline T *cas (T *cmp_, T *val_)
{
T *old;
__asm__ volatile (
"lock; cmpxchg %2, %3"
: "=a" (old), "=m" (ptr)
: "r" (val_), "m" (ptr), "0" (cmp_)
: "cc");
return old;
}
Three kinds of locksapi介绍
互斥锁 mutex
有两个特殊的api,pthread_mutex_trylock
尝试加锁,如果没有获取到锁则返回,而不是休眠.pthread_mutex_timedlock
等待一段时间,Timeout also did not get lock is returned.
/* Mutex handling. */
/* Initialize a mutex. */
extern int pthread_mutex_init (pthread_mutex_t *__mutex,
const pthread_mutexattr_t *__mutexattr)
__THROW __nonnull ((1));
/* Destroy a mutex. */
extern int pthread_mutex_destroy (pthread_mutex_t *__mutex)
__THROW __nonnull ((1));
/* Try locking a mutex. */
extern int pthread_mutex_trylock (pthread_mutex_t *__mutex)
__THROWNL __nonnull ((1));
/* Lock a mutex. */
extern int pthread_mutex_lock (pthread_mutex_t *__mutex)
__THROWNL __nonnull ((1));
#ifdef __USE_XOPEN2K
/* Wait until lock becomes available, or specified time passes. */
extern int pthread_mutex_timedlock (pthread_mutex_t *__restrict __mutex,
const struct timespec *__restrict
__abstime) __THROWNL __nonnull ((1, 2));
#endif
/* Unlock a mutex. */
extern int pthread_mutex_unlock (pthread_mutex_t *__mutex)
__THROWNL __nonnull ((1));
读写锁 rdlock
Read-write lock is suitable for more reading and less writing,Otherwise, use a mutex.
- AThread added read lock,BThreads can continue to add read locks,但是不能加写锁.
- AThread added write locks,BThread cannot add read lock,Can't add write lock.
/* Functions for handling read-write locks. */
/* Initialize read-write lock RWLOCK using attributes ATTR, or use the default values if later is NULL. */
extern int pthread_rwlock_init (pthread_rwlock_t *__restrict __rwlock,
const pthread_rwlockattr_t *__restrict
__attr) __THROW __nonnull ((1));
/* Destroy read-write lock RWLOCK. */
extern int pthread_rwlock_destroy (pthread_rwlock_t *__rwlock)
__THROW __nonnull ((1));
/* Acquire read lock for RWLOCK. */
extern int pthread_rwlock_rdlock (pthread_rwlock_t *__rwlock)
__THROWNL __nonnull ((1));
/* Try to acquire read lock for RWLOCK. */
extern int pthread_rwlock_tryrdlock (pthread_rwlock_t *__rwlock)
__THROWNL __nonnull ((1));
# ifdef __USE_XOPEN2K
/* Try to acquire read lock for RWLOCK or return after specfied time. */
extern int pthread_rwlock_timedrdlock (pthread_rwlock_t *__restrict __rwlock,
const struct timespec *__restrict
__abstime) __THROWNL __nonnull ((1, 2));
# endif
/* Acquire write lock for RWLOCK. */
extern int pthread_rwlock_wrlock (pthread_rwlock_t *__rwlock)
__THROWNL __nonnull ((1));
/* Try to acquire write lock for RWLOCK. */
extern int pthread_rwlock_trywrlock (pthread_rwlock_t *__rwlock)
__THROWNL __nonnull ((1));
# ifdef __USE_XOPEN2K
/* Try to acquire write lock for RWLOCK or return after specfied time. */
extern int pthread_rwlock_timedwrlock (pthread_rwlock_t *__restrict __rwlock,
const struct timespec *__restrict
__abstime) __THROWNL __nonnull ((1, 2));
# endif
/* Unlock RWLOCK. */
extern int pthread_rwlock_unlock (pthread_rwlock_t *__rwlock)
__THROWNL __nonnull ((1));
自旋锁 spinlock
The biggest feature of spin locks is that,Can't get the lock and keep waiting,即使CPUThe switch will not happen when the time slice is used up,死等.The above two locks are different,If you can't get it, it will sleep,让出CPU时间片,Switch to another thread or process execution.
/* Functions to handle spinlocks. */
/* Initialize the spinlock LOCK. If PSHARED is nonzero the spinlock can be shared between different processes. */
extern int pthread_spin_init (pthread_spinlock_t *__lock, int __pshared)
__THROW __nonnull ((1));
/* Destroy the spinlock LOCK. */
extern int pthread_spin_destroy (pthread_spinlock_t *__lock)
__THROW __nonnull ((1));
/* Wait until spinlock LOCK is retrieved. */
extern int pthread_spin_lock (pthread_spinlock_t *__lock)
__THROWNL __nonnull ((1));
/* Try to lock spinlock LOCK. */
extern int pthread_spin_trylock (pthread_spinlock_t *__lock)
__THROWNL __nonnull ((1));
/* Release spinlock LOCK. */
extern int pthread_spin_unlock (pthread_spinlock_t *__lock)
__THROWNL __nonnull ((1));
Three lock usage scenarios
For example read a file,就使用mutex.And if it is a simple addition, subtraction and subtraction operations,就是用spinlock.If the system provides atomic operation interface,对于i++这种操作来说,With atomic operation is more appropriate.
spinlock:临界资源操作简单/没有发生系统调用/持续时间较短(自旋锁就主要用在临界区持锁时间非常短且CPU资源不紧张的情况下,consume while waitingcpu资源较多,自旋锁一般用于多核的服务器.)
mutex:Critical resource operations are complex/发生系统调用/持续时间比较长
- 临界区有IO操作
- 临界区代码复杂或者循环量大
- 临界区竞争非常激烈
- 单核处理器
- 原子操作:The usage scene is small,必须需要CPUThe instruction set of support.(Atomic operations are suitable for simple mathematical operations such as addition and subtraction,Belong to the grain size minimum operation.For example, adding a node to a linked list,Atomic operations can make?不行,因为CPUInstruction set no multiple assignment instructions at the same time.cas Competition when multiple threads are not particularly high efficiency,If mutex and spin lock can meet the requirements, try not to use itcas)
Interface for atomic operations
c
对于gcc、g++编译器来讲,它们提供了⼀组APIto do the original⼦操作:
type __sync_fetch_and_add (type *ptr, type value, ...)
type __sync_fetch_and_sub (type *ptr, type value, ...)
type __sync_fetch_and_or (type *ptr, type value, ...)
type __sync_fetch_and_and (type *ptr, type value, ...)
type __sync_fetch_and_xor (type *ptr, type value, ...)
type __sync_fetch_and_nand (type *ptr, type value, ...)
bool __sync_bool_compare_and_swap (type *ptr, type oldval type newval, ...)
type __sync_val_compare_and_swap (type *ptr, type oldval type newval, ...)
type __sync_lock_test_and_set (type *ptr, type value, ...)
void __sync_lock_release (type *ptr, ...)
详细⽂档⻅:https://gcc.gnu.org/onlinedocs/gcc-4.1.1/gcc/Atomic-Builtins.html#AtomicBuiltins
c++
对于c++11来讲,也有⼀组atomic的接⼝:https://en.cppreference.com/w/cpp/atomic
边栏推荐
猜你喜欢
【详细教程】一文参透MongoDB聚合查询
一文参透分布式存储系统Ceph的架构设计、集群搭建(手把手)
The problem of disorganized data output by mnn model
Read FastDFS in one article
c语言分层理解(c语言指针(上))
LeetCode 19:删除链表的倒数第 N 个结点
Analysis: What makes the Nomad Bridge hack unique
Salesforce's China business may see new changes, rumors may be closing
2022-08-03:以下go语言代码输出什么?A:2;B:3;C:1;D:0。 package main import “fmt“ func main() { slice := []i
typescript57 - Array generic interface
随机推荐
七夕佳节即将来到,VR全景云游为你神助攻
Analysis: What makes the Nomad Bridge hack unique
咱们500万条数据测试一下,如何合理使用索引加速?
typescript58 - generic classes
Salesforce的中国区业务可能出现新变化,传言可能正在关闭
C# WPF设备监控软件(经典)-下篇
typescript53-泛型约束
如何通过单步调试的方式找到引起 Fiori Launchpad 路由错误的原因试读版
typescript53 - generic constraints
js中常用的几种遍历处理数据的方法梳理
现货白银需要注意八大事项
boot issue
一文参透分布式存储系统Ceph的架构设计、集群搭建(手把手)
XSS - Bypass for loop filtering
BGP实验(含MPLS)
Vant3—— 点击对应的name名称跳转到下一页对应的tab栏的name的位置
七夕活动浪漫上线,别让网络拖慢和小姐姐的开黑时间
XSS-绕过for循环过滤
Node.js的基本使用(三)数据库与身份认证
高斯推断推导