当前位置:网站首页>Is AI more fair than people in the distribution of wealth? Research on multiplayer game from deepmind
Is AI more fair than people in the distribution of wealth? Research on multiplayer game from deepmind
2022-07-07 19:07:00 【qubit 】
Yi Pavilion From the Aofei temple qubits | official account QbitAI
DeepMind No chess this time , And don't play video games , Instead, I studied a multiplayer game .
The latest development of “Democratic AI”—— Learn human values through training , Then we can allocate resources fairly according to everyone's contribution .
To demonstrate this concept ,DeepMind Designed a simple investment game , from AI And human beings serve as referees respectively , Let players choose the preferred allocation rules ,Democratic AI Even got a higher support rate than human referees .
AI Referees are more popular than humans
When a group of people decide to concentrate their money on investment , How to distribute the income is a big problem that we must face .
A simple strategy is to distribute returns equally among investors , But this is likely to be unfair , Because some people contribute more than others .
The second option is , We can allocate according to the initial investment of each person , That sounds fair , But what if people start with different asset levels ?
If two people contribute the same amount , But one is a small part of their available funds , The other contributed all his assets , Should they get the same share of income ?
To meet this challenge ,DeepMind Created a simple multiplayer investment game .
Game involves 4 Players , Share in 10 round .
Each player will be allocated initial funds , In each round , Players can make choices according to their own wishes : keep , Or invest it in a common pool .
The investment will definitely pay off , But there is a risk —— Players don't know how the final revenue will be distributed .
besides , They were told , front 10 There is a referee in the round (A) Make allocation decisions , Then 10 round , By different judges (B) To take over .
At the end of the game , They will vote for A or B, To decide which referee you want to play with again .
And the revenue of this last game can be retained by the players themselves , This will enable players to choose the most impartial referee in their hearts more actively .
in fact , One of the judges performs according to the preset allocation rules , On the other side is by Democratic AI make designs of one's own .
When we study the voting of these players , We found that AI Designed rules are more popular than standard allocation rules .
meanwhile ,DeepMind Also invited a human referee , And introduce him to the rules 、 Let him try to achieve fair distribution to win votes , But the final vote showed , He still lost to Democratic AI.
Democratic AI Why can we win ?
stay DeepMind The latest is published in Nature Sub issue Nature Human Behaviour Papers , It records the researchers' understanding of Democratic AI Training process of .
First , They let 4000 Many human players participate in the game many times under different allocation rules , And vote to choose which distribution method you prefer .
These data are used for training AI To imitate human behavior in the game , Including the way players vote .
secondly , The researchers made these AI Players compete with each other in thousands of games , And another one. AI System basis AI Players' voting methods continue to adjust the redistribution rules .
therefore , At the end of the process ,AI Redistribution rules that are very close to fairness have been established :
First ,AI Choose to allocate according to the proportion of relative contribution rather than absolute contribution . It means , When reallocating funds ,AI We will consider the initial amount of each player and their willingness to invest .
secondly ,AI The system specially rewards players who contribute more generously , To encourage others to do the same . It is important to , AI can only discover these rules by maximizing human voting rate .
Can this method be extended to reality ?
although DeepMind The game test of has achieved brilliant results , But to transform this approach from a simple four person game to a large-scale economy , It is still a huge challenge , At present, it is uncertain how it will develop in the real world .
secondly , The researchers themselves found several potential problems .
Democratic One problem is that it may develop into “ Tyranny of the majority ”, This will lead to the persistence of existing discrimination or unfair patterns against minorities .
AI More work needs to be done to understand how design allows everyone's voice to be heard .
in addition , The researchers also raised people's concerns about AI The question of trust :
Whether people will trust by AI Designed mechanisms to replace humans ? If people know the identity of the referee , Will it affect the final voting result ?
If you want to Democratic AI Designed solutions are used to solve real-world dilemmas , This is crucial .
Reference link : [1]https://www.deepmind.com/publications/human-centred-mechanism-design-with-democratic-ai [2]https://www.nature.com/articles/s41562-022-01383-x [3]https://singularityhub.com/2022/07/04/deepminds-new-ai-may-be-better-at-distributing-societys-resources-than-humans-are/
— End —
「 Artificial intelligence 」、「 Smart car 」 Wechat community invites you to join !
Welcome to AI 、 Smart car partners join us , And AI Practitioners exchange 、 Compare notes , Don't miss the latest industry development & Technological progress .
ps. Please note your name when adding friends - company - Position oh ~
Focus on me here , Remember to mark the star ~
One key, three links 「 Share 」、「 give the thumbs-up 」 and 「 Looking at 」
The frontier of science and technology meets day by day ~
边栏推荐
- 高温火烧浑不怕,钟薛高想留清白在人间
- 虚拟数字人里的生意经
- [tpm2.0 principle and Application guide] Chapter 16, 17 and 18
- 脑洞从何而来?加州大学最新研究:有创造力的人神经连接会「抄近道」
- SD_ DATA_ SEND_ SHIFT_ REGISTER
- POJ 2392 Space Elevator
- Redis
- 线程池和单例模式以及文件操作
- Usage of PHP interview questions foreach ($arr as $value) and foreach ($arr as $value)
- Basic operation of chain binary tree (implemented in C language)
猜你喜欢
App capture of charles+postern
Comparison and selection of kubernetes Devops CD Tools
二叉树的基本概念和性质
Short selling, overprinting and stock keeping, Oriental selection actually sold 2.66 million books in Tiktok in one month
Nunjuks template engine
Skills of embedded C language program debugging and macro use
基于图像和激光的多模态点云融合与视觉定位
直播预约通道开启!解锁音视频应用快速上线的秘诀
數據驗證框架 Apache BVal 再使用
企业展厅设计中常用的三种多媒体技术形式
随机推荐
How to implement safety practice in software development stage
云安全日报220707:思科Expressway系列和网真视频通信服务器发现远程攻击漏洞,需要尽快升级
PTA 1101 B是A的多少倍
Five network IO models
嵌入式面试题(算法部分)
【塔望方法论】塔望3W消费战略 - U&A研究法
Yunjing network technology interview question [Hangzhou multi tester] [Hangzhou multi tester _ Wang Sir]
PTA 1102 teaching Super Champion volume
低代码助力企业数字化转型会让程序员失业?
Nunjuks template engine
2022上半年朋友圈都在传的10本书,找到了
POJ 2392 Space Elevator
[software test] from the direct employment of the boss of the enterprise version, looking at the resume, there is a reason why you are not covered
咋吃都不胖的朋友,Nature告诉你原因:是基因突变了
"Decryption" Huawei machine vision Corps: Huawei is moving up and the industry is moving forward
基于图像和激光的多模态点云融合与视觉定位
How much does it cost to develop a small program mall?
强化学习-学习笔记8 | Q-learning
Draw squares with Obama (Lua)
How many times is PTA 1101 B than a