当前位置:网站首页>Debiasing word embeddings | talking about word embedding and deviation removal # yyds dry goods inventory #
Debiasing word embeddings | talking about word embedding and deviation removal # yyds dry goods inventory #
2022-07-01 17:42:00 【LolitaAnn】
This article is my note sharing , The content mainly comes from teacher Wu Enda's in-depth learning course . [1]
The existence of stereotypes
word embedding It has a very important impact on the generalization of our model , Therefore, we should also ensure that they are not affected by unexpected forms of bias . Such as sexism , Racial discrimination , Religious discrimination and so on .
Of course, I think the word hint is a little serious , Here we can understand it as stereotype .
Take a chestnut :
My father is a doctor , My mother is _______ .
My father is a company employee , My mother is _______ .
Boys like _______ . Girls like _______ .
The first empty one, of course, is likely to be “ The nurse ”. The second empty answer is likely to be “ housewife ”. The third empty answer is likely to be “ The transformers ”. The fourth empty answer is likely to be “ Barbie doll ”.
What is this ? This is the so-called gender stereotype . These stereotypes are related to socio-economic status .
Learning algorithms are not stereotyped , But the words written by human beings are stereotyped . and Word embedding Can “ very good ” Learn these stereotypes .
So we need to modify the learning algorithm as much as possible , Minimize or idealize , Eliminate these unexpected types of bias .
Over many decades, over many centuries,I think humanity has made progress in reducing these types of bias. And I think maybe fortunately AI, I think we actually have better ideas for quickly reducing the bias in AI than for quickly reducing the bias in the human race. Although I think we are by no means done for AI as well, and there is still a lot of research and hard work to be done to reduce these types of biases in our learning of learning algorithms.
Eliminate word embedding stereotypes
With the aid of arXiv:1607.06520 [2] Methods .
It is mainly divided into the following three steps :
- Identify bias direction.
- Neutralize: For every word that is not definitional, project to get rid of bias.
- Equalize pairs.
Suppose now we have a good student word embedding.
Or continue our previous style . It uses 300 Dimension characteristics , Then we map it to a two-dimensional plane . The distribution of these words on the plane is shown in the figure .
1. Find a way
To find out the main direction of stereotype between two words , This method we talked about earlier word embedding I mentioned the feature once . Is to subtract two vectors to get the main dimension of their difference .
After subtracting the above, you will find that their differences are mainly in gender In this dimension .
Then make a for the above Average .
We can get the following result :
We can find out the main direction of our stereotype bias . Then you can also find a direction that is not related to a particular bias .
Be careful : In this case , We think our bias is in the direction of “gender” It's a one-dimensional space , And the other irrelevant direction is 299 The subspace of dimension . This is simplified compared with the original paper . Specifically, you can read the references provided at the end of the article .
2. Neutralization treatment
There is this word, which is clearly gender differentiated , But some words should exist fairly without gender distinction .
Gender specific words , such as grandmother and grandfather, There is no gender distinction , such as nurse,doctor. For this kind of words, we should neutralize them , That is, reduce the horizontal distance in the direction of bias .
3. Balancing
The second step is to deal with words that are gender neutral . What's wrong with gender specific words .
We can clearly see from the above figure . about nurse The word , It is associated with girl The distance is significantly longer than boy A more recent . So if the text is generated , mention nurse, appear girl Will be more likely . So we need to balance the distance through calculation .
After calculation, translate it , It's a gender neutral word. It's . The distance between words with gender distinction is equal .
边栏推荐
- Research Report on development monitoring and investment prospects of China's smart environmental protection industry (2022 Edition)
- Source code of new campus errand / campus task platform on mutual station
- 深度优先遍历和广度优先遍历[通俗易懂]
- Kernel stray cat stray dog pet adoption platform H5 source code
- Report on research and investment prospects of China's silicon nitride ceramic substrate industry (2022 Edition)
- 两数之和c语言实现[通俗易懂]
- Product service, operation characteristics
- 在MeterSphere接口测试中如何使用JMeter函数和MockJS函数
- JDBC: deep understanding of Preparedstatement and statement[easy to understand]
- Replace UUID, nanoid is faster and safer!
猜你喜欢
Heavy disclosure! Hundreds of important information systems have been invaded, and the host has become a key attack target
Penetration practice vulnhub range Keyring
String的trim()和substring()详解
In aks, use secret in CSI driver mount key vault
Yyds dry inventory MySQL RC transaction isolation level implementation
Mysql database - Advanced SQL statement (2)
Wechat applet blind box - docking wechat payment
Rotation order and universal lock of unity panel
Length of learning and changing
June issue | antdb database participated in the preparation of the "Database Development Research Report" and appeared on the list of information technology and entrepreneurship industries
随机推荐
[beauty detection artifact] come on, please show your unique skill (is this beauty worthy of the audience?)
Mysql database - Advanced SQL statement (2)
Research Report on China's enzyme Market Forecast and investment strategy (2022 Edition)
(16) ADC conversion experiment
Enter wechat applet
DRF --- response rewrite
Kia recalls some K3 new energy with potential safety hazards
Is it safe to open a stock account by mobile phone? What do you need to bring with you to open an account?
June issue | antdb database participated in the preparation of the "Database Development Research Report" and appeared on the list of information technology and entrepreneurship industries
Technical secrets of ByteDance data platform: implementation and optimization of complex query based on Clickhouse
Is it reasonable and safe to open a securities account for 10000 shares free of charge? How to say
开发那些事儿:EasyCVR集群设备管理页面功能展示优化
换掉UUID,NanoID更快更安全!
pyqt5中,在控件上画柱状图
Intelligent operation and maintenance practice: banking business process and single transaction tracking
vulnhub靶场-hacksudo - Thor
Detailed explanation of string's trim() and substring()
Sword finger offer 20 String representing numeric value
In depth evaluation and development trend prediction report of China's ice cream market (2022 Edition)
The new server is packaged with the source code of H5 mall with an operation level value of several thousand