当前位置:网站首页>Introduction to Cosine Distance
Introduction to Cosine Distance
2022-08-03 19:11:00 【xiaozheng123121】
目录
作者:CSDN博主「深度学习视觉」
原文链接:https://blog.csdn.net/lucky_kai/article/details/89514868
————————————————
概述: 在机器学习领域中,通常将特征表示为向量的形式,所以在分析两个特征向量之间的相似性时,Commonly used cosine similarity representation.
For example, two articles are vectorized,Cosine distance can avoid large distances due to different lengths of articles,The cosine distance only considers the angle between the vectors generated by the two articles.
余弦相似度的取值范围是[-1,1],相同两个向量的之间的相似度为1.
余弦距离的取值范围是[0,2].
The definition formula of cosine similarity is c o s ( A , B ) = A ⋅ B ∥ A ∥ 2 ∥ B ∥ 2 cos(A,B)=\frac{A\cdot B}{\left\|A \right\|_2\left\|B \right\|_2} cos(A,B)=∥A∥2∥B∥2A⋅B
归一化后: ∥ A ∥ 2 = 1 , ∥ B ∥ 2 = 1 , ∥ A ∥ 2 ∥ B ∥ 2 = 1 \left\|A\right\|_2=1, \left\|B\right\|_2=1, \left\|A\right\|_2\left\|B\right\|_2=1 ∥A∥2=1,∥B∥2=1,∥A∥2∥B∥2=1
余弦距离: d i s t ( A , B ) = 1 − c o s ( A , B ) = ∥ A ∥ 2 ∥ B ∥ 2 − A ⋅ B ∥ A ∥ 2 ∥ B ∥ 2 dist(A,B)=1-cos(A,B)=\frac{\left\|A \right\|_2\left\|B \right\|_2-A\cdot B}{\left\|A \right\|_2\left\|B \right\|_2} dist(A,B)=1−cos(A,B)=∥A∥2∥B∥2∥A∥2∥B∥2−A⋅B,distance is greater than0
欧式距离:
It can be seen from the formula that after normalization,There is a monotonic relationship between Euclidean distance and cosine distance.At this time, the value range of both distances is [0,2].
Euclidean distance vs cosine distance:
1.欧式距离的数值受到维度的影响,余弦相似度在高维的情况下也依然保持低维完全相同时相似度为1等性质.
2.欧式距离体现的是距离上的绝对差异,The cosine distance reflects the relative difference in direction.
边栏推荐
猜你喜欢
随机推荐
[Azure Event Hub] Create Event Hub Consume Client + Custom Event Position with Azure AD Authentication
MVC vs MVP
力扣刷题之有效的正方形(每日一题7/29)
Web项目Controller统一返回实体类
Postgresql快照优化Globalvis新体系分析(性能大幅增强)
MD5是对称加密还是非对称加密,有什么优缺点
Climbing Stairs (7/30)
读取 resources 目录下的文件路径的九种方式,你知道多少?
分享即时通讯开发之WebSocket:概念、原理、易错常识、动手实践
WEB 渗透之RCE
idea——同一项目开启多个实例(不同端口)
软件测试技术之如何编写测试用例(3)
实时渲染器不止lumion,Chaos Vantage你值得一试
货比四家 version tb1.63
【C语言学习笔记(六)】分支与跳转(if、else、continue、break、switch)
【ORACLE】什么时候ROWNUM等于0和ROWNUM小于0,两个条件不等价?
[数据集][VOC]老鼠数据集voc格式3001张
mysql跨库关联查询(dblink)
vulnhub pyexp: 1
阿里巴巴政委体系-第七章、阿里政委培育









