当前位置:网站首页>Cognitive fallacy: what is dimensional curse
Cognitive fallacy: what is dimensional curse
2022-07-03 21:37:00 【Jiedao jdon】
The more detailed your data , The less insight it has . Add only to the drawing 1 Additional parameters will cause the volume of the graph to increase exponentially , Scatter the contained data points and delete meaningful associations between them .
The phenomenon of dimensional curse appears in numerical analysis 、 sampling 、 Combinatorics 、 machine learning 、 Data mining, database and other fields . The common theme of these issues is , As dimensions increase , The growth of volume and space is so fast , So that the available data becomes sparse . In order to obtain reliable results , The amount of data required usually grows exponentially with the dimension .
The phrase , Due to Richard Bellman, Is to express the use of brute force ( Also known as grid search ) To optimize functions with too many input variables .
In today's big data world , It can also refer to several other potential problems when your data has a large number of dimensions .
- If we have more features than observations , We run the risk of large-scale over fitting models -- This usually leads to poor off sample performance .
- When we have too many characteristics , Observations will become more difficult to cluster -- Believe it or not , Too many dimensions will cause each observation in your data set to be equidistant from other observations . Because clustering uses distance measurement methods such as Euclidean distance to quantify the similarity between observations , So this is a big problem . If the distances are approximately equal , Then all the observations look the same ( The same difference ), Can't form meaningful clustering .
Refer to machine learning PCIA
边栏推荐
- TiDB 之 TiCDC6.0 初体验
- 全网都在疯传的《老板管理手册》(转)
- (5) User login - services and processes - History Du touch date stat CP
- Dahua series books
- Set, weakset, map, weakmap in ES6
- MySQL - idea connects to MySQL
- Mysql database ----- common commands of database (based on database)
- MySQL——索引
- Xai+ network security? Brandon University and others' latest "interpretable artificial intelligence in network security applications" overview, 33 page PDF describes its current situation, challenges,
- MySQL——索引
猜你喜欢
Implementation principle of inheritance, encapsulation and polymorphism
[vulnhub shooting range] impulse: lupinone
Why use pycharm to run the use case successfully but cannot exit?
Hcie security Day11: preliminarily learn the concepts of firewall dual machine hot standby and vgmp
Yiwen teaches you how to choose your own NFT trading market
Decompile and modify the non source exe or DLL with dnspy
(5) Web security | penetration testing | network security operating system database third-party security, with basic use of nmap and masscan
Transformer structure analysis and the principle of blocks in it
Capturing and sorting out external articles -- autoresponder, composer, statistics [III]
Notes on MySQL related knowledge points (startup, index)
随机推荐
Last week's content review
Software testing skills, JMeter stress testing tutorial, obtaining post request data in x-www-form-urlencoded format (24)
UI automation test: selenium+po mode +pytest+allure integration
MySQL——索引
Goodbye 2021, how do programmers go to the top of the disdain chain?
上周内容回顾
Borui data and Sina Finance released the 2021 credit card industry development report
Etcd raft Based Consistency assurance
2022-02-15 Daily: 2022 AAAI fellow release
MySQL - database backup
Custom view incomplete to be continued
MySQL——索引
Leetcode daily question 540 A single element in an ordered array Valentine's Day special article looking for a single dog in a pile of lovers ~ the clown is myself
Let me ask you a question. Have you ever used the asynchronous io of Flink SQL to associate dimension tables in MySQL? I set various settings according to the official website
Study diary: February 14th, 2022
Service discovery and load balancing mechanism -service
What is the maximum number of concurrent TCP connections for a server? 65535?
Global and Chinese market of recycled yarn 2022-2028: Research Report on technology, participants, trends, market size and share
Set, weakset, map, weakmap in ES6
Persistence of Nacos