当前位置:网站首页>[academic related] why can't many domestic scholars' AI papers be reproduced?
[academic related] why can't many domestic scholars' AI papers be reproduced?
2022-07-29 08:03:00 【51CTO】
Teacher Wu Enda once said , The key to reading a paper , It's the author's algorithm .
However , Many papers can't be reproduced at all , Why is that ?
One 、 Data relation
Because the data used by the author is private , Most people don't get , In this case , Even if the author provides the source code , But readers don't get the data , There's no way to reproduce the algorithm .
This situation is very common in domestic academic circles , No one else has the data , It's like an Olympiad math teacher , I have a Mathematical Olympiad problem , Find out for yourself , Then I wrote a paper about the process of solving the problem , This kind of paper is often not convincing enough , The story is not strong enough .
Two 、 Hardware reasons
Many algorithms for deep learning , It's done by doing miracles with great efforts . For example, Google. 、facebook Some of the algorithms , Rely on powerful hardware training out .
Ordinary researchers don't have that powerful hardware resources , I don't think it can reach their 1% Calculation power , There's no way to reproduce the algorithm .
3、 ... and 、 Data division and training methods
Some papers have made the code public , And it's open data , But the paper does not mention the problem of data division , If the data is small , Different divisions lead to different results .
Four 、 Well known reasons
We all know the reason , I don't understand , This situation appears in the papers of many domestic authors . This is rare in public data .
Many papers published by domestic scholars , The usual routine is :
1. Define a very new but meaningless problem ;
2. oriented github Programming ;
3. Add some to the network attention,module,normalization,loss, Until it doesn't collapse ;
4. Make up a story , produce a novel , It seems that the logic is quite clear , But don't give people a chance to reproduce .
What is the ideal paper like ?
1. The effect can be reproduced , The logic of every experiment in the paper is very clear , The logical chain formed by all the experiments is complete , Using public data sets , The results are basically the same as the paper .
It's the big guys in the field who can achieve this , Like Chen Tianqi 、 He Kaiming .
2. Using public data , Open code , The details of the paper are clear , It can reproduce the effect of the paper . Although the authors of many papers can't explain why the network designed in this way works well , This should be the reason why deep learning can't be explained . Because they made the code public , It works well on public data sets , Can reproduce the effect , So it's also a good paper .
Recommendation at the end of the paper
Last , Recommend a website :paperswithcode.com/, Many excellent papers can be found in the code .

边栏推荐
- Network Security Learning chapter
- Mqtt server setup and mqtt.fx testing
- Unity beginner 4 - frame animation and protagonist attack (2D)
- Why don't you like it? It's easy to send email in cicd
- Explanation and closing method of server 135, 137, 138, 139, 445 and other ports
- [deep learning] data preparation -pytorch custom image segmentation data set loading
- String class
- An Optimal Buffer Management Scheme with Dynamic Thresholds论文总结
- What are the common error types and solutions of black box testing?
- 【NOI模拟赛】计算几何(凸包,暴力,并查集)
猜你喜欢

IonIcons图标大全

Do you want to meet all the needs of customers
![[beauty of software engineering - column notes] 22 | how to do a good job in technology selection for the project?](/img/1a/72bfb3fef59c54188a823ead3a5390.png)
[beauty of software engineering - column notes] 22 | how to do a good job in technology selection for the project?

Tcp/ip five layer reference model and corresponding typical devices and IPv6

STM32 detection signal frequency

Realize the effect of changing some colors of a paragraph of text

JVM garbage collection mechanism (GC)
![[cryoelectron microscope | paper reading] interpretation of sub fault average m software: multi particle cryo EM refining with M](/img/5e/69987afcd1e50ba37bc49441dd3a50.png)
[cryoelectron microscope | paper reading] interpretation of sub fault average m software: multi particle cryo EM refining with M

Amaze UI 图标查询

10 common software architecture modes
随机推荐
The new colleague wrote a few pieces of code, broke the system, and was blasted by the boss!
CDM - code division multiplexing (easy to understand)
Taiyuan bus route crawling
My entrepreneurial neighbors
C language problems
工业互联网行至深水区,落地的路要怎么走?
postman接口测试|js脚本之阻塞休眠和非阻塞休眠
[flask introduction series] installation and configuration of flask Sqlalchemy
Unity beginner 2 - tile making and world interaction (2D)
In JS, 0 means false, and non-0 means true
[密码学实验] 0x00 安装NTL库
[skill accumulation] common expressions when writing emails
Space shooting Lesson 17: game over (end)
[skill accumulation] presentation practical skill accumulation, common sentence patterns
阿里巴巴政委体系-第一章、政委建在连队上
Go, how to become a gopher, and find work related to go language in 7 days, Part 1
[beauty of software engineering - column notes] 25 | what methods can improve development efficiency?
在js中,0表示false,非0表示true
[beauty of software engineering - column notes] 24 | technical debt: continue to make do with it, or overthrow it and start over?
[cryoelectron microscope] relion4.0 pipeline command summary (self use)