当前位置:网站首页>UC伯克利助理教授Jacob Steinhardt预测AI基准性能:AI在数学等领域的进展比预想要快,但鲁棒性基准性能进展较慢
UC伯克利助理教授Jacob Steinhardt预测AI基准性能:AI在数学等领域的进展比预想要快,但鲁棒性基准性能进展较慢
2022-07-06 18:33:00 【智源社区】
- Forecasters’ predictions were not very good in general: two out of four forecasts were outside the 90% credible intervals.
- However, they were better than my personal predictions, and I suspect better than the median prediction of ML researchers (if the latter had been preregistered).
- Specifically, progress on ML benchmarks happened significantly faster than forecasters expected. But forecasters predicted faster progress than I did personally, and my sense is that I expect somewhat faster progress than the median ML researcher does.
- Progress on a robustness benchmark was slower than expected, and was the only benchmark to fall short of forecaster predictions. This is somewhat worrying, as it suggests that machine learning capabilities are progressing quickly, while safety properties are progressing slowly.
边栏推荐
- CISP-PTE之命令注入篇
- Appium automation test foundation uiautomatorviewer positioning tool
- 将截断字符串或二进制数据
- 爬虫实战(六):爬笔趣阁小说
- AcWing 1140. Shortest network (minimum spanning tree)
- AcWing 345. Cattle station solution (nature and multiplication of Floyd)
- ZOJ problem set – 2563 long dominoes [e.g. pressure DP]
- Zabbix 5.0:通过LLD方式自动化监控阿里云RDS
- Reptile practice (VI): novel of climbing pen interesting Pavilion
- AcWing 1148. Secret milk transportation problem solution (minimum spanning tree)
猜你喜欢
Baidu flying general BMN timing action positioning framework | data preparation and training guide (Part 1)
Errors made in the development of merging the quantity of data in the set according to attributes
Integrated navigation: product description and interface description of zhonghaida inav2
ROS learning (24) plugin
新工作感悟~辞旧迎新~
ROS learning (XX) robot slam function package -- installation and testing of rgbdslam
ROS学习(25)rviz plugin插件
我如何编码8个小时而不会感到疲倦。
传感器:土壤湿度传感器(XH-M214)介绍及stm32驱动代码
Flir Blackfly S 工业相机 介绍
随机推荐
蓝桥杯2022年第十三届省赛真题-积木画
新工作感悟~辞旧迎新~
JS ES5也可以创建常量?
Get to know MySQL for the first time
C language [23] classic interview questions [Part 2]
POJ 3177 redundant paths POJ 3352 road construction (dual connection)
Reptile practice (VI): novel of climbing pen interesting Pavilion
Unicode string converted to Chinese character decodeunicode utils (tool class II)
ROS学习(25)rviz plugin插件
2022 system integration project management engineer examination knowledge point: Mobile Internet
JS ES5也可以創建常量?
场景实践:基于函数计算快速搭建Wordpress博客系统
2022/0524/bookstrap
Analyze "C language" [advanced] paid knowledge [End]
Halcon knowledge: segment_ contours_ XLD operator
The use of video in the wiper component causes full screen dislocation
刨析《C语言》【进阶】付费知识【完结】
BigDecimal 的正确使用方式
Baidu flying general BMN timing action positioning framework | data preparation and training guide (Part 1)
微服务架构介绍