当前位置:网站首页>在寻求人类智能AI的过程中,Meta将赌注押向了自监督学习
在寻求人类智能AI的过程中,Meta将赌注押向了自监督学习
2022-07-04 00:37:00 【智源社区】

Meta AI 用于计算机视觉的掩码自编码器是在大部分被遮挡的图像上训练的 [左]。 然而,它的重建 [中心] 非常接近原始图像 [右]。
Meta 的首席 AI 科学家 Yann LeCun 曾表达过这样一个愿景, “我们想建造像动物和人类一样学习的智能机器”。为了实现这一愿景,Meta将宝押在了自监督学习(SSL)上。
前段时间,一篇名为MAE的论文展示了自监督系统如何从非常零散和不完整的数据中重建图像。然而,MAE其实并不是一个新概念,Meta早已将这项工作扩展到了新的领域。
比如在视频MAE中,每个视频帧的掩码率高达95%,因为帧之间的相似性意味着视频信号比静态图像具有更多的冗余。而MAE通过屏蔽高达 95% 的每一帧,将计算成本降低了高达 95%。
又如在音频MAE中,Meta AI团队将声音文件转换为频谱图,对其部分遮蔽以进行训练。重建后的音频令人印象深刻,尽管该模型目前只能处理几秒钟的剪辑。该团队表示将很快在arxiv.org上发布音频MAE的相关工作。
边栏推荐
- 查询效率提升10倍!3种优化方案,帮你解决MySQL深分页问题
- 打印菱形图案
- 手机异步发送短信验证码解决方案-Celery+redis
- 功能:求5行5列矩阵的主、副对角线上元素之和。注意, 两条对角线相交的元素只加一次。例如:主函数中给出的矩阵的两条对角线的和为45。
- A dichotomy of Valentine's Day
- Wechat official account and synchronization assistant
- Regular expression of shell script value
- A Kuan food rushed to the Shenzhen Stock Exchange: with annual sales of 1.1 billion, Hillhouse and Maotai CCB are shareholders
- Function: write function fun to find s=1^k+2^k +3^k ++ The value of n^k, (the cumulative sum of the K power of 1 to the K power of n).
- CSP window
猜你喜欢
![[cloud native topic -48]:kubesphere cloud Governance - operation - overview of multi tenant concept](/img/b4/961b3b44e9ecbfd4bddd04318b663a.jpg)
[cloud native topic -48]:kubesphere cloud Governance - operation - overview of multi tenant concept

A dichotomy of Valentine's Day

OS interrupt mechanism and interrupt handler

A-Frame虚拟现实开发入门

查询效率提升10倍!3种优化方案,帮你解决MySQL深分页问题

Pytest unit test framework: simple and easy to use parameterization and multiple operation modes
![CesiumJS 2022^ 源码解读[8] - 资源封装与多线程](/img/d2/99932660298b4a4cddd7e5e69faca1.png)
CesiumJS 2022^ 源码解读[8] - 资源封装与多线程

From functional testing to automated testing, how did I successfully transform my salary to 15K +?

Function: find the sum of the elements on the main and sub diagonal of the matrix with 5 rows and 5 columns. Note that the elements where the two diagonals intersect are added only once. For example,

@EnableAsync @Async
随机推荐
The difference between objects and objects
机器学习基础:用 Lasso 做特征选择
How to trade spot gold safely?
Understanding of Radix
Global and Chinese market of melting furnaces 2022-2028: Research Report on technology, participants, trends, market size and share
Global and Chinese markets of distributed control system (DCS) consumption 2022-2028: Research Report on technology, participants, trends, market size and share
What is the potential of pocket network, which is favored by well-known investors?
[GNN] hard core! This paper combs the classical graph network model
功能:将主函数中输入的字符串反序存放。例如:输入字符串“abcdefg”,则应输出“gfedcba”。
How to set the response description information when the response parameter in swagger is Boolean or integer
[leetcode] interview question 17.08 Circus tower
Global and Chinese markets for coronary artery disease treatment devices 2022-2028: Research Report on technology, participants, trends, market size and share
Data mining vs Machine Learning: what is the difference between them? Which is more suitable for you to learn
Why use get/set instead of exposing properties
[error record] configure NDK header file path in Visual Studio
Print diamond pattern
gslb(global server load balance)技术的一点理解
Future source code view -juc series
Unity Shader入门精要读书笔记 第三章 Unity Shader基础
Delete all elements with a value of Y. The values of array elements and y are entered by the main function through the keyboard.