当前位置:网站首页>在寻求人类智能AI的过程中,Meta将赌注押向了自监督学习
在寻求人类智能AI的过程中,Meta将赌注押向了自监督学习
2022-07-04 00:37:00 【智源社区】
Meta AI 用于计算机视觉的掩码自编码器是在大部分被遮挡的图像上训练的 [左]。 然而,它的重建 [中心] 非常接近原始图像 [右]。
Meta 的首席 AI 科学家 Yann LeCun 曾表达过这样一个愿景, “我们想建造像动物和人类一样学习的智能机器”。为了实现这一愿景,Meta将宝押在了自监督学习(SSL)上。
前段时间,一篇名为MAE的论文展示了自监督系统如何从非常零散和不完整的数据中重建图像。然而,MAE其实并不是一个新概念,Meta早已将这项工作扩展到了新的领域。
比如在视频MAE中,每个视频帧的掩码率高达95%,因为帧之间的相似性意味着视频信号比静态图像具有更多的冗余。而MAE通过屏蔽高达 95% 的每一帧,将计算成本降低了高达 95%。
又如在音频MAE中,Meta AI团队将声音文件转换为频谱图,对其部分遮蔽以进行训练。重建后的音频令人印象深刻,尽管该模型目前只能处理几秒钟的剪辑。该团队表示将很快在arxiv.org上发布音频MAE的相关工作。
边栏推荐
- @EnableAsync @Async
- Development and application of fcitx functional plug-ins
- HR disgusted interview behavior
- 功能:求出菲波那契数列的前一项与后一项之比的极限的 近似值。例如:当误差为0.0001时,函数值为0.618056。
- Data storage - interview questions
- [C language] break and continue in switch statement
- 7.1 学习内容
- Function: store the strings entered in the main function in reverse order. For example, if you input the string "ABCDEFG", you should output "gfedcba".
- Network layer - routing
- [PHP basics] cookie basics, application case code and attack and defense
猜你喜欢
Is it really possible that the monthly salary is 3K and the monthly salary is 15K?
Entropy and full connection layer
[dynamic programming] leetcode 53: maximum subarray sum
URL (data:image/png; Base64, ivborw0k... Use case
Sorry, Tencent I also refused
swagger中响应参数为Boolean或是integer如何设置响应描述信息
功能:求出菲波那契数列的前一项与后一项之比的极限的 近似值。例如:当误差为0.0001时,函数值为0.618056。
Cloud dial test helps Weidong cloud education to comprehensively improve the global user experience
BBS forum recommendation
[cloud native topic -48]:kubesphere cloud Governance - operation - overview of multi tenant concept
随机推荐
Regular expression of shell script value
PMP 考试常见工具与技术点总结
Joint examination of six provinces 2017
Detailed explanation of the relationship between Zhongtai, wechat and DDD
Oracle database knowledge points (I)
On the day when 28K joined Huawei testing post, I cried: everything I have done in these five months is worth it
Software testers, how can you quickly improve your testing skills? Ten minutes to teach you
ISBN number
2-redis architecture design to use scenarios - four deployment and operation modes (Part 2)
On covariance of array and wildcard of generic type
CesiumJS 2022^ 源码解读[8] - 资源封装与多线程
Entropy and full connection layer
Several ways to set up a blog locally [attach relevant software download links]
A Kuan food rushed to the Shenzhen Stock Exchange: with annual sales of 1.1 billion, Hillhouse and Maotai CCB are shareholders
Beijing invites reporters and media
1214 print diamond
不得不会的Oracle数据库知识点(四)
Cannot build artifact 'test Web: War expanded' because it is included into a circular depend solution
Network layer - routing
@EnableAsync @Async