当前位置:网站首页>Talk about multimodality of fire
Talk about multimodality of fire
2022-07-27 07:02:00 【Xixiaoyao】
Multimodal machine learning , English full name MultiModal Machine Learning (MMML), The aim is to achieve the ability to process and understand multi-source modal information by means of machine learning .
Each of these Source or form of information , You can call it a mode . for example , People have a sense of touch , auditory , Vision , The sense of smell ; The message has voice 、 video 、 Words and other media ; A variety of sensors , Such as radar 、 infrared 、 Accelerometer, etc. . Each of the above can be called a mode .
Modes can also be very broadly defined , For example, we can think of two different languages as two modes , Even the data sets collected in two different cases , You can think of it as two modes .
The present , Multimodal technology has a wide range of application scenarios , Such as Taobao Search 、AI subtitle 、AI Virtual digital human 、 Humanoid interaction 、 Intelligent assistant 、 Product recommendation and information flow advertising 、 Image vector retrieval of video frame and face frame 、 Voice interaction, etc .
We are honored to invite He has many patents and many years of working experience as an algorithm engineer in large factories Peng teacher , utilize 2 About an hour or so , Systematically sort out multimodal technology for you .
Live sharing
01
PART
Day1 live broadcast
01 Development and future of multimodality
02 Extensive reading of papers :CLIP— Hongmeng clock masterpiece in multimodal field
03 Learning path recommendation

Day2 live broadcast
Intensive reading —CLIP: Hongmeng clock masterpiece in multimodal field
01 Research background
02 Introduction
03 Model
04 experiment
05 Conclusion
02
PART
Lecturer

Live time
03
PART
7 month 28 Japan ( Thursday ) On the evening of 20:00-21:00
7 month 29 Japan ( Friday ) On the evening of 20:00-21:00
Students interested in multimodal Technology , Scan the QR code below , Reservation live broadcast .

Sweep code payment 0.1 Yuan means the appointment is successful
Live broadcast when the party staff contact you ~
04
PART
Multimodal learning path

01 Fundamentals of multimodal theory
Study multimodal pre training related papers ——CLIP、ALIGN、VILT
02 Self supervised algorithm
Learn some self-monitoring schemes that may be used in multimodal pre training ——MAE、DINO、MOCO
03 Introduction to multimodal downstream tasks
Mainly understand VQA The tasks and nlvr Mission
04 Multimodal applications
Image Captioning Case study 、 Alibaba e-commerce cross modal retrieval case . Understand the task introduction 、baseline build 、 Model optimization 、 Result display .
05 Multimodal project
AI Smart copywriting 、 Mobile photo album management and retrieval based on multimodal pre training model 、AI Lip recognition 、 Automatic driving based on deep multimodal target detection and semantic segmentation
Students interested in multimodal Technology , Scan the QR code below , Reservation live broadcast .

Sweep code payment 0.1 Yuan means the appointment is successful
Live broadcast when the party staff contact you ~

边栏推荐
- PNA peptide nucleic acid modified peptide suc Tyr Leu Val PNA | suc ala Pro Phe PNA 11
- 肽核酸PNA-多肽PNA-TPP|Glt-Ala-Ala-Pro-Leu-pNA|Suc-Ala-Pro-pNA|Suc-AAPL-pNA|Suc-AAPM-pNA
- DNA(脱氧核糖核酸)供应|碳纳米管载核酸-DNA/RNA材料|DNA/RNA核酸修饰磁性纳米颗粒
- 齐岳:巯基修饰寡聚DNA|DNA修饰CdTe/CdS核壳量子点|DNA偶联砷化铟InAs量子点InAs-DNA QDs
- Shell编程的规范和变量
- The problem of torch loading custom models
- Ftx.us launched stock and ETF trading services to make trading more transparent
- PNA肽核酸修饰多肽Suc-Tyr-Leu-Val-pNA|Suc-Ala-Pro-Phe-pNA 11
- deepsort源码解读(七)
- ES6的新特性(2)
猜你喜欢

Express receive request parameters

VIVO应用市场APP上架总结

Is it feasible to fix the vulnerability with one click? Sunflower to tell you that one click fix vulnerability is feasible? Sunflower to tell you that one click fix vulnerability is feasible? Sunflowe

What "hard core innovations" does Intel have in the first half of 2022? Just look at this picture!

How to delete or replace the loading style of easyplayer streaming media player?

Dsgan degenerate network

PNA肽核酸修饰多肽Suc-Tyr-Leu-Val-pNA|Suc-Ala-Pro-Phe-pNA 11

工控用Web组态软件比组态软件更高效

肽核酸PNA-多肽PNA-TPP|Glt-Ala-Ala-Pro-Leu-pNA|Suc-Ala-Pro-pNA|Suc-AAPL-pNA|Suc-AAPM-pNA

Esxi virtual machine starts, and the module "monitorloop" fails to power on
随机推荐
EasyCVR设备管理列表页面搜索时,分页数据不显示的问题修复
Significance of NVIDIA SMI parameters
PSI | CSI and ROC | AUC and KS - memorandum
基于SSM图书借阅管理系统
Event capture and bubbling - what is the difference between them?
Add virtual network card and configure OP route in win10
What "hard core innovations" does Intel have in the first half of 2022? Just look at this picture!
Customer cases | focus on process experience to help bank enterprise app iteration
How can chrome quickly transfer a group of web pages (tabs) to another device (computer)
CASS11.0.0.4 for AutoCAD2010-2023免狗使用方法
关于ES6的新特性
What is the reason why dragging the timeline is invalid when playing device videos on the easycvr platform?
脱氧核糖核酸DNA修饰氧化锌|DNA修饰纳米金颗粒|DNA偶联修饰碳纳米材料
运行代码报错: libboost_filesystem.so.1.58.0: cannot open shared object file: No such file or directory
仿真模型简单介绍
Reasoning speed of model
PNA modified polypeptide arms PNA PNA DNA suc aapf PNA suc - (ALA) 3 PNA
Dimension problems and contour lines
程序、进程、线程、协程以及单线程、多线程基本概念
Where to connect with user-defined functions leads to slow queries