当前位置:网站首页>Talk about the multimodal project of fire
Talk about the multimodal project of fire
2022-06-21 09:59:00 【woshicver】
Multimodal machine learning , English full name MultiModal Machine Learning (MMML), The aim is to achieve the ability to process and understand multi-source modal information by means of machine learning .
Each source or form of information , You can call it a mode . for example , People have a sense of touch , auditory , Vision , The sense of smell ; The message has voice 、 video 、 Words and other media ; A variety of sensors , Such as radar 、 infrared 、 Accelerometer, etc. . Each of the above can be called a mode .
Modes can also be very broadly defined , For example, we can think of two different languages as two modes , Even the data sets collected in two different cases , You can think of it as two modes .
The present , Multimodal technology has a wide range of application scenarios , Such as Taobao Search 、AI subtitle 、AI Virtual digital human 、 Humanoid interaction 、 Intelligent assistant 、 Product recommendation and information flow advertising 、 Image vector retrieval of video frame and face frame 、 Voice interaction, etc .
We are honored to invite in-service senior algorithm researchers Clark teacher , utilize 1 About an hour or so , Systematically sort out multimodal technology for you .
Live sharing
01
PART
01 The development trend of multimodal models
02 Multimodal data set
03 Common multimodal downstream tasks
02
PART
Lecturer

Live time
03
PART
6 month 22 Friday night 20:00-21:00
Students interested in multimodal Technology , Scan the QR code below , Reservation live broadcast .

Sweep code payment 0.1 Yuan means the appointment is successful
Live broadcast when the party staff contact you ~
04
PART
Multimodal learning path

01 Fundamentals of multimodal theory
Study multimodal pre training related papers ——CLIP、ALIGN、VILT
02 Self supervised algorithm
Learn some self-monitoring schemes that may be used in multimodal pre training ——MAE、DINO、MOCO
03 Introduction to multimodal downstream tasks
Mainly understand VQA The tasks and nlvr Mission
04 Multimodal applications
Image Captioning Case study 、 Alibaba e-commerce cross modal retrieval case . Understand the task introduction 、baseline build 、 Model optimization 、 Result display .
05 Multimodal project
AI Smart copywriting 、 Mobile photo album management and retrieval based on multimodal pre training model 、AI Lip recognition 、 Automatic driving based on deep multimodal target detection and semantic segmentation
6 month 22 Friday night 20:00-21:00
Students interested in multimodal Technology , Scan the QR code below , Reservation live broadcast .

Sweep code payment 0.1 Yuan means the appointment is successful
Live broadcast when the party staff contact you ~

边栏推荐
- Introduction to ground plane in unity
- 118. summary of basic knowledge of typescript (data type, interface, abstract class, inheritance, attribute encapsulation, modifier)
- 获取配置文件properties中的数据
- Comparison between JWT and session
- stm32mp1 Cortex M4开发篇9:扩展板空气温湿度传感器控制
- Alibaba cloud OSS uploading and intelligent image recognition garbage recognition
- 2022年中总结-一步一个脚印,踩出柳暗花明
- Classification of ram and ROM storage media
- R language through rprofile Site file, user-defined configuration of R language development environment startup parameters, shutdown parameters, user-defined specified cran local image source download
- Celsius 的暴雷,会是加密领域的“雷曼时刻”吗?
猜你喜欢

Alibaba cloud OSS uploading and intelligent image recognition garbage recognition

安全百强 中坚力量!美创科技入选《2022年中国数字安全百强报告》

character string

stm32mp1 Cortex M4开发篇10:扩展板数码管控制

leetcode:715. Range 模块【无脑segmentTree】

简易的安卓天气app(三)——城市管理、数据库操作

Lei niukesi --- basis of embedded AI

基因型填充前的质控条件简介

字符串

Lodash real on demand approach
随机推荐
[practice] stm32mp157 development tutorial FreeRTOS system 3: FreeRTOS counting semaphore
Mobile applications introduce static Cordova according to different platforms
Eureka的TimedSupervisorTask类(自动调节间隔的周期性任务)
AI越进化越跟人类大脑像!Meta找到了机器的“前额叶皮层”,AI学者和神经科学家都惊了...
EIG和沙特阿美签署谅解备忘录,扩大能源合作
异常
Optional classes, convenience functions, creating options, optional object operations, and optional streams
Electron checks the CPU and memory performance when the module is introduced
Telecommuting Market Research Report
DSP online upgrade (1) -- understand the startup process of DSP chip
并发底层原理:线程、资源共享、volatile 关键字
聊聊大火的多模态项目
Inner class
109. use of usereducer in hooks (counter case)
Polymorphic & class object & registered factory & Reflection & dynamic proxy
Clipboard learning records and pit encountered
Lei niukesi --- basis of embedded AI
如何选择嵌入式练手项目、嵌入式开源项目大全
【实战】STM32 FreeRTOS移植系列教程4:FreeRTOS 软件定时器
stm32mp1 Cortex M4开发篇8:扩展板LED灯控制实验