当前位置:网站首页>Voice assistant - overall architecture and design
Voice assistant - overall architecture and design
2022-06-12 07:31:00 【Turned_ MZ】
In this chapter, let's take a look at the overall architecture and design of voice assistant .
In general , A relatively perfect voice assistant can be divided into : Central control part + BOT part , For one BOT for , Its essence is a service that can run independently , Include your own central control , Its interior is a small Sunday , The existence of central control is to deal with some problems for each BOT In terms of public treatment , And each BOT Distribution of 、 Sorting and other functions . Here's the picture :

The blue part , For each BOT, For different system types ,BOT The interior design is also different , There are three typical BOT: gossip BOT、 Mission BOT、 Question and answer BOT, As for each BOT The interior design , We will go into more detail in the following chapters . Here we mainly explain the design of the central control system . The rest of the figure , It contains :QU、 Operation intervention layer 、 Dialogue management and post-processing policy layer . The following briefly describes the functions of each module :
One 、QU:
QU For basic semantic understanding , Here is the input query Do some common basic semantic understanding , Include entity identification 、 Text classification 、 Semantic Role Labeling 、 Semantic retrieval 、 Text rewriting, etc , Central control and BOT It will be processed according to the results of semantic understanding , For example, semantic role annotation 、 Text classification 、 Semantic retrieval 、 Entity recognition can be used in conversation management BOT Distribution and sorting of 、 Text rewriting is used to query Error correction , Improve downstream identification effect .
Two 、 Operation intervention layer :
Operation intervention layer , seeing the name of a thing one thinks of its function , It is mainly used for operation intervention , It can be used in two cases :
1. Some scripts , Don't want to use its own semantics , Instead, we want to give it special semantics to achieve specific effects , for example :“ Who is the most beautiful person in the world ”, This is a question , You should search the corresponding Q & A results , But operators sometimes want to turn on the front camera , As a little egg , At this point, we need here to query Intervene .
2、 Some scripts , As a result, we found that it was not carried out as expected , At this point, we can intervene in the results here , Or right query To rewrite , To correct the results .
Of course , In addition to operational availability , There are many things we can do on this floor , For some online questions , It can be handled quickly through intervention here , Avoid impacting more users , For example, you can do BOT Distributed interventions , Interventions that return results, etc .
3、 ... and 、DM layer :
Dialogue management , The dialog management here mainly includes two functions :BOT Distribution and sorting of , Multiple rounds of dialogue . It will take advantage of the current query The result of semantic understanding 、 Historical context 、 The environment is decided by the context Action( Executive action ) And the next state , Here's the picture :

About the distribution and sorting of central control , Multi round conversation , These will be explained in detail in the following chapters .
Four 、 Pendula BOT layer :
Pendula bot layer , It includes each independent BOT, Deal with the contents of their respective fields separately , Like chatting BOT It mainly deals with small talk , Chat with users , Question and answer BOT It mainly deals with the dialogue of knowledge retrieval type , Help users search for knowledge , alarm clock BOT It mainly deals with the semantics related to the alarm clock , music BOT It mainly deals with music related semantics and so on . Different types of BOT Internal detailed design of , Let's talk about it later , It's not going to unfold here .
5、 ... and 、 Post processing strategy layer :
The post-processing policy layer contains : Client interaction , Recommended services , Dialogue strategies, etc .
Client interaction , That is, the final skill result , encapsulate , Get the structure that the client can execute .
Recommended services , According to the knowledge map 、 User portrait 、 Product strategy, etc. make some recommendations for current user scripts , Such as user query:“ Who is Jay Chou ”, You can make related recommendations :“ Jay Chou's itinerary ”,“ Who is Jay Chou's wife ”,“ Play Jay Chou's song ” wait , It can also be based on the user's usage habits and current location 、 Time to make some personalized recommendations , such as : The current night 10 spot , You can recommend “ Tomorrow morning 8 An alarm clock at ”. Recommendation service is also a separate content , In the following chapters, we will talk about it separately .

Dialogue strategy , This is mainly for some special post-processing , For example, in some scenes , Some skills need to be closed .
边栏推荐
- RT thread studio learning (I) new project
- R语言使用epiDisplay包的summ函数计算dataframe中指定变量在不同分组变量下的描述性统计汇总信息并可视化有序点图、使用dot.col参数设置不同分组数据点的颜色
- R语言glm函数构建泊松回归模型(possion)、epiDisplay包的poisgof函数对拟合的泊松回归模型进行拟合优度检验、即模型拟合的效果、验证模型是否有过度离散overdispersion
- tmux 和 vim 的快捷键修改
- 私有协议的解密游戏:从秘文到明文
- Detailed explanation of 8086/8088 system bus (sequence analysis + bus related knowledge)
- RT thread studio learning (x) mpu9250
- Acwing - 4269 school anniversary
- Talk about vscode configuration settings JSON knows why (with a large number of configurations)
- Thyristor, it is a very important AC control device
猜你喜欢

2022起重机械指挥考试题模拟考试平台操作

Esp8266 firmware upgrade method (esp8266-01s module)

鸿蒙os-第一次培训

Federated meta learning with fast convergence and effective communication

Static coordinate transformation in ROS (analysis + example)

晶闸管,它是很重要的,交流控制器件
![‘CMRESHandler‘ object has no attribute ‘_timer‘,socket.gaierror: [Errno 8] nodename nor servname pro](/img/de/6756c1b8d9b792118bebb2d6c1e54c.png)
‘CMRESHandler‘ object has no attribute ‘_timer‘,socket.gaierror: [Errno 8] nodename nor servname pro

Pyhon的第四天

Federated reconnaissance: efficient, distributed, class incremental learning paper reading + code analysis

Kali and programming: how to quickly build the OWASP website security test range?
随机推荐
GD32F4(5):GD32F450时钟配置为200M过程分析
RT thread studio learning summary
Pyhon的第六天
Exploring shared representations for personalized federated learning paper notes + code interpretation
Pyhon的第四天
右击文件转圈卡住、刷新、白屏、闪退、桌面崩溃的通用解决方法
R语言rnorm函数生成正太分布数据、使用epiDisplay包的summ函数计算向量数据的描述性统计汇总信息并可视化有序点图(名称、有效值个数、均值、中位数、标准差、最大值、最小值)
Formatting the generalization forgetting trade off in continuous learning
2022r2 mobile pressure vessel filling test question simulation test platform operation
Unity uses shaders to highlight the edges of ugu I pictures
VS2019 MFC IP Address Control 控件继承CIPAddressCtrl类重绘
Adaptive personalized federated learning paper interpretation + code analysis
Interview computer network - transport layer
Kali and programming: how to quickly build the OWASP website security test range?
2022 electrician (elementary) examination question bank and simulation examination
Hongmeng OS first training
Study on display principle of seven segment digital tube
Imx6q PWM drive
Class as a non type template parameter of the template
Detailed explanation of coordinate tracking of TF2 operation in ROS (example + code)