当前位置:网站首页>Chapter7-13_ Dialogue State Tracking (as Question Answering)
Chapter7-13_ Dialogue State Tracking (as Question Answering)
2022-06-13 02:13:00 【zjuPeco】
This article is for teacher lihongyi 【Dialogue State Tracking (as Question Answering)】 My course notes , Course video youtube Address , spot here ( Need to climb over the wall ).
The pictures used in the following paragraphs are all from Mr. lihongyi PPT, If there is infringement , Must delete .
Article index :
Part 1 - 7-12 Controllable Chatbot
The next part - nothing
1 What is? Dialogue State Tracking
Dialogue State Tracking yes Task-oriented Dialogue A submodule . Let's talk about it first Task-oriented Dialogue What is it? .Task-oriented Dialogue The input of is the current words spoken by the user and the historical dialogue record , If the user's input is voice , There will be an extra ASR The module converts input into text , There will also be an extra TTS The module converts the output to voice . The user's input goes through an understanding module NLU(Natural language understanding), The output will be given to state tracker, That is the core content of this paper .state tracker It will extract useful information from the current input , preserved . At the same time, historical input will save useful information in the same form . The stored information will be in the form of state To the decision-making module (Policy).Policy Will tell the generation module NLG(natural language generation) What should be output . Last NLG The output responds to the user's words .
Said a lot , In fact, to put it simply Dialogue State Tracking It's up there state tracker, The goal is to simplify what users say into a useful information base , It can also be understood as a summary . The following figure vividly illustrates this process . So many conversations , It turns out to be key-value The key/value pair .

these keys It is the content defined in advance , and value It's all multiple choice questions , It is also a well-defined option . Because of the present Dialogue State There are often many keys, therefore key There will be domain and slot Two parts , Used to distinguish and classify different keys, Convenient for model processing , It is also about the migration of models .
All in all ,Dialogue State Tracking According to the conversation , Fill in the preset keys Of values It's a question of what .
2 Data sets
Dialogue State Tracking Some of the most famous datasets are MultiWOZ 2.0,SGD,CrossWOZ wait . This one here WOZ From the wizard of oz (The Wizard of Oz), The final big in the wizard of oz BOSS An old man was behind the scenes . The data set here is also disguised as AI The data from talking to people , That's right. It's people talking to people ,AAI(Artificial Artificial Intelligence).

3 Two challenges
Dialogue State Tracking Of values There are many options to make because the previous schemes regard this as a classification problem , The output is a category , But like the phone number , Time and other options are hard to enumerate , This method doesn't work , This is a challenge .
Another challenge is to have new keys Come in , The model will be re train too , The data has to be reconstructed , Migration costs are high .
To sum up :
- There are some value Inexhaustible
- new key Come in , Migration costs are high
These two questions , Both can be used. QA(Question Answering) How to solve this problem .QA The structure of is shown in the figure below , We put Dialogue history As source Input , And then according to key Ask the appropriate questions and let the model answer them . So the model goes back from source One of them span As the answer , It's solved value An inexhaustible question . And then for domain Different ,slot Same question , Change the problem domain that will do , Solved the problem of migration . Of course, the premise is this QA The model is ok Of .

4 Classic models
Dialogue State Tracking The classic model is TRADE, The structure shown below .Dialogue Will pass by a encoder Extracting features , meanwhile key Of domain and slot I'll take it separately embedding, Then add them together as question Part of the input , It will also be with Dialogue Of encoder Output as a attention.attention The result will be a slot gate, It was decided that PTR( Extract one from the conversation span), still DONTCARE( Users don't care ), still NONE( I haven't mentioned this yet key). According to this decision ,state generator The final result will be output .
because key Will be extracted separately domain and slot Of embedding, So this method can be applied to key Of , This also means that it can be solved zero-shot Problem. .
There are some key There will be some. description, Extract these description Of embedding It's also a solution zero-shot The way of the problem .
There are other variations , such as Slot Carryover Prediction I think that every word I say should put all keys It takes too much time and effort to ask again , So I designed a “ Same as above ” Of classifier To help avoid asking questions that have been known and have not been modified keys;DST QA Think some keys Between values It's related , For example, I know which hotel to book , Then the destination of the taxi is probably this hotel , Then a module is designed to solve this problem ;SimpleTOD be based on GPT-2 Designed a end-to-end Methods , The user said a word , The model outputs one sentence , The effect is surprisingly good .
边栏推荐
- 传感器:MQ-5燃气模块测量燃气值(底部附代码)
- Learning notes 51 single chip microcomputer keyboard (non coding keyboard and coding keyboard, scanning mode of non coding keyboard, independent keyboard, matrix keyboard)
- Sensorless / inductive manufacturing of brushless motor drive board based on stm32
- 1000 fans ~
- Introduction to arm Cortex-M learning
- LabVIEW large project development tools to improve quality
- Sensor: sht30 temperature and humidity sensor testing ambient temperature and humidity experiment (code attached at the bottom)
- [pytorch] kaggle large image dataset data analysis + visualization
- js-dom
- 拍拍贷母公司信也季报图解:营收24亿 净利5.3亿同比降10%
猜你喜欢

ROS learning -5 how function packs with the same name work (workspace coverage)

Huawei equipment configures private IP routing FRR

I didn't expect that the index occupies several times as much space as the data MySQL queries the space occupied by each table in the database, and the space occupied by data and indexes. It is used i

Sensor: sht30 temperature and humidity sensor testing ambient temperature and humidity experiment (code attached at the bottom)

Mac使用Docker安装Oracle

Barrykay electronics rushes to the scientific innovation board: it is planned to raise 360million yuan. Mr. and Mrs. Wang Binhua are the major shareholders

Stm32 mpu6050 servo pan tilt support follow

如何解决通过new Date()获取时间写出数据库与当前时间相差8小时问题【亲测有效】

Opencv camera calibration (1): internal and external parameters, distortion coefficient calibration and 3D point to 2D image projection

Looking at Qianxin's "wild prospect" of network security from the 2021 annual performance report
随机推荐
What are the differences in cache/tlb?
[work with notes] MFC solves the problem that pressing ESC and enter will automatically exit
[the second day of actual combat of smart lock project based on stm32f401ret6 in 10 days] GPIO and register
Opencv camera calibration (1): internal and external parameters, distortion coefficient calibration and 3D point to 2D image projection
Stm32 mpu6050 servo pan tilt support follow
Vscode configuration header file -- Take opencv and its own header file as an example
Combining strings and numbers using ssstream
如何解决通过new Date()获取时间写出数据库与当前时间相差8小时问题【亲测有效】
Share three stories about CMDB
Paipai loan parent company Xinye quarterly report diagram: revenue of RMB 2.4 billion, net profit of RMB 530million, a year-on-year decrease of 10%
Gome's ambition of "folding up" app
C语言压缩字符串保存到二进制文件,从二进制文件读取压缩字符串后解压。
[learning notes] xr872 GUI littlevgl 8.0 migration (file system)
Huawei equipment configures private IP routing FRR
Use of Arduino series pressure sensors and detected data displayed by OLED (detailed tutorial)
万字讲清 synchronized 和 ReentrantLock 实现并发中的锁
Looking at Qianxin's "wild prospect" of network security from the 2021 annual performance report
Sensor: sht30 temperature and humidity sensor testing ambient temperature and humidity experiment (code attached at the bottom)
[the second day of actual combat of smart lock project based on stm32f401ret6 in 10 days] (lighting with library function and register respectively)
Huawei equipment is configured with CE dual attribution