当前位置:网站首页>Gary Marcus wrote: three perspectives from linguists that AI researchers need to know
Gary Marcus wrote: three perspectives from linguists that AI researchers need to know
2022-06-23 13:21:00 【Zhiyuan community】
Everybody knows that large language models like GPT-3 and LaMDA have made tremendous strides, at least in some respects, and powered past many benchmarks, and Cosmo recently described DALL-E but most in the field also agree that something is still missing. A group of engineers at Facebook, for example, wrote in 2019 that:
A growing body of evidence shows that state-of-the-art models learn to exploit spurious statistical patterns in datasets... instead of learning meaning in the flexible and generalizable way that humans do."
Since then, the results on benchmarks have gotten better, but there’s still something missing.
If we had to put our finger on what is still missing, we would focus on these three key elements:
Reference: Words and sentence don’t exist in isolation. Language is about a connection between words (or sentence) and the world; the sequences of words that large language models utter lack connection to the external world.
Cognitive models: The ultimate goal of a language system should be to update a persisting but dynamic sense of the world. Large language models don’t produce such cognitive models, at least not in a way that anybody has been able to make reliable use of.
Compositionality: Complex wholes are (mostly) systematically interpreted in terms of their parts, and how these parts are arranged. Systems like DALL-E face clear challenges when it comes to compositionality. (LLM’s like GPT produce well-formed prose but do not produce interpretable representations of utterances that reflect structured relationships between the parts of those sentences.)
In our view, inadequate attention to these three factors has serious consequences, including:
(a) the tendency of large language models to lose coherence over time, drifting into “empty” language with no clear connection to reality;
(b) the difficulty of large language models in distinguishing truth from falsehoods;
(c) the struggle in these models to avoid perpetuating bias and toxic speech.
Now here’s the thing: none of these three elements we have been stressing are news to linguists. In fact, at least since the work of Gottlob Frege in the late 19th century, they have been pretty central to what many linguists worry about. To be sure, none of these three issues has been solved so far; for example, there is still debate about “how much” of our everyday language use actually relies on compositionality, and what the right cognitive models of language should be. But we do think that linguistics has a lot to offer in terms of formulating and thinking about these questions.
边栏推荐
- kubernetes comfig subpath
- How do the top ten securities firms open accounts? Is online account opening safe?
- Overview of national parks in the United States
- js: 获取页面最大的zIndex(z-index)值
- What should testers do if the requirements need to be changed when the project is half tested?
- R语言将距离矩阵输入给hclust函数进行层次聚类分析,使用cutree函数进行层次聚类簇的划分、参数k指定聚类簇的个数、给每个样本都分配了簇标签
- < Sicily> 1001. Rails
- Is it safe for flush to open an account online? What should we pay attention to
- In flinksql, the Kafka flow table and MySQL latitude flow table are left joined, and the association is made according to I'd. false
- 64 channel PCM telephone optical transceiver 64 channel telephone +2-channel 100M Ethernet telephone optical transceiver 64 channel telephone PCM voice optical transceiver
猜你喜欢

Excel-vba quick start (I. macros, VBA, procedures, types and variables, functions)

华三交换机配置SSH远程登录

Architecture design methods in technical practice

Getting started with reverse debugging - learn about PE structure files

OS的常见用法(图片示例)

「开发者说」钉钉连接器+OA审批实现学校学生假勤场景数字化

4E1 PDH optical transceiver 19 inch rack type single fiber transmission 20km E1 interface optical network optical transceiver
![[website architecture] the unique skill of 10-year database design, practical design steps and specifications](/img/f2/061fa6dd42e57a121401e4f0cf1865.png)
[website architecture] the unique skill of 10-year database design, practical design steps and specifications

How to test the third-party payment interface?

Solution: argument type 'string' expected to be an instance of a class or class constrained type
随机推荐
Unity learning day14 -- collaboration and WWW
理财产品长期是几年?新手最好买长期还是短期?
Broadcast level E1 to aes-ebu audio codec E1 to stereo audio XLR codec
Have you ever encountered incompatibility between flink1.15.0 and Flink CDC MySQL 2.2.1? f
Online text filter less than specified length tool
Solve "thread 1:" -[*.collectionnormalcellview isselected]: unrecognized selector sent to instance 0x7F "
Qunhui 10 Gigabit network configuration and test
Photon network framework
Follow the promotional music MV of domestic tour in Thailand and travel to Bangkok like local people
ExpressionChangedAfterItHasBeenCheckedError: Expression has changed after it was checked.
Service stability governance
React query tutorial ④ - cache status and debugging tools
Solution: argument type 'string' expected to be an instance of a class or class constrained type
美国的国家公园概览
20000 words + 30 pictures | MySQL log: what is the use of undo log, redo log and binlog?
Windows install MySQL
Principle analysis of three methods for exchanging two numbers
< Sicily> 1000. number reversal
What are the risks of opening a mobile account? Is it safe to open an account?
Network foundation and framework