当前位置:网站首页>Day 8.Developing Simplified Chinese Psychological Linguistic Analysis Dictionary for Microblog
Day 8.Developing Simplified Chinese Psychological Linguistic Analysis Dictionary for Microblog
2022-07-27 05:50:00 【Ignorant graduate student】
Title:
Developing Simplified Chinese Psychological Linguistic Analysis Dictionary for Microblog
Develop simplified Chinese psycholinguistic analysis dictionary for Weibo
Keywords:
LIWC,
Traditional Chinese, Traditional Chinese
Simplified Chinese, Simplified Chinese
microblog, Microblogging
text analysis. Text analysis
Abstract:
The words that people use could reveal their emotional states, intentions, thinking styles, individual differences, etc. LIWC (Linguistic Inquiry and Word Count) has been widely used for psychological text analysis, and its dictionary is the core. The Traditional Chinese version of LIWC dictionary has been released, which is a translation of LIWC English dictionary. However, Simplified Chinese which is the world’s most widely used language has subtle differences with Traditional Chinese. Furthermore, both English LIWC dictionary and Traditional Chinese version dictionary were both developed for relatively formal text. Microblog has become more and more popular in China nowadays. Original LIWC dictionaries take less consideration on microblog popular words, which makes it less applicable for text analysis on microblog. In this study, a Simplified Chinese LIWC dictionary is established according to LIWC categories. After translating Traditional Chinese dictionary into Simplified Chinese, five thousand words most frequently used in microblog are added into the dictionary. Four graduate students of psychology rated whether each word belonged in a category. The reliability and validity of Simplified Chinese
LIWC dictionary were tested by these four judges. This new dictionary could contribute to all the text analysis on microblog in future.
The words people use can reveal their emotional state 、 Intention 、 Way of thinking 、 Individual differences, etc . Language query and word count (LIWC) It is widely used in psychological discourse analysis , The dictionary is its core .《LIWC The dictionary 》 The traditional Chinese version of has been released , It is LIWC Translation of English dictionaries . However , As the most widely used language in the world , There are subtle differences between simplified Chinese and traditional Chinese . Besides , English LIWC Dictionaries and traditional Chinese dictionaries are developed for relatively formal texts . Nowadays, Weibo is becoming more and more popular in China . The original LIWC Dictionaries give less consideration to popular words on Weibo , Not suitable for Weibo text analysis . This study is based on LIWC The classification of , Established a simplified Chinese LIWC The dictionary . After translating the traditional Chinese dictionary into simplified Chinese , The 5000 most commonly used words on Weibo are added to the dictionary . Four psychology graduate students rated whether each word belonged to a category . Through these four judges 《 Simplified Chinese LIWC The dictionary 》 The reliability and validity of . This new dictionary will help all text analysis on Weibo in the future .
Conclusion:
Percentage of words captured by the SCLIWC dictionary indicates that words usage in internet environment like Sina microblog are much more diverse compared to formal text materials[9, 14]. Percentage of words captured by the SCMBWC dictionary improves above 10 percent, especially captured more words in category of psychological processes and its sub categories, such as social processes, affective
processes, cognitive processes and etc. Internal Reliability and External Validity of those two dictionaries are well guaranteed by four groups of judges. SCLIWC bridges the gap between LIWC software and Simplified Chinese. What is more, SCMBWC suggests a promising approach for further text analysis of Chinese Simplified in various internet environments.
SCLIWC The percentage of words captured in the dictionary indicates , The vocabulary usage in Sina Weibo and other online environments is better than that in official text materials [9, 14] More diverse .SCMBWC The percentage of words in the dictionary has increased 10% above , Especially in the psychological process class and its subclasses , Such as social process 、 Emotional process, etc , Capture more words , The internal reliability and external validity of these two dictionaries have been fully guaranteed by four groups of judges .SCLIWC Make up for LIWC The gap between software and simplified Chinese . Besides ,SCMBWC It provides a promising method for further analyzing simplified Chinese Texts in various network environments .
边栏推荐
- MySQL如何执行查询语句
- Seektiger's okaleido has a big move. Will the STI of ecological pass break out?
- 难道Redis真的变慢了吗?
- Count the quantity in parallel after MySQL grouping
- kettle的文件名通配规则
- 如果面试官问你 JVM,额外回答“逃逸分析”技术会让你加分
- What are the conditions and procedures for opening crude oil futures accounts?
- 记一次PG主从搭建及数据同步性能测试流程
- Web2.0 giants have deployed VC, and tiger Dao VC may become a shortcut to Web3
- Docker部署redis单机版本 - 修改redis密码和持久化方式
猜你喜欢

Inno setup package jar + H5 + MySQL + redis into exe

Day14. 用可解释机器学习方法鉴别肠结核和克罗恩病

If the interviewer asks you about JVM, the extra answer of "escape analysis" technology will give you extra points

Basic layout of the page

How can I get the lowest handling charge for opening a futures account?

How does gamefi break the circle? Aquanee shows its style by real "p2e"

How to realize master-slave synchronization in mysql5.7

You should negotiate the handling fee before opening a futures account

手把手教你搭建钉钉预警机器人

Okaleido launched the fusion mining mode, which is the only way for Oka to verify the current output
随机推荐
MySQL快速比较数据库表数据
Aquanee will land in gate and bitmart in the near future, which is a good opportunity for low-level layout
Day 6.重大医疗伤害事件网络舆情能量传播过程分析*———以“魏则西事件”为例
You should negotiate the handling fee before opening a futures account
Seektiger's okaleido has a big move. Will the STI of ecological pass break out?
GBASE 8C——SQL参考6 sql语法(15)
If the interviewer asks you about JVM, the extra answer of "escape analysis" technology will give you extra points
给测试小姐姐的第三封信 | ORACLE存储过程知识分享和测试说明
Jenkins build image automatic deployment
Deploy redis with docker for high availability master-slave replication
「中高级试题」:MVCC实现原理是什么?
未来刷脸支付是能够占据市场很多的份额
刷脸支付更符合支付宝一直做生态的理念
Seven enabling schemes of m-dao help Dao ecology move towards mode and standardization
Which futures company has a low handling fee and a high refund?
go通过channel获取goroutine的处理结果
Sealem Finance - a new decentralized financial platform based on Web3
The written test questions of 25 large Internet companies are summarized, and I have encountered packages.
dbswitch数据迁移数据增量时如何不覆盖目标源数据
PHP 实现与MySQL的数据交互