当前位置:网站首页>Reading of false news detection papers (3): semi supervised content-based detection of misinformation via tensor embeddings
Reading of false news detection papers (3): semi supervised content-based detection of misinformation via tensor embeddings
2022-07-29 08:16:00 【Quinn-ntmy】
Paper title :Semi-supervised Content-based Detection of Misinformation via Tensor Embeddings
Source of the paper :IEEE 2018
# Based on news text 、# Semi supervision 、# The graph structure
One 、 The motivation of the article
Most of the existing work focuses on manual feature extraction and supervised classification models using a large number of labeled articles . But in reality, there are often not many labeled data , There are only a few labels .
Two 、 Main work
The author simulates the above situation , Express the set of news articles as multidimensional tensors , Use tensor decomposition to derive concise article embedding , The space used to capture each article / Context information , And use these embeddings to create article by article graphs , Spread Limited labels on the graph .
Using tensor representation and decomposition of news articles , Carefully constructed KNN Graph and propagate limited tagged article information to reason about a larger set :
(1) The graph representation of news articles is derived by tensor based article embedding ;
(2) Develop semi supervisory methods , Propagate known tags on the graph to identify unknown tags ;
(3) Collect a large number of error information and real news article data sets publicly shared on social media ;
(4) Evaluate the proposed method on real data sets .
1. Express the content of the article as tensor (Tensor decompositon)
Two different tensor construction methods are proposed :
(1) Frequency based tensor : A the third mock examination tensor is constructed (word, word, news), For each news article, a non-zero co-occurrence matrix is created
(2) Tensor based on binary : Construct a tensor (word, word, news), The entries used and appeared are Boolean values , Instruction No i And the first j Whether words appear at least once in the predefined window .
2. News articles KNN chart
Use steps 1 The tensor embedding calculated in constructs the graphical representation of news articles , Use factor matrix C Building news articles KNN chart G,C Each column in is the representation of the corresponding news article in the potential theme space , therefore , By means of C Build a KNN chart , We can find similar articles in this space .
Specific operation :
(1) take C Each line in is treated as R A point in dimensional space ;
(2) Calculate the... Between news l 2 l_2 l2 Distance and find C At every point in k- More recently 【 The number of news articles is very large in practice , We can take advantage of kd-tree(k Dimension tree ) In order to find more efficiently for each article k Nearest neighbor 】;
(3)G Each node in represents a news article , Each edge encodes two articles that are similar in the embedded space ;
Final plan G Is an undirected symmetric graph , Each of these nodes is connected to at least k Nodes . The compact representation of the figure is M×M Adjacency matrix .
3. Belief Propagation
Using a belief propagation algorithm that assumes homogeneity .
Use fast and linear FaBP variant :
(1) The algorithm is proved to be insensitive to the size of known tags , Therefore, we believe that FaBP Good learning performance can be achieved by using only a few known tags .
(2)FaBP stay G The number of edges in is linear , Therefore, it can be used for large-scale graphics .
3、 ... and 、 Data sets
1、Political(150 A political news article );
2、Bulgarian(68 Real news and 69 Fake news ).
边栏推荐
- Preparation of SQL judgment statement
- Stm32ff030 replaces domestic MCU dp32g030
- Multifunctional signal generator based on AD9850
- Proteus simulation based on msp430f2491
- Hal library learning notes - 8 concept of serial communication
- Phy6252 is an ultra-low power Bluetooth wireless communication chip for the Internet of things
- Eps32+platform+arduino running lantern
- Arduinoide + stm32link burning debugging
- Nrf52832-qfaa Bluetooth wireless chip
- Domestic application of ft232 replacing gp232rl usb-rs232 converter chip
猜你喜欢

Noise monitoring and sensing system

MySQL rownum implementation

What is Amazon self support number and what should sellers do?

Implementation of support vector machine with ml11 sklearn

Unity shader learning (VI) achieving radar scanning effect

随机抽奖转盘微信小程序项目源码

分段分页以及段页结合

Rotation in model space and rotation in world space

Cv520 domestic replacement of ci521 13.56MHz contactless reader chip

MySQL中的时间函数
随机推荐
Unity beginner 1 - character movement control (2D)
DC motor speed regulation system based on 51 single chip microcomputer (use of L298)
What is the working principle of the noise sensor?
110道 MySQL面试题及答案 (持续更新)
Crawl notes
Simplefoc parameter adjustment 3-pid parameter setting strategy
Unicode私人使用区域(Private Use Areas)
产品推广的渠道和策略,化妆品品牌推广方法及步骤
[密码学实验] 0x00 安装NTL库
Alibaba political commissar system - Chapter 4: political commissars are built on companies
[academic related] why can't many domestic scholars' AI papers be reproduced?
Simulation of four way responder based on 51 single chip microcomputer
Noise monitoring and sensing system
[beauty of software engineering - column notes] 22 | how to do a good job in technology selection for the project?
Proteus simulation based on msp430f2491
Tb6600+stm32f407 test
Arduinoide + stm32link burning debugging
Proteus simulation based on msp430f2491 (realize water lamp)
[beauty of software engineering - column notes] 24 | technical debt: continue to make do with it, or overthrow it and start over?
Temperature acquisition and control system based on WiFi