当前位置:网站首页>Reading papers on false news detection (5): a semi supervised learning method for fake news detection in social media
Reading papers on false news detection (5): a semi supervised learning method for fake news detection in social media
2022-07-29 06:12:00 【Quinn-ntmy】
Paper title :A Semi-supervised Learning Method for Fake News Detection in Social Media
date :IEEE2020
# Based on news text (+ Image multimode )、# Semi supervision 、# Pseudo label 、#LDA
One 、 The basic content
utilize LDA Method to fake label the unlabeled data , So as to better train unmarked CNN Model , Good results have been achieved .
Two 、 Main work
SLD-CNN
Based on semi supervised learning framework , Use CNN For tagged and unlabeled data .
(1) use first CNN Extract various features of text and image data ;
(2) Using linear discriminant analysis (LDA) Predict the categories of unclassified data ;
(3) A method is proposed to calculate the fitness function , To improve the effect of prediction categories in each step .
3、 ... and 、 Model framework

- CNN Need to tag data to optimize the network , And unmarked data cannot be in CNN Use in . Therefore, the use is based on LDA Method to predict unlabeled data .
- LDA Linear discriminant analysis :
A concept close to variance analysis and regression analysis . In every statistical method , The dependent variable is modeled as a combination of other variables . But in variance analysis and regression analysis , The dependent variable is the distance type , And in the LDA in , Dependent variables are nominal or ordered .
The paper assumes that each features Can be modeled as a A random variable .
Covariance matrix is a general form of variance of numerical variables in different directions , And because variance represents the distribution of values of random variables near the mean . therefore , n n n Covariance matrix of variables Express Around the mean vector n n n Probability distribution in dimensional space .
If there is n n n Random variables { h 1 , … , h n } {\{h_1,…,h_n}\} { h1,…,hn} Make each variable contain m m m An example ( And stored in dimension m × n m×n m×n Matrix D D D in , Among them the first i i i Column number 1 j j j The element of the row represents x i x_i xi and y i y_i yi The covariance between ).
∑ [ i , j ] = C o v ( x i , y i ) ∑[i,j] =Cov(x_i,y_i) ∑[i,j]=Cov(xi,yi)
C o v ( x i , y i ) = 1 m ∑ l = 1 m [ ( D ( l , i ) − μ i ) ( D ( i , j ) − μ j ) ] {Cov(x_i,y_i )} = {1\over m} {∑_{l=1}^m[(D(l,i)-μ_i)(D(i,j)-μ_j)]} Cov(xi,yi)=m1∑l=1m[(D(l,i)−μi)(D(i,j)−μj)]
μ i μ_i μi and μ j μ_j μj Is the second of the matrix i i i That's ok 、 The first j j j The variable mean of the column , according to Fisher Linear discriminant theory , When the difference between the mean values is the largest , The difference between the two parties is the smallest , The discrimination effect is the best :
W = ( ∑ 0 + ∑ 1 ) − 1 ( μ ‾ 1 ) − ( μ ‾ 0 ) W=(∑_0+∑_1 )^{-1} (\overline{μ}_1 ) -(\overline{μ}_0 ) W=(∑0+∑1)−1(μ1)−(μ0)
Due to the problem 2 Categories ( Real articles and fake articles ), So the index is 0 and 1.
Because for every class , The mean is the vector of each mean of each feature . This means that the two class separation levels are at their maximum , We can reduce the dimension ( That is, the number of features ) And retain the important features . This process applies to data without labels . - Semi supervision LDA(SLDA)
SLDA It's an iterative algorithm , Get the number of features and classes and iteratively maximize the discrimination class ( Two classes :fake—0、real—1), therefore SLDA The output of is between 0 and 1 Number between .
- combination CNN
There may be errors in estimating the value of the tag , In order to control its effect on CNN The impact of the classification process , The influence factor is introduced α:
θ ∗ = a r g m i n [ L l a b e l e d ( y ^ , y ) + α L u n l a b e l e d ( y ^ , y ) ] θ^*=argmin[L_{labeled} (\hat{y},y)+αL_{unlabeled} (\hat{y},y)] θ∗=argmin[Llabeled(y^,y)+αLunlabeled(y^,y)]
【 The activation functions of the first and second convolution layers are ReLU, The activation function of the output layer is Softmax, Use Adam Optimizer 】
Parameter information used :
Four 、 Data sets
Multimode ( Text 、 Images ) Data sets come from well-known websites :20015 News articles , among 11941 Article is false ,8074 This article is correct .
边栏推荐
- Configuration and use of Nacos external database
- 京微齐力:基于HMEP060的心率血氧模块开发(1:FPGA发送多位指令)
- 噪音监测传感系统
- ML4自学笔记
- 华为云14天鸿蒙设备开发-Day7WIFI功能开发
- 第一周任务 深度学习和pytorch基础
- 华为云14天鸿蒙设备开发-Day1源码获取
- How to perform POC in depth with full flash distribution?
- 2、 Multi concurrent interface pressure test
- AttributeError: module ‘tensorflow‘ has no attribute ‘placeholder‘
猜你喜欢

噪音监测传感系统

ML11-SKlearn实现支持向量机

第2周学习:卷积神经网络基础

迁移学习—— Transfer Feature Learning with Joint Distribution Adaptation

电力电子:单项逆变器设计(MATLAB程序+AD原理图)

Pytorch's data reading mechanism

ML4自学笔记

Beijing Baode & taocloud jointly build the road of information innovation
![[convolution kernel design] scaling up your kernels to 31x31: revising large kernel design in CNN](/img/71/f3fdf677cd5fddefffd4715e747297.png)
[convolution kernel design] scaling up your kernels to 31x31: revising large kernel design in CNN

基于STM32开源:磁流体蓝牙音箱(包含源码+PCB)
随机推荐
【Transformer】SOFT: Softmax-free Transformer with Linear Complexity
Wechat built-in browser prohibits caching
Beijing Baode & taocloud jointly build the road of information innovation
基于STC51:四轴飞控开源项目原理图与源码(入门级DIY)
torch.nn.Parameter()函数理解
Improve quality with intelligence financial imaging platform solution
ML4自学笔记
1、 Combine multiple txt files into one TXT file
HAL学习笔记 - 7 定时器之基本定时器
Typical cases of xdfs & China Daily Online Collaborative Editing Platform
[semantic segmentation] Introduction to mapillary dataset
2022春招——芯动科技FPGA岗技术面(一面心得)
[convolution kernel design] scaling up your kernels to 31x31: revising large kernel design in CNN
迁移学习笔记——Adapting Component Analysis
研究生新生培训第一周:深度学习和pytorch基础
2022春招——上海安路FPGA岗面经(以及乐鑫SOC面试)
迁移学习—— Transfer Feature Learning with Joint Distribution Adaptation
Tf.get in tensorflow_ Detailed explanation of variable() function
CNOOC, desktop cloud & network disk storage system application case
2022春招——芯动科技FPGA开发岗笔试题(原题以及心得)