当前位置:网站首页>Dgraph: large scale dynamic graph dataset
Dgraph: large scale dynamic graph dataset
2022-07-02 16:57:00 【Aitime theory】
Click on the blue words
Pay attention to our
AI TIME Welcome to everyone AI Fans join in !
Webpage: https://dgraph.xinye.com/
GitHub: https://github.com/DGraphXinye/
In recent days, , Yang Yang's scientific research group of Zhejiang University (yangy.org) Hexin also jointly released a large-scale dynamic graph data set DGraph, Aimed at service graph neural network 、 Graph mining 、 Social networks 、 Researchers in the direction of anomaly detection , Provide large-scale data of real scenes .DGraph On the one hand, it can be used as the standard data to verify the performance of the correlation graph model , On the other hand, it can also be used to carry out user portrait 、 Network analysis and other research work .
Data set description
DGraph The source data of is provided by Xinye Technology .DGraph It is a directed dynamic graph with no right , Contains more than 370 Ten thousand nodes and 430 Ten thousand dynamic edges . As shown in the figure below ,DGraph The node in represents the financial lending user of Xinye technology service , A directed edge indicates an urgent contact relationship , Each node contains the attribute characteristics after desensitization , And a label indicating whether it is a financial fraud user .
Data features
The scene is real
DGraph It comes from the real financial business scenario , Its construction logic is close to the industrial landing , It provides an opportunity for users of data sets to explore how to extend the graph model to the financial field . To be specific ,DGraph The proportion of abnormal and normal users in is about 1:100, Its “ The label is unbalanced ” The characteristics of the are in line with the real scene , Support exception detection 、 Research on classification of unbalanced nodes .
Structural dynamics
DGraph User relationships in are sampled from across 27 A business scenario for months , And the network structure will evolve over time , It provides data support for the current dynamic graph model and mining research .
Large scale
DGraph contain 370 Thousands of desensitized real financial lending users and 430 Ten thousand dynamic relationships , Its scale is about the largest dynamic graph data in the financial field Elliptic Of 17 times , Support the research and evaluation of large-scale graph models . Besides ,DGraph Contained in the 60% Of “ Background node ”, That is, it is not a classification or analysis object, but it actually exists 、 Nodes that have an indirect impact on business logic . These nodes play an important role in maintaining the connectivity of the network , Widely exists in industry . Reasonable processing of background nodes can effectively improve the storage space of data and the operation efficiency of the model in large-scale data scenarios .DGraph It contains more than 200 10000 background nodes , It can support researchers to explore the properties of background nodes .
Open source community maintenance
Ranking List
DGraph Users can submit at any time 、 Refreshed performance leaderboard (leaderboard), To track the research progress of the latest graph model . The list provides a unified evaluation process , All results are open and transparent .
Research results
DGraph It has rich characteristics , Support graph research in multiple directions .
Algorithm contest
Xinye technology revolves around DGraph The seventh Xinye Technology Cup algorithm competition was held , Task and DGraph The fraud user identification in is consistent . The competition is open to the whole society , Colleges and universities at home and abroad 、 Scientific research institutes 、 Internet enterprises can sign up for the competition , The bonus pool is abundant , total 31 Thousands of yuan .
Welcome interested colleagues 「 Scan the qr code below 」 Patronize DGraph Public data website , Work together to provide rich application data for the field of artificial intelligence , Work together to build an open digital ecosystem .
Dataset home page
Match Links
carry
Wake up
Related papers :
DGraph: A Large-Scale Financial Dataset for Graph Anomaly Detection. Xuanwen Huang, Yang Yang*, Yang Wang, Chunping Wang, Zhisheng Zhang, Jiarong Xu, and Lei Chen. Preprint.
Thesis link :
http://yangy.org/works/dgraph/dgraph_2022.pdf
Cooperation platform
Excellent articles in the past are recommended
Remember to pay attention to us ! There is new knowledge every day !
About AI TIME
AI TIME From 2019 year , It aims to carry forward the spirit of scientific speculation , Invite people from all walks of life to the theory of artificial intelligence 、 Explore the essence of algorithm and scenario application , Strengthen the collision of ideas , Link the world AI scholars 、 Industry experts and enthusiasts , I hope in the form of debate , Explore the contradiction between artificial intelligence and human future , Explore the future of artificial intelligence .
so far ,AI TIME Has invited 600 Many speakers at home and abroad , Held more than 300 An event , super 210 10000 people watch .
I know you.
Looking at
Oh
~
Click on Read the original entrants !
边栏推荐
- AcWing 300. Task arrangement
- 国内比较好的OJ平台[通俗易懂]
- System Verilog implements priority arbiter
- 深度学习图像数据自动标注[通俗易懂]
- 机器学习-感知机模型
- Global and Chinese market of jacquard looms 2022-2028: Research Report on technology, participants, trends, market size and share
- lsf基础命令
- 大廠面試總結大全
- John blasting appears using default input encoding: UTF-8 loaded 1 password hash (bcrypt [blowfish 32/64 x3])
- Where can I open computer administrator permissions
猜你喜欢
电脑自带软件使图片底色变为透明(抠图白底)
DGraph: 大规模动态图数据集
Atcoder beginer contest 169 (B, C, D unique decomposition, e mathematical analysis f (DP))
Easy language ABCD sort
OpenPose的使用
What if the win11 app store cannot load the page? Win11 store cannot load page
OpenHarmony如何启动远程设备的FA
大廠面試總結大全
Bib | graph representation based on heterogeneous information network learning to predict drug disease association
只是巧合?苹果iOS16的神秘技术竟然与中国企业5年前产品一致!
随机推荐
How openharmony starts FA of remote devices
[error record] error -32000 received from application: there are no running service protocol
配置基于接口的ARP表项限制和端口安全(限制用户私自接入傻瓜交换机或非法主机接入)
电脑自带软件使图片底色变为透明(抠图白底)
TCP congestion control details | 2 background
L'explosion de John utilise l'encodage d'entrée par défaut: UTF - 8 Loaded 1 password Hash (bcrypt [blowfish 32 / 64 X3])
Global and Chinese market of oil analyzers 2022-2028: Research Report on technology, participants, trends, market size and share
Student course selection system (curriculum design of Shandong Agricultural University)
How to solve the failure of printer driver installation of computer equipment
Just a coincidence? The mysterious technology of apple ios16 is even consistent with the products of Chinese enterprises five years ago!
IP地址转换地址段
二、mock平台的扩展
一文看懂:数据指标体系的4大类型
VMware install win10 image
P6774 [NOI2020] 时代的眼泪(分块)
618 reprise en profondeur: la méthode gagnante de la famille Haier Zhi
Global and Chinese market of jacquard looms 2022-2028: Research Report on technology, participants, trends, market size and share
Unity使用UGUI设置一个简单多级水平方向下拉菜单(不需要代码)
DigiCert SSL证书支持中文域名申请吗?
PCL least median square method fitting plane