当前位置:网站首页>Dgraph: large scale dynamic graph dataset
Dgraph: large scale dynamic graph dataset
2022-07-02 16:57:00 【Aitime theory】
Click on the blue words
Pay attention to our
AI TIME Welcome to everyone AI Fans join in !
Webpage: https://dgraph.xinye.com/
GitHub: https://github.com/DGraphXinye/
In recent days, , Yang Yang's scientific research group of Zhejiang University (yangy.org) Hexin also jointly released a large-scale dynamic graph data set DGraph, Aimed at service graph neural network 、 Graph mining 、 Social networks 、 Researchers in the direction of anomaly detection , Provide large-scale data of real scenes .DGraph On the one hand, it can be used as the standard data to verify the performance of the correlation graph model , On the other hand, it can also be used to carry out user portrait 、 Network analysis and other research work .
Data set description
DGraph The source data of is provided by Xinye Technology .DGraph It is a directed dynamic graph with no right , Contains more than 370 Ten thousand nodes and 430 Ten thousand dynamic edges . As shown in the figure below ,DGraph The node in represents the financial lending user of Xinye technology service , A directed edge indicates an urgent contact relationship , Each node contains the attribute characteristics after desensitization , And a label indicating whether it is a financial fraud user .
Data features
The scene is real
DGraph It comes from the real financial business scenario , Its construction logic is close to the industrial landing , It provides an opportunity for users of data sets to explore how to extend the graph model to the financial field . To be specific ,DGraph The proportion of abnormal and normal users in is about 1:100, Its “ The label is unbalanced ” The characteristics of the are in line with the real scene , Support exception detection 、 Research on classification of unbalanced nodes .
Structural dynamics
DGraph User relationships in are sampled from across 27 A business scenario for months , And the network structure will evolve over time , It provides data support for the current dynamic graph model and mining research .
Large scale
DGraph contain 370 Thousands of desensitized real financial lending users and 430 Ten thousand dynamic relationships , Its scale is about the largest dynamic graph data in the financial field Elliptic Of 17 times , Support the research and evaluation of large-scale graph models . Besides ,DGraph Contained in the 60% Of “ Background node ”, That is, it is not a classification or analysis object, but it actually exists 、 Nodes that have an indirect impact on business logic . These nodes play an important role in maintaining the connectivity of the network , Widely exists in industry . Reasonable processing of background nodes can effectively improve the storage space of data and the operation efficiency of the model in large-scale data scenarios .DGraph It contains more than 200 10000 background nodes , It can support researchers to explore the properties of background nodes .
Open source community maintenance
Ranking List
DGraph Users can submit at any time 、 Refreshed performance leaderboard (leaderboard), To track the research progress of the latest graph model . The list provides a unified evaluation process , All results are open and transparent .
Research results
DGraph It has rich characteristics , Support graph research in multiple directions .
Algorithm contest
Xinye technology revolves around DGraph The seventh Xinye Technology Cup algorithm competition was held , Task and DGraph The fraud user identification in is consistent . The competition is open to the whole society , Colleges and universities at home and abroad 、 Scientific research institutes 、 Internet enterprises can sign up for the competition , The bonus pool is abundant , total 31 Thousands of yuan .
Welcome interested colleagues 「 Scan the qr code below 」 Patronize DGraph Public data website , Work together to provide rich application data for the field of artificial intelligence , Work together to build an open digital ecosystem .
Dataset home page
Match Links
carry
Wake up
Related papers :
DGraph: A Large-Scale Financial Dataset for Graph Anomaly Detection. Xuanwen Huang, Yang Yang*, Yang Wang, Chunping Wang, Zhisheng Zhang, Jiarong Xu, and Lei Chen. Preprint.
Thesis link :
http://yangy.org/works/dgraph/dgraph_2022.pdf
Cooperation platform
Excellent articles in the past are recommended
Remember to pay attention to us ! There is new knowledge every day !
About AI TIME
AI TIME From 2019 year , It aims to carry forward the spirit of scientific speculation , Invite people from all walks of life to the theory of artificial intelligence 、 Explore the essence of algorithm and scenario application , Strengthen the collision of ideas , Link the world AI scholars 、 Industry experts and enthusiasts , I hope in the form of debate , Explore the contradiction between artificial intelligence and human future , Explore the future of artificial intelligence .
so far ,AI TIME Has invited 600 Many speakers at home and abroad , Held more than 300 An event , super 210 10000 people watch .
I know you.
Looking at
Oh
~
Click on Read the original entrants !
边栏推荐
- Take you ten days to easily complete the go micro service series (I)
- 618 deep resumption: Haier Zhijia's winning methodology
- 关于举办科技期刊青年编辑沙龙——新时代青年编辑应具备的能力及提升策略的通知...
- Digital IC hand tearing code -- voting device
- Global and Chinese market of switching valves 2022-2028: Research Report on technology, participants, trends, market size and share
- jsp 和 servlet 有什么区别?
- go-zero微服务实战系列(八、如何处理每秒上万次的下单请求)
- 2322. 从树中删除边的最小分数(异或和&模拟)
- PCL least median square method fitting plane
- <四> H264解码输出yuv文件
猜你喜欢
Yyds dry goods inventory has not revealed the artifact? Valentine's Day is coming. Please send her a special gift~
Headline | Asian control technology products are selected in the textile and clothing industry digital transformation solution key promotion directory of Textile Federation
PCL point cloud image transformation
Cell: Tsinghua Chenggong group revealed an odor of skin flora. Volatiles promote flavivirus to infect the host and attract mosquitoes
LeetCode 1. Sum of two numbers
LeetCode 2. 两数相加
go-zero微服务实战系列(八、如何处理每秒上万次的下单请求)
PWM controlled steering gear
易语言abcd排序
渗透工具-内网权限维持-Cobalt strike
随机推荐
【云原生】简单谈谈海量数据采集组件Flume的理解
Notice on holding a salon for young editors of scientific and Technological Journals -- the abilities and promotion strategies that young editors should have in the new era
小鹏P7雨天出事故安全气囊没有弹出 官方回应:撞击力度未达到弹出要求
机器学习-感知机模型
LeetCode 1. Sum of two numbers
Unity Json 编写
LeetCode 2. Add two numbers
Learning Weekly - total issue 60 - 25th week of 2022
P6774 [noi2020] tears in the era (block)
IP地址转换地址段
System Verilog实现优先级仲裁器
远程办公对我们的各方面影响心得 | 社区征文
La boîte de connexion du hub de l'unit é devient trop étroite pour se connecter
Download blender on Alibaba cloud image station
How to solve the failure of printer driver installation of computer equipment
学习周刊-总第60期-2022年第25周
jsp 和 servlet 有什么区别?
SQL solves the problem of continuous login deformation holiday filtering
Yyds dry goods inventory has not revealed the artifact? Valentine's Day is coming. Please send her a special gift~
有赞和腾讯云、阿里云一同摘得“中国企业云科技服务商50强”[通俗易懂]