当前位置:网站首页>Integration of ongdb graph database and spark
Integration of ongdb graph database and spark
2022-07-04 16:40:00 【Ma Chao's blog】
Quickly explore graph data and graph calculation
Graph computing is the study of anything in the objective world and the relationship between things , Make a complete description of it 、 A technique of calculation and analysis . Graph computation depends on the underlying graph data model , Calculate and analyze on the basis of graph data model Spark Is a very popular and mature and stable computing engine . The following article starts from ONgDB And Spark Start of integration 【 Use TensorFlow The scheme of analyzing graph data with equal depth learning framework is beyond the scope of this paper , Only from the field of graph database Spark The integration of is a popular solution , You can do some calculation and pre training of basic map data and submit it to TensorFlow】, Introduce the specific integration implementation scheme . Downloading the source code of the case project can help novices quickly start exploring , No need to step on the pit . The general process is first Spark Cluster integration diagram database plug-in , Then use specific API Build graph data analysis code .
stay Spark Cluster installation neo4j-spark plug-in unit
- Download components
https://github.com/ongdb-contrib/neo4j-spark-connector/releases/tag/2.4.1-M1
- Download components on spark Installation directory jars Folder
E:\software\ongdb-spark\spark-2.4.0-bin-hadoop2.7\jars
The basic component depends on information
- Version information
Spark 2.4.0 http://archive.apache.org/dist/spark/spark-2.4.0/
ONgDB 3.5.x
Neo4j-Java-Driver 1.7.5
Scala 2.11
JDK 1.8
hadoop-2.7.7
https://mirrors.tuna.tsinghua.edu.cn/apache/hadoop/common/
neo4j-spark-connector-full-2.4.1-M1 https://github.com/neo4j-contrib/neo4j-spark-connector
- Download the installation package for
hadoop-2.7.7
spark-2.4.0-bin-hadoop2.7
winutils
neo4j-spark-connector-full-2.4.1-M1 【 hold jar Put the bag in spark/jars Folder 】
scala-2.11.12
Create test data
UNWIND range(1,100) as id
CREATE (p:Person {id:id}) WITH collect(p) as people
UNWIND people as p1
UNWIND range(1,10) as friend
WITH p1, people[(p1.id + friend) % size(people)] as p2
CREATE (p1)-[:KNOWS {years: abs(p2.id - p2.id)}]->(p2)
FOREACH (x in range(1,1000000) | CREATE (:Person {name:"name"+x, age: x%100}));
UNWIND range(1,1000000) as x
MATCH (n),(m) WHERE id(n) = x AND id(m)=toInt(rand()*1000000)
CREATE (n)-[:KNOWS]->(m);
remarks
- Case project 【 To avoid stepping on this under the pit Java-Scala The mixed case project can be referred to 】
https://github.com/ongdb-contrib/ongdb-spark-java-scala-example
If there is a problem downloading the dependent package, please check whether the following website can be downloaded normally Spark dependent JAR package
http://dl.bintray.com/spark-packages/maven
- Screenshot of case project 【 Start locally before use Spark】
- Please read the original text for the installation of relevant components and other references
边栏推荐
- 2021 Google vulnerability reward program review
- Accounting regulations and professional ethics [7]
- Variable cannot have type 'void'
- Laravel simply realizes Alibaba cloud storage + Baidu AI Cloud image review
- QT graphical view frame: element movement
- The 17 year growth route of Zhang Liang, an open source person, can only be adhered to if he loves it
- 高度剩余法
- Understand Alibaba cloud's secret weapon "dragon architecture" in the article "science popularization talent"
- Common knowledge of unity Editor Extension
- Transformer中position encoding实践
猜你喜欢

Function test - knowledge points and common interview questions

Opencv learning -- geometric transformation of image processing

The 17 year growth route of Zhang Liang, an open source person, can only be adhered to if he loves it

Talking about Net core how to use efcore to inject multiple instances of a context annotation type for connecting to the master-slave database

Qt---error: ‘QObject‘ is an ambiguous base of ‘MyView‘

Working group and domain analysis of Intranet
![[North Asia data recovery] a database data recovery case where the disk on which the database is located is unrecognized due to the RAID disk failure of HP DL380 server](/img/79/3fab19045e1ab2f5163033afaa4309.jpg)
[North Asia data recovery] a database data recovery case where the disk on which the database is located is unrecognized due to the RAID disk failure of HP DL380 server

MySQL learning notes - data type (2)

Software Engineer vs Hardware Engineer

Filtered off site request to
随机推荐
How to save the contents of div as an image- How to save the contents of a div as a image?
基于check-point实现图数据构建任务
. Net delay queue
China tall oil fatty acid market trend report, technical dynamic innovation and market forecast
error: ‘connect‘ was not declared in this scope connect(timer, SIGNAL(timeout()), this, SLOT(up
Model fusion -- stacking principle and Implementation
Accounting regulations and professional ethics [9]
Interpretation of the champion scheme of CVPR 2020 night target detection challenge
Four point probe Industry Research Report - market status analysis and development prospect prediction
Detailed process of DC-2 range construction and penetration practice (DC range Series)
MySQL learning notes - data type (2)
Interface fonctionnelle, référence de méthode, Widget de tri de liste implémenté par lambda
After the eruption of Tonga volcano, we analyzed the global volcanic distribution and found that the area with the most volcanoes is here!
Common knowledge of unity Editor Extension
时序图数据建模与产业链分析
Feature extraction and detection 15-akaze local matching
Market trend report, technical innovation and market forecast of tetrabromophthalate (pht4 diol) in China
Selenium element interaction
System. Currenttimemillis() and system Nanotime (), which is faster? Don't use it wrong!
Market trend report, technical innovation and market forecast of China's hair repair therapeutic apparatus