当前位置:网站首页>Named entity recognition in natural language processing: tanford core LP ner (1)
Named entity recognition in natural language processing: tanford core LP ner (1)
2020-11-06 01:28:00 【Elementary school students in IT field】
Reprint please indicate the source :https://blog.csdn.net/HHTNAN
brief introduction
CoreNLP The project is Stanford Developed a set of open source NLP System . Include tokenize, pos , parse And so on , And SpaCy similar .SpaCy Claims to be the fastest NLP System , And provide ready-made python Interface , But the disadvantage is that it does not support Chinese processing at present , CoreNLP The Chinese model is included , It can be directly used to process Chinese , but CoreNLP Use Java Development ,python It's a little bit cumbersome to call .
Stanford CoreNLP It is a powerful natural language processing tool , Many models are trained based on deep learning .
First attach a link to its official website :
https://stanfordnlp.github.io/CoreNLP/index.html
https://nlp.stanford.edu/nlp/javadoc/javanlp/
https://github.com/stanfordnlp/CoreNLP
install Installation
windows 10 Environmental Science
Installation dependency
1. First you need to configure JDK, install JDK 1.8 And above ..
2. Later on https://stanfordnlp.github.io/CoreNLP/history.html Download the corresponding jar package .
Decompress the package to get the directory , Then the language of jar Put the package in this directory .
3. download Stanford CoreNLP file :http://stanfordnlp.github.io/CoreNLP/download.html
4. Download the Chinese model jar package ( Be sure to download this file , Otherwise, it is handled in English by default ).
5. Next py install stanfordcorenlp
6. Unzip configuration
When the download is complete, the two files add up 1G+ When the download is complete, the two files add up 1G+
Take the decompressed Stanford CoreNLP Folder downloaded Stanford-chinese-corenlp-2018—models.jar Put it in the same directory ( Be careful : Be sure to be in the same directory , Otherwise, the execution will report an error )
7. stay Python Reference model in , Execute the following statement :
from stanfordcorenlp import StanfordCoreNLP
nlp=StanfordCoreNLP(r’D:\D:\stanford_nlp\stanford-corenlp-full-2018-10-05’,lang=‘zh’)
application
#encoding="utf-8"
from stanfordcorenlp import StanfordCoreNLP
import os
if os.path.exists('D:\\stanford_nlp\\stanford-corenlp-full-2018-10-05'):
print("corenlp exists")
else:
print("corenlp not exists")
nlp=StanfordCoreNLP('D:\\stanford_nlp\\stanford-corenlp-full-2018-10-05',lang='zh')
sentence = ' Wang Ming is a graduate student of Tsinghua University '
print(nlp.ner(sentence))
Output :
corenlp exists
[(‘ Wang Ming ’, ‘PERSON’), (‘ yes ’, ‘O’), (‘ tsinghua ’, ‘ORGANIZATION’), (‘ university ’, ‘ORGANIZATION’), (‘ Of ’, ‘O’), (‘ One ’, ‘NUMBER’), (‘ individual ’, ‘O’), (‘ Graduate student ’, ‘O’)]
3、 ... and 、 See part of speech tagging
Access in a browser :http://localhost:9000/
data:image/s3,"s3://crabby-images/5462b/5462b01b2ed42a21720996f8ab558e8181ec7db9" alt="WeChat ID"
Reprint please indicate the source :https://blog.csdn.net/HHTNAN
版权声明
本文为[Elementary school students in IT field]所创,转载请带上原文链接,感谢
边栏推荐
- Nodejs crawler captures ancient books and records, a total of 16000 pages, experience summary and project sharing
- Python + appium automatic operation wechat is enough
- Process analysis of Python authentication mechanism based on JWT
- Network security engineer Demo: the original * * is to get your computer administrator rights! 【***】
- 加速「全民直播」洪流,如何攻克延时、卡顿、高并发难题?
- Synchronous configuration from git to consult with git 2consul
- Building and visualizing decision tree with Python
- Relationship between business policies, business rules, business processes and business master data - modern analysis
- The difference between Es5 class and ES6 class
- Do not understand UML class diagram? Take a look at this edition of rural love class diagram, a learn!
猜你喜欢
Did you blog today?
前端基础牢记的一些操作-Github仓库管理
Python filtering sensitive word records
The road of C + + Learning: from introduction to mastery
I think it is necessary to write a general idempotent component
EOS创始人BM: UE,UBI,URI有什么区别?
PN8162 20W PD快充芯片,PD快充充电器方案
What to do if you are squeezed by old programmers? I don't want to quit
It's so embarrassing, fans broke ten thousand, used for a year!
这个项目可以让你在几分钟快速了解某个编程语言
随机推荐
It's so embarrassing, fans broke ten thousand, used for a year!
Mongodb (from 0 to 1), 11 days mongodb primary to intermediate advanced secret
Summary of common algorithms of binary tree
ES6学习笔记(五):轻松了解ES6的内置扩展对象
Programmer introspection checklist
Vuejs development specification
From zero learning artificial intelligence, open the road of career planning!
What is the side effect free method? How to name it? - Mario
Vue.js Mobile end left slide delete component
Keyboard entry lottery random draw
Skywalking series blog 1 - install stand-alone skywalking
Wechat applet: prevent multiple click jump (function throttling)
[event center azure event hub] interpretation of error information found in event hub logs
ES6 essence:
5.4 static resource mapping
This article will introduce you to jest unit test
ipfs正舵者Filecoin落地正当时 FIL币价格破千来了
Working principle of gradient descent algorithm in machine learning
Natural language processing - BM25 commonly used in search
Mac installation hanlp, and win installation and use