open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

Last update: Nov 02, 2022

Overview

Open-Information-Extraction-System

中文开放信息抽取系统, open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

码源分析

基于LTP依存句法分析(DP, dependency parsing)的中文开放信息抽取系统(rule-based)。

增加并列关系、左附加关系、右附加关系等(递归实现);
这里的依存句法分析只适合简单短句，过长句子、口语化句子dp效果不好会很影响下游抽取。

结果展示(部分)

{
    "ques": "郑州是那个省的",
    "answer": [
        "河南"
    ],
    "desc": "郑州是河南省省会城市，周边有洛阳、开封、新郑、新密、许昌等城市",
    "SPO": [
        [
            "郑州",
            "是",
            "那个省"
        ]
    ]
},
{
    "ques": "格林童话《灰姑娘》中,灰姑娘参加舞会时所做的车是由哪种植物变成的?",
    "answer": [
        "南瓜"
    ],
    "desc": "这时，有一位仙女出现了，帮助她摇身一变成为高贵的千金小姐，并将老鼠变成马夫，南瓜变成马车，又变了一套漂亮的衣服和一双水晶（玻璃）鞋给灰姑娘穿上。",
    "SPO": [
        [
            "灰姑娘",
            "参加",
            "舞会"
        ],
        [
            "灰姑娘",
            "参加",
            "舞会"
        ],
        [
            "做车",
            "是",
            "变成"
        ]
    ]
 },
 {
    "ques": "中国农历的哪个节气有着北方吃饺子、南方吃汤圆的习俗?",
    "answer": [
        "冬至"
    ],
    "desc": "在冬至节，中国北方有冬至日吃饺子的习俗，南方某些地方有冬至日吃汤圆、粉糍粑的习俗，传说在汉朝的医圣张仲景体念家乡乡民在寒冬中工作的辛苦，在冬至那天利用羊肉等祛寒的药材包在面皮中，作成耳朵的样子，给乡民们治病补身，这个药方的名字...",
    "SPO": [
        [
            "中国农历哪个节气",
            "有着",
            "吃饺子习俗"
        ],
        [
            "北方",
            "吃",
            "饺子"
        ],
        [
            "南方",
            "吃",
            "汤圆"
        ]
    ]
}

open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

Related tags

Overview

Open-Information-Extraction-System

码源分析

结果展示(部分)

资源&依赖

Owner

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

History Aware Multimodal Transformer for Vision-and-Language Navigation

Code Implementation of "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Python Implementation of ``Modeling the Influence of Verb Aspect on the Activation of Typical Event Locations with BERT'' (Findings of ACL: ACL 2021)

Various capabilities for static malware analysis.

translate using your voice

Repository of the Code to Chatbots, developed in Python

AEC_DeepModel - Deep learning based acoustic echo cancellation baseline code

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

中文医疗信息处理基准CBLUE: A Chinese Biomedical LanguageUnderstanding Evaluation Benchmark

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

An A-SOUL Text Generator Based on CPM-Distill.

Use Google's BERT for named entity recognition （CoNLL-2003 as the dataset）.

Extract Keywords from sentence or Replace keywords in sentences.

Code for EMNLP20 paper: "ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training"

Pretty-doc - Composable text objects with python

Code and data accompanying Natural Language Processing with PyTorch

This Project is based on NLTK It generates a RANDOM WORD from a predefined list of words, From that random word it read out the word, its meaning with parts of speech , its antonyms, its synonyms

CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training

Mlcode - Continuous ML API Integrations