当前位置:网站首页>NLP text summary: data set introduction and preprocessing [New York Times annotated corpus]
NLP text summary: data set introduction and preprocessing [New York Times annotated corpus]
2022-06-30 02:50:00 【Ninja luantaro】
New York Time Description of the corpus :
- 1.8 million The article
- exceed 650k Manually written article summaries
- exceed 1.5 million Manually tagged articles , Tags include figure , place , organization , title , The theme
- exceed 275k Use algorithms to generate tagged articles
- For parsing xml Of documents java Tools
There are... In the corpus 650k A manually written article summary , This can be used to evaluate the document summarization algorithm ,
Reference material :
New York Times Corpus Introduce ( To be continued )
The New York Times Annotated Corpus
边栏推荐
- Entering Jiangsu writers and poets carmine Jasmine World Book Day
- Unity3d ugui force refresh of layout components
- Mysql表数据比较大情况下怎么修改添加字段
- Distributed file storage system fastdfs hands on how to do it
- How to set password complexity and timeout exit function in Oracle
- 外汇交易平台哪个好?有监管的资金就安全吗?
- How to use redis to realize the like function
- Xunwei NXP itop-imx6 development platform
- Cmake tutorial series -02- generating binaries using cmake code
- 什么是X.509证书?X.509证书工作原理及应用?
猜你喜欢

How to use redis to realize the like function

Recursion frog jumping steps problem

【postgres】postgres 数据库迁移

Raki's notes on reading paper: Leveraging type descriptions for zero shot named entity recognition and classification

Several key points recorded after reviewing redis design and Implementation

Welfare lottery | what are the highlights of open source enterprise monitoring zabbix6.0

How to prevent duplicate submission under concurrent requests

What files does a CA digital certificate contain? How to view SSL certificate information?

uniapp 地址转换经纬度

Creating exquisite skills in maker Education
随机推荐
Playful palette: an interactive parametric color mixer for artists
Redis+AOP怎么自定义注解实现限流
Wechat applet page Jump and parameter transfer
Lua Basics
打造創客教育中精湛技藝
什么是自签名证书?自签名SSL证书的优缺点?
uniapp 地址转换经纬度
学术汇报(academic presentation)/PPT应该怎么做?
What is certificate transparency CT? How to query CT logs certificate logs?
Global and Chinese market for defense network security 2022-2028: Research Report on technology, participants, trends, market size and share
How to use vant to realize data paging and drop-down loading
How can redis+aop customize annotations to achieve flow restriction
C console format code
[dry goods sharing] the latest WHQL logo certification application process
How to switch ipykernel to a different CONDA virtual environment in jupyterlab?
(图论) 连通分量(模板) + 强连通分量(模板)
Creating exquisite skills in maker Education
【npm】解决使用npm安装TypeORM的报错问题
隐藏在科技教育中的steam元素
外汇交易平台哪个好?有监管的资金就安全吗?