当前位置:网站首页>[database] differences among structured data, unstructured data and semi-structured data
[database] differences among structured data, unstructured data and semi-structured data
2022-06-10 18:24:00 【bandaoyu】
Linzhonglu
Structured data can obtain corresponding information through inherent key values , And the data format is fixed , Such as RDBMS data

Semi structured data can obtain corresponding information through flexible key value adjustment , And the format of the data is not fixed , Such as json, The information stored under the same key value may be numeric , It may be textual , It could also be a dictionary or a list
<person>
<name>A</name>
<age>13</age>
<gender>female</gender>
</person><person>
<name>B</name>
<gender>male</gender>
</person>
Unstructured data cannot obtain corresponding information through key value .
In fact, we can't say that documents are unstructured data in general , It depends on the level of information you want to get , For example, I just want to count the company's financial statements , And take the financial report as a whole as my information unit , So the financial report here is also structured data ; And if I want to get the specific information in the financial report , Such as total income , So here, the financial report is unstructured data .
link :https://www.zhihu.com/question/50986354/answer/276099917
Ai Play badminton and call me :
structured :SQL
Semi structured :json
Unstructured : picture
SQL: Structured query language . Its data is structured . Get the corresponding information through the intrinsic key value
json: Through flexible key value adjustment , Get the information
NOSQL database : It does not require structured data design . So its fault tolerance is very strong , Nor is there too strict a design , the
It is easy to expand and modify in the future .
NOSQL The concept of relation does not exist in the database , If you want to achieve a relationship , for instance 1 Yes 1, One to many , Many to many , You need to program it , Instead of using the database itself .
https://www.zhihu.com/question/50986354/answer/1849823026
Jidao Technology ( Beijing ) Co., LTD.
Structured data
Fixed data model Schema, A data combination of a specific data type , For example, database tables
• Stored in RDBMS perhaps Spreadsheet in
• advantage : Association query and modification are simple
• shortcoming : The table structure is fixed , Adding an attribute change table structure is difficult . The introduction of table Association will bring more application troubles .
• application :CRM,ERP, Flight and train reservation system
Unstructured data
• There is no fixed data structure and type , There is no fixed data model schema
• Stored in file storage system or object storage system
• advantage : Follow one's inclinations , All kinds of
• shortcoming : There is no standard , Difficult to standardize management , retrieval , Inquire about
• application :BLOB, Audio and video files , Office documents , report form , journal
Semi-structured data
• There is a format but no fixed data model Schema, Have self describing attribute information to express data content .
• Store in key value pairs ,JSON/XML Similar documentation , Stored in a file or document shaped database , Or stored in the graph database as a graph , In the data warehouse .
• advantage : Good scalability , Any data can be added 、 Delete 、 Change description information , It can be retrieved and analyzed through specific algorithms and methods .
• shortcoming : Specific usage for specific application scenarios , Not good at storing BLOB
link :https://www.zhihu.com/question/50986354/answer/2214580874
Brother Piao :
summary :
- structured 、 Semi structured 、 Unstructured is actually classified according to data format .
- Structured data belongs to unstructured data , Is a special case of unstructured data .
- Both structured and semi-structured data are data with basic fixed structure patterns
- There is only domain overlap between semi-structured and unstructured data and the current popular big data , In essence, there is no necessary connection between the two .
- The industry regards big data as semi-structured / Unstructured data , It is because big data technology initially played a role in the field of semi-structured data , Its essence is to confuse data processing technology with data format , It's not true .
link :https://www.zhihu.com/question/50986354/answer/683873405
边栏推荐
- C language -- 14 loop statement for
- Abbexa 1,3-二棕榈素 CLIA 试剂盒解决方案
- Abbexa 细菌基因组 DNA 试剂盒介绍
- Can the "no password era" that apple is looking forward to really come true?
- 堆利用之chunk extend: HITCON tranining lab13
- 领导提拔你的原因,只有这点最真实,其他都是瞎扯!
- nfs网络挂载制作服务器镜像
- Detailed explanation of MySQL windowing function
- QtMqtt 源码编译设置KeepAlive后ping包超时错误不返回问题修复(QMQTT::MqttNoPingResponse,QMQTT::ClientPrivate::onPingTimeo)
- 基础提升---树形DP补充
猜你喜欢
随机推荐
软考不通过能不能补考?解答来了
Abbexa 1,3-二棕榈素 CLIA 试剂盒解决方案
Aesthetic experience (episode 237) Luo Guozheng
[CEPH] CEPH configuration source code analysis | common/config*
踩坑了,BigDecimal 使用不当,造成P0事故!
作为程序员,对于底层原理真的有那么重要吗?
AOV network topology sorting
Library for adding progress bar during training --tqdm
期货网上开户安全吗,具体怎么开户的
QtMqtt 源码编译设置KeepAlive后ping包超时错误不返回问题修复(QMQTT::MqttNoPingResponse,QMQTT::ClientPrivate::onPingTimeo)
微信小程序仿陶票票课程设计
关于目前CIM(BIM+GIS)行业的一些看法
数据的软删除—什么时候需要?又如何去实现?
搭建在线帮助中心,轻松帮助客户解决问题
Container containing the most water
(CVPR 2020) RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
内存池原理一(基于整块)
XML&Xpath解析
阅读micropyton源码-添加C扩展类模块(4)
XML & XPath parsing








