当前位置:网站首页>Advantages of time series database and traditional database
Advantages of time series database and traditional database
2022-07-26 17:13:00 【51CTO】
The problem of storing time series data in traditional relational databases
With time series data , Where should I store it ? First, let's look at the problems that traditional relational database solutions will encounter when storing temporal data .
Many people may think that adding a timestamp column to a traditional relational database can be used as a temporal database . When the amount of data is small, it's no problem . But time series data are often generated by millions or even tens of millions of terminal devices , Write concurrency is high , It belongs to a massive data scenario .
MySQL In the scenario of massive time series data, there are the following problems :
Storage costs are high : For time series data compression is not good , It takes up a lot of machine resources ;
Maintenance costs are high : Stand alone system , Need to be in the upper manual sub database sub table , Maintenance costs are high ;
Low write throughput : Single machine write throughput is low , It is difficult to meet the write pressure of tens of millions of sequential data ;
Poor query performance : For transaction processing , The aggregation analysis performance of massive data is poor .
in addition , Use Hadoop ecology (Hadoop、Spark etc. ) There are the following problems when storing timing data :
High data latency : Offline batch system , Data from generation to analyzability , It takes hours 、 Even heaven level ;
Poor query performance : Can't make good use of index , rely on MapReduce Mission , The query time is usually at the minute level .
It can be seen that the following problems need to be solved in the temporal database :
Writing of time series data : How to support the writing of hundreds of millions of data points per second .
Reading of time series data : How to support the grouping and aggregation of hundreds of millions of data at the second level .
Cost sensitive : What is brought by massive data storage is the cost problem . How to store these data at a lower cost , It will become the most important problem to be solved in time series database .
**** The advantages of temporal databases
*** The invention of temporal database products is to solve the deficiencies and defects of traditional relational database in temporal data storage and analysis , Such products are uniformly classified as time series database .*** According to the characteristics of timing data, write 、 Storage 、 Query and other processes have been optimized , These optimizations are closely related to the characteristics of time series data :
Storage costs :
Use time to increase 、 Dimension repeat 、 The characteristics of smooth change of indicators , Choose a reasonable coding and compression algorithm , Improve data compression ratio ;
By pre reducing the accuracy , Aggregate historical data , Save storage space .
High concurrent write :
Write data in bulk , Reduce network overhead ;
Data is written to memory first , Re cyclical dump Store for immutable files .
Low query latency , High query concurrency :
Optimize common query patterns , Reduce query latency through indexing and other technologies ;
Through caching 、routing And other technologies to improve query concurrency .
Comparison of open source time series database
At present, the more popular open source time series database products in the industry are InfluxDB、OpenTSDB、Prometheus、Graphite etc. , The comparison of product characteristics is shown in the figure below :

边栏推荐
- [ctfshow web] deserialization
- Interface comparator
- About the idea plug-in I wrote that can generate service and mapper with one click (with source code)
- 2022-2023 topic recommendation of information management graduation project
- 2022软件测试技能 Postman+newman+jenkins 持续集成 实战教程
- Relationship between standardization, normalization and regularization
- The Ministry of Public Security issued a traffic safety warning for summer tourism passenger transport: hold the steering wheel and tighten the safety string
- Batch normalization batch_ normalization
- 快速学会配置yum的本地源和网络源,并学会yum的使用
- The first case in Guangdong! A company in Guangzhou was punished by the police for failing to fulfill its obligation of data security protection
猜你喜欢

Win11怎么重新安装系统?

About the difference between BigDecimal conversion string toengineeringstring, toplainstring and toString

操作系统迁移实战之在openEuler上部署MySQL数据库

Implementing DDD based on ABP -- aggregation and aggregation root practice

PyQt5快速开发与实战 3.2 布局管理入门 and 3.3 Qt Designer实战应用

什么是分布式定时任务框架?

营销指南 | 几种常见的微博营销打法

Packet capturing and streaming software and network diagnosis

IDEA 阿里云多模块部署

Singleton mode
随机推荐
Response object - response character data
Threads and processes
Realizing DDD based on ABP -- related concepts of DDD
What does it mean to lock financial products regularly? Can financial products be redeemed during the lock-in period?
怎么使用C语言嵌套链表实现学生成绩管理系统
中金证券vip账户找谁开安全啊?
Create MySQL function: access denied; you need (at least one of) the SUPER privilege(s) for this operation
Win11怎么自动清理回收站?
【飞控开发基础教程3】疯壳·开源编队无人机-串口(基础收发)
Who is safe to open the VIP account of CICC securities?
The first self-developed embedded 40nm industrial scale memory chip in China was released, breaking the status quo that the localization rate is zero
mysql锁机制(举例说明)
Review the past and know the new MySQL isolation level
京东三面:我要查询千万级数据量的表,怎么操作?
Response对象-响应字符数据
“青出于蓝胜于蓝”,为何藏宝计划(TPC)是持币生息最后的一朵白莲花
Current limiting comparison: how to choose sentinel vs hystrix?
Pyqt5 rapid development and practice 3.4 signal and slot correlation
[flight control development basic tutorial 3] crazy shell · open source formation UAV - serial port (basic transceiver)
Win11怎么重新安装系统?