当前位置:网站首页>Apache Doris just "graduated": why should we pay attention to this kind of SQL data warehouse?
Apache Doris just "graduated": why should we pay attention to this kind of SQL data warehouse?
2022-07-07 15:57:00 【Ink Sky Wheel】
translator : Bugatti
Doris It's based on SQL Large scale parallel processing (MPP) Open source analysis data warehouse , is Apache Incubator(Apache The incubator ) Development . Now? ,Doris Among the top projects , According to the Apache Software foundation (ASF) claim , It means “ It has proved that proper autonomy can be achieved ”.
The data warehouse recently ushered in a version 1.0, This is its eighth version developed in the incubator ( Six more Connector edition ). It is designed to support online analytical processing (OLAP) The workload , Usually used in data science scenarios .
Doris Original name Palo, Born in Baidu, a Chinese Internet search giant , It is the data warehouse system of its advertising business ,2017 In open source ,2018 in Apache The incubator .
Doris Rooted in Apache Impala and Google Mesa
According to the Apache The software foundation claims ,Doris be based on Google Mesa and Apache Impala Integrate ,Apache Impala yes 2012 Open source developed in MPP SQL Query engine , be based on Google F1 The basis of .
Mesa stay 2014 It was designed as a highly scalable analytical data warehouse system around , It is used to store key measurement data related to Google's Internet advertising business .
According to Baidu and Apache The developers of the incubator claim ,Doris Provides a simple design architecture , At the same time, it provides high availability 、 reliability 、 Fault tolerance and scalability .
“ Easy to ( Development 、 Deploy and use ), And a single system to meet the needs of many data services , This is a Doris Two characteristics of ”,Apache The software foundation said in a statement , He added that the data warehouse supports multidimensional reports 、 User portrait 、 Ad hoc queries and real-time dashboards .
Doris Other features of include column storage 、 Parallel execution 、 Vectorization Technology 、 Query optimization 、ANSI SQL, And by facing Apache Flink、Apache Hive、Apache Hudi、Apache Iceberg、Apache Spark、 Elasticsearch And the connectors of other systems are integrated with the big data ecosystem .
Usage of open source databases is expected to grow
The usage of enterprise level open source databases is expected to grow . Consultancy, Gartner stay 《2019 In open source DBMS Market conditions 》 The report predicts , To 2022 end of the year , exceed 70% The new internal application will be in the open source database management system (OSDBMS) Or based on OSDBMS Database platform as a service (dbPaaS) On the development .
Besides , As data surges and enterprises increasingly need real-time analysis , A simple large-scale parallel processing open source database has become the current need .
Ventana Research Director of research David Menninger say :“ As the volume of data continues to grow ,MPP Database has become the only practical way to process data fast enough or low enough to meet the needs of the organization .”
Cloud Architecture has inspired organizations to MPP Database interest
Menninger Express , Push MPP Other trends in database development are the relatively inexpensive cloud based server instances , These examples can be used as MPP Part of the configuration , Therefore, the organization does not need to purchase and install the physical hardware used by these systems .
Menninger Think Doris There is great hope , Although there are many MPP The database is optional , Some of them are open source , But in fact, there is no open source MPP MySQL Alternatives .
“MySQL Itself and the MariaDB Has been extended , It can support a larger analysis workload , But they were originally designed for transaction processing ”,Menninger say , He added that open source can be used PostreSQL database Greenplum as well as Google BigQuery、Amazon RedShift and Microsoft Synapse And other super large-scale services are regarded as Doris competitors .
Besides ,Gartner Vice president of big data and pre analysis research Sanjeev Mohan Express , Can also be ClickHouse、Apache Druid and Apache Pinot As a competitor .
According to the Apache The foundation claims , Use Doris There may be many advantages , For example, simple architecture and faster query time .
Doris One of the simple reasons is , It does not rely on multiple components to complete class management 、 Tasks such as synchronization and communication . Fast query time can be attributed to vectorization , This method allows programs or algorithms to operate on multiple values at once rather than a single value .
According to the Apache The developers of the foundation claim , Another benefit of this data warehouse is Doris Ultra high concurrency support , This means that it can simultaneously process processing data proposed by thousands of users 、 Request for insight from the database .
Because most organizations allow their employees to access data , So that they can use data to gain insight , Not only executives can enjoy analytical tools , Nowadays, the demand for high concurrency has increased .
Source of the article :https://baijiahao.baidu.com/s?id=1737572791176015816&wfr=spider&for=pc
边栏推荐
- Steps to create P8 certificate and warehousing account
- Use of SVN
- 航运船公司人工智能AI产品成熟化标准化规模应用,全球港航人工智能/集装箱人工智能领军者CIMC中集飞瞳,打造国际航运智能化标杆
- Three. JS introductory learning notes 08:orbitcontrols JS plug-in - mouse control model rotation, zoom in, zoom out, translation, etc
- 融云斩获 2022 中国信创数字化办公门户卓越产品奖!
- AE learning 01: AE complete project summary
- Shipping companies' AI products are mature, standardized and applied on a large scale. CIMC, the global leader in port and shipping AI / container AI, has built a benchmark for international shipping
- Spin animation of Cocos performance optimization
- Use moviepy Editor clips videos and intercepts video clips in batches
- Unity3D_ Class fishing project, bullet rebound effect is achieved
猜你喜欢
Cut ffmpeg as needed, and use emscripten to compile and run
The download button and debug button in keil are grayed out
LeetCode3_ Longest substring without duplicate characters
SPI master rx time out中断
Eye of depth (VI) -- inverse of matrix (attachment: some ideas of logistic model)
TS as a general cache method
喜讯!科蓝SUNDB数据库与鸿数科技隐私数据保护管理软件完成兼容性适配
Vertex shader to slice shader procedure, varying variable
Annexb and avcc are two methods of data segmentation in decoding
Application example of infinite list [uigridview]
随机推荐
TS typescript type declaration special declaration field number is handled when the key key
How does geojson data merge the boundaries of regions?
航運船公司人工智能AI產品成熟化標准化規模應用,全球港航人工智能/集裝箱人工智能領軍者CIMC中集飛瞳,打造國際航運智能化標杆
The download button and debug button in keil are grayed out
Streaming end, server end, player end
航天宏图信息中标乌鲁木齐某单位数据库系统研发项目
Three. JS introductory learning notes 18: how to export JSON files with Blender
Three. JS introductory learning notes 15: threejs frame animation module
用手机在通达信上开户靠谱吗?这样炒股有没有什么安全隐患
Particle effect for ugui
The difference between full-time graduate students and part-time graduate students!
A JS script can be directly put into the browser to perform operations
Keil5 does not support online simulation of STM32 F0 series
Clang compile link ffmpeg FAQ
Points for attention in porting gd32 F4 series programs to gd32 F3 series
Eye of depth (VI) -- inverse of matrix (attachment: some ideas of logistic model)
招标公告:盘锦市人民医院盘锦医院数据库维保项目
15. Using the text editing tool VIM
Three. Introduction to JS learning notes 17: mouse control of 3D model rotation of JSON file
Postman generate timestamp, future timestamp