当前位置:网站首页>Bridging the gap between open source databases and database services
Bridging the gap between open source databases and database services
2022-06-22 15:55:00 【Ink Sky Wheel】

It is relatively easy for a group of people to create a new database management system or a new data store . The reason why we know this , Because in the past 5 In the calculation of the year , The proliferation of tools that provide structure for data has increased , And it seems to be growing faster and faster . To a large extent, thanks to the innovation of large-scale enterprises, cloud builders and scholars , They just like to wander around the database to prove a point .
however , Turn an open source database or data storage project into a business that can provide enterprise level matching and complete and support a wider range of use cases and customer types and sizes , This is quite another matter . It's hard work , It takes a lot of people 、 attention 、 Money and luck .
This is a Dipti Borkar、Steven Mih and David Simmen stay Launched two years ago Ahana In order to Facebook Created Presto Distributed SQL Engine PrestoDB The task of commercializing variants , It is no coincidence that , With the original Presto The creator of used 了 PrestoSQL, Now known as Trinio, This is a Presto A variation of the , Commercialized by their company , be called Starburst. In any case , these Presto Variations are federated with databases and datastores , And provide a general SQL layer , Allow them to query in place —— This is a very powerful function , This is necessary due to the persistence of legacy databases and the importance of data .
It's too difficult to move them all to one place to query it , This is how companies try to create data warehouses . even so , Data warehouses usually only have summary data , Although once the data enters the warehouse, it has the advantage of convenience , But getting data in the warehouse ( And make sure it's not garbage ) It's a very painful thing . In short , As we said a few months ago , You want to do data analysis without a data warehouse , This is the darling of the database industry Snowflake Using its cloud data warehouse does the exact opposite .
More and more companies want to use PrestoDB To query the location of the data . That's why Ahana Can extend its last year 8 Announced in A Reasons for round financing ,Google Ventures、Lux Capital、Third Point Ventures and Leslie Capital In which... Was raised 2720 Thousands of dollars , In order to increase Ahana Raised 480 Million dollars of seed funds to obtain in 2020 Year begins . With A Extension of the series ,Liberty Global Ventures It is the venture capital Department of the telecom company , More involvement by the same company doing business across Europe and Google ventures , Re funding the series 720 Ten thousand dollars a kitten .( We strongly suspect Liberty Global yes Ahana The customer , But the CEO Steven Mih Will not comment on this .) This brings the total revenue so far to 3200 Thousands of dollars ,Mih Add ,Ahana Not going to raise money . In the current economic environment , We joked that , If someone gives you money , You will find a reason to accept it .
In the first round A In the ten months of round financing ,Ahana The number of employees has more than doubled , Less than 50 people , And has downloaded more than 100,000 Copy its Ahana Implemented PrestoDB copy .Mih It is impossible to say that it is at the commercial level of the database Ahana Cloud How many paying customers are there in the implementation .
As for increasing the company's payroll ,Mih Your caution is understandable .“ We want to know what is happening in the global economy and the possible adverse factors associated with it ,”Mih tell The Next Platform, Avoid using R word .“ If some potential problems do not occur , Then our development speed will be very fast .”
This growth is driven by the need for joint queries across database platforms , The concept of multimodal data processing makes this even more obvious ,Matt Bornstein、Jennifer Li and Martin Casado( One of them OpenFlow The founder and Nicira One of the co founders of ,Nicira by VMware Provides NSX Virtual network stack ), All these people are working around the world for Andreesen Horowitz Make good technology investment .
The core of this modern data processing architecture is the so-called data lake —— Part of the data warehouse comes from the past , Part of the data Lake comes from Hadoop Time , But it's really just cheap and deep storage , No need to use MapReduce To solve the problem of unstructured data on machine clusters .
This is from Mih The chart of the chart summarizes the center of the chart more clearly and easily :
“ As you know , A lot of data is injected into the data lake , Including semi-structured 、 Structured and unstructured data ,”Mih explains .“ As everything is commercialized , People will ask why they should put data in another proprietary store , For example, data warehouse , And why they should keep it in an open format . If they do try to put this data into commercial storage , So computing on the data warehouse is proprietary . The idea of the data lake library is to use open source computing , And for SQL Query processing Presto Is one of the main options . And then for non SQL Queries and workloads , You can use ML and AI Frame calculation , And use Parquet Equiform . Storage is the commodity of the lakeside cottage , The computing layer is really where the cost lies ,
The entire multimodal data processing architecture has many mobile parts , If Ahana To succeed in PrestoDB Commercialize and federate distributed query engines in various relational data stores , It must become easier to install and test data Lake libraries SQL The core . This is the new Ahana Cloud for Presto The full meaning of the Community Edition . It is a free and unrestricted database version , Can run on any single cluster , Regardless of the size .( majority Presto Customers have multiple clusters , This is where the subscription will begin .) The following is the community version and the complete Ahana Cloud for Presto The difference between versions :

Community Edition stay Amazon Web Services Run on the cloud , It's like Presto The production of Ahana Cloud equally , As long as it only runs on a single cluster —— No matter how many EC2 Instance drives it ——Community Edition It's all free . There are some warnings . The Community Edition doesn't support Graviton、Graviton2 or Graviton3 example , It has only community support . If you want Ahana Cloud for Presto Enterprise version , You can seamlessly upgrade to it , Then you can have any number of clusters and run them in any AWS Run on instance type , Include AWS Created for it Graviton Arm The server CPU Series for your own use . The production version also has higher security 、 Performance enhancements ( for example AWS Autoscale on ), Of course, Ahana Technical support provided by hired real people .
Now? ,Ahana It allows people to quickly start using Presto, And save them setting up on the data lake Presto It takes days or weeks . Just grab this container , To open it , Point it to the data Lake , Then start using SQL Query to attack it . every last Community Edition Users can use it permanently , And before you have a second cluster or need enhanced security or performance , Never pay for it .
author :Timothy Prickett Morgan
Source of the article :https://www.nextplatform.com/2022/06/17/bridging-the-gap-between-open-source-database-and-database-business/
边栏推荐
- 【VTK】模型旋转平移
- 还整成这样
- 『忘了再学』Shell流程控制 — 38、while循环和until循环介绍
- Common operations in Visual Studio development
- "Software defines the world, open source builds the future" 2022 open atom global open source summit will open at the end of July
- uni开发微信小程序自定义相机自动检测(人像+身份证)
- Hello, big guys. Error reporting when using MySQL CDC for the first time
- 关于 GIN 的路由树
- Ros2 pre basic tutorial | using cmakelists Txt compile ros2 node
- 模板特例化 template<>
猜你喜欢

84.(cesium篇)cesium模型在地形上运动

mysql的concat()函数如何用

HMS Core新闻行业解决方案:让技术加上人文的温度

C语言学习-18-makefile文件编写例子以及如何生成、调用动态库

The bank card identification function of Huawei machine learning service enables bank card identification and binding with one click

Recommend several AI Intelligent Platforms

Jenkins 通过检查代码提交自动触发编译

标准化、最值归一化、均值归一化应用场景的进阶思考

SDVO:LDSO+语义,直接法语义SLAM(RAL 2022)

How to use the concat() function of MySQL
随机推荐
String的模拟实现
Is it difficult for flush to open an account? Is it safe to open an account online?
润迈德医疗通过聆讯:年内亏损6.3亿 平安资本是股东
C语言学生成绩排名系统
壹连科技冲刺深交所:年营收14亿 65%收入来自宁德时代
对领域驱动设计DDD理解
华为机器学习服务银行卡识别功能,一键实现银行卡识别与绑定
How can ordinary people make 1million yuan a year?
Discover the number of media new users can insert
向量1(类和对象)
After 100 days, Xiaoyu built a robot communication community!! Now invite moderators!
推進兼容適配,使能協同發展 GBase 5月適配速遞
架构师之路,从「存储选型」起步
Discourse 的信任级别
各位学弟学妹,别再看教材了,时间复杂度看这篇就好了
Scala language learning-05-a comparison of the efficiency of recursion and tail recursion
Quickly play ci/cd graphical choreography
[single chip microcomputer] [make buzzer sound] know the buzzer and let it make the sound you want
洛谷P2466 [SDOI2008] Sue 的小球 题解
Ros2 pre basic tutorial | using cmakelists Txt compile ros2 node

