当前位置:网站首页>Connect the Snowflake of CKAN tutorial CKAN to release to open data portal
Connect the Snowflake of CKAN tutorial CKAN to release to open data portal
2022-08-03 01:32:00 【Knowledge is big and fat】
The State of California leverages CKAN data management products to provide California citizens with extensive datasets from government agencies on the California Open Data Portal.These datasets are public and can be downloaded by anyone.The current state details how State can extract and load data through a python application.This small application requires an Azure virtual machine instance that needs to be brought online or manually turned on periodically to extract and load.This requires maintenance for small applications that they want to avoid.In this article, I detailed how to use an external function to push a dataset directly from Snowflake to CKAN.
Current Status
The current state requires managed servers that the infrastructure team must monitor.To avoid some of the work associated with this small server, the team is interested in establishing a more native Snowflake connection on the OpenGov-managed CKAN server.The deployed python code runs periodically and is triggered by a Windows OS task.
The python code checks for updates once a day and uploads data to CKAN.To do this, the python code extracts the data from Snowflake and puts it on the local file system as a csv.Then a stream of bytes pushes the data to CKAN.This means we have a file copy in Snowflake, a temporary data copy on the python server, and a data copy in CKAN.Every step in the process physically moves data.
Future Status
We want to build a pipeline that removes as much infrastructure from the equation as possible.So we're going to use Snowflake as our source of truth and connect with OpenGov in some direct way.as external
边栏推荐
猜你喜欢
随机推荐
2022第十一届财经峰会:优炫软件斩获双项大奖
Cholesterol-PEG-Amine,CLS-PEG-NH2,胆固醇-聚乙二醇-氨基脂两亲性脂质衍生物
No code development platform data ID introductory tutorial
Strict feedback nonlinear systems based on event trigger preset since the immunity of finite time tracking control
工业元宇宙的价值和发展
VMware workstation program starts slowly
数字化转型巨浪拍岸,成长型企业如何“渡河”?
mysql根据多字段分组——group by带两个或多个参数
openssl源码下载
Pytest配置项-pytest.ini
最近公共祖先(LCA)学习笔记 | P3379 【模板】最近公共祖先(LCA)题解
漫画:怎么证明sleep不释放锁,而wait释放锁?
学习Autodock分子对接
d实验新异常
聚乙二醇衍生物4-Arm PEG-DSPE,四臂-聚乙二醇-磷脂
典型相关分析CCA计算过程
2022中国眼博会,山东眼健康展,视力矫正仪器展,护眼产品展
【斯坦福计网CS144项目】Lab5: NetworkInterface
记一次mysql查询慢的优化历程
MDL 内存描述符链表