当前位置:网站首页>30: Kakfa simulates JSON data generation and transmission
30: Kakfa simulates JSON data generation and transmission
2022-06-13 01:36:00 【Python's path to becoming a God】
In the calculation PV and UV A key step in the process of is to clean the log data . In fact, in other businesses , For example, in order data statistics , We also need to filter out some “ Dirty data ”.
So-called “ Dirty data ” It is inconsistent with the standard data structure defined by us , Or unwanted data . Because in data cleaning ETL Data deserialization, parsing and Java Class mapping , In this mapping process “ Dirty data ” Will cause deserialization failure , This causes the task to fail and restart . In some big assignments , Restarting will cause the task to become unstable , And too much “ Dirty data ” This will cause our task to report errors frequently , Finally, it failed completely .
framework

Mentioned the whole PV and UV Data processing architecture in computing process , It uses Flume Collect business data and send it to Kafka in , So calculating PV、UV Need to consume before Kafka Data in , And will “ Dirty data ” To filter out .
In real business , We consume primitive Kafka After processing the log data , The detailed data will also be written to similar Elasticsearch Query in such an engine ; The summary data will also be written into HBase perhaps Redis And other databases for front-end query and display . meanwhile , And write the data again Kafka For other businesses .
边栏推荐
猜你喜欢

Leetcode find duplicates

Crypto JS reports uglifyjs error

Tangent and tangent plane

Alexnet implements image classification of caltech101 dataset (pytorch Implementation)

【斯坦福計網CS144項目】Lab1: StreamReassembler

Design of distributed game server

谷歌的受众群体是如何发挥作用的?

Traversal of binary tree - first order traversal, middle order traversal, and second order traversal

使用Pygame创建一个简单游戏界面

ES6 deconstruction assignment
随机推荐
Golang context (context summary)
Database query user mailbox
Leetcode 01 array
My crawler learning notes
Golang inline mechanism & go development test
Downloading wiki corpus and aligning with multilingual wikis
使用Pygame创建一个简单游戏界面
Binary tree traversal - recursive and iterative templates
On February 26, 2022, the latest news of national oil price adjustment today
[从零开始学习FPGA编程-22]:进阶篇 - 架构 - FPGA内部硬件电路的设计与建模
Differences among bio, NiO and AIO
Large end storage and small end storage
路径字段是什么? ——竞价广告
项目实训(十七)---个人工作总结
leetcode. 349. intersection of two arrays
C language implementation of the classic eight queens problem
Understanding of the detach() function of pytorch
Leetcode 05 tree
Temporary objects and compilation optimization
Alexnet implements image classification of caltech101 dataset (pytorch Implementation)